WorldWideScience

Sample records for coexpressed disease gene

  1. Multiscale Embedded Gene Co-expression Network Analysis.

    Directory of Open Access Journals (Sweden)

    Won-Min Song

    2015-11-01

    Full Text Available Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3, the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA by: i introducing quality control of co-expression similarities, ii parallelizing embedded network construction, and iii developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs. We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA. MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  2. Multiscale Embedded Gene Co-expression Network Analysis.

    Science.gov (United States)

    Song, Won-Min; Zhang, Bin

    2015-11-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  3. Gene co-expression networks shed light into diseases of brain iron accumulation.

    Science.gov (United States)

    Bettencourt, Conceição; Forabosco, Paola; Wiethoff, Sarah; Heidari, Moones; Johnstone, Daniel M; Botía, Juan A; Collingwood, Joanna F; Hardy, John; Milward, Elizabeth A; Ryten, Mina; Houlden, Henry

    2016-03-01

    Aberrant brain iron deposition is observed in both common and rare neurodegenerative disorders, including those categorized as Neurodegeneration with Brain Iron Accumulation (NBIA), which are characterized by focal iron accumulation in the basal ganglia. Two NBIA genes are directly involved in iron metabolism, but whether other NBIA-related genes also regulate iron homeostasis in the human brain, and whether aberrant iron deposition contributes to neurodegenerative processes remains largely unknown. This study aims to expand our understanding of these iron overload diseases and identify relationships between known NBIA genes and their main interacting partners by using a systems biology approach. We used whole-transcriptome gene expression data from human brain samples originating from 101 neuropathologically normal individuals (10 brain regions) to generate weighted gene co-expression networks and cluster the 10 known NBIA genes in an unsupervised manner. We investigated NBIA-enriched networks for relevant cell types and pathways, and whether they are disrupted by iron loading in NBIA diseased tissue and in an in vivo mouse model. We identified two basal ganglia gene co-expression modules significantly enriched for NBIA genes, which resemble neuronal and oligodendrocytic signatures. These NBIA gene networks are enriched for iron-related genes, and implicate synapse and lipid metabolism related pathways. Our data also indicates that these networks are disrupted by excessive brain iron loading. We identified multiple cell types in the origin of NBIA disorders. We also found unforeseen links between NBIA networks and iron-related processes, and demonstrate convergent pathways connecting NBIAs and phenotypically overlapping diseases. Our results are of further relevance for these diseases by providing candidates for new causative genes and possible points for therapeutic intervention. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  4. Dissection of regulatory networks that are altered in disease via differential co-expression.

    Directory of Open Access Journals (Sweden)

    David Amar

    Full Text Available Comparing the gene-expression profiles of sick and healthy individuals can help in understanding disease. Such differential expression analysis is a well-established way to find gene sets whose expression is altered in the disease. Recent approaches to gene-expression analysis go a step further and seek differential co-expression patterns, wherein the level of co-expression of a set of genes differs markedly between disease and control samples. Such patterns can arise from a disease-related change in the regulatory mechanism governing that set of genes, and pinpoint dysfunctional regulatory networks. Here we present DICER, a new method for detecting differentially co-expressed gene sets using a novel probabilistic score for differential correlation. DICER goes beyond standard differential co-expression and detects pairs of modules showing differential co-expression. The expression profiles of genes within each module of the pair are correlated across all samples. The correlation between the two modules, however, differs markedly between the disease and normal samples. We show that DICER outperforms the state of the art in terms of significance and interpretability of the detected gene sets. Moreover, the gene sets discovered by DICER manifest regulation by disease-specific microRNA families. In a case study on Alzheimer's disease, DICER dissected biological processes and protein complexes into functional subunits that are differentially co-expressed, thereby revealing inner structures in disease regulatory networks.

  5. Proteome Profiling Outperforms Transcriptome Profiling for Coexpression Based Gene Function Prediction

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Jing; Ma, Zihao; Carr, Steven A.; Mertins, Philipp; Zhang, Hui; Zhang, Zhen; Chan, Daniel W.; Ellis, Matthew J. C.; Townsend, R. Reid; Smith, Richard D.; McDermott, Jason E.; Chen, Xian; Paulovich, Amanda G.; Boja, Emily S.; Mesri, Mehdi; Kinsinger, Christopher R.; Rodriguez, Henry; Rodland, Karin D.; Liebler, Daniel C.; Zhang, Bing

    2016-11-11

    Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies

  6. Link-based quantitative methods to identify differentially coexpressed genes and gene Pairs

    Directory of Open Access Journals (Sweden)

    Ye Zhi-Qiang

    2011-08-01

    Full Text Available Abstract Background Differential coexpression analysis (DCEA is increasingly used for investigating the global transcriptional mechanisms underlying phenotypic changes. Current DCEA methods mostly adopt a gene connectivity-based strategy to estimate differential coexpression, which is characterized by comparing the numbers of gene neighbors in different coexpression networks. Although it simplifies the calculation, this strategy mixes up the identities of different coexpression neighbors of a gene, and fails to differentiate significant differential coexpression changes from those trivial ones. Especially, the correlation-reversal is easily missed although it probably indicates remarkable biological significance. Results We developed two link-based quantitative methods, DCp and DCe, to identify differentially coexpressed genes and gene pairs (links. Bearing the uniqueness of exploiting the quantitative coexpression change of each gene pair in the coexpression networks, both methods proved to be superior to currently popular methods in simulation studies. Re-mining of a publicly available type 2 diabetes (T2D expression dataset from the perspective of differential coexpression analysis led to additional discoveries than those from differential expression analysis. Conclusions This work pointed out the critical weakness of current popular DCEA methods, and proposed two link-based DCEA algorithms that will make contribution to the development of DCEA and help extend it to a broader spectrum.

  7. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  8. Characterization of chemically induced liver injuries using gene co-expression modules.

    Directory of Open Access Journals (Sweden)

    Gregory J Tawa

    Full Text Available Liver injuries due to ingestion or exposure to chemicals and industrial toxicants pose a serious health risk that may be hard to assess due to a lack of non-invasive diagnostic tests. Mapping chemical injuries to organ-specific damage and clinical outcomes via biomarkers or biomarker panels will provide the foundation for highly specific and robust diagnostic tests. Here, we have used DrugMatrix, a toxicogenomics database containing organ-specific gene expression data matched to dose-dependent chemical exposures and adverse clinical pathology assessments in Sprague Dawley rats, to identify groups of co-expressed genes (modules specific to injury endpoints in the liver. We identified 78 such gene co-expression modules associated with 25 diverse injury endpoints categorized from clinical pathology, organ weight changes, and histopathology. Using gene expression data associated with an injury condition, we showed that these modules exhibited different patterns of activation characteristic of each injury. We further showed that specific module genes mapped to 1 known biochemical pathways associated with liver injuries and 2 clinically used diagnostic tests for liver fibrosis. As such, the gene modules have characteristics of both generalized and specific toxic response pathways. Using these results, we proposed three gene signature sets characteristic of liver fibrosis, steatosis, and general liver injury based on genes from the co-expression modules. Out of all 92 identified genes, 18 (20% genes have well-documented relationships with liver disease, whereas the rest are novel and have not previously been associated with liver disease. In conclusion, identifying gene co-expression modules associated with chemically induced liver injuries aids in generating testable hypotheses and has the potential to identify putative biomarkers of adverse health effects.

  9. Gene coexpression network analysis as a source of functional annotation for rice genes.

    Directory of Open Access Journals (Sweden)

    Kevin L Childs

    Full Text Available With the existence of large publicly available plant gene expression data sets, many groups have undertaken data analyses to construct gene coexpression networks and functionally annotate genes. Often, a large compendium of unrelated or condition-independent expression data is used to construct gene networks. Condition-dependent expression experiments consisting of well-defined conditions/treatments have also been used to create coexpression networks to help examine particular biological processes. Gene networks derived from either condition-dependent or condition-independent data can be difficult to interpret if a large number of genes and connections are present. However, algorithms exist to identify modules of highly connected and biologically relevant genes within coexpression networks. In this study, we have used publicly available rice (Oryza sativa gene expression data to create gene coexpression networks using both condition-dependent and condition-independent data and have identified gene modules within these networks using the Weighted Gene Coexpression Network Analysis method. We compared the number of genes assigned to modules and the biological interpretability of gene coexpression modules to assess the utility of condition-dependent and condition-independent gene coexpression networks. For the purpose of providing functional annotation to rice genes, we found that gene modules identified by coexpression analysis of condition-dependent gene expression experiments to be more useful than gene modules identified by analysis of a condition-independent data set. We have incorporated our results into the MSU Rice Genome Annotation Project database as additional expression-based annotation for 13,537 genes, 2,980 of which lack a functional annotation description. These results provide two new types of functional annotation for our database. Genes in modules are now associated with groups of genes that constitute a collective functional

  10. A comprehensive analysis on preservation patterns of gene co-expression networks during Alzheimer's disease progression.

    Science.gov (United States)

    Ray, Sumanta; Hossain, Sk Md Mosaddek; Khatun, Lutfunnesa; Mukhopadhyay, Anirban

    2017-12-20

    Alzheimer's disease (AD) is a chronic neuro-degenerative disruption of the brain which involves in large scale transcriptomic variation. The disease does not impact every regions of the brain at the same time, instead it progresses slowly involving somewhat sequential interaction with different regions. Analysis of the expression patterns of the genes in different regions of the brain influenced in AD surely contribute for a enhanced comprehension of AD pathogenesis and shed light on the early characterization of the disease. Here, we have proposed a framework to identify perturbation and preservation characteristics of gene expression patterns across six distinct regions of the brain ("EC", "HIP", "PC", "MTG", "SFG", and "VCX") affected in AD. Co-expression modules were discovered considering a couple of regions at once. These are then analyzed to know the preservation and perturbation characteristics. Different module preservation statistics and a rank aggregation mechanism have been adopted to detect the changes of expression patterns across brain regions. Gene ontology (GO) and pathway based analysis were also carried out to know the biological meaning of preserved and perturbed modules. In this article, we have extensively studied the preservation patterns of co-expressed modules in six distinct brain regions affected in AD. Some modules are emerged as the most preserved while some others are detected as perturbed between a pair of brain regions. Further investigation on the topological properties of preserved and non-preserved modules reveals a substantial association amongst "betweenness centrality" and "degree" of the involved genes. Our findings may render a deeper realization of the preservation characteristics of gene expression patterns in discrete brain regions affected by AD.

  11. GeneCAT--novel webtools that combine BLAST and co-expression analyses

    DEFF Research Database (Denmark)

    Mutwil, Marek; Obro, Jens; Willats, William G T

    2008-01-01

    The gene co-expression analysis toolbox (GeneCAT) introduces several novel microarray data analyzing tools. First, the multigene co-expression analysis, combined with co-expressed gene networks, provides a more powerful data mining technique than standard, single-gene co-expression analysis. Second...... orthologs in the plant model organisms Arabidopsis thaliana and Hordeum vulgare (Barley). GeneCAT is equipped with expression data for the model plant A. thaliana, and first to introduce co-expression mining tools for the monocot Barley. GeneCAT is available at http://genecat.mpg.de....

  12. Co-Expression of Neighboring Genes in the Zebrafish (Danio rerio Genome

    Directory of Open Access Journals (Sweden)

    Daryi Wang

    2009-08-01

    Full Text Available Neighboring genes in the eukaryotic genome have a tendency to express concurrently, and the proximity of two adjacent genes is often considered a possible explanation for their co-expression behavior. However, the actual contribution of the physical distance between two genes to their co-expression behavior has yet to be defined. To further investigate this issue, we studied the co-expression of neighboring genes in zebrafish, which has a compact genome and has experienced a whole genome duplication event. Our analysis shows that the proportion of highly co-expressed neighboring pairs (Pearson’s correlation coefficient R>0.7 is low (0.24% ~ 0.67%; however, it is still significantly higher than that of random pairs. In particular, the statistical result implies that the co-expression tendency of neighboring pairs is negatively correlated with their physical distance. Our findings therefore suggest that physical distance may play an important role in the co-expression of neighboring genes. Possible mechanisms related to the neighboring genes’ co-expression are also discussed.

  13. Weighted gene co-expression network analysis of the peripheral blood from Amyotrophic Lateral Sclerosis patients

    Directory of Open Access Journals (Sweden)

    DeYoung Joseph

    2009-08-01

    Full Text Available Abstract Background Amyotrophic Lateral Sclerosis (ALS is a lethal disorder characterized by progressive degeneration of motor neurons in the brain and spinal cord. Diagnosis is mainly based on clinical symptoms, and there is currently no therapy to stop the disease or slow its progression. Since access to spinal cord tissue is not possible at disease onset, we investigated changes in gene expression profiles in whole blood of ALS patients. Results Our transcriptional study showed dramatic changes in blood of ALS patients; 2,300 probes (9.4% showed significant differential expression in a discovery dataset consisting of 30 ALS patients and 30 healthy controls. Weighted gene co-expression network analysis (WGCNA was used to find disease-related networks (modules and disease related hub genes. Two large co-expression modules were found to be associated with ALS. Our findings were replicated in a second (30 patients and 30 controls and third dataset (63 patients and 63 controls, thereby demonstrating a highly significant and consistent association of two large co-expression modules with ALS disease status. Ingenuity Pathway Analysis of the ALS related module genes implicates enrichment of functional categories related to genetic disorders, neurodegeneration of the nervous system and inflammatory disease. The ALS related modules contain a number of candidate genes possibly involved in pathogenesis of ALS. Conclusion This first large-scale blood gene expression study in ALS observed distinct patterns between cases and controls which may provide opportunities for biomarker development as well as new insights into the molecular mechanisms of the disease.

  14. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    Science.gov (United States)

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  15. Using gene co-expression network analysis to predict biomarkers for chronic lymphocytic leukemia

    Directory of Open Access Journals (Sweden)

    Borlawsky Tara B

    2010-10-01

    Full Text Available Abstract Background Chronic lymphocytic leukemia (CLL is the most common adult leukemia. It is a highly heterogeneous disease, and can be divided roughly into indolent and progressive stages based on classic clinical markers. Immunoglobin heavy chain variable region (IgVH mutational status was found to be associated with patient survival outcome, and biomarkers linked to the IgVH status has been a focus in the CLL prognosis research field. However, biomarkers highly correlated with IgVH mutational status which can accurately predict the survival outcome are yet to be discovered. Results In this paper, we investigate the use of gene co-expression network analysis to identify potential biomarkers for CLL. Specifically we focused on the co-expression network involving ZAP70, a well characterized biomarker for CLL. We selected 23 microarray datasets corresponding to multiple types of cancer from the Gene Expression Omnibus (GEO and used the frequent network mining algorithm CODENSE to identify highly connected gene co-expression networks spanning the entire genome, then evaluated the genes in the co-expression network in which ZAP70 is involved. We then applied a set of feature selection methods to further select genes which are capable of predicting IgVH mutation status from the ZAP70 co-expression network. Conclusions We have identified a set of genes that are potential CLL prognostic biomarkers IL2RB, CD8A, CD247, LAG3 and KLRK1, which can predict CLL patient IgVH mutational status with high accuracies. Their prognostic capabilities were cross-validated by applying these biomarker candidates to classify patients into different outcome groups using a CLL microarray datasets with clinical information.

  16. Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways.

    Science.gov (United States)

    Obayashi, Takeshi; Kinoshita, Kengo

    2010-05-01

    Gene coexpression analyses are a powerful method to predict the function of genes and/or to identify genes that are functionally related to query genes. The basic idea of gene coexpression analyses is that genes with similar functions should have similar expression patterns under many different conditions. This approach is now widely used by many experimental researchers, especially in the field of plant biology. In this review, we will summarize recent successful examples obtained by using our gene coexpression database, ATTED-II. Specifically, the examples will describe the identification of new genes, such as the subunits of a complex protein, the enzymes in a metabolic pathway and transporters. In addition, we will discuss the discovery of a new intercellular signaling factor and new regulatory relationships between transcription factors and their target genes. In ATTED-II, we provide two basic views of gene coexpression, a gene list view and a gene network view, which can be used as guide gene approach and narrow-down approach, respectively. In addition, we will discuss the coexpression effectiveness for various types of gene sets.

  17. Characterization of differentially expressed genes using high-dimensional co-expression networks

    DEFF Research Database (Denmark)

    Coelho Goncalves de Abreu, Gabriel; Labouriau, Rodrigo S.

    2010-01-01

    We present a technique to characterize differentially expressed genes in terms of their position in a high-dimensional co-expression network. The set-up of Gaussian graphical models is used to construct representations of the co-expression network in such a way that redundancy and the propagation...... that allow to make effective inference in problems with high degree of complexity (e.g. several thousands of genes) and small number of observations (e.g. 10-100) as typically occurs in high throughput gene expression studies. Taking advantage of the internal structure of decomposable graphical models, we...... construct a compact representation of the co-expression network that allows to identify the regions with high concentration of differentially expressed genes. It is argued that differentially expressed genes located in highly interconnected regions of the co-expression network are less informative than...

  18. Large clusters of co-expressed genes in the Drosophila genome.

    Science.gov (United States)

    Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I

    2002-12-12

    Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.

  19. Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.

    Science.gov (United States)

    Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J

    2016-11-04

    Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types

  20. Identification of Transcriptional Modules and Key Genes in Chickens Infected with Salmonella enterica Serovar Pullorum Using Integrated Coexpression Analyses

    Directory of Open Access Journals (Sweden)

    Bao-Hong Liu

    2017-01-01

    Full Text Available Salmonella enterica Pullorum is one of the leading causes of mortality in poultry. Understanding the molecular response in chickens in response to the infection by S. enterica is important in revealing the mechanisms of pathogenesis and disease progress. There have been studies on identifying genes associated with Salmonella infection by differential expression analysis, but the relationships among regulated genes have not been investigated. In this study, we employed weighted gene coexpression network analysis (WGCNA and differential coexpression analysis (DCEA to identify coexpression modules by exploring microarray data derived from chicken splenic tissues in response to the S. enterica infection. A total of 19 modules from 13,538 genes were associated with the Jak-STAT signaling pathway, the extracellular matrix, cytoskeleton organization, the regulation of the actin cytoskeleton, G-protein coupled receptor activity, Toll-like receptor signaling pathways, and immune system processes; among them, 14 differentially coexpressed modules (DCMs and 2,856 differentially coexpressed genes (DCGs were identified. The global expression of module genes between infected and uninfected chickens showed slight differences but considerable changes for global coexpression. Furthermore, DCGs were consistently linked to the hubs of the modules. These results will help prioritize candidate genes for future studies of Salmonella infection.

  1. Gene coexpression measures in large heterogeneous samples using count statistics.

    Science.gov (United States)

    Wang, Y X Rachel; Waterman, Michael S; Huang, Haiyan

    2014-11-18

    With the advent of high-throughput technologies making large-scale gene expression data readily available, developing appropriate computational tools to process these data and distill insights into systems biology has been an important part of the "big data" challenge. Gene coexpression is one of the earliest techniques developed that is still widely in use for functional annotation, pathway analysis, and, most importantly, the reconstruction of gene regulatory networks, based on gene expression data. However, most coexpression measures do not specifically account for local features in expression profiles. For example, it is very likely that the patterns of gene association may change or only exist in a subset of the samples, especially when the samples are pooled from a range of experiments. We propose two new gene coexpression statistics based on counting local patterns of gene expression ranks to take into account the potentially diverse nature of gene interactions. In particular, one of our statistics is designed for time-course data with local dependence structures, such as time series coupled over a subregion of the time domain. We provide asymptotic analysis of their distributions and power, and evaluate their performance against a wide range of existing coexpression measures on simulated and real data. Our new statistics are fast to compute, robust against outliers, and show comparable and often better general performance.

  2. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery.

    Science.gov (United States)

    Jia, Zhilong; Liu, Ying; Guan, Naiyang; Bo, Xiaochen; Luo, Zhigang; Barnes, Michael R

    2016-05-27

    Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and

  3. Novel Approach for Coexpression Analysis of E2F1–3 and MYC Target Genes in Chronic Myelogenous Leukemia

    Directory of Open Access Journals (Sweden)

    Fengfeng Wang

    2014-01-01

    Full Text Available Background. Chronic myelogenous leukemia (CML is characterized by tremendous amount of immature myeloid cells in the blood circulation. E2F1–3 and MYC are important transcription factors that form positive feedback loops by reciprocal regulation in their own transcription processes. Since genes regulated by E2F1–3 or MYC are related to cell proliferation and apoptosis, we wonder if there exists difference in the coexpression patterns of genes regulated concurrently by E2F1–3 and MYC between the normal and the CML states. Results. We proposed a method to explore the difference in the coexpression patterns of those candidate target genes between the normal and the CML groups. A disease-specific cutoff point for coexpression levels that classified the coexpressed gene pairs into strong and weak coexpression classes was identified. Our developed method effectively identified the coexpression pattern differences from the overall structure. Moreover, we found that genes related to the cell adhesion and angiogenesis properties were more likely to be coexpressed in the normal group when compared to the CML group. Conclusion. Our findings may be helpful in exploring the underlying mechanisms of CML and provide useful information in cancer treatment.

  4. In-silico gene co-expression network analysis in Paracoccidioides brasiliensis with reference to haloacid dehalogenase superfamily hydrolase gene

    Directory of Open Access Journals (Sweden)

    Raghunath Satpathy

    2015-01-01

    Full Text Available Context: Paracoccidioides brasiliensis, a dimorphic fungus is the causative agent of paracoccidioidomycosis, a disease globally affecting millions of people. The haloacid dehalogenase (HAD superfamily hydrolases enzyme in the fungi, in particular, is known to be responsible in the pathogenesis by adhering to the tissue. Hence, identification of novel drug targets is essential. Aims: In-silico based identification of co-expressed genes along with HAD superfamily hydrolase in P. brasiliensis during the morphogenesis from mycelium to yeast to identify possible genes as drug targets. Materials and Methods: In total, four datasets were retrieved from the NCBI-gene expression omnibus (GEO database, each containing 4340 genes, followed by gene filtration expression of the data set. Further co-expression (CE study was performed individually and then a combination these genes were visualized in the Cytoscape 2. 8.3. Statistical Analysis Used: Mean and standard deviation value of the HAD superfamily hydrolase gene was obtained from the expression data and this value was subsequently used for the CE calculation purpose by selecting specific correlation power and filtering threshold. Results: The 23 genes that were thus obtained are common with respect to the HAD superfamily hydrolase gene. A significant network was selected from the Cytoscape network visualization that contains total 7 genes out of which 5 genes, which do not have significant protein hits, obtained from gene annotation of the expressed sequence tags by BLAST X. For all the protein PSI-BLAST was performed against human genome to find the homology. Conclusions: The gene co-expression network was obtained with respect to HAD superfamily dehalogenase gene in P. Brasiliensis.

  5. A stochastic model for identifying differential gene pair co-expression patterns in prostate cancer progression

    Directory of Open Access Journals (Sweden)

    Mao Yu

    2009-07-01

    Full Text Available Abstract Background The identification of gene differential co-expression patterns between cancer stages is a newly developing method to reveal the underlying molecular mechanisms of carcinogenesis. Most researches of this subject lack an algorithm useful for performing a statistical significance assessment involving cancer progression. Lacking this specific algorithm is apparently absent in identifying precise gene pairs correlating to cancer progression. Results In this investigation we studied gene pair co-expression change by using a stochastic process model for approximating the underlying dynamic procedure of the co-expression change during cancer progression. Also, we presented a novel analytical method named 'Stochastic process model for Identifying differentially co-expressed Gene pair' (SIG method. This method has been applied to two well known prostate cancer data sets: hormone sensitive versus hormone resistant, and healthy versus cancerous. From these data sets, 428,582 gene pairs and 303,992 gene pairs were identified respectively. Afterwards, we used two different current statistical methods to the same data sets, which were developed to identify gene pair differential co-expression and did not consider cancer progression in algorithm. We then compared these results from three different perspectives: progression analysis, gene pair identification effectiveness analysis, and pathway enrichment analysis. Statistical methods were used to quantify the quality and performance of these different perspectives. They included: Re-identification Scale (RS and Progression Score (PS in progression analysis, True Positive Rate (TPR in gene pair analysis, and Pathway Enrichment Score (PES in pathway analysis. Our results show small values of RS and large values of PS, TPR, and PES; thus, suggesting that gene pairs identified by the SIG method are highly correlated with cancer progression, and highly enriched in disease-specific pathways. From

  6. Elucidating gene function and function evolution through comparison of co-expression networks in plants

    Directory of Open Access Journals (Sweden)

    Marek eMutwil

    2014-08-01

    Full Text Available The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23. In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We show that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that, in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.

  7. Global similarity and local divergence in human and mouse gene co-expression networks

    Directory of Open Access Journals (Sweden)

    Koonin Eugene V

    2006-09-01

    Full Text Available Abstract Background A genome-wide comparative analysis of human and mouse gene expression patterns was performed in order to evaluate the evolutionary divergence of mammalian gene expression. Tissue-specific expression profiles were analyzed for 9,105 human-mouse orthologous gene pairs across 28 tissues. Expression profiles were resolved into species-specific coexpression networks, and the topological properties of the networks were compared between species. Results At the global level, the topological properties of the human and mouse gene coexpression networks are, essentially, identical. For instance, both networks have topologies with small-world and scale-free properties as well as closely similar average node degrees, clustering coefficients, and path lengths. However, the human and mouse coexpression networks are highly divergent at the local level: only a small fraction ( Conclusion The dissonance between global versus local network divergence suggests that the interspecies similarity of the global network properties is of limited biological significance, at best, and that the biologically relevant aspects of the architectures of gene coexpression are specific and particular, rather than universal. Nevertheless, there is substantial evolutionary conservation of the local network structure which is compatible with the notion that gene coexpression networks are subject to purifying selection.

  8. Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks

    Directory of Open Access Journals (Sweden)

    Kohane Isaac S

    2005-09-01

    Full Text Available Abstract Background Biological processes are carried out by coordinated modules of interacting molecules. As clustering methods demonstrate that genes with similar expression display increased likelihood of being associated with a common functional module, networks of coexpressed genes provide one framework for assigning gene function. This has informed the guilt-by-association (GBA heuristic, widely invoked in functional genomics. Yet although the idea of GBA is accepted, the breadth of GBA applicability is uncertain. Results We developed methods to systematically explore the breadth of GBA across a large and varied corpus of expression data to answer the following question: To what extent is the GBA heuristic broadly applicable to the transcriptome and conversely how broadly is GBA captured by a priori knowledge represented in the Gene Ontology (GO? Our study provides an investigation of the functional organization of five coexpression networks using data from three mammalian organisms. Our method calculates a probabilistic score between each gene and each Gene Ontology category that reflects coexpression enrichment of a GO module. For each GO category we use Receiver Operating Curves to assess whether these probabilistic scores reflect GBA. This methodology applied to five different coexpression networks demonstrates that the signature of guilt-by-association is ubiquitous and reproducible and that the GBA heuristic is broadly applicable across the population of nine hundred Gene Ontology categories. We also demonstrate the existence of highly reproducible patterns of coexpression between some pairs of GO categories. Conclusion We conclude that GBA has universal value and that transcriptional control may be more modular than previously realized. Our analyses also suggest that methodologies combining coexpression measurements across multiple genes in a biologically-defined module can aid in characterizing gene function or in characterizing

  9. VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

    Science.gov (United States)

    Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

    2013-12-16

    Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis

  10. [Weighted gene co-expression network analysis in biomedicine research].

    Science.gov (United States)

    Liu, Wei; Li, Li; Ye, Hua; Tu, Wei

    2017-11-25

    High-throughput biological technologies are now widely applied in biology and medicine, allowing scientists to monitor thousands of parameters simultaneously in a specific sample. However, it is still an enormous challenge to mine useful information from high-throughput data. The emergence of network biology provides deeper insights into complex bio-system and reveals the modularity in tissue/cellular networks. Correlation networks are increasingly used in bioinformatics applications. Weighted gene co-expression network analysis (WGCNA) tool can detect clusters of highly correlated genes. Therefore, we systematically reviewed the application of WGCNA in the study of disease diagnosis, pathogenesis and other related fields. First, we introduced principle, workflow, advantages and disadvantages of WGCNA. Second, we presented the application of WGCNA in disease, physiology, drug, evolution and genome annotation. Then, we indicated the application of WGCNA in newly developed high-throughput methods. We hope this review will help to promote the application of WGCNA in biomedicine research.

  11. Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

    Directory of Open Access Journals (Sweden)

    Paules Richard S

    2007-11-01

    Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying

  12. Meta-analysis of inter-species liver co-expression networks elucidates traits associated with common human diseases.

    Directory of Open Access Journals (Sweden)

    Kai Wang

    2009-12-01

    Full Text Available Co-expression networks are routinely used to study human diseases like obesity and diabetes. Systematic comparison of these networks between species has the potential to elucidate common mechanisms that are conserved between human and rodent species, as well as those that are species-specific characterizing evolutionary plasticity. We developed a semi-parametric meta-analysis approach for combining gene-gene co-expression relationships across expression profile datasets from multiple species. The simulation results showed that the semi-parametric method is robust against noise. When applied to human, mouse, and rat liver co-expression networks, our method out-performed existing methods in identifying gene pairs with coherent biological functions. We identified a network conserved across species that highlighted cell-cell signaling, cell-adhesion and sterol biosynthesis as main biological processes represented in genome-wide association study candidate gene sets for blood lipid levels. We further developed a heterogeneity statistic to test for network differences among multiple datasets, and demonstrated that genes with species-specific interactions tend to be under positive selection throughout evolution. Finally, we identified a human-specific sub-network regulated by RXRG, which has been validated to play a different role in hyperlipidemia and Type 2 diabetes between human and mouse. Taken together, our approach represents a novel step forward in integrating gene co-expression networks from multiple large scale datasets to leverage not only common information but also differences that are dataset-specific.

  13. Local coexpression domains of two to four genes in the genome of Arabidopsis

    NARCIS (Netherlands)

    Ren, X.Y.; Fiers, M.W.E.J.; Stiekema, W.J.; Nap, J.P.H.

    2005-01-01

    Expression of genes in eukaryotic genomes is known to cluster, but cluster size is generally loosely defined and highly variable. We have here taken a very strict definition of cluster as sets of physically adjacent genes that are highly coexpressed and form so-called local coexpression domains. The

  14. Evaluation of gene association methods for coexpression network construction and biological knowledge discovery.

    Directory of Open Access Journals (Sweden)

    Sapna Kumari

    Full Text Available BACKGROUND: Constructing coexpression networks and performing network analysis using large-scale gene expression data sets is an effective way to uncover new biological knowledge; however, the methods used for gene association in constructing these coexpression networks have not been thoroughly evaluated. Since different methods lead to structurally different coexpression networks and provide different information, selecting the optimal gene association method is critical. METHODS AND RESULTS: In this study, we compared eight gene association methods - Spearman rank correlation, Weighted Rank Correlation, Kendall, Hoeffding's D measure, Theil-Sen, Rank Theil-Sen, Distance Covariance, and Pearson - and focused on their true knowledge discovery rates in associating pathway genes and construction coordination networks of regulatory genes. We also examined the behaviors of different methods to microarray data with different properties, and whether the biological processes affect the efficiency of different methods. CONCLUSIONS: We found that the Spearman, Hoeffding and Kendall methods are effective in identifying coexpressed pathway genes, whereas the Theil-sen, Rank Theil-Sen, Spearman, and Weighted Rank methods perform well in identifying coordinated transcription factors that control the same biological processes and traits. Surprisingly, the widely used Pearson method is generally less efficient, and so is the Distance Covariance method that can find gene pairs of multiple relationships. Some analyses we did clearly show Pearson and Distance Covariance methods have distinct behaviors as compared to all other six methods. The efficiencies of different methods vary with the data properties to some degree and are largely contingent upon the biological processes, which necessitates the pre-analysis to identify the best performing method for gene association and coexpression network construction.

  15. G-NEST: A gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Science.gov (United States)

    In previous studies, gene neighborhoods--spatial clusters of co-expressed genes in the genome--have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Sc...

  16. FastGCN: a GPU accelerated tool for fast gene co-expression networks.

    Directory of Open Access Journals (Sweden)

    Meimei Liang

    Full Text Available Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.

  17. Matrix factorization reveals aging-specific co-expression gene modules in the fat and muscle tissues in nonhuman primates

    Science.gov (United States)

    Wang, Yongcui; Zhao, Weiling; Zhou, Xiaobo

    2016-10-01

    Accurate identification of coherent transcriptional modules (subnetworks) in adipose and muscle tissues is important for revealing the related mechanisms and co-regulated pathways involved in the development of aging-related diseases. Here, we proposed a systematically computational approach, called ICEGM, to Identify the Co-Expression Gene Modules through a novel mathematical framework of Higher-Order Generalized Singular Value Decomposition (HO-GSVD). ICEGM was applied on the adipose, and heart and skeletal muscle tissues in old and young female African green vervet monkeys. The genes associated with the development of inflammation, cardiovascular and skeletal disorder diseases, and cancer were revealed by the ICEGM. Meanwhile, genes in the ICEGM modules were also enriched in the adipocytes, smooth muscle cells, cardiac myocytes, and immune cells. Comprehensive disease annotation and canonical pathway analysis indicated that immune cells, adipocytes, cardiomyocytes, and smooth muscle cells played a synergistic role in cardiac and physical functions in the aged monkeys by regulation of the biological processes associated with metabolism, inflammation, and atherosclerosis. In conclusion, the ICEGM provides an efficiently systematic framework for decoding the co-expression gene modules in multiple tissues. Analysis of genes in the ICEGM module yielded important insights on the cooperative role of multiple tissues in the development of diseases.

  18. Building gene co-expression networks using transcriptomics data for systems biology investigations

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Watson-Haigh, Nathan S.

    2012-01-01

    Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four......) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT...... (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended....

  19. Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean) Cattle.

    Science.gov (United States)

    Lim, Dajeong; Lee, Seung-Hwan; Kim, Nam-Kuk; Cho, Yong-Min; Chai, Han-Ha; Seong, Hwan-Hoo; Kim, Heebal

    2013-01-01

    Marbling (intramuscular fat) is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the 'marbling score' trait and systemically analyzed the network topology in Hanwoo (Korean cattle). As a result, we determined 3 modules (gene groups) that showed statistically significant results for marbling score. In particular, one module (denoted as red) has a statistically significant result for marbling score (p = 0.008) and intramuscular fat (p = 0.02) and water capacity (p = 0.006). From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA) have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

  20. Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean Cattle

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    2013-01-01

    Full Text Available Marbling (intramuscular fat is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the ‘marbling score’ trait and systemically analyzed the network topology in Hanwoo (Korean cattle. As a result, we determined 3 modules (gene groups that showed statistically significant results for marbling score. In particular, one module (denoted as red has a statistically significant result for marbling score (p = 0.008 and intramuscular fat (p = 0.02 and water capacity (p = 0.006. From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

  1. Disease gene characterization through large-scale co-expression analysis.

    Directory of Open Access Journals (Sweden)

    Allen Day

    2009-12-01

    Full Text Available In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET.Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2 and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders.UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis.

  2. A powerful nonparametric method for detecting differentially co-expressed genes: distance correlation screening and edge-count test.

    Science.gov (United States)

    Zhang, Qingyang

    2018-05-16

    Differential co-expression analysis, as a complement of differential expression analysis, offers significant insights into the changes in molecular mechanism of different phenotypes. A prevailing approach to detecting differentially co-expressed genes is to compare Pearson's correlation coefficients in two phenotypes. However, due to the limitations of Pearson's correlation measure, this approach lacks the power to detect nonlinear changes in gene co-expression which is common in gene regulatory networks. In this work, a new nonparametric procedure is proposed to search differentially co-expressed gene pairs in different phenotypes from large-scale data. Our computational pipeline consisted of two main steps, a screening step and a testing step. The screening step is to reduce the search space by filtering out all the independent gene pairs using distance correlation measure. In the testing step, we compare the gene co-expression patterns in different phenotypes by a recently developed edge-count test. Both steps are distribution-free and targeting nonlinear relations. We illustrate the promise of the new approach by analyzing the Cancer Genome Atlas data and the METABRIC data for breast cancer subtypes. Compared with some existing methods, the new method is more powerful in detecting nonlinear type of differential co-expressions. The distance correlation screening can greatly improve computational efficiency, facilitating its application to large data sets.

  3. Using the 2A Protein Coexpression System: Multicistronic 2A Vectors Expressing Gene(s) of Interest and Reporter Proteins.

    Science.gov (United States)

    Luke, Garry A; Ryan, Martin D

    2018-01-01

    To date, a huge range of different proteins-many with cotranslational and posttranslational subcellular localization signals-have been coexpressed together with various reporter proteins in vitro and in vivo using 2A peptides. The pros and cons of 2A co-expression technology are considered below, followed by a simple example of a "how to" protocol to concatenate multiple genes of interest, together with a reporter gene, into a single gene linked via 2As for easy identification or selection of transduced cells.

  4. Network statistics of genetically-driven gene co-expression modules in mouse crosses

    Directory of Open Access Journals (Sweden)

    Marie-Pier eScott-Boyer

    2013-12-01

    Full Text Available In biology, networks are used in different contexts as ways to represent relationships between entities, such as for instance interactions between genes, proteins or metabolites. Despite progress in the analysis of such networks and their potential to better understand the collective impact of genes on complex traits, one remaining challenge is to establish the biologic validity of gene co-expression networks and to determine what governs their organization. We used WGCNA to construct and analyze seven gene expression datasets from several tissues of mouse recombinant inbred strains (RIS. For six out of the 7 networks, we found that linkage to module QTLs (mQTLs could be established for 29.3% of gene co-expression modules detected in the several mouse RIS. For about 74.6% of such genetically-linked modules, the mQTL was on the same chromosome as the one contributing most genes to the module, with genes originating from that chromosome showing higher connectivity than other genes in the modules. Such modules (that we considered as genetically-driven had network statistic properties (density, centralization and heterogeneity that set them apart from other modules in the network. Altogether, a sizeable portion of gene co-expression modules detected in mouse RIS panels had genetic determinants as their main organizing principle. In addition to providing a biologic interpretation validation for these modules, these genetic determinants imparted on them particular properties that set them apart from other modules in the network, to the point that they can be predicted to a large extent on the basis of their network statistics.

  5. The Arabidopsis co-expression tool (act): a WWW-based tool and database for microarray-based gene expression analysis

    DEFF Research Database (Denmark)

    Jen, C. H.; Manfield, I. W.; Michalopoulos, D. W.

    2006-01-01

    be examined using the novel clique finder tool to determine the sets of genes most likely to be regulated in a similar manner. In combination, these tools offer three levels of analysis: creation of correlation lists of co-expressed genes, refinement of these lists using two-dimensional scatter plots......We present a new WWW-based tool for plant gene analysis, the Arabidopsis Co-Expression Tool (act) , based on a large Arabidopsis thaliana microarray data set obtained from the Nottingham Arabidopsis Stock Centre. The co-expression analysis tool allows users to identify genes whose expression...

  6. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    Science.gov (United States)

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  7. Uncovering co-expression gene network modules regulating fruit acidity in diverse apples.

    Science.gov (United States)

    Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Zhong, Gan-Yuan; Xu, Kenong

    2015-08-16

    Acidity is a major contributor to fruit quality. Several organic acids are present in apple fruit, but malic acid is predominant and determines fruit acidity. The trait is largely controlled by the Malic acid (Ma) locus, underpinning which Ma1 that putatively encodes a vacuolar aluminum-activated malate transporter1 (ALMT1)-like protein is a strong candidate gene. We hypothesize that fruit acidity is governed by a gene network in which Ma1 is key member. The goal of this study is to identify the gene network and the potential mechanisms through which the network operates. Guided by Ma1, we analyzed the transcriptomes of mature fruit of contrasting acidity from six apple accessions of genotype Ma_ (MaMa or Mama) and four of mama using RNA-seq and identified 1301 fruit acidity associated genes, among which 18 were most significant acidity genes (MSAGs). Network inferring using weighted gene co-expression network analysis (WGCNA) revealed five co-expression gene network modules of significant (P acidity. Overall, this study provides important insight into the Ma1-mediated gene network controlling acidity in mature apple fruit of diverse genetic background.

  8. Exploring Plant Co-Expression and Gene-Gene Interactions with CORNET 3.0.

    Science.gov (United States)

    Van Bel, Michiel; Coppens, Frederik

    2017-01-01

    Selecting and filtering a reference expression and interaction dataset when studying specific pathways and regulatory interactions can be a very time-consuming and error-prone task. In order to reduce the duplicated efforts required to amass such datasets, we have created the CORNET (CORrelation NETworks) platform which allows for easy access to a wide variety of data types: coexpression data, protein-protein interactions, regulatory interactions, and functional annotations. The CORNET platform outputs its results in either text format or through the Cytoscape framework, which is automatically launched by the CORNET website.CORNET 3.0 is the third iteration of the web platform designed for the user exploration of the coexpression space of plant genomes, with a focus on the model species Arabidopsis thaliana. Here we describe the platform: the tools, data, and best practices when using the platform. We indicate how the platform can be used to infer networks from a set of input genes, such as upregulated genes from an expression experiment. By exploring the network, new target and regulator genes can be discovered, allowing for follow-up experiments and more in-depth study. We also indicate how to avoid common pitfalls when evaluating the networks and how to avoid over interpretation of the results.All CORNET versions are available at http://bioinformatics.psb.ugent.be/cornet/ .

  9. Gene Coexpression Analysis Reveals Complex Metabolism of the Monoterpene Alcohol Linalool in Arabidopsis FlowersW

    NARCIS (Netherlands)

    Ginglinger, J.F.; Boachon, B.; Hofer, R.; Paetz, C.; Kollner, T.G.; Miesch, L.; Lugan, R.; Baltenweck, R.; Mutterer, J.; Ullman, P.; Verstappen, F.W.A.; Bouwmeester, H.J.

    2013-01-01

    The cytochrome P450 family encompasses the largest family of enzymes in plant metabolism, and the functions of many of its members in Arabidopsis thaliana are still unknown. Gene coexpression analysis pointed to two P450s that were coexpressed with two monoterpene synthases in flowers and were thus

  10. Characterization of Genes for Beef Marbling Based on Applying Gene Coexpression Network

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    2014-01-01

    Full Text Available Marbling is an important trait in characterization beef quality and a major factor for determining the price of beef in the Korean beef market. In particular, marbling is a complex trait and needs a system-level approach for identifying candidate genes related to the trait. To find the candidate gene associated with marbling, we used a weighted gene coexpression network analysis from the expression value of bovine genes. Hub genes were identified; they were topologically centered with large degree and BC values in the global network. We performed gene expression analysis to detect candidate genes in M. longissimus with divergent marbling phenotype (marbling scores 2 to 7 using qRT-PCR. The results demonstrate that transmembrane protein 60 (TMEM60 and dihydropyrimidine dehydrogenase (DPYD are associated with increasing marbling fat. We suggest that the network-based approach in livestock may be an important method for analyzing the complex effects of candidate genes associated with complex traits like marbling or tenderness.

  11. Effects of threshold on the topology of gene co-expression networks.

    Science.gov (United States)

    Couto, Cynthia Martins Villar; Comin, César Henrique; Costa, Luciano da Fontoura

    2017-09-26

    Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.

  12. Two novel antimicrobial defensins from rice identified by gene coexpression network analyses.

    Science.gov (United States)

    Tantong, Supaluk; Pringsulaka, Onanong; Weerawanich, Kamonwan; Meeprasert, Arthitaya; Rungrotmongkol, Thanyada; Sarnthima, Rakrudee; Roytrakul, Sittiruk; Sirikantaramas, Supaart

    2016-10-01

    Defensins form an antimicrobial peptides (AMP) family, and have been widely studied in various plants because of their considerable inhibitory functions. However, their roles in rice (Oryza sativa L.) have not been characterized, even though rice is one of the most important staple crops that is susceptible to damaging infections. Additionally, a previous study identified 598 rice genes encoding cysteine-rich peptides, suggesting there are several uncharacterized AMPs in rice. We performed in silico gene expression and coexpression network analyses of all genes encoding defensin and defensin-like peptides, and determined that OsDEF7 and OsDEF8 are coexpressed with pathogen-responsive genes. Recombinant OsDEF7 and OsDEF8 could form homodimers. They inhibited the growth of the bacteria Xanthomonas oryzae pv. oryzae, X. oryzae pv. oryzicola, and Erwinia carotovora subsp. atroseptica with minimum inhibitory concentration (MIC) ranging from 0.6 to 63μg/mL. However, these OsDEFs are weakly active against the phytopathogenic fungi Helminthosporium oryzae and Fusarium oxysporum f.sp. cubense. This study describes a useful method for identifying potential plant AMPs with biological activities. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Gene co-expression analysis identifies gene clusters associated with isotropic and polarized growth in Aspergillus fumigatus conidia.

    Science.gov (United States)

    Baltussen, Tim J H; Coolen, Jordy P M; Zoll, Jan; Verweij, Paul E; Melchers, Willem J G

    2018-04-26

    Aspergillus fumigatus is a saprophytic fungus that extensively produces conidia. These microscopic asexually reproductive structures are small enough to reach the lungs. Germination of conidia followed by hyphal growth inside human lungs is a key step in the establishment of infection in immunocompromised patients. RNA-Seq was used to analyze the transcriptome of dormant and germinating A. fumigatus conidia. Construction of a gene co-expression network revealed four gene clusters (modules) correlated with a growth phase (dormant, isotropic growth, polarized growth). Transcripts levels of genes encoding for secondary metabolites were high in dormant conidia. During isotropic growth, transcript levels of genes involved in cell wall modifications increased. Two modules encoding for growth and cell cycle/DNA processing were associated with polarized growth. In addition, the co-expression network was used to identify highly connected intermodular hub genes. These genes may have a pivotal role in the respective module and could therefore be compelling therapeutic targets. Generally, cell wall remodeling is an important process during isotropic and polarized growth, characterized by an increase of transcripts coding for hyphal growth and cell cycle/DNA processing when polarized growth is initiated. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  14. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model

    DEFF Research Database (Denmark)

    Kogelman, Lisette; Cirera Salicio, Susanna; Zhernakova, Daria V.

    2014-01-01

    interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model...... (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. Results WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P ... the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using...

  15. An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

    Science.gov (United States)

    Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

    2017-04-12

    Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.

  16. Functional Module Analysis for Gene Coexpression Networks with Network Integration.

    Science.gov (United States)

    Zhang, Shuqin; Zhao, Hongyu; Ng, Michael K

    2015-01-01

    Network has been a general tool for studying the complex interactions between different genes, proteins, and other small molecules. Module as a fundamental property of many biological networks has been widely studied and many computational methods have been proposed to identify the modules in an individual network. However, in many cases, a single network is insufficient for module analysis due to the noise in the data or the tuning of parameters when building the biological network. The availability of a large amount of biological networks makes network integration study possible. By integrating such networks, more informative modules for some specific disease can be derived from the networks constructed from different tissues, and consistent factors for different diseases can be inferred. In this paper, we have developed an effective method for module identification from multiple networks under different conditions. The problem is formulated as an optimization model, which combines the module identification in each individual network and alignment of the modules from different networks together. An approximation algorithm based on eigenvector computation is proposed. Our method outperforms the existing methods, especially when the underlying modules in multiple networks are different in simulation studies. We also applied our method to two groups of gene coexpression networks for humans, which include one for three different cancers, and one for three tissues from the morbidly obese patients. We identified 13 modules with three complete subgraphs, and 11 modules with two complete subgraphs, respectively. The modules were validated through Gene Ontology enrichment and KEGG pathway enrichment analysis. We also showed that the main functions of most modules for the corresponding disease have been addressed by other researchers, which may provide the theoretical basis for further studying the modules experimentally.

  17. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes.

    Directory of Open Access Journals (Sweden)

    Simone de Jong

    Full Text Available Despite large-scale genome-wide association studies (GWAS, the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1, is located in, and regulated by the major histocompatibility (MHC complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network.

  18. A methodology for the analysis of differential coexpression across the human lifespan.

    Science.gov (United States)

    Gillis, Jesse; Pavlidis, Paul

    2009-09-22

    Differential coexpression is a change in coexpression between genes that may reflect 'rewiring' of transcriptional networks. It has previously been hypothesized that such changes might be occurring over time in the lifespan of an organism. While both coexpression and differential expression of genes have been previously studied in life stage change or aging, differential coexpression has not. Generalizing differential coexpression analysis to many time points presents a methodological challenge. Here we introduce a method for analyzing changes in coexpression across multiple ordered groups (e.g., over time) and extensively test its validity and usefulness. Our method is based on the use of the Haar basis set to efficiently represent changes in coexpression at multiple time scales, and thus represents a principled and generalizable extension of the idea of differential coexpression to life stage data. We used published microarray studies categorized by age to test the methodology. We validated the methodology by testing our ability to reconstruct Gene Ontology (GO) categories using our measure of differential coexpression and compared this result to using coexpression alone. Our method allows significant improvement in characterizing these groups of genes. Further, we examine the statistical properties of our measure of differential coexpression and establish that the results are significant both statistically and by an improvement in semantic similarity. In addition, we found that our method finds more significant changes in gene relationships compared to several other methods of expressing temporal relationships between genes, such as coexpression over time. Differential coexpression over age generates significant and biologically relevant information about the genes producing it. Our Haar basis methodology for determining age-related differential coexpression performs better than other tested methods. The Haar basis set also lends itself to ready interpretation

  19. Chronic ethanol exposure produces time- and brain region-dependent changes in gene coexpression networks.

    Directory of Open Access Journals (Sweden)

    Elizabeth A Osterndorff-Kahanek

    Full Text Available Repeated ethanol exposure and withdrawal in mice increases voluntary drinking and represents an animal model of physical dependence. We examined time- and brain region-dependent changes in gene coexpression networks in amygdala (AMY, nucleus accumbens (NAC, prefrontal cortex (PFC, and liver after four weekly cycles of chronic intermittent ethanol (CIE vapor exposure in C57BL/6J mice. Microarrays were used to compare gene expression profiles at 0-, 8-, and 120-hours following the last ethanol exposure. Each brain region exhibited a large number of differentially expressed genes (2,000-3,000 at the 0- and 8-hour time points, but fewer changes were detected at the 120-hour time point (400-600. Within each region, there was little gene overlap across time (~20%. All brain regions were significantly enriched with differentially expressed immune-related genes at the 8-hour time point. Weighted gene correlation network analysis identified modules that were highly enriched with differentially expressed genes at the 0- and 8-hour time points with virtually no enrichment at 120 hours. Modules enriched for both ethanol-responsive and cell-specific genes were identified in each brain region. These results indicate that chronic alcohol exposure causes global 'rewiring' of coexpression systems involving glial and immune signaling as well as neuronal genes.

  20. Coexpression of nuclear receptors and histone methylation modifying genes in the testis: implications for endocrine disruptor modes of action.

    Directory of Open Access Journals (Sweden)

    Alison M Anderson

    Full Text Available BACKGROUND: Endocrine disruptor chemicals elicit adverse health effects by perturbing nuclear receptor signalling systems. It has been speculated that these compounds may also perturb epigenetic mechanisms and thus contribute to the early origin of adult onset disease. We hypothesised that histone methylation may be a component of the epigenome that is susceptible to perturbation. We used coexpression analysis of publicly available data to investigate the combinatorial actions of nuclear receptors and genes involved in histone methylation in normal testis and when faced with endocrine disruptor compounds. METHODOLOGY/PRINCIPAL FINDINGS: The expression patterns of a set of genes were profiled across testis tissue in human, rat and mouse, plus control and exposed samples from four toxicity experiments in the rat. Our results indicate that histone methylation events are a more general component of nuclear receptor mediated transcriptional regulation in the testis than previously appreciated. Coexpression patterns support the role of a gatekeeper mechanism involving the histone methylation modifiers Kdm1, Prdm2, and Ehmt1 and indicate that this mechanism is a common determinant of transcriptional integrity for genes critical to diverse physiological endpoints relevant to endocrine disruption. Coexpression patterns following exposure to vinclozolin and dibutyl phthalate suggest that coactivity of the demethylase Kdm1 in particular warrants further investigation in relation to endocrine disruptor mode of action. CONCLUSIONS/SIGNIFICANCE: This study provides proof of concept that a bioinformatics approach that profiles genes related to a specific hypothesis across multiple biological settings can provide powerful insight into coregulatory activity that would be difficult to discern at an individual experiment level or by traditional differential expression analysis methods.

  1. THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer's Disease.

    Science.gov (United States)

    Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K

    2016-11-30

    There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer's disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer's disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer's disease brains. The biological pathways associated with Alzheimer's disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature.

  2. A contribution to the study of plant development evolution based on gene co-expression networks

    Directory of Open Access Journals (Sweden)

    Francisco J. Romero-Campero

    2013-08-01

    Full Text Available Phototrophic eukaryotes are among the most successful organisms on Earth due to their unparalleled efficiency at capturing light energy and fixing carbon dioxide to produce organic molecules. A conserved and efficient network of light-dependent regulatory modules could be at the bases of this success. This regulatory system conferred early advantages to phototrophic eukaryotes that allowed for specialization, complex developmental processes and modern plant characteristics. We have studied light-dependent gene regulatory modules from algae to plants employing integrative-omics approaches based on gene co-expression networks. Our study reveals some remarkably conserved ways in which eukaryotic phototrophs deal with day length and light signaling. Here we describe how a family of Arabidopsis transcription factors involved in photoperiod response has evolved from a single algal gene according to the innovation, amplification and divergence theory of gene evolution by duplication. These modifications of the gene co-expression networks from the ancient unicellular green algae Chlamydomonas reinhardtii to the modern brassica Arabidopsis thaliana may hint on the evolution and specialization of plants and other organisms.

  3. A Predictive Coexpression Network Identifies Novel Genes Controlling the Seed-to-Seedling Phase Transition in Arabidopsis thaliana.

    Science.gov (United States)

    Silva, Anderson Tadeu; Ribone, Pamela A; Chan, Raquel L; Ligterink, Wilco; Hilhorst, Henk W M

    2016-04-01

    The transition from a quiescent dry seed to an actively growing photoautotrophic seedling is a complex and crucial trait for plant propagation. This study provides a detailed description of global gene expression in seven successive developmental stages of seedling establishment in Arabidopsis (Arabidopsis thaliana). Using the transcriptome signature from these developmental stages, we obtained a coexpression gene network that highlights interactions between known regulators of the seed-to-seedling transition and predicts the functions of uncharacterized genes in seedling establishment. The coexpressed gene data sets together with the transcriptional module indicate biological functions related to seedling establishment. Characterization of the homeodomain leucine zipper I transcription factor AtHB13, which is expressed during the seed-to-seedling transition, demonstrated that this gene regulates some of the network nodes and affects late seedling establishment. Knockout mutants for athb13 showed increased primary root length as compared with wild-type (Columbia-0) seedlings, suggesting that this transcription factor is a negative regulator of early root growth, possibly repressing cell division and/or cell elongation or the length of time that cells elongate. The signal transduction pathways present during the early phases of the seed-to-seedling transition anticipate the control of important events for a vigorous seedling, such as root growth. This study demonstrates that a gene coexpression network together with transcriptional modules can provide insights that are not derived from comparative transcript profiling alone. © 2016 American Society of Plant Biologists. All Rights Reserved.

  4. Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

    Science.gov (United States)

    Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

    2018-05-09

    Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.

  5. Genes and co-expression modules common to drought and bacterial stress responses in Arabidopsis and rice.

    Directory of Open Access Journals (Sweden)

    Rafi Shaik

    Full Text Available Plants are simultaneously exposed to multiple stresses resulting in enormous changes in the molecular landscape within the cell. Identification and characterization of the synergistic and antagonistic components of stress response mechanisms contributing to the cross talk between stresses is of high priority to explore and enhance multiple stress responses. To this end, we performed meta-analysis of drought (abiotic, bacterial (biotic stress response in rice and Arabidopsis by analyzing a total of 386 microarray samples belonging to 20 microarray studies and identified approximately 3100 and 900 DEGs in rice and Arabidopsis, respectively. About 38.5% (1214 and 28.7% (272 DEGs were common to drought and bacterial stresses in rice and Arabidopsis, respectively. A majority of these common DEGs showed conserved expression status in both stresses. Gene ontology enrichment analysis clearly demarcated the response and regulation of various plant hormones and related biological processes. Fatty acid metabolism and biosynthesis of alkaloids were upregulated and, nitrogen metabolism and photosynthesis was downregulated in both stress conditions. WRKY transcription family genes were highly enriched in all upregulated gene sets while 'CO-like' TF family showed inverse relationship of expression between drought and bacterial stresses. Weighted gene co-expression network analysis divided DEG sets into multiple modules that show high co-expression and identified stress specific hub genes with high connectivity. Detection of consensus modules based on DEGs common to drought and bacterial stress revealed 9 and 4 modules in rice and Arabidopsis, respectively, with conserved and reversed co-expression patterns.

  6. Gene co-expression networks and profiles reveal potential biomarkers of boar taint in pigs

    DEFF Research Database (Denmark)

    Drag, Markus; Skinkyté-Juskiené, R.; Do, Duy Ngoc

    synthesis. In testis, >80 DE genes were functionally classified by the PANTHER tool to “Gonadotropin releasing hormone receptor” and “Wnt signaling” pathways which play a role in reproductive maturation and proliferation of spermatogonia, respectively. WGCNA was used to build co-expression modules...

  7. Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

    Science.gov (United States)

    Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

    2017-11-13

    The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly

  8. Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex

    Science.gov (United States)

    Hulsman, Marc; Lelieveldt, Boudewijn P. F.; de Ridder, Jeroen; Reinders, Marcel

    2015-01-01

    The three dimensional conformation of the genome in the cell nucleus influences important biological processes such as gene expression regulation. Recent studies have shown a strong correlation between chromatin interactions and gene co-expression. However, predicting gene co-expression from frequent long-range chromatin interactions remains challenging. We address this by characterizing the topology of the cortical chromatin interaction network using scale-aware topological measures. We demonstrate that based on these characterizations it is possible to accurately predict spatial co-expression between genes in the mouse cortex. Consistent with previous findings, we find that the chromatin interaction profile of a gene-pair is a good predictor of their spatial co-expression. However, the accuracy of the prediction can be substantially improved when chromatin interactions are described using scale-aware topological measures of the multi-resolution chromatin interaction network. We conclude that, for co-expression prediction, it is necessary to take into account different levels of chromatin interactions ranging from direct interaction between genes (i.e. small-scale) to chromatin compartment interactions (i.e. large-scale). PMID:25965262

  9. RNA Sequencing and Coexpression Analysis Reveal Key Genes Involved in α-Linolenic Acid Biosynthesis in Perilla frutescens Seed

    Directory of Open Access Journals (Sweden)

    Tianyuan Zhang

    2017-11-01

    Full Text Available Perilla frutescen is used as traditional food and medicine in East Asia. Its seeds contain high levels of α-linolenic acid (ALA, which is important for health, but is scarce in our daily meals. Previous reports on RNA-seq of perilla seed had identified fatty acid (FA and triacylglycerol (TAG synthesis genes, but the underlying mechanism of ALA biosynthesis and its regulation still need to be further explored. So we conducted Illumina RNA-sequencing in seven temporal developmental stages of perilla seeds. Sequencing generated a total of 127 million clean reads, containing 15.88 Gb of valid data. The de novo assembly of sequence reads yielded 64,156 unigenes with an average length of 777 bp. A total of 39,760 unigenes were annotated and 11,693 unigenes were found to be differentially expressed in all samples. According to Kyoto Encyclopedia of Genes and Genomes (KEGG pathway analysis, 486 unigenes were annotated in the “lipid metabolism” pathway. Of these, 150 unigenes were found to be involved in fatty acid (FA biosynthesis and triacylglycerol (TAG assembly in perilla seeds. A coexpression analysis showed that a total of 104 genes were highly coexpressed (r > 0.95. The coexpression network could be divided into two main subnetworks showing over expression in the medium or earlier and late phases, respectively. In order to identify the putative regulatory genes, a transcription factor (TF analysis was performed. This led to the identification of 45 gene families, mainly including the AP2-EREBP, bHLH, MYB, and NAC families, etc. After coexpression analysis of TFs with highly expression of FAD2 and FAD3 genes, 162 TFs were found to be significantly associated with two FAD genes (r > 0.95. Those TFs were predicted to be the key regulatory factors in ALA biosynthesis in perilla seed. The qRT-PCR analysis also verified the relevance of expression pattern between two FAD genes and partial candidate TFs. Although it has been reported that some TFs

  10. MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters.

    Science.gov (United States)

    Gonzalez-Dominguez, Jorge; Martin, Maria J

    2017-10-10

    In this work we present MPIGeneNet, a parallel tool that applies Pearson's correlation and Random Matrix Theory to construct gene co-expression networks. It is based on the state-of-the-art sequential tool RMTGeneNet, which provides networks with high robustness and sensitivity at the expenses of relatively long runtimes for large scale input datasets. MPIGeneNet returns the same results as RMTGeneNet but improves the memory management, reduces the I/O cost, and accelerates the two most computationally demanding steps of co-expression network construction by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on two different systems using three typical input datasets shows that MPIGeneNet is significantly faster than RMTGeneNet. As an example, our tool is up to 175.41 times faster on a cluster with eight nodes, each one containing two 12-core Intel Haswell processors. Source code of MPIGeneNet, as well as a reference manual, are available at https://sourceforge.net/projects/mpigenenet/.

  11. Application of Weighted Gene Co-expression Network Analysis for Data from Paired Design.

    Science.gov (United States)

    Li, Jianqiang; Zhou, Doudou; Qiu, Weiliang; Shi, Yuliang; Yang, Ji-Jiang; Chen, Shi; Wang, Qing; Pan, Hui

    2018-01-12

    Investigating how genes jointly affect complex human diseases is important, yet challenging. The network approach (e.g., weighted gene co-expression network analysis (WGCNA)) is a powerful tool. However, genomic data usually contain substantial batch effects, which could mask true genomic signals. Paired design is a powerful tool that can reduce batch effects. However, it is currently unclear how to appropriately apply WGCNA to genomic data from paired design. In this paper, we modified the current WGCNA pipeline to analyse high-throughput genomic data from paired design. We illustrated the modified WGCNA pipeline by analysing the miRNA dataset provided by Shiah et al. (2014), which contains forty oral squamous cell carcinoma (OSCC) specimens and their matched non-tumourous epithelial counterparts. OSCC is the sixth most common cancer worldwide. The modified WGCNA pipeline identified two sets of novel miRNAs associated with OSCC, in addition to the existing miRNAs reported by Shiah et al. (2014). Thus, this work will be of great interest to readers of various scientific disciplines, in particular, genetic and genomic scientists as well as medical scientists working on cancer.

  12. Analysis of the relationship between coexpression domains and chromatin 3D organization.

    Directory of Open Access Journals (Sweden)

    María E Soler-Oliva

    2017-09-01

    Full Text Available Gene order is not random in eukaryotic chromosomes, and co-regulated genes tend to be clustered. The mechanisms that determine co-regulation of large regions of the genome and its connection with chromatin three-dimensional (3D organization are still unclear however. Here we have adapted a recently described method for identifying chromatin topologically associating domains (TADs to identify coexpression domains (which we term "CODs". Using human normal breast and breast cancer RNA-seq data, we have identified approximately 500 CODs. CODs in the normal and breast cancer genomes share similar characteristics but differ in their gene composition. COD genes have a greater tendency to be coexpressed with genes that reside in other CODs than with non-COD genes. Such inter-COD coexpression is maintained over large chromosomal distances in the normal genome but is partially lost in the cancer genome. Analyzing the relationship between CODs and chromatin 3D organization using Hi-C contact data, we find that CODs do not correspond to TADs. In fact, intra-TAD gene coexpression is the same as random for most chromosomes. However, the contact profile is similar between gene pairs that reside either in the same COD or in coexpressed CODs. These data indicate that co-regulated genes in the genome present similar patterns of contacts irrespective of the frequency of physical chromatin contacts between them.

  13. Co-expression modules construction by WGCNA and identify potential prognostic markers of uveal melanoma.

    Science.gov (United States)

    Wan, Qi; Tang, Jing; Han, Yu; Wang, Dan

    2018-01-01

    Uveal melanoma is an aggressive cancer which has a high percentage recurrence and with a worse prognosis. Identify the potential prognostic markers of uveal melanoma may provide information for early detection of recurrence and treatment. RNA sequence data of uveal melanoma and patient clinic traits were obtained from The Cancer Genome Atlas (TCGA) database. Co-expression modules were built by weighted gene co -expression network analysis (WGCNA) and applied to investigate the relationship underlying modules and clinic traits. Besides, functional enrichment analysis was performed on these co-expression genes from interested modules. First, using WGCNA, identified 21 co-expression modules were constructed by the 10975 genes from the 80 human uveal melanoma samples. The number of genes in these modules ranged from 42 to 5091. Found four co -expression modules significantly correlated with three clinic traits (status, recurrence and recurrence Time). Module red, and purple positively correlated with patient's life status and recurrence Time. Module green positively correlates with recurrence. The result of functional enrichment analysis showed that the module magenta was mainly enriched genetic material assemble processes, the purple module was mainly enriched in tissue homeostasis and melanosome membrane and the module red was mainly enriched metastasis of cell, suggesting its critical role in the recurrence and development of the disease. Additionally, identified the hug gene (top connectivity with other genes) in each module. The hub gene SLC17A7, NTRK2, ABTB1 and ADPRHL1 might play a vital role in recurrence of uveal melanoma. Our findings provided the framework of co-expression gene modules of uveal melanoma and identified some prognostic markers might be detection of recurrence and treatment for uveal melanoma. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Weighted Gene Co-expression Network Analysis of the Dioscin Rich Medicinal Plant Dioscorea nipponica

    Directory of Open Access Journals (Sweden)

    Wei Sun

    2017-06-01

    Full Text Available Dioscorea contains critically important species which can be used as staple foods or sources of bioactive substances, including Dioscorea nipponica, which has been used to develop highly successful drugs to treat cardiovascular disease. Its major active ingredients are thought to be sterol compounds such as diosgenin, which has been called “medicinal gold” because of its valuable properties. However, reliance on naturally growing plants as a production system limits the potential use of D. nipponica, raising interest in engineering metabolic pathways to enhance the production of secondary metabolites. However, the biosynthetic pathway of diosgenin is still poorly understood, and D. nipponica is poorly characterized at a molecular level, hindering in-depth investigation. In the present work, the RNAs from five organs and seven methyl jasmonate treated D. nipponica rhizomes were sequenced using the Illumina high-throughput sequencing platform, yielding 52 gigabases of data, which were pooled and assembled into a reference transcriptome. Four hundred and eighty two genes were found to be highly expressed in the rhizomes, and these genes are mainly involved in stress response and transcriptional regulation. Based on their expression patterns, 36 genes were selected for further investigation as candidate genes involved in dioscin biosynthesis. Constructing co-expression networks based on significant changes in gene expression revealed 15 gene modules. Of these, four modules with properties correlating to dioscin regulation and biosynthesis, consisting of 4,665 genes in total, were selected for further functional investigation. These results improve our understanding of dioscin biosynthesis in this important medicinal plant and will help guide more intensive investigations.

  15. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

    Science.gov (United States)

    Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

    2009-02-01

    Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

  16. G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Directory of Open Access Journals (Sweden)

    Lemay Danielle G

    2012-09-01

    Full Text Available Abstract Background In previous studies, gene neighborhoods—spatial clusters of co-expressed genes in the genome—have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Scoring Tool (G-NEST which combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all possible window sizes simultaneously. Results Using G-NEST on atlases of mouse and human tissue expression data, we found that large neighborhoods of ten or more genes are extremely rare in mammalian genomes. When they do occur, neighborhoods are typically composed of families of related genes. Both the highest scoring and the largest neighborhoods in mammalian genomes are formed by tandem gene duplication. Mammalian gene neighborhoods contain highly and variably expressed genes. Co-localized noisy gene pairs exhibit lower evolutionary conservation of their adjacent genome locations, suggesting that their shared transcriptional background may be disadvantageous. Genes that are essential to mammalian survival and reproduction are less likely to occur in neighborhoods, although neighborhoods are enriched with genes that function in mitosis. We also found that gene orientation and protein-protein interactions are partially responsible for maintenance of gene neighborhoods. Conclusions Our experiments using G-NEST confirm that tandem gene duplication is the primary driver of non-random gene order in mammalian genomes. Non-essentiality, co-functionality, gene orientation, and protein-protein interactions are additional forces that maintain gene neighborhoods, especially those formed by tandem duplicates. We expect G-NEST to be useful for other applications such as the identification of core regulatory modules, common transcriptional backgrounds, and chromatin domains. The

  17. Bioinformatics Data Mining Approach Suggests Coexpression of AGTPBP1 with an ALS-linked Gene C9orf72

    Directory of Open Access Journals (Sweden)

    Shouta Kitano

    2015-01-01

    Full Text Available Background Expanded GGGGCC hexanucleotide repeats located in the noncoding region of the chromosome 9 open reading frame 72 ( C9orf72 gene represent the most common genetic abnormality for familial and sporadic amyotrophic lateral sclerosis (ALS and frontotemporal dementia (FTD. Formation of nuclear RNA foci, accumulation of repeat-associated non-ATG-translated dipeptide-repeat proteins, and haploinsufficiency of C9orf72 are proposed for pathological mechanisms of C9ALS/FTD. However, at present, the physiological function of C9orf72 remains largely unknown. Methods By searching on a bioinformatics database named COXPRESdb composed of the comprehensive gene coexpression data, we studied potential C9orf72 interactors. Results We identified the ATP/GTP binding protein 1 ( AGTPBP1 gene alternatively named NNA1 encoding a cytosolic carboxypeptidase whose mutation is causative of the degeneration of Purkinje cells and motor neurons as the most significant gene coexpressed with C9orf72. We verified coexpression and interaction of AGTPBP1 and C9orf72 in transfected cells by immunoprecipitation and in neurons of the human brain by double-labeling immunohistochemistry. Furthermore, we found a positive correlation between AGTPBP1 and C9orf72 mRNA expression levels in the set of 21 human brains examined. Conclusions These results suggest that AGTPBP1 serves as a C9orf72 interacting partner that plays a role in the regulation of neuronal function in a coordinated manner within the central nervous system.

  18. Estimation of the proteomic cancer co-expression sub networks by using association estimators.

    Directory of Open Access Journals (Sweden)

    Cihat Erdoğan

    Full Text Available In this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA. Correlation and mutual information (MI based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators' performance, a multi-layer data integration platform on gene-disease associations (DisGeNET and the Molecular Signatures Database (MSigDB was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink and 64% for Schurmann-Grassberger (SG association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists.

  19. Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model.

    Science.gov (United States)

    Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O; Eijlander, Robyn T; Kuipers, Oscar P

    2018-02-09

    Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes.

  20. Integrating Genetic and Gene Co-expression Analysis Identifies Gene Networks Involved in Alcohol and Stress Responses.

    Science.gov (United States)

    Luo, Jie; Xu, Pei; Cao, Peijian; Wan, Hongjian; Lv, Xiaonan; Xu, Shengchun; Wang, Gangjun; Cook, Melloni N; Jones, Byron C; Lu, Lu; Wang, Xusheng

    2018-01-01

    Although the link between stress and alcohol is well recognized, the underlying mechanisms of how they interplay at the molecular level remain unclear. The purpose of this study is to identify molecular networks underlying the effects of alcohol and stress responses, as well as their interaction on anxiety behaviors in the hippocampus of mice using a systems genetics approach. Here, we applied a gene co-expression network approach to transcriptomes of 41 BXD mouse strains under four conditions: stress, alcohol, stress-induced alcohol and control. The co-expression analysis identified 14 modules and characterized four expression patterns across the four conditions. The four expression patterns include up-regulation in no restraint stress and given an ethanol injection (NOE) but restoration in restraint stress followed by an ethanol injection (RSE; pattern 1), down-regulation in NOE but rescue in RSE (pattern 2), up-regulation in both restraint stress followed by a saline injection (RSS) and NOE, and further amplification in RSE (pattern 3), and up-regulation in RSS but reduction in both NOE and RSE (pattern 4). We further identified four functional subnetworks by superimposing protein-protein interactions (PPIs) to the 14 co-expression modules, including γ-aminobutyric acid receptor (GABA) signaling, glutamate signaling, neuropeptide signaling, cAMP-dependent signaling. We further performed module specificity analysis to identify modules that are specific to stress, alcohol, or stress-induced alcohol responses. Finally, we conducted causality analysis to link genetic variation to these identified modules, and anxiety behaviors after stress and alcohol treatments. This study underscores the importance of integrative analysis and offers new insights into the molecular networks underlying stress and alcohol responses.

  1. Integrating Genetic and Gene Co-expression Analysis Identifies Gene Networks Involved in Alcohol and Stress Responses

    Directory of Open Access Journals (Sweden)

    Jie Luo

    2018-04-01

    Full Text Available Although the link between stress and alcohol is well recognized, the underlying mechanisms of how they interplay at the molecular level remain unclear. The purpose of this study is to identify molecular networks underlying the effects of alcohol and stress responses, as well as their interaction on anxiety behaviors in the hippocampus of mice using a systems genetics approach. Here, we applied a gene co-expression network approach to transcriptomes of 41 BXD mouse strains under four conditions: stress, alcohol, stress-induced alcohol and control. The co-expression analysis identified 14 modules and characterized four expression patterns across the four conditions. The four expression patterns include up-regulation in no restraint stress and given an ethanol injection (NOE but restoration in restraint stress followed by an ethanol injection (RSE; pattern 1, down-regulation in NOE but rescue in RSE (pattern 2, up-regulation in both restraint stress followed by a saline injection (RSS and NOE, and further amplification in RSE (pattern 3, and up-regulation in RSS but reduction in both NOE and RSE (pattern 4. We further identified four functional subnetworks by superimposing protein-protein interactions (PPIs to the 14 co-expression modules, including γ-aminobutyric acid receptor (GABA signaling, glutamate signaling, neuropeptide signaling, cAMP-dependent signaling. We further performed module specificity analysis to identify modules that are specific to stress, alcohol, or stress-induced alcohol responses. Finally, we conducted causality analysis to link genetic variation to these identified modules, and anxiety behaviors after stress and alcohol treatments. This study underscores the importance of integrative analysis and offers new insights into the molecular networks underlying stress and alcohol responses.

  2. Identifying gene coexpression networks underlying the dynamic regulation of wood-forming tissues in Populus under diverse environmental conditions.

    Science.gov (United States)

    Zinkgraf, Matthew; Liu, Lijun; Groover, Andrew; Filkov, Vladimir

    2017-06-01

    Trees modify wood formation through integration of environmental and developmental signals in complex but poorly defined transcriptional networks, allowing trees to produce woody tissues appropriate to diverse environmental conditions. In order to identify relationships among genes expressed during wood formation, we integrated data from new and publically available datasets in Populus. These datasets were generated from woody tissue and include transcriptome profiling, transcription factor binding, DNA accessibility and genome-wide association mapping experiments. Coexpression modules were calculated, each of which contains genes showing similar expression patterns across experimental conditions, genotypes and treatments. Conserved gene coexpression modules (four modules totaling 8398 genes) were identified that were highly preserved across diverse environmental conditions and genetic backgrounds. Functional annotations as well as correlations with specific experimental treatments associated individual conserved modules with distinct biological processes underlying wood formation, such as cell-wall biosynthesis, meristem development and epigenetic pathways. Module genes were also enriched for DNase I hypersensitivity footprints and binding from four transcription factors associated with wood formation. The conserved modules are excellent candidates for modeling core developmental pathways common to wood formation in diverse environments and genotypes, and serve as testbeds for hypothesis generation and testing for future studies. No claim to original US government works. New Phytologist © 2017 New Phytologist Trust.

  3. Step-by-Step Construction of Gene Co-expression Networks from High-Throughput Arabidopsis RNA Sequencing Data.

    Science.gov (United States)

    Contreras-López, Orlando; Moyano, Tomás C; Soto, Daniela C; Gutiérrez, Rodrigo A

    2018-01-01

    The rapid increase in the availability of transcriptomics data generated by RNA sequencing represents both a challenge and an opportunity for biologists without bioinformatics training. The challenge is handling, integrating, and interpreting these data sets. The opportunity is to use this information to generate testable hypothesis to understand molecular mechanisms controlling gene expression and biological processes (Fig. 1). A successful strategy to generate tractable hypotheses from transcriptomics data has been to build undirected network graphs based on patterns of gene co-expression. Many examples of new hypothesis derived from network analyses can be found in the literature, spanning different organisms including plants and specific fields such as root developmental biology.In order to make the process of constructing a gene co-expression network more accessible to biologists, here we provide step-by-step instructions using published RNA-seq experimental data obtained from a public database. Similar strategies have been used in previous studies to advance root developmental biology. This guide includes basic instructions for the operation of widely used open source platforms such as Bio-Linux, R, and Cytoscape. Even though the data we used in this example was obtained from Arabidopsis thaliana, the workflow developed in this guide can be easily adapted to work with RNA-seq data from any organism.

  4. ESR1 Is Co-Expressed with Closely Adjacent Uncharacterised Genes Spanning a Breast Cancer Susceptibility Locus at 6q25.1

    Science.gov (United States)

    Dunbier, Anita K.; Anderson, Helen; Ghazoui, Zara; Lopez-Knowles, Elena; Pancholi, Sunil; Ribas, Ricardo; Drury, Suzanne; Sidhu, Kally; Leary, Alexandra; Martin, Lesley-Ann; Dowsett, Mitch

    2011-01-01

    Approximately 80% of human breast carcinomas present as oestrogen receptor α-positive (ER+ve) disease, and ER status is a critical factor in treatment decision-making. Recently, single nucleotide polymorphisms (SNPs) in the region immediately upstream of the ER gene (ESR1) on 6q25.1 have been associated with breast cancer risk. Our investigation of factors associated with the level of expression of ESR1 in ER+ve tumours has revealed unexpected associations between genes in this region and ESR1 expression that are important to consider in studies of the genetic causes of breast cancer risk. RNA from tumour biopsies taken from 104 postmenopausal women before and after 2 weeks treatment with an aromatase (oestrogen synthase) inhibitor was analyzed on Illumina 48K microarrays. Multiple-testing corrected Spearman correlation revealed that three previously uncharacterized open reading frames (ORFs) located immediately upstream of ESR1, C6ORF96, C6ORF97, and C6ORF211 were highly correlated with ESR1 (Rs = 0.67, 0.64, and 0.55 respectively, FDRaccount for the correlations. The correlations were maintained in cultured cells. An ERα antagonist did not affect the ORFs' expression or their correlation with ESR1, suggesting their transcriptional co-activation is not directly mediated by ERα. siRNA inhibition of C6ORF211 suppressed proliferation in MCF7 cells, and C6ORF211 positively correlated with a proliferation metagene in tumours. In contrast, C6ORF97 expression correlated negatively with the metagene and predicted for improved disease-free survival in a tamoxifen-treated published dataset, independently of ESR1. Our observations suggest that some of the biological effects previously attributed to ER could be mediated and/or modified by these co-expressed genes. The co-expression and function of these genes may be important influences on the recently identified relationship between SNPs in this region and breast cancer risk. PMID:21552322

  5. Weighted gene co-expression network analysis reveals potential genes involved in early metamorphosis process in sea cucumber Apostichopus japonicus.

    Science.gov (United States)

    Li, Yongxin; Kikuchi, Mani; Li, Xueyan; Gao, Qionghua; Xiong, Zijun; Ren, Yandong; Zhao, Ruoping; Mao, Bingyu; Kondo, Mariko; Irie, Naoki; Wang, Wen

    2018-01-01

    Sea cucumbers, one main class of Echinoderms, have a very fast and drastic metamorphosis process during their development. However, the molecular basis under this process remains largely unknown. Here we systematically examined the gene expression profiles of Japanese common sea cucumber (Apostichopus japonicus) for the first time by RNA sequencing across 16 developmental time points from fertilized egg to juvenile stage. Based on the weighted gene co-expression network analysis (WGCNA), we identified 21 modules. Among them, MEdarkmagenta was highly expressed and correlated with the early metamorphosis process from late auricularia to doliolaria larva. Furthermore, gene enrichment and differentially expressed gene analysis identified several genes in the module that may play key roles in the metamorphosis process. Our results not only provide a molecular basis for experimentally studying the development and morphological complexity of sea cucumber, but also lay a foundation for improving its emergence rate. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Expression atlas and comparative coexpression network analyses reveal important genes involved in the formation of lignified cell wall in Brachypodium distachyon.

    Science.gov (United States)

    Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek

    2017-08-01

    While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  7. Gene Expression Correlated with Severe Asthma Characteristics Reveals Heterogeneous Mechanisms of Severe Disease.

    Science.gov (United States)

    Modena, Brian D; Bleecker, Eugene R; Busse, William W; Erzurum, Serpil C; Gaston, Benjamin M; Jarjour, Nizar N; Meyers, Deborah A; Milosevic, Jadranka; Tedrow, John R; Wu, Wei; Kaminski, Naftali; Wenzel, Sally E

    2017-06-01

    Severe asthma (SA) is a heterogeneous disease with multiple molecular mechanisms. Gene expression studies of bronchial epithelial cells in individuals with asthma have provided biological insight and underscored possible mechanistic differences between individuals. Identify networks of genes reflective of underlying biological processes that define SA. Airway epithelial cell gene expression from 155 subjects with asthma and healthy control subjects in the Severe Asthma Research Program was analyzed by weighted gene coexpression network analysis to identify gene networks and profiles associated with SA and its specific characteristics (i.e., pulmonary function tests, quality of life scores, urgent healthcare use, and steroid use), which potentially identified underlying biological processes. A linear model analysis confirmed these findings while adjusting for potential confounders. Weighted gene coexpression network analysis constructed 64 gene network modules, including modules corresponding to T1 and T2 inflammation, neuronal function, cilia, epithelial growth, and repair mechanisms. Although no network selectively identified SA, genes in modules linked to epithelial growth and repair and neuronal function were markedly decreased in SA. Several hub genes of the epithelial growth and repair module were found located at the 17q12-21 locus, near a well-known asthma susceptibility locus. T2 genes increased with severity in those treated with corticosteroids but were also elevated in untreated, mild-to-moderate disease compared with healthy control subjects. T1 inflammation, especially when associated with increased T2 gene expression, was elevated in a subgroup of younger patients with SA. In this hypothesis-generating analysis, gene expression networks in relation to asthma severity provided potentially new insight into biological mechanisms associated with the development of SA and its phenotypes.

  8. ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis

    Directory of Open Access Journals (Sweden)

    Saurav Mallik

    2017-12-01

    Full Text Available For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures—weighted rank-based Jaccard and Cosine measures—and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm—RANWAR—was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.

  9. ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis.

    Science.gov (United States)

    Mallik, Saurav; Zhao, Zhongming

    2017-12-28

    For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures-weighted rank-based Jaccard and Cosine measures-and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s) through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm-RANWAR-was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.

  10. VSNL1 Co-expression networks in aging include calcium signaling, synaptic plasticity, and Alzheimer’s disease pathways

    Directory of Open Access Journals (Sweden)

    C W Lin

    2015-03-01

    Full Text Available The Visinin-like 1 (VSNL1 gene encodes Visinin-like protein 1, a peripheral biomarker for Alzheimer disease (AD. Little is known, however, about normal VSNL1 expression in brain and the biologic networks in which it participates. Frontal cortex gray matter from 209 subjects without neurodegenerative or psychiatric illness, ranging in age from 16–91, were processed on Affymetrix GeneChip 1.1 ST and Human SNP Array 6.0. VSNL1 expression was unaffected by age and sex, and not significantly associated with SNPs in cis or trans. VSNL1 was significantly co-expressed with genes in pathways for Calcium Signaling, AD, Long Term Potentiation, Long Term Depression, and Trafficking of AMPA Receptors. The association with AD was driven, in part, by correlation with amyloid precursor protein (APP expression. These findings provide an unbiased link between VSNL1 and molecular mechanisms of AD, including pathways implicated in synaptic pathology in AD. Whether APP may drive increased VSNL1 expression, VSNL1 drives increased APP expression, or both are downstream of common pathogenic regulators will need to be evaluated in model systems.

  11. Transient Co-Expression of Post-Transcriptional Gene Silencing Suppressors for Increased in Planta Expression of a Recombinant Anthrax Receptor Fusion Protein

    Directory of Open Access Journals (Sweden)

    Kittipong Rattanaporn

    2011-08-01

    Full Text Available Potential epidemics of infectious diseases and the constant threat of bioterrorism demand rapid, scalable, and cost-efficient manufacturing of therapeutic proteins. Molecular farming of tobacco plants provides an alternative for the recombinant production of therapeutics. We have developed a transient production platform that uses Agrobacterium infiltration of Nicotiana benthamiana plants to express a novel anthrax receptor decoy protein (immunoadhesin, CMG2-Fc. This chimeric fusion protein, designed to protect against the deadly anthrax toxins, is composed of the von Willebrand factor A (VWA domain of human capillary morphogenesis 2 (CMG2, an effective anthrax toxin receptor, and the Fc region of human immunoglobulin G (IgG. We evaluated, in N. benthamiana intact plants and detached leaves, the expression of CMG2-Fc under the control of the constitutive CaMV 35S promoter, and the co-expression of CMG2-Fc with nine different viral suppressors of post-transcriptional gene silencing (PTGS: p1, p10, p19, p21, p24, p25, p38, 2b, and HCPro. Overall, transient CMG2-Fc expression was higher on intact plants than detached leaves. Maximum expression was observed with p1 co-expression at 3.5 days post-infiltration (DPI, with a level of 0.56 g CMG2-Fc per kg of leaf fresh weight and 1.5% of the total soluble protein, a ten-fold increase in expression when compared to absence of suppression. Co-expression with the p25 PTGS suppressor also significantly increased the CMG2-Fc expression level after just 3.5 DPI.

  12. Transient co-expression of post-transcriptional gene silencing suppressors for increased in planta expression of a recombinant anthrax receptor fusion protein.

    Science.gov (United States)

    Arzola, Lucas; Chen, Junxing; Rattanaporn, Kittipong; Maclean, James M; McDonald, Karen A

    2011-01-01

    Potential epidemics of infectious diseases and the constant threat of bioterrorism demand rapid, scalable, and cost-efficient manufacturing of therapeutic proteins. Molecular farming of tobacco plants provides an alternative for the recombinant production of therapeutics. We have developed a transient production platform that uses Agrobacterium infiltration of Nicotiana benthamiana plants to express a novel anthrax receptor decoy protein (immunoadhesin), CMG2-Fc. This chimeric fusion protein, designed to protect against the deadly anthrax toxins, is composed of the von Willebrand factor A (VWA) domain of human capillary morphogenesis 2 (CMG2), an effective anthrax toxin receptor, and the Fc region of human immunoglobulin G (IgG). We evaluated, in N. benthamiana intact plants and detached leaves, the expression of CMG2-Fc under the control of the constitutive CaMV 35S promoter, and the co-expression of CMG2-Fc with nine different viral suppressors of post-transcriptional gene silencing (PTGS): p1, p10, p19, p21, p24, p25, p38, 2b, and HCPro. Overall, transient CMG2-Fc expression was higher on intact plants than detached leaves. Maximum expression was observed with p1 co-expression at 3.5 days post-infiltration (DPI), with a level of 0.56 g CMG2-Fc per kg of leaf fresh weight and 1.5% of the total soluble protein, a ten-fold increase in expression when compared to absence of suppression. Co-expression with the p25 PTGS suppressor also significantly increased the CMG2-Fc expression level after just 3.5 DPI.

  13. ESR1 is co-expressed with closely adjacent uncharacterised genes spanning a breast cancer susceptibility locus at 6q25.1.

    Directory of Open Access Journals (Sweden)

    Anita K Dunbier

    2011-04-01

    Full Text Available Approximately 80% of human breast carcinomas present as oestrogen receptor α-positive (ER+ve disease, and ER status is a critical factor in treatment decision-making. Recently, single nucleotide polymorphisms (SNPs in the region immediately upstream of the ER gene (ESR1 on 6q25.1 have been associated with breast cancer risk. Our investigation of factors associated with the level of expression of ESR1 in ER+ve tumours has revealed unexpected associations between genes in this region and ESR1 expression that are important to consider in studies of the genetic causes of breast cancer risk. RNA from tumour biopsies taken from 104 postmenopausal women before and after 2 weeks treatment with an aromatase (oestrogen synthase inhibitor was analyzed on Illumina 48K microarrays. Multiple-testing corrected Spearman correlation revealed that three previously uncharacterized open reading frames (ORFs located immediately upstream of ESR1, C6ORF96, C6ORF97, and C6ORF211 were highly correlated with ESR1 (Rs =  0.67, 0.64, and 0.55 respectively, FDR<1 × 10(-7. Publicly available datasets confirmed this relationship in other groups of ER+ve tumours. DNA copy number changes did not account for the correlations. The correlations were maintained in cultured cells. An ERα antagonist did not affect the ORFs' expression or their correlation with ESR1, suggesting their transcriptional co-activation is not directly mediated by ERα. siRNA inhibition of C6ORF211 suppressed proliferation in MCF7 cells, and C6ORF211 positively correlated with a proliferation metagene in tumours. In contrast, C6ORF97 expression correlated negatively with the metagene and predicted for improved disease-free survival in a tamoxifen-treated published dataset, independently of ESR1. Our observations suggest that some of the biological effects previously attributed to ER could be mediated and/or modified by these co-expressed genes. The co-expression and function of these genes may be

  14. Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.

    Directory of Open Access Journals (Sweden)

    Xiaolin Xiao

    2014-01-01

    Full Text Available Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states. Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based and humans (mRNA-sequencing-based and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi

  15. Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.

    Science.gov (United States)

    Xiao, Xiaolin; Moreno-Moral, Aida; Rotival, Maxime; Bottolo, Leonardo; Petretto, Enrico

    2014-01-01

    Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states). Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks) that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted) networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based) and humans (mRNA-sequencing-based) and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi-tissue subnetwork of co-expressed

  16. Comparison of co-expression measures: mutual information, correlation, and model based indices.

    Science.gov (United States)

    Song, Lin; Langfelder, Peter; Horvath, Steve

    2012-12-09

    Co-expression measures are often used to define networks among genes. Mutual information (MI) is often used as a generalized correlation measure. It is not clear how much MI adds beyond standard (robust) correlation measures or regression model based association measures. Further, it is important to assess what transformations of these and other co-expression measures lead to biologically meaningful modules (clusters of genes). We provide a comprehensive comparison between mutual information and several correlation measures in 8 empirical data sets and in simulations. We also study different approaches for transforming an adjacency matrix, e.g. using the topological overlap measure. Overall, we confirm close relationships between MI and correlation in all data sets which reflects the fact that most gene pairs satisfy linear or monotonic relationships. We discuss rare situations when the two measures disagree. We also compare correlation and MI based approaches when it comes to defining co-expression network modules. We show that a robust measure of correlation (the biweight midcorrelation transformed via the topological overlap transformation) leads to modules that are superior to MI based modules and maximal information coefficient (MIC) based modules in terms of gene ontology enrichment. We present a function that relates correlation to mutual information which can be used to approximate the mutual information from the corresponding correlation coefficient. We propose the use of polynomial or spline regression models as an alternative to MI for capturing non-linear relationships between quantitative variables. The biweight midcorrelation outperforms MI in terms of elucidating gene pairwise relationships. Coupled with the topological overlap matrix transformation, it often leads to more significantly enriched co-expression modules. Spline and polynomial networks form attractive alternatives to MI in case of non-linear relationships. Our results indicate that MI

  17. The arabidopsis wall associated kinase-like 10 gene encodes a functional guanylyl cyclase and is co-expressed with pathogen defense related genes

    KAUST Repository

    Meier, Stuart; Ruzvidzo, Oziniel; Morse, Monique; Donaldson, Lara; Kwezi, Lusisizwe; Gehring, Christoph A

    2010-01-01

    Background: Second messengers have a key role in linking environmental stimuli to physiological responses. One such messenger, guanosine 3?,5?-cyclic monophosphate (cGMP), has long been known to be an essential signaling molecule in many different physiological processes in higher plants, including biotic stress responses. To date, however, the guanylyl cyclase (GC) enzymes that catalyze the formation of cGMP from GTP have largely remained elusive in higher plants. Principal Findings: We have identified an Arabidopsis receptor type wall associated kinase-like molecule (AtWAKL10) as a candidate GC and provide experimental evidence to show that the intracellular domain of AtWAKL10431-700 can generate cGMP in vitro. Further, we also demonstrate that the molecule has kinase activity indicating that AtWAKL10 is a twin-domain catalytic protein. A co-expression and stimulus-specific expression analysis revealed that AtWAKL10 is consistently coexpressed with well characterized pathogen defense related genes and along with these genes is induced early and sharply in response to a range of pathogens and their elicitors. Conclusions: We demonstrate that AtWAKL10 is a twin-domain, kinase-GC signaling molecule that may function in biotic stress responses that are critically dependent on the second messenger cGMP. © 2010 Meier et al.

  18. The arabidopsis wall associated kinase-like 10 gene encodes a functional guanylyl cyclase and is co-expressed with pathogen defense related genes

    KAUST Repository

    Meier, Stuart

    2010-01-26

    Background: Second messengers have a key role in linking environmental stimuli to physiological responses. One such messenger, guanosine 3?,5?-cyclic monophosphate (cGMP), has long been known to be an essential signaling molecule in many different physiological processes in higher plants, including biotic stress responses. To date, however, the guanylyl cyclase (GC) enzymes that catalyze the formation of cGMP from GTP have largely remained elusive in higher plants. Principal Findings: We have identified an Arabidopsis receptor type wall associated kinase-like molecule (AtWAKL10) as a candidate GC and provide experimental evidence to show that the intracellular domain of AtWAKL10431-700 can generate cGMP in vitro. Further, we also demonstrate that the molecule has kinase activity indicating that AtWAKL10 is a twin-domain catalytic protein. A co-expression and stimulus-specific expression analysis revealed that AtWAKL10 is consistently coexpressed with well characterized pathogen defense related genes and along with these genes is induced early and sharply in response to a range of pathogens and their elicitors. Conclusions: We demonstrate that AtWAKL10 is a twin-domain, kinase-GC signaling molecule that may function in biotic stress responses that are critically dependent on the second messenger cGMP. © 2010 Meier et al.

  19. Co-expression Network Approach to Studying the Effects of Botulinum Neurotoxin-A.

    Science.gov (United States)

    Mukund, Kavitha; Ward, Samuel R; Lieber, Richard L; Subramaniam, Shankar

    2017-10-16

    Botulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules, further hierarchically clustered into 5 groups. Quantifying average expression and co-expression patterns across these groups revealed temporal aspects of muscle's response to BoNT-A. Functional analysis revealed enrichment of group 1 with metabolism; group 5 with contradictory functions of atrophy and cellular recovery; and groups 2 and 3 with extracellular matrix (ECM) and non-fast fiber isoforms. Topological positioning of two highly ranked, significantly expressed genes- Dclk1 and Ostalpha within group 5 suggested possible mechanistic roles in recovery from BoNT-A induced atrophy. Phenotypic correlations of groups with titin and myosin protein content further emphasized the effect of BoNT-A on the sarcomeric contraction machinery in early phase of chemodenervation. In summary, our approach revealed a hierarchical functional response to BoNT-A induced paralysis with early metabolic and later ECM responses and identified putative biomarkers associated with chemodenervation. Additionally, our results provide an unbiased validation of the response documented in our previous workBotulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules

  20. Coexpression of bile salt hydrolase gene and catalase gene remarkably improves oxidative stress and bile salt resistance in Lactobacillus casei.

    Science.gov (United States)

    Wang, Guohong; Yin, Sheng; An, Haoran; Chen, Shangwu; Hao, Yanling

    2011-08-01

    Lactic acid bacteria (LAB) encounter various types of stress during industrial processes and gastrointestinal transit. Catalase (CAT) and bile salt hydrolase (BSH) can protect bacteria from oxidative stress or damage caused by bile salts by decomposing hydrogen peroxide (H(2)O(2)) or deconjugating the bile salts, respectively. Lactobacillus casei is a valuable probiotic strain and is often deficient in both CAT and BSH. In order to improve the resistance of L. casei to both oxidative and bile salts stress, the catalase gene katA from L. sakei and the bile salt hydrolase gene bsh1 from L. plantarum were coexpressed in L. casei HX01. The enzyme activities of CAT and BSH were 2.41 μmol H(2)O(2)/min/10(8) colony-forming units (CFU) and 2.11 μmol glycine/min/ml in the recombinant L. casei CB, respectively. After incubation with 8 mM H(2)O(2), survival ratio of L. casei CB was 40-fold higher than that of L. casei CK. Treatment of L. casei CB with various concentrations of sodium glycodeoxycholate (GDCA) showed that ~10(5) CFU/ml cells survived after incubation with 0.5% GDCA, whereas almost all the L. casei CK cells were killed when treaded with 0.4% GDCA. These results indicate that the coexpression of CAT and BSH confers high-level resistance to both oxidative and bile salts stress conditions in L. casei HX01.

  1. Preservation Analysis of Macrophage Gene Coexpression Between Human and Mouse Identifies PARK2 as a Genetically Controlled Master Regulator of Oxidative Phosphorylation in Humans

    Directory of Open Access Journals (Sweden)

    Veronica Codoni

    2016-10-01

    Full Text Available Macrophages are key players involved in numerous pathophysiological pathways and an in-depth characterization of their gene regulatory networks can help in better understanding how their dysfunction may impact on human diseases. We here conducted a cross-species network analysis of macrophage gene expression data between human and mouse to identify conserved networks across both species, and assessed whether such networks could reveal new disease-associated regulatory mechanisms. From a sample of 684 individuals processed for genome-wide macrophage gene expression profiling, we identified 27 groups of coexpressed genes (modules. Six modules were found preserved (P < 10−4 in macrophages from 86 mice of the Hybrid Mouse Diversity Panel. One of these modules was significantly [false discovery rate (FDR = 8.9 × 10−11] enriched for genes belonging to the oxidative phosphorylation (OXPHOS pathway. This pathway was also found significantly (FDR < 10−4 enriched in susceptibility genes for Alzheimer, Parkinson, and Huntington diseases. We further conducted an expression quantitative trait loci analysis to identify SNP that could regulate macrophage OXPHOS gene expression in humans. This analysis identified the PARK2 rs192804963 as a trans-acting variant influencing (minimal P-value = 4.3 × 10−8 the expression of most OXPHOS genes in humans. Further experimental work demonstrated that PARK2 knockdown expression was associated with increased OXPHOS gene expression in THP1 human macrophages. This work provided strong new evidence that PARK2 participates to the regulatory networks associated with oxidative phosphorylation and suggested that PARK2 genetic variations could act as a trans regulator of OXPHOS gene macrophage expression in humans.

  2. Gene Coexpression Analysis Reveals Complex Metabolism of the Monoterpene Alcohol Linalool in Arabidopsis Flowers[W][OPEN

    Science.gov (United States)

    Ginglinger, Jean-François; Boachon, Benoit; Höfer, René; Paetz, Christian; Köllner, Tobias G.; Miesch, Laurence; Lugan, Raphael; Baltenweck, Raymonde; Mutterer, Jérôme; Ullmann, Pascaline; Beran, Franziska; Claudel, Patricia; Verstappen, Francel; Fischer, Marc J.C.; Karst, Francis; Bouwmeester, Harro; Miesch, Michel; Schneider, Bernd; Gershenzon, Jonathan; Ehlting, Jürgen; Werck-Reichhart, Danièle

    2013-01-01

    The cytochrome P450 family encompasses the largest family of enzymes in plant metabolism, and the functions of many of its members in Arabidopsis thaliana are still unknown. Gene coexpression analysis pointed to two P450s that were coexpressed with two monoterpene synthases in flowers and were thus predicted to be involved in monoterpenoid metabolism. We show that all four selected genes, the two terpene synthases (TPS10 and TPS14) and the two cytochrome P450s (CYP71B31 and CYP76C3), are simultaneously expressed at anthesis, mainly in upper anther filaments and in petals. Upon transient expression in Nicotiana benthamiana, the TPS enzymes colocalize in vesicular structures associated with the plastid surface, whereas the P450 proteins were detected in the endoplasmic reticulum. Whether they were expressed in Saccharomyces cerevisiae or in N. benthamiana, the TPS enzymes formed two different enantiomers of linalool: (−)-(R)-linalool for TPS10 and (+)-(S)-linalool for TPS14. Both P450 enzymes metabolize the two linalool enantiomers to form different but overlapping sets of hydroxylated or epoxidized products. These oxygenated products are not emitted into the floral headspace, but accumulate in floral tissues as further converted or conjugated metabolites. This work reveals complex linalool metabolism in Arabidopsis flowers, the ecological role of which remains to be determined. PMID:24285789

  3. Predicting targeted drug combinations based on Pareto optimal patterns of coexpression network connectivity.

    Science.gov (United States)

    Penrod, Nadia M; Greene, Casey S; Moore, Jason H

    2014-01-01

    Molecularly targeted drugs promise a safer and more effective treatment modality than conventional chemotherapy for cancer patients. However, tumors are dynamic systems that readily adapt to these agents activating alternative survival pathways as they evolve resistant phenotypes. Combination therapies can overcome resistance but finding the optimal combinations efficiently presents a formidable challenge. Here we introduce a new paradigm for the design of combination therapy treatment strategies that exploits the tumor adaptive process to identify context-dependent essential genes as druggable targets. We have developed a framework to mine high-throughput transcriptomic data, based on differential coexpression and Pareto optimization, to investigate drug-induced tumor adaptation. We use this approach to identify tumor-essential genes as druggable candidates. We apply our method to a set of ER(+) breast tumor samples, collected before (n = 58) and after (n = 60) neoadjuvant treatment with the aromatase inhibitor letrozole, to prioritize genes as targets for combination therapy with letrozole treatment. We validate letrozole-induced tumor adaptation through coexpression and pathway analyses in an independent data set (n = 18). We find pervasive differential coexpression between the untreated and letrozole-treated tumor samples as evidence of letrozole-induced tumor adaptation. Based on patterns of coexpression, we identify ten genes as potential candidates for combination therapy with letrozole including EPCAM, a letrozole-induced essential gene and a target to which drugs have already been developed as cancer therapeutics. Through replication, we validate six letrozole-induced coexpression relationships and confirm the epithelial-to-mesenchymal transition as a process that is upregulated in the residual tumor samples following letrozole treatment. To derive the greatest benefit from molecularly targeted drugs it is critical to design combination

  4. Age gene expression and coexpression progressive signatures in peripheral blood leukocytes.

    Science.gov (United States)

    Irizar, Haritz; Goñi, Joaquín; Alzualde, Ainhoa; Castillo-Triviño, Tamara; Olascoaga, Javier; Lopez de Munain, Adolfo; Otaegui, David

    2015-12-01

    Both cellular senescence and organismic aging are known to be dynamic processes that start early in life and progress constantly during the whole life of the individual. In this work, with the objective of identifying signatures of age-related progressive change at the transcriptomic level, we have performed a whole-genome gene expression analysis of peripheral blood leukocytes in a group of healthy individuals with ages ranging from 14 to 93 years. A set of genes with progressively changing gene expression (either increase or decrease with age) has been identified and contextualized in a coexpression network. A modularity analysis has been performed on this network and biological-term and pathway enrichment analyses have been used for biological interpretation of each module. In summary, the results of the present work reveal the existence of a transcriptomic component that shows progressive expression changes associated to age in peripheral blood leukocytes, highlighting both the dynamic nature of the process and the need to complement young vs. elder studies with longitudinal studies that include middle aged individuals. From the transcriptional point of view, immunosenescence seems to be occurring from a relatively early age, at least from the late 20s/early 30s, and the 49-56 year old age-range appears to be critical. In general, the genes that, according to our results, show progressive expression changes with aging are involved in pathogenic/cellular processes that have classically been linked to aging in humans: cancer, immune processes and cellular growth vs. maintenance. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. Guidance for RNA-seq co-expression network construction and analysis: safety in numbers.

    Science.gov (United States)

    Ballouz, S; Verleyen, W; Gillis, J

    2015-07-01

    RNA-seq co-expression analysis is in its infancy and reasonable practices remain poorly defined. We assessed a variety of RNA-seq expression data to determine factors affecting functional connectivity and topology in co-expression networks. We examine RNA-seq co-expression data generated from 1970 RNA-seq samples using a Guilt-By-Association framework, in which genes are assessed for the tendency of co-expression to reflect shared function. Minimal experimental criteria to obtain performance on par with microarrays were >20 samples with read depth >10 M per sample. While the aggregate network constructed shows good performance (area under the receiver operator characteristic curve ∼0.71), the dependency on number of experiments used is nearly identical to that present in microarrays, suggesting thousands of samples are required to obtain 'gold-standard' co-expression. We find a major topological difference between RNA-seq and microarray co-expression in the form of low overlaps between hub-like genes from each network due to changes in the correlation of expression noise within each technology. jgillis@cshl.edu or sballouz@cshl.edu Networks are available at: http://gillislab.labsites.cshl.edu/supplements/rna-seq-networks/ and supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  6. Construction and identification of double-gene co-expression vector with radiation-inducible human TRAIL and endostatin

    International Nuclear Information System (INIS)

    Li Yanbo; Guo Caixia; Gong Pingsheng; Liu Yang; Liangshuo; Wang Hongfang; Wang Jianfeng; Gong Shouliang

    2010-01-01

    Objective: To construct a recombinant plasmid pshuttle-Egr1-shTRAIL-shES containing tumor necrosis factor-related apoptosis-inducing ligand (TRAIL) and endostatin double genes. Methods: The secretary endostatin gene (shES) fragment was amplified from the pMD19T-endostatin vector by PCR. The shES gene was ligated to pMD19Tand sequenced. Finally, using the gene recombinant technique, the recombinant plasmid pshuttle-Egr1- shTRAIL-shES with radiation-inducible Egr1 promoter, secretary TRAIL and endostatin double-gene was constructed. Results: The sequence of the shES gene was in concordance with that anticipated indicating shES gene was acquired successfully.Moreover, the results acquired by PCR and restrictive digestion identification of the recombinant plasmid pshuttle-Egr1-shTRAIL-shES and all the vectors refered to its construction confirmed that pshuttle-Egr1-shTRAIL-shES was constructed correctly. Conclusion: The radiation-inducible double-gene co-expression vector pshuttle-Egr1-shTRAIL-shES is constructed successfully, which would set the experimental foundation for further study on the anti-tumor effect of TRAIL and endostatin double-gene-radiotherapy and its related mechanisms. (authors)

  7. Co-expression of interleukin 12 enhances antitumor effects of a novel chimeric promoter-mediated suicide gene therapy in an immunocompetent mouse model

    International Nuclear Information System (INIS)

    Xu, Yu; Liu, Zhengchun; Kong, Haiyan; Sun, Wenjie; Liao, Zhengkai; Zhou, Fuxiang; Xie, Conghua

    2011-01-01

    Highlights: → A novel chimeric promoter consisting of CArG element and hTERT promoter was developed. → The promoter was characterized with radiation-inducibility and tumor-specificity. → Suicide gene system driven by the promoter showed remarkable cytotoxicity in vitro. → Co-expression of IL12 enhanced the promoter mediated suicide gene therapy in vivo. -- Abstract: The human telomerase reverse transcriptase (hTERT) promoter has been widely used in target gene therapy of cancer. However, low transcriptional activity limited its clinical application. Here, we designed a novel dual radiation-inducible and tumor-specific promoter system consisting of CArG elements and the hTERT promoter, resulting in increased expression of reporter genes after gamma-irradiation. Therapeutic and side effects of adenovirus-mediated horseradish peroxidase (HRP)/indole-3-acetic (IAA) system downstream of the chimeric promoter were evaluated in mice bearing Lewis lung carcinoma, combining with or without adenovirus-mediated interleukin 12 (IL12) gene driven by the cytomegalovirus promoter. The combination treatment showed more effective suppression of tumor growth than those with single agent alone, being associated with pronounced intratumoral T-lymphocyte infiltration and minor side effects. Our results suggest that the combination treatment with HRP/IAA system driven by the novel chimeric promoter and the co-expression of IL12 might be an effective and safe target gene therapy strategy of cancer.

  8. Co-expression of interleukin 12 enhances antitumor effects of a novel chimeric promoter-mediated suicide gene therapy in an immunocompetent mouse model

    Energy Technology Data Exchange (ETDEWEB)

    Xu, Yu, E-mail: xuyu1001@gmail.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Liu, Zhengchun, E-mail: l135027@126.com [Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Kong, Haiyan, E-mail: suppleant@163.com [Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Sun, Wenjie, E-mail: wendy11240325@163.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Liao, Zhengkai, E-mail: fastbeta@gmail.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Zhou, Fuxiang, E-mail: happyzhoufx@sina.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Xie, Conghua, E-mail: chxie_65@hotmail.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); and others

    2011-09-09

    Highlights: {yields} A novel chimeric promoter consisting of CArG element and hTERT promoter was developed. {yields} The promoter was characterized with radiation-inducibility and tumor-specificity. {yields} Suicide gene system driven by the promoter showed remarkable cytotoxicity in vitro. {yields} Co-expression of IL12 enhanced the promoter mediated suicide gene therapy in vivo. -- Abstract: The human telomerase reverse transcriptase (hTERT) promoter has been widely used in target gene therapy of cancer. However, low transcriptional activity limited its clinical application. Here, we designed a novel dual radiation-inducible and tumor-specific promoter system consisting of CArG elements and the hTERT promoter, resulting in increased expression of reporter genes after gamma-irradiation. Therapeutic and side effects of adenovirus-mediated horseradish peroxidase (HRP)/indole-3-acetic (IAA) system downstream of the chimeric promoter were evaluated in mice bearing Lewis lung carcinoma, combining with or without adenovirus-mediated interleukin 12 (IL12) gene driven by the cytomegalovirus promoter. The combination treatment showed more effective suppression of tumor growth than those with single agent alone, being associated with pronounced intratumoral T-lymphocyte infiltration and minor side effects. Our results suggest that the combination treatment with HRP/IAA system driven by the novel chimeric promoter and the co-expression of IL12 might be an effective and safe target gene therapy strategy of cancer.

  9. A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways

    OpenAIRE

    Musungu, Bryan M.; Bhatnagar, Deepak; Brown, Robert L.; Payne, Gary A.; OBrian, Greg; Fakhoury, Ahmad M.; Geisler, Matt

    2016-01-01

    A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flav...

  10. Gene Coexpression and Evolutionary Conservation Analysis of the Human Preimplantation Embryos

    Directory of Open Access Journals (Sweden)

    Tiancheng Liu

    2015-01-01

    Full Text Available Evolutionary developmental biology (EVO-DEVO tries to decode evolutionary constraints on the stages of embryonic development. Two models—the “funnel-like” model and the “hourglass” model—have been proposed by investigators to illustrate the fluctuation of selective pressure on these stages. However, selective indices of stages corresponding to mammalian preimplantation embryonic development (PED were undetected in previous studies. Based on single cell RNA sequencing of stages during human PED, we used coexpression method to identify gene modules activated in each of these stages. Through measuring the evolutionary indices of gene modules belonging to each stage, we observed change pattern of selective constraints on PED for the first time. The selective pressure decreases from the zygote stage to the 4-cell stage and increases at the 8-cell stage and then decreases again from 8-cell stage to the late blastocyst stages. Previous EVO-DEVO studies concerning the whole embryo development neglected the fluctuation of selective pressure in these earlier stages, and the fluctuation was potentially correlated with events of earlier stages, such as zygote genome activation (ZGA. Such oscillation in an earlier stage would further affect models of the evolutionary constraints on whole embryo development. Therefore, these earlier stages should be measured intensively in future EVO-DEVO studies.

  11. Inferring the transcriptional landscape of bovine skeletal muscle by integrating co-expression networks.

    Directory of Open Access Journals (Sweden)

    Nicholas J Hudson

    Full Text Available BACKGROUND: Despite modern technologies and novel computational approaches, decoding causal transcriptional regulation remains challenging. This is particularly true for less well studied organisms and when only gene expression data is available. In muscle a small number of well characterised transcription factors are proposed to regulate development. Therefore, muscle appears to be a tractable system for proposing new computational approaches. METHODOLOGY/PRINCIPAL FINDINGS: Here we report a simple algorithm that asks "which transcriptional regulator has the highest average absolute co-expression correlation to the genes in a co-expression module?" It correctly infers a number of known causal regulators of fundamental biological processes, including cell cycle activity (E2F1, glycolysis (HLF, mitochondrial transcription (TFB2M, adipogenesis (PIAS1, neuronal development (TLX3, immune function (IRF1 and vasculogenesis (SOX17, within a skeletal muscle context. However, none of the canonical pro-myogenic transcription factors (MYOD1, MYOG, MYF5, MYF6 and MEF2C were linked to muscle structural gene expression modules. Co-expression values were computed using developing bovine muscle from 60 days post conception (early foetal to 30 months post natal (adulthood for two breeds of cattle, in addition to a nutritional comparison with a third breed. A number of transcriptional landscapes were constructed and integrated into an always correlated landscape. One notable feature was a 'metabolic axis' formed from glycolysis genes at one end, nuclear-encoded mitochondrial protein genes at the other, and centrally tethered by mitochondrially-encoded mitochondrial protein genes. CONCLUSIONS/SIGNIFICANCE: The new module-to-regulator algorithm complements our recently described Regulatory Impact Factor analysis. Together with a simple examination of a co-expression module's contents, these three gene expression approaches are starting to illuminate the in vivo

  12. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    Science.gov (United States)

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  13. Construction and comparison of gene co-expression networks shows complex plant immune responses

    Directory of Open Access Journals (Sweden)

    Luis Guillermo Leal

    2014-10-01

    Full Text Available Gene co-expression networks (GCNs are graphic representations that depict the coordinated transcription of genes in response to certain stimuli. GCNs provide functional annotations of genes whose function is unknown and are further used in studies of translational functional genomics among species. In this work, a methodology for the reconstruction and comparison of GCNs is presented. This approach was applied using gene expression data that were obtained from immunity experiments in Arabidopsis thaliana, rice, soybean, tomato and cassava. After the evaluation of diverse similarity metrics for the GCN reconstruction, we recommended the mutual information coefficient measurement and a clustering coefficient-based method for similarity threshold selection. To compare GCNs, we proposed a multivariate approach based on the Principal Component Analysis (PCA. Branches of plant immunity that were exemplified by each experiment were analyzed in conjunction with the PCA results, suggesting both the robustness and the dynamic nature of the cellular responses. The dynamic of molecular plant responses produced networks with different characteristics that are differentiable using our methodology. The comparison of GCNs from plant pathosystems, showed that in response to similar pathogens plants could activate conserved signaling pathways. The results confirmed that the closeness of GCNs projected on the principal component space is an indicative of similarity among GCNs. This also can be used to understand global patterns of events triggered during plant immune responses.

  14. Identifying miRNA and gene modules of colon cancer associated with pathological stage by weighted gene co-expression network analysis

    Directory of Open Access Journals (Sweden)

    Zhou X

    2018-05-01

    Full Text Available Xian-guo Zhou,1,2,* Xiao-liang Huang,1,2,* Si-yuan Liang,1–3 Shao-mei Tang,1,2 Si-kao Wu,1,2 Tong-tong Huang,1,2 Zeng-nan Mo,1,2,4 Qiu-yan Wang1,2,5 1Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China; 2Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China; 3Department of Colorectal Surgery, First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China; 4Department of Urology and Nephrology, The First Affiliated Hospital of Guangxi, Medical University, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China; 5Guangxi Colleges and Universities Key Laboratory of Biological Molecular Medicine Research, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China *These authors contributed equally to this work Introduction: Colorectal cancer (CRC is the fourth most common cause of cancer-related mortality worldwide. The tumor, node, metastasis (TNM stage remains the standard for CRC prognostication. Identification of meaningful microRNA (miRNA and gene modules or representative biomarkers related to the pathological stage of colon cancer helps to predict prognosis and reveal the mechanisms behind cancer progression.Materials and methods: We applied a systems biology approach by combining differential expression analysis and weighted gene co-expression network analysis (WGCNA to detect the pathological stage-related miRNA and gene modules and construct a miRNA–gene network. The Cancer Genome Atlas (TCGA colon adenocarcinoma (CAC RNA-sequencing data and miRNA-sequencing data were subjected to WGCNA analysis, and the GSE29623, GSE35602 and GSE39396 were utilized to validate and

  15. Production of natural fragrance aromatic acids by coexpression of trans-anethole oxygenase and p-anisaldehyde dehydrogenase genes of Pseudomonas putida JYR-1 in Escherichia coli.

    Science.gov (United States)

    Han, Dongfei; Kurusarttra, Somwang; Ryu, Ji-Young; Kanaly, Robert A; Hur, Hor-Gil

    2012-12-05

    A gene encoding p-anisaldehyde dehydrogenase (PAADH), which catalyzes the oxidation of p-anisaldehyde to p-anisic acid, was identified to be clustered with the trans-anethole oxygenase (tao) gene in Pseudomonas putida JYR-1. Heterologously expressed PAADH in Escherichia coli catalyzed the oxidation of vanillin, veratraldehyde, and piperonal to the corresponding aromatic acids vanillic acid, veratric acid, and piperonylic acid, respectively. Coexpression of trans-anethole oxygenase (TAO) and PAADH in E. coli also resulted in the successful transformation of trans-anethole, isoeugenol, O-methyl isoeugenol, and isosafrole to p-anisic acid, vanillic acid, veratric acid, and piperonylic acid, respectively, which are compounds found in plants as secondary metabolites. Because of the relaxed substrate specificity and high transformation rates by coexpressed TAO and PAADH in E. coli , the engineered strain has potential to be applied in the fragrance industry.

  16. Comparative transcriptome and gene co-expression network analysis reveal genes and signaling pathways adaptively responsive to varied adverse stresses in the insect fungal pathogen, Beauveria bassiana.

    Science.gov (United States)

    He, Zhangjiang; Zhao, Xin; Lu, Zhuoyue; Wang, Huifang; Liu, Pengfei; Zeng, Fanqin; Zhang, Yongjun

    2018-01-01

    Sensing, responding, and adapting to the surrounding environment are crucial for all living organisms to survive, proliferate, and differentiate in their biological niches. Beauveria bassiana is an economically important insect-pathogenic fungus which is widely used as a biocontrol agent to control a variety of insect pests. The fungal pathogen unavoidably encounters a variety of adverse environmental stresses and defense response from the host insects during application of the fungal agents. However, few are known about the transcription response of the fungus to respond or adapt varied adverse stresses. Here, we comparatively analyzed the transcriptome of B. bassiana in globe genome under the varied stationary-phase stresses including osmotic agent (0.8 M NaCl), high temperature (32 °C), cell wall-perturbing agent (Congo red), and oxidative agents (H 2 O 2 or menadione). Total of 12,412 reads were obtained, and mapped to the 6767 genes of the B. bassiana. All of these stresses caused transcription responses involved in basal metabolism, cell wall construction, stress response or cell rescue/detoxification, signaling transduction and gene transcription regulation, and likely other cellular processes. An array of genes displayed similar transcription patterns in response to at least two of the five stresses, suggesting a shared transcription response to varied adverse stresses. Gene co-expression network analysis revealed that mTOR signaling pathway, but not HOG1 MAP kinase pathway, played a central role in regulation the varied adverse stress responses, which was verified by RNAi-mediated knockdown of TOR1. Our findings provided an insight of transcription response and gene co-expression network of B. bassiana in adaptation to varied environments. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Co-expression analysis identifies CRC and AP1 the regulator of Arabidopsis fatty acid biosynthesis.

    Science.gov (United States)

    Han, Xinxin; Yin, Linlin; Xue, Hongwei

    2012-07-01

    Fatty acids (FAs) play crucial rules in signal transduction and plant development, however, the regulation of FA metabolism is still poorly understood. To study the relevant regulatory network, fifty-eight FA biosynthesis genes including de novo synthases, desaturases and elongases were selected as "guide genes" to construct the co-expression network. Calculation of the correlation between all Arabidopsis thaliana (L.) genes with each guide gene by Arabidopsis co-expression dating mining tools (ACT) identifies 797 candidate FA-correlated genes. Gene ontology (GO) analysis of these co-expressed genes showed they are tightly correlated to photosynthesis and carbohydrate metabolism, and function in many processes. Interestingly, 63 transcription factors (TFs) were identified as candidate FA biosynthesis regulators and 8 TF families are enriched. Two TF genes, CRC and AP1, both correlating with 8 FA guide genes, were further characterized. Analyses of the ap1 and crc mutant showed the altered total FA composition of mature seeds. The contents of palmitoleic acid, stearic acid, arachidic acid and eicosadienoic acid are decreased, whereas that of oleic acid is increased in ap1 and crc seeds, which is consistent with the qRT-PCR analysis revealing the suppressed expression of the corresponding guide genes. In addition, yeast one-hybrid analysis and electrophoretic mobility shift assay (EMSA) revealed that CRC can bind to the promoter regions of KCS7 and KCS15, indicating that CRC may directly regulate FA biosynthesis. © 2012 Institute of Botany, Chinese Academy of Sciences.

  18. Comparison of gene co-networks reveals the molecular mechanisms of the rice (Oryza sativa L.) response to Rhizoctonia solani AG1 IA infection.

    Science.gov (United States)

    Zhang, Jinfeng; Zhao, Wenjuan; Fu, Rong; Fu, Chenglin; Wang, Lingxia; Liu, Huainian; Li, Shuangcheng; Deng, Qiming; Wang, Shiquan; Zhu, Jun; Liang, Yueyang; Li, Ping; Zheng, Aiping

    2018-05-05

    Rhizoctonia solani causes rice sheath blight, an important disease affecting the growth of rice (Oryza sativa L.). Attempts to control the disease have met with little success. Based on transcriptional profiling, we previously identified more than 11,947 common differentially expressed genes (TPM > 10) between the rice genotypes TeQing and Lemont. In the current study, we extended these findings by focusing on an analysis of gene co-expression in response to R. solani AG1 IA and identified gene modules within the networks through weighted gene co-expression network analysis (WGCNA). We compared the different genes assigned to each module and the biological interpretations of gene co-expression networks at early and later modules in the two rice genotypes to reveal differential responses to AG1 IA. Our results show that different changes occurred in the two rice genotypes and that the modules in the two groups contain a number of candidate genes possibly involved in pathogenesis, such as the VQ protein. Furthermore, these gene co-expression networks provide comprehensive transcriptional information regarding gene expression in rice in response to AG1 IA. The co-expression networks derived from our data offer ideas for follow-up experimentation that will help advance our understanding of the translational regulation of rice gene expression changes in response to AG1 IA.

  19. Partial antiviral activities detection of chicken Mx jointing with neuraminidase gene (NA against Newcastle disease virus.

    Directory of Open Access Journals (Sweden)

    Yani Zhang

    Full Text Available As an attempt to increase the resistance to Newcastle Disease Virus (NDV and so further reduction of its risk on the poultry industry. This work aimed to build the eukaryotic gene co-expression plasmid of neuraminidase (NA gene and myxo-virus resistance (Mx and detect the gene expression in transfected mouse fibroblasts (NIH-3T3 cells, it is most important to investigate the influence of the recombinant plasmid on the chicken embryonic fibroblasts (CEF cells. cDNA fragment of NA and mutant Mx gene were derived from pcDNA3.0-NA and pcDNA3.0-Mx plasmid via PCR, respectively, then NA and Mx cDNA fragment were inserted into the multiple cloning sites of pVITRO2 to generate the eukaryotic co-expression plasmid pVITRO2-Mx-NA. The recombinant plasmid was confirmed by restriction endonuclease treatment and sequencing, and it was transfected into the mouse fibroblasts (NIH-3T3 cells. The expression of genes in pVITRO2-Mx-NA were measured by RT-PCR and indirect immunofluorescence assay (IFA. The recombinant plasmid was transfected into CEF cells then RT-PCR and the micro-cell inhibition tests were used to test the antiviral activity for NDV. Our results showed that co-expression vector pVITRO2-Mx-NA was constructed successfully; the expression of Mx and NA could be detected in both NIH-3T3 and CEF cells. The recombinant proteins of Mx and NA protect CEF cells from NDV infection until after 72 h of incubation but the individually mutagenic Mx protein or NA protein protects CEF cells from NDV infection till 48 h post-infection, and co-transfection group decreased significantly NDV infection compared with single-gene transfection group (P<0. 05, indicating that Mx-NA jointing contributed to delaying the infection of NDV in single-cell level and the co-transfection of the jointed genes was more powerful than single one due to their synergistic effects.

  20. Assessing pathogenicity of MLH1 variants by co-expression of human MLH1 and PMS2 genes in yeast

    Energy Technology Data Exchange (ETDEWEB)

    Vogelsang, Matjaz; Comino, Aleksandra; Zupanec, Neja [Department for Biosynthesis and Biotransformation, National Institute of Chemistry, Hajdrihova 19, SI-1001 Ljubljana (Slovenia); Hudler, Petra [Medical Center for Molecular Biology, Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1000 Ljubljana (Slovenia); Komel, Radovan [Department for Biosynthesis and Biotransformation, National Institute of Chemistry, Hajdrihova 19, SI-1001 Ljubljana (Slovenia); Medical Center for Molecular Biology, Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1000 Ljubljana (Slovenia)

    2009-10-28

    Loss of DNA mismatch repair (MMR) in humans, mainly due to mutations in the hMLH1 gene, is linked to hereditary nonpolyposis colorectal cancer (HNPCC). Because not all MLH1 alterations result in loss of MMR function, accurate characterization of variants and their classification in terms of their effect on MMR function is essential for reliable genetic testing and effective treatment. To date, in vivo assays for functional characterization of MLH1 mutations performed in various model systems have used episomal expression of the modified MMR genes. We describe here a novel approach to determine accurately the functional significance of hMLH1 mutations in vivo, based on co-expression of human MLH1 and PMS2 in yeast cells. Yeast MLH1 and PMS1 genes, whose protein products form the MutLα complex, were replaced by human orthologs directly on yeast chromosomes by homologous recombination, and the resulting MMR activity was tested. The yeast strain co-expressing hMLH1 and hPMS2 exhibited the same mutation rate as the wild-type. Eight cancer-related MLH1 variants were introduced, using the same approach, into the prepared yeast model, and their effect on MMR function was determined. Five variants (A92P, S93G, I219V, K618R and K618T) were classified as non-pathogenic, whereas variants T117M, Y646C and R659Q were characterized as pathogenic. Results of our in vivo yeast-based approach correlate well with clinical data in five out of seven hMLH1 variants and the described model was thus shown to be useful for functional characterization of MLH1 variants in cancer patients found throughout the entire coding region of the gene.

  1. Assessing pathogenicity of MLH1 variants by co-expression of human MLH1 and PMS2 genes in yeast

    Directory of Open Access Journals (Sweden)

    Hudler Petra

    2009-10-01

    Full Text Available Abstract Background Loss of DNA mismatch repair (MMR in humans, mainly due to mutations in the hMLH1 gene, is linked to hereditary nonpolyposis colorectal cancer (HNPCC. Because not all MLH1 alterations result in loss of MMR function, accurate characterization of variants and their classification in terms of their effect on MMR function is essential for reliable genetic testing and effective treatment. To date, in vivo assays for functional characterization of MLH1 mutations performed in various model systems have used episomal expression of the modified MMR genes. We describe here a novel approach to determine accurately the functional significance of hMLH1 mutations in vivo, based on co-expression of human MLH1 and PMS2 in yeast cells. Methods Yeast MLH1 and PMS1 genes, whose protein products form the MutLα complex, were replaced by human orthologs directly on yeast chromosomes by homologous recombination, and the resulting MMR activity was tested. Results The yeast strain co-expressing hMLH1 and hPMS2 exhibited the same mutation rate as the wild-type. Eight cancer-related MLH1 variants were introduced, using the same approach, into the prepared yeast model, and their effect on MMR function was determined. Five variants (A92P, S93G, I219V, K618R and K618T were classified as non-pathogenic, whereas variants T117M, Y646C and R659Q were characterized as pathogenic. Conclusion Results of our in vivo yeast-based approach correlate well with clinical data in five out of seven hMLH1 variants and the described model was thus shown to be useful for functional characterization of MLH1 variants in cancer patients found throughout the entire coding region of the gene.

  2. Assessing pathogenicity of MLH1 variants by co-expression of human MLH1 and PMS2 genes in yeast

    International Nuclear Information System (INIS)

    Vogelsang, Matjaz; Comino, Aleksandra; Zupanec, Neja; Hudler, Petra; Komel, Radovan

    2009-01-01

    Loss of DNA mismatch repair (MMR) in humans, mainly due to mutations in the hMLH1 gene, is linked to hereditary nonpolyposis colorectal cancer (HNPCC). Because not all MLH1 alterations result in loss of MMR function, accurate characterization of variants and their classification in terms of their effect on MMR function is essential for reliable genetic testing and effective treatment. To date, in vivo assays for functional characterization of MLH1 mutations performed in various model systems have used episomal expression of the modified MMR genes. We describe here a novel approach to determine accurately the functional significance of hMLH1 mutations in vivo, based on co-expression of human MLH1 and PMS2 in yeast cells. Yeast MLH1 and PMS1 genes, whose protein products form the MutLα complex, were replaced by human orthologs directly on yeast chromosomes by homologous recombination, and the resulting MMR activity was tested. The yeast strain co-expressing hMLH1 and hPMS2 exhibited the same mutation rate as the wild-type. Eight cancer-related MLH1 variants were introduced, using the same approach, into the prepared yeast model, and their effect on MMR function was determined. Five variants (A92P, S93G, I219V, K618R and K618T) were classified as non-pathogenic, whereas variants T117M, Y646C and R659Q were characterized as pathogenic. Results of our in vivo yeast-based approach correlate well with clinical data in five out of seven hMLH1 variants and the described model was thus shown to be useful for functional characterization of MLH1 variants in cancer patients found throughout the entire coding region of the gene

  3. Dynamic functional modules in co-expressed protein interaction networks of dilated cardiomyopathy

    Directory of Open Access Journals (Sweden)

    Oyang Yen-Jen

    2010-10-01

    Full Text Available Abstract Background Molecular networks represent the backbone of molecular activity within cells and provide opportunities for understanding the mechanism of diseases. While protein-protein interaction data constitute static network maps, integration of condition-specific co-expression information provides clues to the dynamic features of these networks. Dilated cardiomyopathy is a leading cause of heart failure. Although previous studies have identified putative biomarkers or therapeutic targets for heart failure, the underlying molecular mechanism of dilated cardiomyopathy remains unclear. Results We developed a network-based comparative analysis approach that integrates protein-protein interactions with gene expression profiles and biological function annotations to reveal dynamic functional modules under different biological states. We found that hub proteins in condition-specific co-expressed protein interaction networks tended to be differentially expressed between biological states. Applying this method to a cohort of heart failure patients, we identified two functional modules that significantly emerged from the interaction networks. The dynamics of these modules between normal and disease states further suggest a potential molecular model of dilated cardiomyopathy. Conclusions We propose a novel framework to analyze the interaction networks in different biological states. It successfully reveals network modules closely related to heart failure; more importantly, these network dynamics provide new insights into the cause of dilated cardiomyopathy. The revealed molecular modules might be used as potential drug targets and provide new directions for heart failure therapy.

  4. An extensive (co-expression analysis tool for the cytochrome P450 superfamily in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Provart Nicholas J

    2008-04-01

    Full Text Available Abstract Background Sequencing of the first plant genomes has revealed that cytochromes P450 have evolved to become the largest family of enzymes in secondary metabolism. The proportion of P450 enzymes with characterized biochemical function(s is however very small. If P450 diversification mirrors evolution of chemical diversity, this points to an unexpectedly poor understanding of plant metabolism. We assumed that extensive analysis of gene expression might guide towards the function of P450 enzymes, and highlight overlooked aspects of plant metabolism. Results We have created a comprehensive database, 'CYPedia', describing P450 gene expression in four data sets: organs and tissues, stress response, hormone response, and mutants of Arabidopsis thaliana, based on public Affymetrix ATH1 microarray expression data. P450 expression was then combined with the expression of 4,130 re-annotated genes, predicted to act in plant metabolism, for co-expression analyses. Based on the annotation of co-expressed genes from diverse pathway annotation databases, co-expressed pathways were identified. Predictions were validated for most P450s with known functions. As examples, co-expression results for P450s related to plastidial functions/photosynthesis, and to phenylpropanoid, triterpenoid and jasmonate metabolism are highlighted here. Conclusion The large scale hypothesis generation tools presented here provide leads to new pathways, unexpected functions, and regulatory networks for many P450s in plant metabolism. These can now be exploited by the community to validate the proposed functions experimentally using reverse genetics, biochemistry, and metabolic profiling.

  5. An MHC class I immune evasion gene of Marek׳s disease virus.

    Science.gov (United States)

    Hearn, Cari; Preeyanon, Likit; Hunt, Henry D; York, Ian A

    2015-01-15

    Marek׳s disease virus (MDV) is a widespread α-herpesvirus of chickens that causes T cell tumors. Acute, but not latent, MDV infection has previously been shown to lead to downregulation of cell-surface MHC class I (Virology 282:198-205 (2001)), but the gene(s) involved have not been identified. Here we demonstrate that an MDV gene, MDV012, is capable of reducing surface expression of MHC class I on chicken cells. Co-expression of an MHC class I-binding peptide targeted to the endoplasmic reticulum (bypassing the requirement for the TAP peptide transporter) partially rescued MHC class I expression in the presence of MDV012, suggesting that MDV012 is a TAP-blocking MHC class I immune evasion protein. This is the first unique non-mammalian MHC class I immune evasion gene identified, and suggests that α-herpesviruses have conserved this function for at least 100 million years. Copyright © 2014 Elsevier Inc. All rights reserved.

  6. Co-expression of G2-EPSPS and glyphosate acetyltransferase GAT genes conferring high tolerance to glyphosate in soybean

    Directory of Open Access Journals (Sweden)

    Bingfu eGuo

    2015-10-01

    Full Text Available Glyphosate is a widely used non-selective herbicide with broad spectrum of weed control around the world. At present, most of the commercial glyphosate tolerant soybeans utilize glyphosate tolerant gene CP4-EPSPS or glyphosate acetyltransferase gene GAT separately. In this study, both glyphosate tolerant gene G2-EPSPS and glyphosate degraded gene GAT were co-transferred into soybean and transgenic plants showed high tolerance to glyphosate. Molecular analysis including PCR, Sothern blot, qRT-PCR and Western blot revealed that target genes have been integrated into genome and expressed effectively at both mRNA and protein levels. Furthermore, the glyphosate tolerance analysis showed that no typical symptom was observed when compared with a glyphosate tolerant line HJ06-698 derived from GR1 transgenic soybean even at four-fold labeled rate of Roundup. Chlorophyll and shikimic acid content analysis of transgenic plant also revealed that these two indexes were not significantly altered after glyphosate application. These results indicated that co-expression of G2-EPSPS and GAT conferred high tolerance to the herbicide glyphosate in soybean. Therefore, combination of tolerant and degraded genes provides a new strategy for developing glyphosate tolerant transgenic crops.

  7. Neutralization of Bacterial YoeBSpn Toxicity and Enhanced Plant Growth in Arabidopsis thaliana via Co-Expression of the Toxin-Antitoxin Genes

    Science.gov (United States)

    Abu Bakar, Fauziah; Yeo, Chew Chieng; Harikrishna, Jennifer Ann

    2016-01-01

    Bacterial toxin-antitoxin (TA) systems have various cellular functions, including as part of the general stress response. The genome of the Gram-positive human pathogen Streptococcus pneumoniae harbors several putative TA systems, including yefM-yoeBSpn, which is one of four systems that had been demonstrated to be biologically functional. Overexpression of the yoeBSpn toxin gene resulted in cell stasis and eventually cell death in its native host, as well as in Escherichia coli. Our previous work showed that induced expression of a yoeBSpn toxin-Green Fluorescent Protein (GFP) fusion gene apparently triggered apoptosis and was lethal in the model plant, Arabidopsis thaliana. In this study, we investigated the effects of co-expression of the yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic A. thaliana. When co-expressed in Arabidopsis, the YefMSpn antitoxin was found to neutralize the toxicity of YoeBSpn-GFP. Interestingly, the inducible expression of both yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic hybrid Arabidopsis resulted in larger rosette leaves and taller plants with a higher number of inflorescence stems and increased silique production. To our knowledge, this is the first demonstration of a prokaryotic antitoxin neutralizing its cognate toxin in plant cells. PMID:27104531

  8. Neutralization of Bacterial YoeBSpn Toxicity and Enhanced Plant Growth in Arabidopsis thaliana via Co-Expression of the Toxin-Antitoxin Genes

    Directory of Open Access Journals (Sweden)

    Fauziah Abu Bakar

    2016-04-01

    Full Text Available Bacterial toxin-antitoxin (TA systems have various cellular functions, including as part of the general stress response. The genome of the Gram-positive human pathogen Streptococcus pneumoniae harbors several putative TA systems, including yefM-yoeBSpn, which is one of four systems that had been demonstrated to be biologically functional. Overexpression of the yoeBSpn toxin gene resulted in cell stasis and eventually cell death in its native host, as well as in Escherichia coli. Our previous work showed that induced expression of a yoeBSpn toxin-Green Fluorescent Protein (GFP fusion gene apparently triggered apoptosis and was lethal in the model plant, Arabidopsis thaliana. In this study, we investigated the effects of co-expression of the yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic A. thaliana. When co-expressed in Arabidopsis, the YefMSpn antitoxin was found to neutralize the toxicity of YoeBSpn-GFP. Interestingly, the inducible expression of both yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic hybrid Arabidopsis resulted in larger rosette leaves and taller plants with a higher number of inflorescence stems and increased silique production. To our knowledge, this is the first demonstration of a prokaryotic antitoxin neutralizing its cognate toxin in plant cells.

  9. Bioinformatics analysis of RNA-seq data revealed critical genes in colon adenocarcinoma.

    Science.gov (United States)

    Xi, W-D; Liu, Y-J; Sun, X-B; Shan, J; Yi, L; Zhang, T-T

    2017-07-01

    RNA-seq data of colon adenocarcinoma (COAD) were analyzed with bioinformatics tools to discover critical genes in the disease. Relevant small molecule drugs, transcription factors (TFs) and microRNAs (miRNAs) were also investigated. RNA-seq data of COAD were downloaded from The Cancer Genome Atlas (TCGA). Differential analysis was performed with package edgeR. False positive discovery (FDR) 1 were set as the cut-offs to screen out differentially expressed genes (DEGs). Gene coexpression network was constructed with package Ebcoexpress. GO enrichment analysis was performed for the DEGs in the gene coexpression network with DAVID. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was also performed for the genes with KOBASS 2.0. Modules were identified with MCODE of Cytoscape. Relevant small molecules drugs were predicted by Connectivity map. Relevant miRNAs and TFs were searched by WebGestalt. A total of 457 DEGs, including 255 up-regulated and 202 down-regulated genes, were identified from 437 COAD and 39 control samples. A gene coexpression network was constructed containing 40 DEGs and 101 edges. The genes were mainly associated with collagen fibril organization, extracellular matrix organization and translation. Two modules were identified from the gene coexpression network, which were implicated in muscle contraction and extracellular matrix organization, respectively. Several critical genes were disclosed, such as MYH11, COL5A2 and ribosomal proteins. Nine relevant small molecule drugs were identified, such as scriptaid and STOCK1N-35874. Accordingly, a total of 17 TFs and 10 miRNAs related to COAD were acquired, such as ETS2, NFAT, AP4, miR-124A, MiR-9, miR-96 and let-7. Several critical genes and relevant drugs, TFs and miRNAs were revealed in COAD. These findings could advance the understanding of the disease and benefit therapy development.

  10. A co-expression gene network associated with developmental regulation of apple fruit acidity.

    Science.gov (United States)

    Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Xu, Kenong

    2015-08-01

    Apple fruit acidity, which affects the fruit's overall taste and flavor to a large extent, is primarily determined by the concentration of malic acid. Previous studies demonstrated that the major QTL malic acid (Ma) on chromosome 16 is largely responsible for fruit acidity variations in apple. Recent advances suggested that a natural mutation that gives rise to a premature stop codon in one of the two aluminum-activated malate transporter (ALMT)-like genes (called Ma1) is the genetic causal element underlying Ma. However, the natural mutation does not explain the developmental changes of fruit malate levels in a given genotype. Using RNA-seq data from the fruit of 'Golden Delicious' taken at 14 developmental stages from 1 week after full-bloom (WAF01) to harvest (WAF20), we characterized their transcriptomes in groups of high (12.2 ± 1.6 mg/g fw, WAF03-WAF08), mid (7.4 ± 0.5 mg/g fw, WAF01-WAF02 and WAF10-WAF14) and low (5.4 ± 0.4 mg/g fw, WAF16-WAF20) malate concentrations. Detailed analyses showed that a set of 3,066 genes (including Ma1) were expressed not only differentially (P FDR < 0.05) between the high and low malate groups (or between the early and late developmental stages) but also in significant (P < 0.05) correlation with malate concentrations. The 3,066 genes fell in 648 MapMan (sub-) bins or functional classes, and 19 of them were significantly (P FDR < 0.05) co-enriched or co-suppressed in a malate dependent manner. Network inferring using the 363 genes encompassed in the 19 (sub-) bins, identified a major co-expression network of 239 genes. Since the 239 genes were also differentially expressed between the early (WAF03-WAF08) and late (WAF16-WAF20) developmental stages, the major network was considered to be associated with developmental regulation of apple fruit acidity in 'Golden Delicious'.

  11. Integrative Analysis of Hippocampus Gene Expression Profiles Identifies Network Alterations in Aging and Alzheimer’s Disease

    Directory of Open Access Journals (Sweden)

    Vinay Lanke

    2018-05-01

    Full Text Available Alzheimer’s disease (AD is a neurodegenerative disorder contributing to rapid decline in cognitive function and ultimately dementia. Most cases of AD occur in elderly and later years. There is a growing need for understanding the relationship between aging and AD to identify shared and unique hallmarks associated with the disease in a region and cell-type specific manner. Although genomic studies on AD have been performed extensively, the molecular mechanism of disease progression is still not clear. The major objective of our study is to obtain a higher-order network-level understanding of aging and AD, and their relationship using the hippocampal gene expression profiles of young (20–50 years, aging (70–99 years, and AD (70–99 years. The hippocampus is vulnerable to damage at early stages of AD and altered neurogenesis in the hippocampus is linked to the onset of AD. We combined the weighted gene co-expression network and weighted protein–protein interaction network-level approaches to study the transition from young to aging to AD. The network analysis revealed the organization of co-expression network into functional modules that are cell-type specific in aging and AD. We found that modules associated with astrocytes, endothelial cells and microglial cells are upregulated and significantly correlate with both aging and AD. The modules associated with neurons, mitochondria and endoplasmic reticulum are downregulated and significantly correlate with AD than aging. The oligodendrocytes module does not show significant correlation with neither aging nor disease. Further, we identified aging- and AD-specific interactions/subnetworks by integrating the gene expression with a human protein–protein interaction network. We found dysregulation of genes encoding protein kinases (FYN, SYK, SRC, PKC, MAPK1, ephrin receptors and transcription factors (FOS, STAT3, CEBPB, MYC, NFKβ, and EGR1 in AD. Further, we found genes that encode proteins

  12. Identification of PEG-induced water stress responsive transcripts using co-expression network in Eucalyptus grandis.

    Science.gov (United States)

    Ghosh Dasgupta, Modhumita; Dharanishanthi, Veeramuthu

    2017-09-05

    Ecophysiological studies in Eucalyptus have shown that water is the principal factor limiting stem growth. Effect of water deficit conditions on physiological and biochemical parameters has been extensively reported in Eucalyptus. The present study was conducted to identify major polyethylene glycol induced water stress responsive transcripts in Eucalyptus grandis using gene co-expression network. A customized array representing 3359 water stress responsive genes was designed to document their expression in leaves of E. grandis cuttings subjected to -0.225MPa of PEG treatment. The differentially expressed transcripts were documented and significantly co-expressed transcripts were used for construction of network. The co-expression network was constructed with 915 nodes and 3454 edges with degree ranging from 2 to 45. Ninety four GO categories and 117 functional pathways were identified in the network. MCODE analysis generated 27 modules and module 6 with 479 nodes and 1005 edges was identified as the biologically relevant network. The major water responsive transcripts represented in the module included dehydrin, osmotin, LEA protein, expansin, arabinogalactans, heat shock proteins, major facilitator proteins, ARM repeat proteins, raffinose synthase, tonoplast intrinsic protein and transcription factors like DREB2A, ARF9, AGL24, UNE12, WLIM1 and MYB66, MYB70, MYB 55, MYB 16 and MYB 103. The coordinated analysis of gene expression patterns and coexpression networks developed in this study identified an array of transcripts that may regulate PEG induced water stress responses in E. grandis. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Network-based identification of biomarkers coexpressed with multiple pathways.

    Science.gov (United States)

    Guo, Nancy Lan; Wan, Ying-Wooi

    2014-01-01

    Unraveling complex molecular interactions and networks and incorporating clinical information in modeling will present a paradigm shift in molecular medicine. Embedding biological relevance via modeling molecular networks and pathways has become increasingly important for biomarker identification in cancer susceptibility and metastasis studies. Here, we give a comprehensive overview of computational methods used for biomarker identification, and provide a performance comparison of several network models used in studies of cancer susceptibility, disease progression, and prognostication. Specifically, we evaluated implication networks, Boolean networks, Bayesian networks, and Pearson's correlation networks in constructing gene coexpression networks for identifying lung cancer diagnostic and prognostic biomarkers. The results show that implication networks, implemented in Genet package, identified sets of biomarkers that generated an accurate prediction of lung cancer risk and metastases; meanwhile, implication networks revealed more biologically relevant molecular interactions than Boolean networks, Bayesian networks, and Pearson's correlation networks when evaluated with MSigDB database.

  14. Enhancement of γ-aminobutyric acid production in recombinant Corynebacterium glutamicum by co-expressing two glutamate decarboxylase genes from Lactobacillus brevis.

    Science.gov (United States)

    Shi, Feng; Jiang, Junjun; Li, Yongfu; Li, Youxin; Xie, Yilong

    2013-11-01

    γ-Aminobutyric acid (GABA), a non-protein amino acid, is a bioactive component in the food, feed and pharmaceutical fields. To establish an effective single-step production system for GABA, a recombinant Corynebacterium glutamicum strain co-expressing two glutamate decarboxylase (GAD) genes (gadB1 and gadB2) derived from Lactobacillus brevis Lb85 was constructed. Compared with the GABA production of the gadB1 or gadB2 single-expressing strains, GABA production by the gadB1-gadB2 co-expressing strain increased more than twofold. By optimising urea supplementation, the total production of L-glutamate and GABA increased from 22.57 ± 1.24 to 30.18 ± 1.33 g L⁻¹, and GABA production increased from 4.02 ± 0.95 to 18.66 ± 2.11 g L⁻¹ after 84-h cultivation. Under optimal urea supplementation, L-glutamate continued to be consumed, GABA continued to accumulate after 36 h of fermentation, and the pH level fluctuated. GABA production increased to a maximum level of 27.13 ± 0.54 g L⁻¹ after 120-h flask cultivation and 26.32 g L⁻¹ after 60-h fed-batch fermentation. The conversion ratio of L-glutamate to GABA reached 0.60-0.74 mol mol⁻¹. By co-expressing gadB1 and gadB2 and optimising the urea addition method, C. glutamicum was genetically improved for de novo biosynthesis of GABA from its own accumulated L-glutamate.

  15. Systematic comparison of co-expression of multiple recombinant thermophilic enzymes in Escherichia coli BL21(DE3).

    Science.gov (United States)

    Chen, Hui; Huang, Rui; Zhang, Y-H Percival

    2017-06-01

    The precise control of multiple heterologous enzyme expression levels in one Escherichia coli strain is important for cascade biocatalysis, metabolic engineering, synthetic biology, natural product synthesis, and studies of complexed proteins. We systematically investigated the co-expression of up to four thermophilic enzymes (i.e., α-glucan phosphorylase (αGP), phosphoglucomutase (PGM), glucose 6-phosphate dehydrogenase (G6PDH), and 6-phosphogluconate dehydrogenase (6PGDH)) in E. coli BL21(DE3) by adding T7 promoter or T7 terminator of each gene for multiple genes in tandem, changing gene alignment, and comparing one or two plasmid systems. It was found that the addition of T7 terminator after each gene was useful to decrease the influence of the upstream gene. The co-expression of the four enzymes in E. coli BL21(DE3) was demonstrated to generate two NADPH molecules from one glucose unit of maltodextrin, where NADPH was oxidized to convert xylose to xylitol. The best four-gene co-expression system was based on two plasmids (pET and pACYC) which harbored two genes. As a result, apparent enzymatic activities of the four enzymes were regulated to be at similar levels and the overall four-enzyme activity was the highest based on the formation of xylitol. This study provides useful information for the precise control of multi-enzyme-coordinated expression in E. coli BL21(DE3).

  16. Gene coexpression network analysis of fruit transcriptomes uncovers a possible mechanistically distinct class of sugar/acid ratio-associated genes in sweet orange.

    Science.gov (United States)

    Qiao, Liang; Cao, Minghao; Zheng, Jian; Zhao, Yihong; Zheng, Zhi-Liang

    2017-10-30

    The ratio of sugars to organic acids, two of the major metabolites in fleshy fruits, has been considered the most important contributor to fruit sweetness. Although accumulation of sugars and acids have been extensively studied, whether plants evolve a mechanism to maintain, sense or respond to the fruit sugar/acid ratio remains a mystery. In a prior study, we used an integrated systems biology tool to identify a group of 39 acid-associated genes from the fruit transcriptomes in four sweet orange varieties (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). We reanalyzed the prior sweet orange fruit transcriptome data, leading to the identification of 72 genes highly correlated with the fruit sugar/acid ratio. The majority of these sugar/acid ratio-related genes are predicted to be involved in regulatory functions such as transport, signaling and transcription or encode enzymes involved in metabolism. Surprisingly, only three of these sugar/acid ratio-correlated genes are weakly correlated with sugar level and none of them overlaps with the acid-associated genes. Weighted Gene Coexpression Network Analysis (WGCNA) has revealed that these genes belong to four modules, Blue, Grey, Brown and Turquoise, with the former two modules being unique to the sugar/acid ratio control. Our results indicate that orange fruits contain a possible mechanistically distinct class of genes that may potentially be involved in maintaining fruit sugar/acid ratios and/or responding to the cellular sugar/acid ratio status. Therefore, our analysis of orange transcriptomes provides an intriguing insight into the potentially novel genetic or molecular mechanisms controlling the sugar/acid ratio in fruits.

  17. Co-expression of Exo-inulinase and Endo-inulinase Genes in the Oleaginous Yeast Yarrowia lipolytica for Efficient Single Cell Oil Production from Inulin.

    Science.gov (United States)

    Shi, Nianci; Mao, Weian; He, Xiaoxia; Chi, Zhe; Chi, Zhenming; Liu, Guanglei

    2018-05-01

    Yarrowia lipolytica is a promising platform for the single cell oil (SCO) production. In this study, a transformant X+N8 in which exo- and endo-inulinase genes were co-expressed could produce an inulinase activity of 124.33 U/mL within 72 h. However, the inulinase activity of a transformant X2 carrying a single exo-inulinase gene was only 47.33 U/mL within 72 h. Moreover, the transformant X+N8 could accumulate 48.13% (w/w) SCO from inulin and the cell dry weight reached 13.63 g/L within 78 h, which were significantly higher than those of the transformant X2 (41.87% (w/w) and 11.23 g/L) under the same conditions. In addition, inulin hydrolysis and utilization of the transformant X+N8 were also more efficient than those of the transformant X2 during the fermentation process. These results demonstrated that the co-expression of the exo- and endo-inulinase genes significantly enhanced the SCO production from inulin due to the improvement of the inulinase activity and the synergistic action of exo- and endo-inulinase. Besides, over 95.01% of the fatty acids from the transformant X+N8 were C16-C18, especially C18:1 (53.10%), suggesting that the fatty acids could be used as feedstock for biodiesel production.

  18. Whole brain and brain regional coexpression network interactions associated with predisposition to alcohol consumption.

    Directory of Open Access Journals (Sweden)

    Lauren A Vanderlinden

    Full Text Available To identify brain transcriptional networks that may predispose an animal to consume alcohol, we used weighted gene coexpression network analysis (WGCNA. Candidate coexpression modules are those with an eigengene expression level that correlates significantly with the level of alcohol consumption across a panel of BXD recombinant inbred mouse strains, and that share a genomic region that regulates the module transcript expression levels (mQTL with a genomic region that regulates alcohol consumption (bQTL. To address a controversy regarding utility of gene expression profiles from whole brain, vs specific brain regions, as indicators of the relationship of gene expression to phenotype, we compared candidate coexpression modules from whole brain gene expression data (gathered with Affymetrix 430 v2 arrays in the Colorado laboratories and from gene expression data from 6 brain regions (nucleus accumbens (NA; prefrontal cortex (PFC; ventral tegmental area (VTA; striatum (ST; hippocampus (HP; cerebellum (CB available from GeneNetwork. The candidate modules were used to construct candidate eigengene networks across brain regions, resulting in three "meta-modules", composed of candidate modules from two or more brain regions (NA, PFC, ST, VTA and whole brain. To mitigate the potential influence of chromosomal location of transcripts and cis-eQTLs in linkage disequilibrium, we calculated a semi-partial correlation of the transcripts in the meta-modules with alcohol consumption conditional on the transcripts' cis-eQTLs. The function of transcripts that retained the correlation with the phenotype after correction for the strong genetic influence, implicates processes of protein metabolism in the ER and Golgi as influencing susceptibility to variation in alcohol consumption. Integration of these data with human GWAS provides further information on the function of polymorphisms associated with alcohol-related traits.

  19. Cis-Natural Antisense Transcripts Are Mainly Co-expressed with Their Sense Transcripts and Primarily Related to Energy Metabolic Pathways during Muscle Development.

    Science.gov (United States)

    Zhao, Yunxia; Hou, Ye; Zhao, Changzhi; Liu, Fei; Luan, Yu; Jing, Lu; Li, Xinyun; Zhu, Mengjin; Zhao, Shuhong

    2016-01-01

    Cis-natural antisense transcripts (cis-NATs) are a new class of RNAs identified in various species. However, the biological functions of cis-NATs are largely unknown. In this study, we investigated the transcriptional characteristics and functions of cis-NATs in the muscle tissue of lean Landrace and indigenous fatty Lantang pigs. In total, 3,306 cis-NATs of 2,469 annotated genes were identified in the muscle tissue of pigs. More than 1,300 cis-NATs correlated with their sense genes at the transcriptional level, and approximately 80% of them were co-expressed in the two breeds. Furthermore, over 1,200 differentially expressed cis-NATs were identified during muscle development. Function annotation showed that the cis-NATs participated in muscle development mainly by co-expressing with genes involved in energy metabolic pathways, including citrate cycle (TCA cycle), glycolysis or gluconeogenesis, mitochondrial activation and so on. Moreover, these cis-NATs and their sense genes abruptly increased at the transition from the late fetal stages to the early postnatal stages and then decreased along with muscle development. In conclusion, the cis-NATs in the muscle tissue of pigs were identified and determined to be mainly co-expressed with their sense genes. The co-expressed cis-NATs and their sense gene were primarily related to energy metabolic pathways during muscle development in pigs. Our results offered novel evidence on the roles of cis-NATs during the muscle development of pigs.

  20. Integration of liver gene co-expression networks and eGWAs analyses highlighted candidate regulators implicated in lipid metabolism in pigs.

    Science.gov (United States)

    Ballester, Maria; Ramayo-Caldas, Yuliaxis; Revilla, Manuel; Corominas, Jordi; Castelló, Anna; Estellé, Jordi; Fernández, Ana I; Folch, Josep M

    2017-04-19

    In the present study, liver co-expression networks and expression Genome Wide Association Study (eGWAS) were performed to identify DNA variants and molecular pathways implicated in the functional regulatory mechanisms of meat quality traits in pigs. With this purpose, the liver mRNA expression of 44 candidates genes related with lipid metabolism was analysed in 111 Iberian x Landrace backcross animals. The eGWAS identified 92 eSNPs located in seven chromosomal regions and associated with eight genes: CROT, CYP2U1, DGAT1, EGF, FABP1, FABP5, PLA2G12A, and PPARA. Remarkably, cis-eSNPs associated with FABP1 gene expression which may be determining the C18:2(n-6)/C18:3(n-3) ratio in backfat through the multiple interaction of DNA variants and genes were identified. Furthermore, a hotspot on SSC8 associated with the gene expression of eight genes was identified and the TBCK gene was pointed out as candidate gene regulating it. Our results also suggested that the PI3K-Akt-mTOR pathway plays an important role in the control of the analysed genes highlighting nuclear receptors as the NR3C1 or PPARA. Finally, sex-dimorphism associated with hepatic lipid metabolism was identified with over-representation of female-biased genes. These results increase our knowledge of the genetic architecture underlying fat composition traits.

  1. Dysregulated Pathway Identification of Alzheimer's Disease Based on Internal Correlation Analysis of Genes and Pathways.

    Science.gov (United States)

    Kong, Wei; Mou, Xiaoyang; Di, Benteng; Deng, Jin; Zhong, Ruxing; Wang, Shuaiqun

    2017-11-20

    Dysregulated pathway identification is an important task which can gain insight into the underlying biological processes of disease. Current pathway-identification methods focus on a set of co-expression genes and single pathways and ignore the correlation between genes and pathways. The method proposed in this study, takes into account the internal correlations not only between genes but also pathways to identifying dysregulated pathways related to Alzheimer's disease (AD), the most common form of dementia. In order to find the significantly differential genes for AD, mutual information (MI) is used to measure interdependencies between genes other than expression valves. Then, by integrating the topology information from KEGG, the significant pathways involved in the feature genes are identified. Next, the distance correlation (DC) is applied to measure the pairwise pathway crosstalks since DC has the advantage of detecting nonlinear correlations when compared to Pearson correlation. Finally, the pathway pairs with significantly different correlations between normal and AD samples are known as dysregulated pathways. The molecular biology analysis demonstrated that many dysregulated pathways related to AD pathogenesis have been discovered successfully by the internal correlation detection. Furthermore, the insights of the dysregulated pathways in the development and deterioration of AD will help to find new effective target genes and provide important theoretical guidance for drug design. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  2. Module discovery by exhaustive search for densely connected, co-expressed regions in biomolecular interaction networks.

    Directory of Open Access Journals (Sweden)

    Recep Colak

    2010-10-01

    Full Text Available Computational prediction of functionally related groups of genes (functional modules from large-scale data is an important issue in computational biology. Gene expression experiments and interaction networks are well studied large-scale data sources, available for many not yet exhaustively annotated organisms. It has been well established, when analyzing these two data sources jointly, modules are often reflected by highly interconnected (dense regions in the interaction networks whose participating genes are co-expressed. However, the tractability of the problem had remained unclear and methods by which to exhaustively search for such constellations had not been presented.We provide an algorithmic framework, referred to as Densely Connected Biclustering (DECOB, by which the aforementioned search problem becomes tractable. To benchmark the predictive power inherent to the approach, we computed all co-expressed, dense regions in physical protein and genetic interaction networks from human and yeast. An automatized filtering procedure reduces our output which results in smaller collections of modules, comparable to state-of-the-art approaches. Our results performed favorably in a fair benchmarking competition which adheres to standard criteria. We demonstrate the usefulness of an exhaustive module search, by using the unreduced output to more quickly perform GO term related function prediction tasks. We point out the advantages of our exhaustive output by predicting functional relationships using two examples.We demonstrate that the computation of all densely connected and co-expressed regions in interaction networks is an approach to module discovery of considerable value. Beyond confirming the well settled hypothesis that such co-expressed, densely connected interaction network regions reflect functional modules, we open up novel computational ways to comprehensively analyze the modular organization of an organism based on prevalent and largely

  3. Module discovery by exhaustive search for densely connected, co-expressed regions in biomolecular interaction networks.

    Science.gov (United States)

    Colak, Recep; Moser, Flavia; Chu, Jeffrey Shih-Chieh; Schönhuth, Alexander; Chen, Nansheng; Ester, Martin

    2010-10-25

    Computational prediction of functionally related groups of genes (functional modules) from large-scale data is an important issue in computational biology. Gene expression experiments and interaction networks are well studied large-scale data sources, available for many not yet exhaustively annotated organisms. It has been well established, when analyzing these two data sources jointly, modules are often reflected by highly interconnected (dense) regions in the interaction networks whose participating genes are co-expressed. However, the tractability of the problem had remained unclear and methods by which to exhaustively search for such constellations had not been presented. We provide an algorithmic framework, referred to as Densely Connected Biclustering (DECOB), by which the aforementioned search problem becomes tractable. To benchmark the predictive power inherent to the approach, we computed all co-expressed, dense regions in physical protein and genetic interaction networks from human and yeast. An automatized filtering procedure reduces our output which results in smaller collections of modules, comparable to state-of-the-art approaches. Our results performed favorably in a fair benchmarking competition which adheres to standard criteria. We demonstrate the usefulness of an exhaustive module search, by using the unreduced output to more quickly perform GO term related function prediction tasks. We point out the advantages of our exhaustive output by predicting functional relationships using two examples. We demonstrate that the computation of all densely connected and co-expressed regions in interaction networks is an approach to module discovery of considerable value. Beyond confirming the well settled hypothesis that such co-expressed, densely connected interaction network regions reflect functional modules, we open up novel computational ways to comprehensively analyze the modular organization of an organism based on prevalent and largely available large

  4. Evolution, functional differentiation, and co-expression of the RLK gene family revealed in Jilin ginseng, Panax ginseng C.A. Meyer.

    Science.gov (United States)

    Lin, Yanping; Wang, Kangyu; Li, Xiangyu; Sun, Chunyu; Yin, Rui; Wang, Yanfang; Wang, Yi; Zhang, Meiping

    2018-02-21

    Most genes in a genome exist in the form of a gene family; therefore, it is necessary to have knowledge of how a gene family functions to comprehensively understand organismal biology. The receptor-like kinase (RLK)-encoding gene family is one of the most important gene families in plants. It plays important roles in biotic and abiotic stress tolerances, and growth and development. However, little is known about the functional differentiation and relationships among the gene members within a gene family in plants. This study has isolated 563 RLK genes (designated as PgRLK genes) expressed in Jilin ginseng (Panax ginseng C.A. Meyer), investigated their evolution, and deciphered their functional diversification and relationships. The PgRLK gene family is highly diverged and formed into eight types. The LRR type is the earliest and most prevalent, while only the Lec type originated after P. ginseng evolved. Furthermore, although the members of the PgRLK gene family all encode receptor-like protein kinases and share conservative domains, they are functionally very diverse, participating in numerous biological processes. The expressions of different members of the PgRLK gene family are extremely variable within a tissue, at a developmental stage and in the same cultivar, but most of the genes tend to express correlatively, forming a co-expression network. These results not only provide a deeper and comprehensive understanding of the evolution, functional differentiation and correlation of a gene family in plants, but also an RLK genic resource useful for enhanced ginseng genetic improvement.

  5. EMMPRIN co-expressed with matrix metalloproteinases predicts poor prognosis in patients with osteosarcoma.

    Science.gov (United States)

    Futamura, Naohisa; Nishida, Yoshihiro; Urakawa, Hiroshi; Kozawa, Eiji; Ikuta, Kunihiro; Hamada, Shunsuke; Ishiguro, Naoki

    2014-06-01

    Several studies have focused on the relationships between the expression of extracellular matrix metalloproteinase inducer (EMMPRIN) and the prognosis of patients with malignant tumors. However, few of these have investigated the expression of EMMPRIN in osteosarcoma. We examined expression levels of EMMPRIN immunohistochemically in 53 cases of high-grade osteosarcoma of the extremities and analyzed the correlation of its expression with patient prognosis. The correlation between matrix metalloproteinases (MMPs) and EMMPRIN expression and the prognostic value of co-expression were also analyzed. Staining positivity for EMMPRIN was negative in 7 cases, low in 17, moderate in 19, and strong in 10. The overall and disease-free survivals (OS and DFS) in patients with higher EMMPRIN expression (strong-moderate) were significantly lower than those in the lower (weak-negative) group (0.037 and 0.024, respectively). In multivariate analysis, age (P=0.004), location (P=0.046), and EMMPRIN expression (P=0.038) were significant prognostic factors for overall survival. EMMPRIN expression (P=0.024) was also a significant prognostic factor for disease-free survival. Co-expression analyses of EMMPRIN and MMPs revealed that strong co-expression of EMMPRIN and membrane-type 1 (MT1)-MMP had a poor prognostic value (P=0.056 for DFS, P=0.006 for OS). EMMPRIN expression and co-expression with MMPs well predict the prognosis of patients with extremity osteosarcoma, making EMMPRIN a possible therapeutic target in these patients.

  6. Co-expression of NCED and ALO improves vitamin C level and tolerance to drought and chilling in transgenic tobacco and stylo plants.

    Science.gov (United States)

    Bao, Gegen; Zhuo, Chunliu; Qian, Chunmei; Xiao, Ting; Guo, Zhenfei; Lu, Shaoyun

    2016-01-01

    Abscisic acid (ABA) regulates plant adaptive responses to various environmental stresses, while L-ascorbic acid (AsA) that is also named vitamin C is an important antioxidant and involves in plant stress tolerance and the immune system in domestic animals. Transgenic tobacco (Nicotiana tabacum L.) and stylo [Stylosanthes guianensis (Aublet) Swartz], a forage legume, plants co-expressing stylo 9-cis-epoxycarotenoid dioxygenase (SgNCED1) and yeast D-arabinono-1,4-lactone oxidase (ALO) genes were generated in this study, and tolerance to drought and chilling was analysed in comparison with transgenic tobacco overexpressing SgNCED1 or ALO and the wild-type plants. Compared to the SgNCED1 or ALO transgenic plants, in which only ABA or AsA levels were increased, both ABA and AsA levels were increased in transgenic tobacco and stylo plants co-expressing SgNCED1 and ALO genes. Compared to the wild type, an enhanced drought tolerance was observed in SgNCED1 transgenic tobacco plants with induced expression of drought-responsive genes, but not in ALO plants, while an enhanced chilling tolerance was observed in ALO transgenic tobaccos with induced expression of cold-responsive genes, but not in SgNCED1 plants. Co-expression of SgNCED1 and ALO genes resulted in elevated tolerance to both drought and chilling in transgenic tobacco and stylo plants with induced expression of both drought and cold-responsive genes. Our result suggests that co-expression of SgNCED1 and ALO genes is an effective way for use in forage plant improvement for increased tolerance to drought and chilling and nutrition quality. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  7. Genetic architecture of wood properties based on association analysis and co-expression networks in white spruce.

    Science.gov (United States)

    Lamara, Mebarek; Raherison, Elie; Lenz, Patrick; Beaulieu, Jean; Bousquet, Jean; MacKay, John

    2016-04-01

    Association studies are widely utilized to analyze complex traits but their ability to disclose genetic architectures is often limited by statistical constraints, and functional insights are usually minimal in nonmodel organisms like forest trees. We developed an approach to integrate association mapping results with co-expression networks. We tested single nucleotide polymorphisms (SNPs) in 2652 candidate genes for statistical associations with wood density, stiffness, microfibril angle and ring width in a population of 1694 white spruce trees (Picea glauca). Associations mapping identified 229-292 genes per wood trait using a statistical significance level of P wood associated genes and several known MYB and NAC regulators were identified as network hubs. The network revealed a link between the gene PgNAC8, wood stiffness and microfibril angle, as well as considerable within-season variation for both genetic control of wood traits and gene expression. Trait associations were distributed throughout the network suggesting complex interactions and pleiotropic effects. Our findings indicate that integration of association mapping and co-expression networks enhances our understanding of complex wood traits. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  8. TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES.

    Science.gov (United States)

    Zhu, Lingxue; Lei, Jing; Devlin, Bernie; Roeder, Kathryn

    2017-09-01

    Scientists routinely compare gene expression levels in cases versus controls in part to determine genes associated with a disease. Similarly, detecting case-control differences in co-expression among genes can be critical to understanding complex human diseases; however statistical methods have been limited by the high dimensional nature of this problem. In this paper, we construct a sparse-Leading-Eigenvalue-Driven (sLED) test for comparing two high-dimensional covariance matrices. By focusing on the spectrum of the differential matrix, sLED provides a novel perspective that accommodates what we assume to be common, namely sparse and weak signals in gene expression data, and it is closely related with Sparse Principal Component Analysis. We prove that sLED achieves full power asymptotically under mild assumptions, and simulation studies verify that it outperforms other existing procedures under many biologically plausible scenarios. Applying sLED to the largest gene-expression dataset obtained from post-mortem brain tissue from Schizophrenia patients and controls, we provide a novel list of genes implicated in Schizophrenia and reveal intriguing patterns in gene co-expression change for Schizophrenia subjects. We also illustrate that sLED can be generalized to compare other gene-gene "relationship" matrices that are of practical interest, such as the weighted adjacency matrices.

  9. Co-expression of an Erwinia chrysanthemi pectate lyase-encoding gene (pelE) and an E. carotovora polygalacturonase-encoding gene (peh1) in Saccharomyces cerevisiae.

    Science.gov (United States)

    Laing, E; Pretorius, I S

    1993-05-01

    A pectate lyase (PL)-encoding gene (pelE) from Erwinia chrysanthemi and a polygalacturonase (PG)-encoding gene (peh1) from E. carotovora were each inserted between a novel yeast expression-secretion cassette and a yeast gene terminator, and cloned separately into a yeast-centromeric shuttle vector (YCp50), generating recombinant plasmids pAMS12 and pAMS13. Transcription initiation signals present in the expression-secretion cassette were derived from the yeast alcohol dehydrogenase gene promoter (ADC1P), whereas the transcription termination signals were derived from the yeast tryptophan synthase gene terminator (TRP5T). Secretion of PL and PG was directed by the signal sequence of the yeast mating pheromone alpha-factor (MF alpha 1s). A pectinase cassette comprising ADC1P-MF alpha 1s-pelE-TRP5T and ADC1P-MF alpha 1s-peh1-TRP5T was subcloned into YCp50, generating plasmid pAMS14. Subsequently, the dominant selectable Geneticin G418-resistance (GtR) marker, APH1, inserted between the yeast uridine diphosphoglucose 4-epimerase gene promoter (GAL10P) and yeast orotidine-5'-phosphate carboxylase gene terminator (URA3T), was cloned into pAMS14, resulting in plasmid pAMS15. Plasmids pAMS12, pAMS13 and pAMS14 were transformed into a laboratory strain of Saccharomyces cerevisiae, whereas pAMS15 was stably introduced into two commercial wine yeast strains. DNA-DNA and DNA-RNA hybridization analyses revealed the presence of these plasmids, and the pelE and peh1 transcripts in the yeast transformants, respectively. A polypectate agarose assay indicated the extracellular production of biologically active PL and PG by the S. cerevisiae transformants and confirmed that co-expression of the pelE and peh1 genes synergistically enhanced pectate degradation.

  10. In silico identification of miRNAs and their target genes and analysis of gene co-expression network in saffron (Crocus sativus L.) stigma

    Science.gov (United States)

    Zinati, Zahra; Shamloo-Dashtpagerdi, Roohollah; Behpouri, Ali

    2016-01-01

    As an aromatic and colorful plant of substantive taste, saffron (Crocus sativus L.) owes such properties of matter to growing class of the secondary metabolites derived from the carotenoids, apocarotenoids. Regarding the critical role of microRNAs in secondary metabolic synthesis and the limited number of identified miRNAs in C. sativus, on the other hand, one may see the point how the characterization of miRNAs along with the corresponding target genes in C. sativus might expand our perspectives on the roles of miRNAs in carotenoid/apocarotenoid biosynthetic pathway. A computational analysis was used to identify miRNAs and their targets using EST (Expressed Sequence Tag) library from mature saffron stigmas. Then, a gene co- expression network was constructed to identify genes which are potentially involved in carotenoid/apocarotenoid biosynthetic pathways. EST analysis led to the identification of two putative miRNAs (miR414 and miR837-5p) along with the corresponding stem- looped precursors. To our knowledge, this is the first report on miR414 and miR837-5p in C. sativus. Co-expression network analysis indicated that miR414 and miR837-5p may play roles in C. sativus metabolic pathways and led to identification of candidate genes including six transcription factors and one protein kinase probably involved in carotenoid/apocarotenoid biosynthetic pathway. Presence of transcription factors, miRNAs and protein kinase in the network indicated multiple layers of regulation in saffron stigma. The candidate genes from this study may help unraveling regulatory networks underlying the carotenoid/apocarotenoid biosynthesis in saffron and designing metabolic engineering for enhanced secondary metabolites. PMID:28261627

  11. A subpopulation of dopaminergic neurons co-expresses serotonin in ventral mesencephalic cultures but not after intrastriatal transplantation in a rat model of Parkinsons disease

    DEFF Research Database (Denmark)

    Di Santo, Stefano; Seiler, Stefanie; Ducray, Angélique

    2017-01-01

    Cell replacement therapy is a promising avenue into the investigation and treatment of Parkinson’s disease (PD) and in some cases significant long-term motor improvements have been demonstrated. The main source of donor tissue is the human fetal ventral mesencephalon (VM), which consists...... 30% of the dopaminergic neurons in the donor tissue co-expressed serotonin, no co-localization could be detected in grafts one month after intrastriatal transplantation into hemi-parkinsonian rats. In conclusion, a significant and susceptible sub-population of dopaminergic neurons in fetal VM tissues...... both fetal rat and human dissociated, organotypic and neurosphere VM cultures as well as an animal model of PD were investigated. In dissociated rat VM cultures approximately 30% of the TH positive neurons co-expressed serotonin, while no co-localization with GABA was observed. Interestingly, co...

  12. Different substrate regimes determine transcriptional profiles and gene co-expression in Methanosarcina barkeri (DSM 800)

    Czech Academy of Sciences Publication Activity Database

    Lin, Qiang; Fang, X.; Ho, A.; Li, J.; Yan, X.; Tu, B.; Li, Ch.; Li, J.; Yao, M.; Li, X.

    2017-01-01

    Roč. 101, č. 19 (2017), s. 7303-7316 ISSN 0175-7598 Institutional support: RVO:60077344 Keywords : Methanosarcina barkeri * substrate regimes * diversity * co-expression * ecological strategies Subject RIV: EH - Ecology, Behaviour OBOR OECD: Ecology Impact factor: 3.420, year: 2016

  13. A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways

    Science.gov (United States)

    Musungu, Bryan M.; Bhatnagar, Deepak; Brown, Robert L.; Payne, Gary A.; OBrian, Greg; Fakhoury, Ahmad M.; Geisler, Matt

    2016-01-01

    A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flavus, a link between aflatoxin production and vesicular transport was identified within the network. There was significant interspecies correlation of expression between Z. mays and A. flavus for a subset of 104 Z. mays, and 1942 A. flavus genes. This resulted in an interspecies subnetwork enriched in multiple Z. mays genes involved in the production of ROS. In addition to the ROS from Z. mays, there was enrichment in the vesicular transport pathways and the aflatoxin pathway for A. flavus. Included in these genes, a key aflatoxin cluster regulator, AflS, was found to be co-regulated with multiple Z. mays ROS producing genes within the network, suggesting AflS may be monitoring host ROS levels. The entire GEN for both host and pathogen, and the subset of interspecies correlations, is presented as a tool for hypothesis generation and discovery for events in the early stages of fungal infection of Z. mays by A. flavus. PMID:27917194

  14. A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways.

    Science.gov (United States)

    Musungu, Bryan M; Bhatnagar, Deepak; Brown, Robert L; Payne, Gary A; OBrian, Greg; Fakhoury, Ahmad M; Geisler, Matt

    2016-01-01

    A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flavus , a link between aflatoxin production and vesicular transport was identified within the network. There was significant interspecies correlation of expression between Z. mays and A. flavus for a subset of 104 Z. mays , and 1942 A. flavus genes. This resulted in an interspecies subnetwork enriched in multiple Z. mays genes involved in the production of ROS. In addition to the ROS from Z. mays , there was enrichment in the vesicular transport pathways and the aflatoxin pathway for A. flavus . Included in these genes, a key aflatoxin cluster regulator, AflS, was found to be co-regulated with multiple Z. mays ROS producing genes within the network, suggesting AflS may be monitoring host ROS levels. The entire GEN for both host and pathogen, and the subset of interspecies correlations, is presented as a tool for hypothesis generation and discovery for events in the early stages of fungal infection of Z. mays by A. flavus .

  15. Co-expression analysis and identification of fecundity-related long non-coding RNAs in sheep ovaries.

    Science.gov (United States)

    Miao, Xiangyang; Luo, Qingmiao; Zhao, Huijing; Qin, Xiaoyu

    2016-12-16

    Small Tail Han sheep, including the FecB B FecB B (Han BB) and FecB + FecB + (Han++) genotypes, and Dorset sheep exhibit different fecundities. To identify novel long non-coding RNAs (lncRNAs) associated with sheep fecundity to better understand their molecular mechanisms, a genome-wide analysis of mRNAs and lncRNAs from Han BB, Han++ and Dorset sheep was performed. After the identification of differentially expressed mRNAs and lncRNAs, 16 significant modules were explored by using weighted gene coexpression network analysis (WGCNA) followed by functional enrichment analysis of the genes and lncRNAs in significant modules. Among these selected modules, the yellow and brown modules were significantly related to sheep fecundity. lncRNAs (e.g., NR0B1, XLOC_041882, and MYH15) in the yellow module were mainly involved in the TGF-β signalling pathway, and NYAP1 and BCORL1 were significantly associated with the oxytocin signalling pathway, which regulates several genes in the coexpression network of the brown module. Overall, we identified several gene modules associated with sheep fecundity, as well as networks consisting of hub genes and lncRNAs that may contribute to sheep prolificacy by regulating the target mRNAs related to the TGF-β and oxytocin signalling pathways. This study provides an alternative strategy for the identification of potential candidate regulatory lncRNAs.

  16. Genome-wide expression of transcriptomes and their co-expression pattern in subtropical maize (Zea mays L. under waterlogging stress.

    Directory of Open Access Journals (Sweden)

    Nepolean Thirunavukkarasu

    Full Text Available Waterlogging causes extensive damage to maize crops in tropical and subtropical regions. The identification of tolerance genes and their interactions at the molecular level will be helpful to engineer tolerant genotypes. A whole-genome transcriptome assay revealed the specific role of genes in response to waterlogging stress in susceptible and tolerant genotypes. Genes involved in the synthesis of ethylene and auxin, cell wall metabolism, activation of G-proteins and formation of aerenchyma and adventitious roots, were upregulated in the tolerant genotype. Many transcription factors, particularly ERFs, MYB, HSPs, MAPK, and LOB-domain protein were involved in regulation of these traits. Genes responsible for scavenging of ROS generated under stress were expressed along with those involved in carbohydrate metabolism. The physical locations of 21 genes expressed in the tolerant genotype were found to correspond with the marker intervals of known QTLs responsible for development of adaptive traits. Among the candidate genes, most showed synteny with genes of sorghum and foxtail millet. Co-expression analysis of 528 microarray samples including 16 samples from the present study generated seven functional modules each in the two genotypes, with differing characteristics. In the tolerant genotype, stress genes were co-expressed along with peroxidase and fermentation pathway genes.

  17. A Genome-Wide Association Study for Culm Cellulose Content in Barley Reveals Candidate Genes Co-Expressed with Members of the CELLULOSE SYNTHASE A Gene Family

    Science.gov (United States)

    Houston, Kelly; Burton, Rachel A.; Sznajder, Beata; Rafalski, Antoni J.; Dhugga, Kanwarpal S.; Mather, Diane E.; Taylor, Jillian; Steffenson, Brian J.; Waugh, Robbie; Fincher, Geoffrey B.

    2015-01-01

    Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two – rowed and 288 six – rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with CELLULOSE SYNTHASE A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the

  18. The functional landscape of mouse gene expression

    Directory of Open Access Journals (Sweden)

    Zhang Wen

    2004-12-01

    Full Text Available Abstract Background Large-scale quantitative analysis of transcriptional co-expression has been used to dissect regulatory networks and to predict the functions of new genes discovered by genome sequencing in model organisms such as yeast. Although the idea that tissue-specific expression is indicative of gene function in mammals is widely accepted, it has not been objectively tested nor compared with the related but distinct strategy of correlating gene co-expression as a means to predict gene function. Results We generated microarray expression data for nearly 40,000 known and predicted mRNAs in 55 mouse tissues, using custom-built oligonucleotide arrays. We show that quantitative transcriptional co-expression is a powerful predictor of gene function. Hundreds of functional categories, as defined by Gene Ontology 'Biological Processes', are associated with characteristic expression patterns across all tissues, including categories that bear no overt relationship to the tissue of origin. In contrast, simple tissue-specific restriction of expression is a poor predictor of which genes are in which functional categories. As an example, the highly conserved mouse gene PWP1 is widely expressed across different tissues but is co-expressed with many RNA-processing genes; we show that the uncharacterized yeast homolog of PWP1 is required for rRNA biogenesis. Conclusions We conclude that 'functional genomics' strategies based on quantitative transcriptional co-expression will be as fruitful in mammals as they have been in simpler organisms, and that transcriptional control of mammalian physiology is more modular than is generally appreciated. Our data and analyses provide a public resource for mammalian functional genomics.

  19. Coexpression of EGFR and CXCR4 predicts poor prognosis in resected pancreatic ductal adenocarcinoma.

    Directory of Open Access Journals (Sweden)

    Huanwen Wu

    Full Text Available Epidermal growth factor receptor (EGFR is highly expressed in pancreatic ductal adenocarcinoma (PDAC and is involved in tumorigenesis and development. However, EGFR expression alone has limited clinical and prognostic significance. Recently, the cross-talk between EGFR and G-protein-coupled chemokine receptor CXCR4 has become increasingly recognized.In the present study, immunohistochemical staining of EGFR and CXCR4 was performed on paraffin-embedded specimens from 131 patients with surgically resected PDAC. Subsequently, the associations between EGFR expression, CXCR4 expression, EGFR/CXCR4 coexpression and clinicopathologic factors were assessed, and survival analyses were performed.In total, 64 (48.9% patients expressed EGFR, 68 (51.9% expressed CXCR4, and 33 (25.2% coexpressed EGFR and CXCR4. No significant association between EGFR and CXCR4 expression was observed (P = 0.938. EGFR expression significantly correlated with tumor differentiation (P = 0.031, whereas CXCR4 expression significantly correlated with lymph node metastasis (P = 0.001. EGFR/CXCR4 coexpression was significantly associated with lymph node metastasis (P = 0.026, TNM stage (P = 0.048, and poor tumor differentiation (P = 0.004. By univariate survival analysis, both CXCR4 expression and EGFR/CXCR4 coexpression were significant prognostic factors for poor disease-free survival (DFS and overall survival (OS. Moreover, EGFR/CXCR4 coexpression significantly increased the hazard ratio for both recurrence and death compared with EGFR or CXCR4 protein expression alone. Multivariate survival analysis demonstrated that EGFR/CXCR4 coexpression was an independent prognostic factor for DFS (HR = 2.33, P<0.001 and OS (HR = 2.48, P = 0.001.In conclusion, our data indicate that although EGFR expression alone has limited clinical and prognostic significance, EGFR/CXCR4 coexpression identified a subset of PDAC patients with more aggressive tumor characteristics and a significantly worse

  20. Co-expression of the Thermotoga neapolitana aglB gene with an upstream 3'-coding fragment of the malG gene improves enzymatic characteristics of recombinant AglB cyclomaltodextrinase.

    Science.gov (United States)

    Lunina, Natalia A; Agafonova, Elena V; Chekanovskaya, Lyudmila A; Dvortsov, Igor A; Berezina, Oksana V; Shedova, Ekaterina N; Kostrov, Sergey V; Velikodvorskaya, Galina A

    2007-07-01

    A cluster of Thermotoga neapolitana genes participating in starch degradation includes the malG gene of sugar transport protein and the aglB gene of cyclomaltodextrinase. The start and stop codons of these genes share a common overlapping sequence, aTGAtg. Here, we compared properties of expression products of three different constructs with aglB from T. neapolitana. The first expression vector contained the aglB gene linked to an upstream 90-bp 3'-terminal region of the malG gene with the stop codon overlapping with the start codon of aglB. The second construct included the isolated coding sequence of aglB with two tandem potential start codons. The expression product of this construct in Escherichia coli had two tandem Met residues at its N terminus and was characterized by low thermostability and high tendency to aggregate. In contrast, co-expression of aglB and the 3'-terminal region of malG (the first construct) resulted in AglB with only one N-terminal Met residue and a much higher specific activity of cyclomaltodextrinase. Moreover, the enzyme expressed by such a construct was more thermostable and less prone to aggregation. The third construct was the same as the second one except that it contained only one ATG start codon. The product of its expression had kinetic and other properties similar to those of the enzyme with only one N-terminal Met residue.

  1. Effector genomics accelerates discovery and functional profiling of potato disease resistance and phytophthora infestans avirulence genes.

    Directory of Open Access Journals (Sweden)

    Vivianne G A A Vleeshouwers

    Full Text Available Potato is the world's fourth largest food crop yet it continues to endure late blight, a devastating disease caused by the Irish famine pathogen Phytophthora infestans. Breeding broad-spectrum disease resistance (R genes into potato (Solanum tuberosum is the best strategy for genetically managing late blight but current approaches are slow and inefficient. We used a repertoire of effector genes predicted computationally from the P. infestans genome to accelerate the identification, functional characterization, and cloning of potentially broad-spectrum R genes. An initial set of 54 effectors containing a signal peptide and a RXLR motif was profiled for activation of innate immunity (avirulence or Avr activity on wild Solanum species and tentative Avr candidates were identified. The RXLR effector family IpiO induced hypersensitive responses (HR in S. stoloniferum, S. papita and the more distantly related S. bulbocastanum, the source of the R gene Rpi-blb1. Genetic studies with S. stoloniferum showed cosegregation of resistance to P. infestans and response to IpiO. Transient co-expression of IpiO with Rpi-blb1 in a heterologous Nicotiana benthamiana system identified IpiO as Avr-blb1. A candidate gene approach led to the rapid cloning of S. stoloniferum Rpi-sto1 and S. papita Rpi-pta1, which are functionally equivalent to Rpi-blb1. Our findings indicate that effector genomics enables discovery and functional profiling of late blight R genes and Avr genes at an unprecedented rate and promises to accelerate the engineering of late blight resistant potato varieties.

  2. Genetic architecture of gene expression in the chicken

    Directory of Open Access Journals (Sweden)

    Stanley Dragana

    2013-01-01

    Full Text Available Abstract Background The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. Results Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. Conclusions We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.

  3. Differential co-expression and regulation analyses reveal different mechanisms underlying major depressive disorder and subsyndromal symptomatic depression.

    Science.gov (United States)

    Xu, Fan; Yang, Jing; Chen, Jin; Wu, Qingyuan; Gong, Wei; Zhang, Jianguo; Shao, Weihua; Mu, Jun; Yang, Deyu; Yang, Yongtao; Li, Zhiwei; Xie, Peng

    2015-04-03

    Recent depression research has revealed a growing awareness of how to best classify depression into depressive subtypes. Appropriately subtyping depression can lead to identification of subtypes that are more responsive to current pharmacological treatment and aid in separating out depressed patients in which current antidepressants are not particularly effective. Differential co-expression analysis (DCEA) and differential regulation analysis (DRA) were applied to compare the transcriptomic profiles of peripheral blood lymphocytes from patients with two depressive subtypes: major depressive disorder (MDD) and subsyndromal symptomatic depression (SSD). Six differentially regulated genes (DRGs) (FOSL1, SRF, JUN, TFAP4, SOX9, and HLF) and 16 transcription factor-to-target differentially co-expressed gene links or pairs (TF2target DCLs) appear to be the key differential factors in MDD; in contrast, one DRG (PATZ1) and eight TF2target DCLs appear to be the key differential factors in SSD. There was no overlap between the MDD target genes and SSD target genes. Venlafaxine (Efexor™, Effexor™) appears to have a significant effect on the gene expression profile of MDD patients but no significant effect on the gene expression profile of SSD patients. DCEA and DRA revealed no apparent similarities between the differential regulatory processes underlying MDD and SSD. This bioinformatic analysis may provide novel insights that can support future antidepressant R&D efforts.

  4. Gene-Transformation-Induced Changes in Chemical Functional Group Features and Molecular Structure Conformation in Alfalfa Plants Co-Expressing Lc-bHLH and C1-MYB Transcriptive Flavanoid Regulatory Genes: Effects of Single-Gene and Two-Gene Insertion.

    Science.gov (United States)

    Heendeniya, Ravindra G; Yu, Peiqiang

    2017-03-20

    Alfalfa ( Medicago sativa L.) genotypes transformed with Lc-bHLH and Lc transcription genes were developed with the intention of stimulating proanthocyanidin synthesis in the aerial parts of the plant. To our knowledge, there are no studies on the effect of single-gene and two-gene transformation on chemical functional groups and molecular structure changes in these plants. The objective of this study was to use advanced molecular spectroscopy with multivariate chemometrics to determine chemical functional group intensity and molecular structure changes in alfalfa plants when co-expressing Lc-bHLH and C1-MYB transcriptive flavanoid regulatory genes in comparison with non-transgenic (NT) and AC Grazeland (ACGL) genotypes. The results showed that compared to NT genotype, the presence of double genes ( Lc and C1 ) increased ratios of both the area and peak height of protein structural Amide I/II and the height ratio of α-helix to β-sheet. In carbohydrate-related spectral analysis, the double gene-transformed alfalfa genotypes exhibited lower peak heights at 1370, 1240, 1153, and 1020 cm -1 compared to the NT genotype. Furthermore, the effect of double gene transformation on carbohydrate molecular structure was clearly revealed in the principal component analysis of the spectra. In conclusion, single or double transformation of Lc and C1 genes resulted in changing functional groups and molecular structure related to proteins and carbohydrates compared to the NT alfalfa genotype. The current study provided molecular structural information on the transgenic alfalfa plants and provided an insight into the impact of transgenes on protein and carbohydrate properties and their molecular structure's changes.

  5. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm

    Energy Technology Data Exchange (ETDEWEB)

    SacconePhD, Scott F [Washington University, St. Louis; Chesler, Elissa J [ORNL; Bierut, Laura J [Washington University, St. Louis; Kalivas, Peter J [Medical College of South Carolina, Charleston; Lerman, Caryn [University of Pennsylvania; Saccone, Nancy L [Washington University, St. Louis; Uhl, George R [Johns Hopkins University; Li, Chuan-Yun [Peking University; Philip, Vivek M [ORNL; Edenberg, Howard [Indiana University; Sherry, Steven [National Center for Biotechnology Information; Feolo, Michael [National Center for Biotechnology Information; Moyzis, Robert K [Johns Hopkins University; Rutter, Joni L [National Institute of Drug Abuse

    2009-01-01

    Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.

  6. Characterization of differentially expressed genes involved in pathways associated with gastric cancer.

    Directory of Open Access Journals (Sweden)

    Hao Li

    Full Text Available To explore the patterns of gene expression in gastric cancer, a total of 26 paired gastric cancer and noncancerous tissues from patients were enrolled for gene expression microarray analyses. Limma methods were applied to analyze the data, and genes were considered to be significantly differentially expressed if the False Discovery Rate (FDR value was 2. Subsequently, Gene Ontology (GO categories were used to analyze the main functions of the differentially expressed genes. According to the Kyoto Encyclopedia of Genes and Genomes (KEGG database, we found pathways significantly associated with the differential genes. Gene-Act network and co-expression network were built respectively based on the relationships among the genes, proteins and compounds in the database. 2371 mRNAs and 350 lncRNAs considered as significantly differentially expressed genes were selected for the further analysis. The GO categories, pathway analyses and the Gene-Act network showed a consistent result that up-regulated genes were responsible for tumorigenesis, migration, angiogenesis and microenvironment formation, while down-regulated genes were involved in metabolism. These results of this study provide some novel findings on coding RNAs, lncRNAs, pathways and the co-expression network in gastric cancer which will be useful to guide further investigation and target therapy for this disease.

  7. Domestication rewired gene expression and nucleotide diversity patterns in tomato.

    Science.gov (United States)

    Sauvage, Christopher; Rau, Andrea; Aichholz, Charlotte; Chadoeuf, Joël; Sarah, Gautier; Ruiz, Manuel; Santoni, Sylvain; Causse, Mathilde; David, Jacques; Glémin, Sylvain

    2017-08-01

    Plant domestication has led to considerable phenotypic modifications from wild species to modern varieties. However, although changes in key traits have been well documented, less is known about the underlying molecular mechanisms, such as the reduction of molecular diversity or global gene co-expression patterns. In this study, we used a combination of gene expression and population genetics in wild and crop tomato to decipher the footprints of domestication. We found a set of 1729 differentially expressed genes (DEG) between the two genetic groups, belonging to 17 clusters of co-expressed DEG, suggesting that domestication affected not only individual genes but also regulatory networks. Five co-expression clusters were enriched in functional terms involving carbohydrate metabolism or epigenetic regulation of gene expression. We detected differences in nucleotide diversity between the crop and wild groups specific to DEG. Our study provides an extensive profiling of the rewiring of gene co-expression induced by the domestication syndrome in one of the main crop species. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  8. Shared activity patterns arising at genetic susceptibility loci reveal underlying genomic and cellular architecture of human disease.

    Science.gov (United States)

    Baillie, J Kenneth; Bretherick, Andrew; Haley, Christopher S; Clohisey, Sara; Gray, Alan; Neyton, Lucile P A; Barrett, Jeffrey; Stahl, Eli A; Tenesa, Albert; Andersson, Robin; Brown, J Ben; Faulkner, Geoffrey J; Lizio, Marina; Schaefer, Ulf; Daub, Carsten; Itoh, Masayoshi; Kondo, Naoto; Lassmann, Timo; Kawai, Jun; Mole, Damian; Bajic, Vladimir B; Heutink, Peter; Rehli, Michael; Kawaji, Hideya; Sandelin, Albin; Suzuki, Harukazu; Satsangi, Jack; Wells, Christine A; Hacohen, Nir; Freeman, Thomas C; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R R; Hume, David A

    2018-03-01

    Genetic variants underlying complex traits, including disease susceptibility, are enriched within the transcriptional regulatory elements, promoters and enhancers. There is emerging evidence that regulatory elements associated with particular traits or diseases share similar patterns of transcriptional activity. Accordingly, shared transcriptional activity (coexpression) may help prioritise loci associated with a given trait, and help to identify underlying biological processes. Using cap analysis of gene expression (CAGE) profiles of promoter- and enhancer-derived RNAs across 1824 human samples, we have analysed coexpression of RNAs originating from trait-associated regulatory regions using a novel quantitative method (network density analysis; NDA). For most traits studied, phenotype-associated variants in regulatory regions were linked to tightly-coexpressed networks that are likely to share important functional characteristics. Coexpression provides a new signal, independent of phenotype association, to enable fine mapping of causative variants. The NDA coexpression approach identifies new genetic variants associated with specific traits, including an association between the regulation of the OCT1 cation transporter and genetic variants underlying circulating cholesterol levels. NDA strongly implicates particular cell types and tissues in disease pathogenesis. For example, distinct groupings of disease-associated regulatory regions implicate two distinct biological processes in the pathogenesis of ulcerative colitis; a further two separate processes are implicated in Crohn's disease. Thus, our functional analysis of genetic predisposition to disease defines new distinct disease endotypes. We predict that patients with a preponderance of susceptibility variants in each group are likely to respond differently to pharmacological therapy. Together, these findings enable a deeper biological understanding of the causal basis of complex traits.

  9. Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

    Science.gov (United States)

    Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

    2017-10-03

    Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.

  10. A Systems Biology Framework Identifies Molecular Underpinnings of Coronary Heart Disease

    Science.gov (United States)

    Huan, Tianxiao; Zhang, Bin; Wang, Zhi; Joehanes, Roby; Zhu, Jun; Johnson, Andrew D.; Ying, Saixia; Munson, Peter J.; Raghavachari, Nalini; Wang, Richard; Liu, Poching; Courchesne, Paul; Hwang, Shih-Jen; Assimes, Themistocles L.; McPherson, Ruth; Samani, Nilesh J.; Schunkert, Heribert; Meng, Qingying; Suver, Christine; O'Donnell, Christopher J.; Derry, Jonathan; Yang, Xia; Levy, Daniel

    2013-01-01

    Objective Genetic approaches have identified numerous loci associated with coronary heart disease (CHD). The molecular mechanisms underlying CHD gene-disease associations, however, remain unclear. We hypothesized that genetic variants with both strong and subtle effects drive gene subnetworks that in turn affect CHD. Approach and Results We surveyed CHD-associated molecular interactions by constructing coexpression networks using whole blood gene expression profiles from 188 CHD cases and 188 age- and sex-matched controls. 24 coexpression modules were identified including one case-specific and one control-specific differential module (DM). The DMs were enriched for genes involved in B-cell activation, immune response, and ion transport. By integrating the DMs with altered gene expression associated SNPs (eSNPs) and with results of GWAS of CHD and its risk factors, the control-specific DM was implicated as CHD-causal based on its significant enrichment for both CHD and lipid eSNPs. This causal DM was further integrated with tissue-specific Bayesian networks and protein-protein interaction networks to identify regulatory key driver (KD) genes. Multi-tissue KDs (SPIB and TNFRSF13C) and tissue-specific KDs (e.g. EBF1) were identified. Conclusions Our network-driven integrative analysis not only identified CHD-related genes, but also defined network structure that sheds light on the molecular interactions of genes associated with CHD risk. PMID:23539213

  11. Genomewide Expression and Functional Interactions of Genes under Drought Stress in Maize

    Directory of Open Access Journals (Sweden)

    Nepolean Thirunavukkarasu

    2017-01-01

    Full Text Available A genomewide transcriptome assay of two subtropical genotypes of maize was used to observe the expression of genes at seedling stage of drought stress. The number of genes expressed differentially was greater in HKI1532 (a drought tolerant genotype than in PC3 (a drought sensitive genotype, indicating primary differences at the transcriptional level in stress tolerance. The global coexpression networks of the two genotypes differed significantly with respect to the number of modules and the coexpression pattern within the modules. A total of 174 drought-responsive genes were selected from HKI1532, and their coexpression network revealed key correlations between different adaptive pathways, each cluster of the network representing a specific biological function. Transcription factors related to ABA-dependent stomatal closure, signalling, and phosphoprotein cascades work in concert to compensate for reduced photosynthesis. Under stress, water balance was maintained by coexpression of the genes involved in osmotic adjustments and transporter proteins. Metabolism was maintained by the coexpression of genes involved in cell wall modification and protein and lipid metabolism. The interaction of genes involved in crucial biological functions during stress was identified and the results will be useful in targeting important gene interactions to understand drought tolerance in greater detail.

  12. Genes2FANs: connecting genes through functional association networks

    Science.gov (United States)

    2012-01-01

    Background Protein-protein, cell signaling, metabolic, and transcriptional interaction networks are useful for identifying connections between lists of experimentally identified genes/proteins. However, besides physical or co-expression interactions there are many ways in which pairs of genes, or their protein products, can be associated. By systematically incorporating knowledge on shared properties of genes from diverse sources to build functional association networks (FANs), researchers may be able to identify additional functional interactions between groups of genes that are not readily apparent. Results Genes2FANs is a web based tool and a database that utilizes 14 carefully constructed FANs and a large-scale protein-protein interaction (PPI) network to build subnetworks that connect lists of human and mouse genes. The FANs are created from mammalian gene set libraries where mouse genes are converted to their human orthologs. The tool takes as input a list of human or mouse Entrez gene symbols to produce a subnetwork and a ranked list of intermediate genes that are used to connect the query input list. In addition, users can enter any PubMed search term and then the system automatically converts the returned results to gene lists using GeneRIF. This gene list is then used as input to generate a subnetwork from the user’s PubMed query. As a case study, we applied Genes2FANs to connect disease genes from 90 well-studied disorders. We find an inverse correlation between the counts of links connecting disease genes through PPI and links connecting diseases genes through FANs, separating diseases into two categories. Conclusions Genes2FANs is a useful tool for interpreting the relationships between gene/protein lists in the context of their various functions and networks. Combining functional association interactions with physical PPIs can be useful for revealing new biology and help form hypotheses for further experimentation. Our finding that disease genes in

  13. Successful recombinant production of Allochromatium vinosum cytochrome c' requires coexpression of cmm genes in heme-rich Escherichia coli JCB712

    International Nuclear Information System (INIS)

    Evers, Toon H.; Merkx, Maarten

    2005-01-01

    Cytochrome c' from the purple photosynthetic bacterium Allochromatium vinosum (CCP) displays a unique, reversible dimer-to-monomer transition upon binding of NO, CO, and CN - . This small, four helix bundle protein represents an attractive model for the study of other heme protein biosensors, provided a recombinant expression system is available. Here we report the development of an efficient expression system for CCP that makes use of a maltose binding protein fusion strategy to enhance periplasmic expression and allow easy purification by affinity chromatography. Coexpression of cytochrome c maturase genes and the use of a heme-rich Escherichia coli strain were found to be necessary to obtain reasonable yields of cytochrome c'. Characterization using circular dichroism, UV-vis spectroscopy, and size-exclusion chromatography confirms the native-like properties of the recombinant protein, including its ligand-induced monomerization

  14. Genetic Network Inference: From Co-Expression Clustering to Reverse Engineering

    Science.gov (United States)

    Dhaeseleer, Patrik; Liang, Shoudan; Somogyi, Roland

    2000-01-01

    Advances in molecular biological, analytical, and computational technologies are enabling us to systematically investigate the complex molecular processes underlying biological systems. In particular, using high-throughput gene expression assays, we are able to measure the output of the gene regulatory network. We aim here to review datamining and modeling approaches for conceptualizing and unraveling the functional relationships implicit in these datasets. Clustering of co-expression profiles allows us to infer shared regulatory inputs and functional pathways. We discuss various aspects of clustering, ranging from distance measures to clustering algorithms and multiple-duster memberships. More advanced analysis aims to infer causal connections between genes directly, i.e., who is regulating whom and how. We discuss several approaches to the problem of reverse engineering of genetic networks, from discrete Boolean networks, to continuous linear and non-linear models. We conclude that the combination of predictive modeling with systematic experimental verification will be required to gain a deeper insight into living organisms, therapeutic targeting, and bioengineering.

  15. MicroRNAs Clustered within the 14q32 Locus Are Associated with Endothelial Damage and Microparticle Secretion in Bicuspid Aortic Valve Disease

    Directory of Open Access Journals (Sweden)

    Neus Martínez-Micaelo

    2017-09-01

    Full Text Available Background: We previously described that PECAM+ circulating endothelial microparticles (EMPs are elevated in bicuspid aortic valve (BAV disease as a manifestation of endothelial damage. In this study, we hypothesized that this endothelial damage, is functionally related to the secretion of a specific pattern of EMP-associated miRNAs.Methods: We used a bioinformatics approach to correlate the PECAM+ EMP levels with the miRNA expression profile in plasma in healthy individuals and BAV patients (n = 36. In addition, using the miRNAs that were significantly associated with PECAM+ EMP levels, we inferred a miRNA co-expression network using a Gaussian graphical modeling approach to identify highly co-expressed miRNAs or miRNA clusters whose expression could functionally regulate endothelial damage.Results: We identified a co-expression network composed of 131 miRNAs whose circulating expression was significantly associated with PECAM+ EMP levels. Using a topological analysis, we found that miR-494 was the most important hub within the co-expression network. Furthermore, through positional gene enrichment analysis, we identified a cluster of 19 highly co-expressed miRNAs, including miR-494, that was located in the 14q32 locus on chromosome 14 (p = 1.9 × 10−7. We evaluated the putative biological role of this miRNA cluster by determining the biological significance of the genes targeted by the cluster using functional enrichment analysis. We found that this cluster was involved in the regulation of genes with various functions, specifically the “cellular nitrogen compound metabolic process” (p = 2.34 × 10−145, “immune system process” (p = 2.57 × 10−6, and “extracellular matrix organization” (p = 8.14 × 10−5 gene ontology terms and the “TGF-β signaling pathway” KEGG term (p = 2.59 × 10−8.Conclusions: Using an integrative bioinformatics approach, we identified the circulating miRNA expression profile associated with

  16. MicroRNAs Clustered within the 14q32 Locus Are Associated with Endothelial Damage and Microparticle Secretion in Bicuspid Aortic Valve Disease

    Science.gov (United States)

    Martínez-Micaelo, Neus; Beltrán-Debón, Raúl; Aragonés, Gerard; Faiges, Marta; Alegret, Josep M.

    2017-01-01

    Background: We previously described that PECAM+ circulating endothelial microparticles (EMPs) are elevated in bicuspid aortic valve (BAV) disease as a manifestation of endothelial damage. In this study, we hypothesized that this endothelial damage, is functionally related to the secretion of a specific pattern of EMP-associated miRNAs. Methods: We used a bioinformatics approach to correlate the PECAM+ EMP levels with the miRNA expression profile in plasma in healthy individuals and BAV patients (n = 36). In addition, using the miRNAs that were significantly associated with PECAM+ EMP levels, we inferred a miRNA co-expression network using a Gaussian graphical modeling approach to identify highly co-expressed miRNAs or miRNA clusters whose expression could functionally regulate endothelial damage. Results: We identified a co-expression network composed of 131 miRNAs whose circulating expression was significantly associated with PECAM+ EMP levels. Using a topological analysis, we found that miR-494 was the most important hub within the co-expression network. Furthermore, through positional gene enrichment analysis, we identified a cluster of 19 highly co-expressed miRNAs, including miR-494, that was located in the 14q32 locus on chromosome 14 (p = 1.9 × 10−7). We evaluated the putative biological role of this miRNA cluster by determining the biological significance of the genes targeted by the cluster using functional enrichment analysis. We found that this cluster was involved in the regulation of genes with various functions, specifically the “cellular nitrogen compound metabolic process” (p = 2.34 × 10−145), “immune system process” (p = 2.57 × 10−6), and “extracellular matrix organization” (p = 8.14 × 10−5) gene ontology terms and the “TGF-β signaling pathway” KEGG term (p = 2.59 × 10−8). Conclusions: Using an integrative bioinformatics approach, we identified the circulating miRNA expression profile associated with secreted PECAM

  17. Revealing Pathway Dynamics in Heart Diseases by Analyzing Multiple Differential Networks.

    Directory of Open Access Journals (Sweden)

    Xiaoke Ma

    2015-06-01

    Full Text Available Development of heart diseases is driven by dynamic changes in both the activity and connectivity of gene pathways. Understanding these dynamic events is critical for understanding pathogenic mechanisms and development of effective treatment. Currently, there is a lack of computational methods that enable analysis of multiple gene networks, each of which exhibits differential activity compared to the network of the baseline/healthy condition. We describe the iMDM algorithm to identify both unique and shared gene modules across multiple differential co-expression networks, termed M-DMs (multiple differential modules. We applied iMDM to a time-course RNA-Seq dataset generated using a murine heart failure model generated on two genotypes. We showed that iMDM achieves higher accuracy in inferring gene modules compared to using single or multiple co-expression networks. We found that condition-specific M-DMs exhibit differential activities, mediate different biological processes, and are enriched for genes with known cardiovascular phenotypes. By analyzing M-DMs that are present in multiple conditions, we revealed dynamic changes in pathway activity and connectivity across heart failure conditions. We further showed that module dynamics were correlated with the dynamics of disease phenotypes during the development of heart failure. Thus, pathway dynamics is a powerful measure for understanding pathogenesis. iMDM provides a principled way to dissect the dynamics of gene pathways and its relationship to the dynamics of disease phenotype. With the exponential growth of omics data, our method can aid in generating systems-level insights into disease progression.

  18. Recombinant amyloid beta-peptide production by coexpression with an affibody ligand

    Science.gov (United States)

    Macao, Bertil; Hoyer, Wolfgang; Sandberg, Anders; Brorsson, Ann-Christin; Dobson, Christopher M; Härd, Torleif

    2008-01-01

    Background Oligomeric and fibrillar aggregates of the amyloid β-peptide (Aβ) have been implicated in the pathogenesis of Alzheimer's disease (AD). The characterization of Aβ assemblies is essential for the elucidation of the mechanisms of Aβ neurotoxicity, but requires large quantities of pure peptide. Here we describe a novel approach to the recombinant production of Aβ. The method is based on the coexpression of the affibody protein ZAβ3, a selected affinity ligand derived from the Z domain three-helix bundle scaffold. ZAβ3 binds to the amyloidogenic central and C-terminal part of Aβ with nanomolar affinity and consequently inhibits aggregation. Results Coexpression of ZAβ3 affords the overexpression of both major Aβ isoforms, Aβ(1–40) and Aβ(1–42), yielding 4 or 3 mg, respectively, of pure 15N-labeled peptide per liter of culture. The method does not rely on a protein-fusion or -tag and thus does not require a cleavage reaction. The purified peptides were characterized by NMR, circular dichroism, SDS-PAGE and size exclusion chromatography, and their aggregation propensities were assessed by thioflavin T fluorescence and electron microscopy. The data coincide with those reported previously for monomeric, largely unstructured Aβ. ZAβ3 coexpression moreover permits the recombinant production of Aβ(1–42) carrying the Arctic (E22G) mutation, which causes early onset familial AD. Aβ(1–42)E22G is obtained in predominantly monomeric form and suitable, e.g., for NMR studies. Conclusion The coexpression of an engineered aggregation-inhibiting binding protein offers a novel route to the recombinant production of amyloidogenic Aβ peptides that can be advantageously employed to study the molecular basis of AD. The presented expression system is the first for which expression and purification of the aggregation-prone Arctic variant (E22G) of Aβ(1–42) is reported. PMID:18973685

  19. Recombinant amyloid beta-peptide production by coexpression with an affibody ligand

    Directory of Open Access Journals (Sweden)

    Dobson Christopher M

    2008-10-01

    Full Text Available Abstract Background Oligomeric and fibrillar aggregates of the amyloid β-peptide (Aβ have been implicated in the pathogenesis of Alzheimer's disease (AD. The characterization of Aβ assemblies is essential for the elucidation of the mechanisms of Aβ neurotoxicity, but requires large quantities of pure peptide. Here we describe a novel approach to the recombinant production of Aβ. The method is based on the coexpression of the affibody protein ZAβ3, a selected affinity ligand derived from the Z domain three-helix bundle scaffold. ZAβ3 binds to the amyloidogenic central and C-terminal part of Aβ with nanomolar affinity and consequently inhibits aggregation. Results Coexpression of ZAβ3 affords the overexpression of both major Aβ isoforms, Aβ(1–40 and Aβ(1–42, yielding 4 or 3 mg, respectively, of pure 15N-labeled peptide per liter of culture. The method does not rely on a protein-fusion or -tag and thus does not require a cleavage reaction. The purified peptides were characterized by NMR, circular dichroism, SDS-PAGE and size exclusion chromatography, and their aggregation propensities were assessed by thioflavin T fluorescence and electron microscopy. The data coincide with those reported previously for monomeric, largely unstructured Aβ. ZAβ3 coexpression moreover permits the recombinant production of Aβ(1–42 carrying the Arctic (E22G mutation, which causes early onset familial AD. Aβ(1–42E22G is obtained in predominantly monomeric form and suitable, e.g., for NMR studies. Conclusion The coexpression of an engineered aggregation-inhibiting binding protein offers a novel route to the recombinant production of amyloidogenic Aβ peptides that can be advantageously employed to study the molecular basis of AD. The presented expression system is the first for which expression and purification of the aggregation-prone Arctic variant (E22G of Aβ(1–42 is reported.

  20. S187. SEARCHING FOR BRAIN CO-EXPRESSION MODULES THAT CONTRIBUTE DISPROPORTIONATELY TO THE COMMON POLYGENIC RISK FOR SCHIZOPHRENIA

    Science.gov (United States)

    Costas, Javier; Paramo, Mario; Arrojo, Manuel

    2018-01-01

    Abstract Background Genomic research has revealed that schizophrenia is a highly polygenic disease. Recent estimates indicate that at least 71% of genomic segments of 1 Mb include one or more risk loci for schizophrenia (Loh et al., Nature Genet 2015). This extremely high polygenicity represents a challenge to decipher the biological basis of schizophrenia, as it is expected that any set of SNPs with enough size will be associated with the disorder. Among the different gene sets available for study (such as those from Gene Ontology, KEGG pathway, Reactome pathways or protein protein interaction datasets), those based on brain co-expression networks represent putative functional relationships in the relevant tissue. The aim of this work was to identify brain co-expression networks that contribute disproportionately to the common polygenic risk for schizophrenia to get more insight on schizophrenia etiopathology. Methods We analyzed a case -control dataset consisting of 582 schizophrenia patients from Galicia, NW Spain, and 591 ancestrally matched controls, genotyped with the Illumina PsychArray. Using as discovery sample the summary results from the largest GWAS of schizophrenia to date (Psychiatric Genomics Consortium, SCZ2), we generated polygenic risk scores (PRS) in our sample based on SNPs located at genes belonging to brain co-expression modules determined by the CommonMind Consortium (Fromer et al., Nature Neurosci 2016). PRS were generated using the clumping procedure of PLINK, considering several different thresholds to select SNPs from the discovery sample. In order to test if any specific module increased risk to schizophrenia more than expected by their size, we generated up to 10,000 random permutations of the same number of SNPs, matched by frequency, distance to nearest gene, number of SNPs in LD and gene density, using SNPsnap. Results As expected, most modules with enough number of independent SNPs belonging to them showed a significant increase in

  1. Bcl-2 and N-Myc Coexpression Increases IGF-IR and Features of Malignant Growth in Neuroblastoma Cell Lines

    Directory of Open Access Journals (Sweden)

    Rama Jasty

    2001-01-01

    Full Text Available The bcl-2 and c-myc oncogenes cooperate to transform multiple cell types. In the pediatric malignancy NB2, Bcl2 is highly expressed. In tumors with a poor prognosis, N-Myc, a protein homologous to c-Myc, is overexpressed as a result of gene amplification. The present study was designed to determine whether Bcl-2 cooperates with N-Myc to bestow a tumorigenic phenotype to neuroblastoma (NB cells. NB cell lines that at baseline express neither Bcl-2 nor N-Myc were stably transfected to express these gene products. In this model, we found Bcl-2 rescues N-Myc-expressing cells from apoptosis induced by serum withdrawal. Coexpression of Bcl-2 and N-Myc supports growth in low serum conditions and anchorage-independent growth in soft agar. Similarly, in vivo tumorigenic and angiogenic activity was dependent on coexpression. Our data further suggests that the mechanism underlying these changes involves the receptor for insulin growth factor type I (IGF-IR.

  2. Intra- and interspecies gene expression models for predicting drug response in canine osteosarcoma.

    Science.gov (United States)

    Fowles, Jared S; Brown, Kristen C; Hess, Ann M; Duval, Dawn L; Gustafson, Daniel L

    2016-02-19

    Genomics-based predictors of drug response have the potential to improve outcomes associated with cancer therapy. Osteosarcoma (OS), the most common primary bone cancer in dogs, is commonly treated with adjuvant doxorubicin or carboplatin following amputation of the affected limb. We evaluated the use of gene-expression based models built in an intra- or interspecies manner to predict chemosensitivity and treatment outcome in canine OS. Models were built and evaluated using microarray gene expression and drug sensitivity data from human and canine cancer cell lines, and canine OS tumor datasets. The "COXEN" method was utilized to filter gene signatures between human and dog datasets based on strong co-expression patterns. Models were built using linear discriminant analysis via the misclassification penalized posterior algorithm. The best doxorubicin model involved genes identified in human lines that were co-expressed and trained on canine OS tumor data, which accurately predicted clinical outcome in 73 % of dogs (p = 0.0262, binomial). The best carboplatin model utilized canine lines for gene identification and model training, with canine OS tumor data for co-expression. Dogs whose treatment matched our predictions had significantly better clinical outcomes than those that didn't (p = 0.0006, Log Rank), and this predictor significantly associated with longer disease free intervals in a Cox multivariate analysis (hazard ratio = 0.3102, p = 0.0124). Our data show that intra- and interspecies gene expression models can successfully predict response in canine OS, which may improve outcome in dogs and serve as pre-clinical validation for similar methods in human cancer research.

  3. Mining tissue specificity, gene connectivity and disease association to reveal a set of genes that modify the action of disease causing genes

    Directory of Open Access Journals (Sweden)

    Reverter Antonio

    2008-09-01

    Full Text Available Abstract Background The tissue specificity of gene expression has been linked to a number of significant outcomes including level of expression, and differential rates of polymorphism, evolution and disease association. Recent studies have also shown the importance of exploring differential gene connectivity and sequence conservation in the identification of disease-associated genes. However, no study relates gene interactions with tissue specificity and disease association. Methods We adopted an a priori approach making as few assumptions as possible to analyse the interplay among gene-gene interactions with tissue specificity and its subsequent likelihood of association with disease. We mined three large datasets comprising expression data drawn from massively parallel signature sequencing across 32 tissues, describing a set of 55,606 true positive interactions for 7,197 genes, and microarray expression results generated during the profiling of systemic inflammation, from which 126,543 interactions among 7,090 genes were reported. Results Amongst the myriad of complex relationships identified between expression, disease, connectivity and tissue specificity, some interesting patterns emerged. These include elevated rates of expression and network connectivity in housekeeping and disease-associated tissue-specific genes. We found that disease-associated genes are more likely to show tissue specific expression and most frequently interact with other disease genes. Using the thresholds defined in these observations, we develop a guilt-by-association algorithm and discover a group of 112 non-disease annotated genes that predominantly interact with disease-associated genes, impacting on disease outcomes. Conclusion We conclude that parameters such as tissue specificity and network connectivity can be used in combination to identify a group of genes, not previously confirmed as disease causing, that are involved in interactions with disease causing

  4. Enforcing Co-expression Within a Brain-Imaging Genomics Regression Framework.

    Science.gov (United States)

    Zille, Pascal; Calhoun, Vince D; Wang, Yu-Ping

    2017-06-28

    Among the challenges arising in brain imaging genetic studies, estimating the potential links between neurological and genetic variability within a population is key. In this work, we propose a multivariate, multimodal formulation for variable selection that leverages co-expression patterns across various data modalities. Our approach is based on an intuitive combination of two widely used statistical models: sparse regression and canonical correlation analysis (CCA). While the former seeks multivariate linear relationships between a given phenotype and associated observations, the latter searches to extract co-expression patterns between sets of variables belonging to different modalities. In the following, we propose to rely on a 'CCA-type' formulation in order to regularize the classical multimodal sparse regression problem (essentially incorporating both CCA and regression models within a unified formulation). The underlying motivation is to extract discriminative variables that are also co-expressed across modalities. We first show that the simplest formulation of such model can be expressed as a special case of collaborative learning methods. After discussing its limitation, we propose an extended, more flexible formulation, and introduce a simple and efficient alternating minimization algorithm to solve the associated optimization problem.We explore the parameter space and provide some guidelines regarding parameter selection. Both the original and extended versions are then compared on a simple toy dataset and a more advanced simulated imaging genomics dataset in order to illustrate the benefits of the latter. Finally, we validate the proposed formulation using single nucleotide polymorphisms (SNP) data and functional magnetic resonance imaging (fMRI) data from a population of adolescents (n = 362 subjects, age 16.9 ± 1.9 years from the Philadelphia Neurodevelopmental Cohort) for the study of learning ability. Furthermore, we carry out a significance

  5. Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes

    KAUST Repository

    Alshahrani, Mona

    2018-04-30

    In the past years, several methods have been developed to incorporate information about phenotypes into computational disease gene prioritization methods. These methods commonly compute the similarity between a disease\\'s (or patient\\'s) phenotypes and a database of gene-to-phenotype associations to find the phenotypically most similar match. A key limitation of these methods is their reliance on knowledge about phenotypes associated with particular genes which is highly incomplete in humans as well as in many model organisms such as the mouse. Results: We developed SmuDGE, a method that uses feature learning to generate vector-based representations of phenotypes associated with an entity. SmuDGE can be used as a trainable semantic similarity measure to compare two sets of phenotypes (such as between a disease and gene, or a disease and patient). More importantly, SmuDGE can generate phenotype representations for entities that are only indirectly associated with phenotypes through an interaction network; for this purpose, SmuDGE exploits background knowledge in interaction networks comprising of multiple types of interactions. We demonstrate that SmuDGE can match or outperform semantic similarity in phenotype-based disease gene prioritization, and furthermore significantly extends the coverage of phenotype-based methods to all genes in a connected interaction network.

  6. Identifying modules of coexpressed transcript units and their organization of Saccharopolyspora erythraea from time series gene expression profiles.

    Directory of Open Access Journals (Sweden)

    Xiao Chang

    Full Text Available BACKGROUND: The Saccharopolyspora erythraea genome sequence was released in 2007. In order to look at the gene regulations at whole transcriptome level, an expression microarray was specifically designed on the S. erythraea strain NRRL 2338 genome sequence. Based on these data, we set out to investigate the potential transcriptional regulatory networks and their organization. METHODOLOGY/PRINCIPAL FINDINGS: In view of the hierarchical structure of bacterial transcriptional regulation, we constructed a hierarchical coexpression network at whole transcriptome level. A total of 27 modules were identified from 1255 differentially expressed transcript units (TUs across time course, which were further classified in to four groups. Functional enrichment analysis indicated the biological significance of our hierarchical network. It was indicated that primary metabolism is activated in the first rapid growth phase (phase A, and secondary metabolism is induced when the growth is slowed down (phase B. Among the 27 modules, two are highly correlated to erythromycin production. One contains all genes in the erythromycin-biosynthetic (ery gene cluster and the other seems to be associated with erythromycin production by sharing common intermediate metabolites. Non-concomitant correlation between production and expression regulation was observed. Especially, by calculating the partial correlation coefficients and building the network based on Gaussian graphical model, intrinsic associations between modules were found, and the association between those two erythromycin production-correlated modules was included as expected. CONCLUSIONS: This work created a hierarchical model clustering transcriptome data into coordinated modules, and modules into groups across the time course, giving insight into the concerted transcriptional regulations especially the regulation corresponding to erythromycin production of S. erythraea. This strategy may be extendable to studies

  7. Identifying modules of coexpressed transcript units and their organization of Saccharopolyspora erythraea from time series gene expression profiles.

    Science.gov (United States)

    Chang, Xiao; Liu, Shuai; Yu, Yong-Tao; Li, Yi-Xue; Li, Yuan-Yuan

    2010-08-12

    The Saccharopolyspora erythraea genome sequence was released in 2007. In order to look at the gene regulations at whole transcriptome level, an expression microarray was specifically designed on the S. erythraea strain NRRL 2338 genome sequence. Based on these data, we set out to investigate the potential transcriptional regulatory networks and their organization. In view of the hierarchical structure of bacterial transcriptional regulation, we constructed a hierarchical coexpression network at whole transcriptome level. A total of 27 modules were identified from 1255 differentially expressed transcript units (TUs) across time course, which were further classified in to four groups. Functional enrichment analysis indicated the biological significance of our hierarchical network. It was indicated that primary metabolism is activated in the first rapid growth phase (phase A), and secondary metabolism is induced when the growth is slowed down (phase B). Among the 27 modules, two are highly correlated to erythromycin production. One contains all genes in the erythromycin-biosynthetic (ery) gene cluster and the other seems to be associated with erythromycin production by sharing common intermediate metabolites. Non-concomitant correlation between production and expression regulation was observed. Especially, by calculating the partial correlation coefficients and building the network based on Gaussian graphical model, intrinsic associations between modules were found, and the association between those two erythromycin production-correlated modules was included as expected. This work created a hierarchical model clustering transcriptome data into coordinated modules, and modules into groups across the time course, giving insight into the concerted transcriptional regulations especially the regulation corresponding to erythromycin production of S. erythraea. This strategy may be extendable to studies on other prokaryotic microorganisms.

  8. Protein Coexpression Using FMDV 2A: Effect of “Linker” Residues

    Directory of Open Access Journals (Sweden)

    Ekaterina Minskaia

    2013-01-01

    Full Text Available Many biomedical applications absolutely require, or are substantially enhanced by, coexpression of multiple proteins from a single vector. Foot-and-mouth disease virus 2A (F2A and “2A-like” sequences (e.g., Thosea asigna virus 2A; T2A are used widely for this purpose since multiple proteins can be coexpressed by linking open reading frames (ORFs to form a single cistron. The activity of F2A “cleavage” may, however, be compromised by both the use of shorter versions of F2A and the sequences (derived from multiple-purpose cloning sites used to link F2A to the upstream protein. To characterise these effects, different lengths of F2A and T2A were inserted between green and cherry fluorescent proteins. Mutations were introduced in the linker region immediately upstream of both F2A- and T2A-based constructs and activities determined using both cell-free translation systems and transfected cells. In shorter versions of F2A, activity may be affected by both the C-terminal sequence of the protein upstream and, equally strikingly, the residues immediately upstream introduced during cloning. Mutations significantly improved activity for shorter versions of F2A but could decrease activity in the case of T2A. These data will aid the design of cloning strategies for the co-expression of multiple proteins in biomedical/biotechnological applications.

  9. Iron homeostasis in Arabidopsis thaliana: transcriptomic analyses reveal novel FIT-regulated genes, iron deficiency marker genes and functional gene networks.

    Science.gov (United States)

    Mai, Hans-Jörg; Pateyron, Stéphanie; Bauer, Petra

    2016-10-03

    FIT (FER-LIKE IRON DEFICIENCY-INDUCED TRANSCRIPTION FACTOR) is the central regulator of iron uptake in Arabidopsis thaliana roots. We performed transcriptome analyses of six day-old seedlings and roots of six week-old plants using wild type, a fit knock-out mutant and a FIT over-expression line grown under iron-sufficient or iron-deficient conditions. We compared genes regulated in a FIT-dependent manner depending on the developmental stage of the plants. We assembled a high likelihood dataset which we used to perform co-expression and functional analysis of the most stably iron deficiency-induced genes. 448 genes were found FIT-regulated. Out of these, 34 genes were robustly FIT-regulated in root and seedling samples and included 13 novel FIT-dependent genes. Three hundred thirty-one genes showed differential regulation in response to the presence and absence of FIT only in the root samples, while this was the case for 83 genes in the seedling samples. We assembled a virtual dataset of iron-regulated genes based on a total of 14 transcriptomic analyses of iron-deficient and iron-sufficient wild-type plants to pinpoint the best marker genes for iron deficiency and analyzed this dataset in depth. Co-expression analysis of this dataset revealed 13 distinct regulons part of which predominantly contained functionally related genes. We could enlarge the list of FIT-dependent genes and discriminate between genes that are robustly FIT-regulated in roots and seedlings or only in one of those. FIT-regulated genes were mostly induced, few of them were repressed by FIT. With the analysis of a virtual dataset we could filter out and pinpoint new candidates among the most reliable marker genes for iron deficiency. Moreover, co-expression and functional analysis of this virtual dataset revealed iron deficiency-induced and functionally distinct regulons.

  10. Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

    Science.gov (United States)

    Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

    2016-05-01

    The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration.

  11. Norrie disease gene is distinct from the monoamine oxidase genes

    OpenAIRE

    Sims, Katherine B.; Ozelius, Laurie; Corey, Timothy; Rinehart, William B.; Liberfarb, Ruth; Haines, Jonathan; Chen, Wei Jane; Norio, Reijo; Sankila, Eeva; de la Chapelle, Albert; Murphy, Dennis L.; Gusella, James; Breakefield, Xandra O.

    1989-01-01

    The genes for MAO-A and MAO-B appear to be very close to the Norrie disease gene, on the basis of loss and /or disruption of the MAO genes and activities in atypical Norrie disease patients deleted for the DXS7 locus; linkage among the MAO genes, the Norrie disease gene, and the DXS7 locus; and mapping of all these loci to the chromosomal region Xp11. The present study provides evidence that the MAO genes are not disrupted in “classic” Norrie disease patients. Genomic DNA from these “nondelet...

  12. Evolutionary dynamics of human autoimmune disease genes and malfunctioned immunological genes

    Directory of Open Access Journals (Sweden)

    Podder Soumita

    2012-01-01

    Full Text Available Abstract Background One of the main issues of molecular evolution is to divulge the principles in dictating the evolutionary rate differences among various gene classes. Immunological genes have received considerable attention in evolutionary biology as candidates for local adaptation and for studying functionally important polymorphisms. The normal structure and function of immunological genes will be distorted when they experience mutations leading to immunological dysfunctions. Results Here, we examined the fundamental differences between the genes which on mutation give rise to autoimmune or other immune system related diseases and the immunological genes that do not cause any disease phenotypes. Although the disease genes examined are analogous to non-disease genes in product, expression, function, and pathway affiliation, a statistically significant decrease in evolutionary rate has been found in autoimmune disease genes relative to all other immune related diseases and non-disease genes. Possible ways of accumulation of mutation in the three steps of the central dogma (DNA-mRNA-Protein have been studied to trace the mutational effects predisposed to disease consequence and acquiring higher selection pressure. Principal Component Analysis and Multivariate Regression Analysis have established the predominant role of single nucleotide polymorphisms in guiding the evolutionary rate of immunological disease and non-disease genes followed by m-RNA abundance, paralogs number, fraction of phosphorylation residue, alternatively spliced exon, protein residue burial and protein disorder. Conclusions Our study provides an empirical insight into the etiology of autoimmune disease genes and other immunological diseases. The immediate utility of our study is to help in disease gene identification and may also help in medicinal improvement of immune related disease.

  13. Gene therapy for ocular diseases.

    Science.gov (United States)

    Liu, Melissa M; Tuo, Jingsheng; Chan, Chi-Chao

    2011-05-01

    The eye is an easily accessible, highly compartmentalised and immune-privileged organ that offers unique advantages as a gene therapy target. Significant advancements have been made in understanding the genetic pathogenesis of ocular diseases, and gene replacement and gene silencing have been implicated as potentially efficacious therapies. Recent improvements have been made in the safety and specificity of vector-based ocular gene transfer methods. Proof-of-concept for vector-based gene therapies has also been established in several experimental models of human ocular diseases. After nearly two decades of ocular gene therapy research, preliminary successes are now being reported in phase 1 clinical trials for the treatment of Leber congenital amaurosis. This review describes current developments and future prospects for ocular gene therapy. Novel methods are being developed to enhance the performance and regulation of recombinant adeno-associated virus- and lentivirus-mediated ocular gene transfer. Gene therapy prospects have advanced for a variety of retinal disorders, including retinitis pigmentosa, retinoschisis, Stargardt disease and age-related macular degeneration. Advances have also been made using experimental models for non-retinal diseases, such as uveitis and glaucoma. These methodological advancements are critical for the implementation of additional gene-based therapies for human ocular diseases in the near future.

  14. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene.

    Science.gov (United States)

    Casucci, Monica; Falcone, Laura; Camisa, Barbara; Norelli, Margherita; Porcellini, Simona; Stornaiuolo, Anna; Ciceri, Fabio; Traversari, Catia; Bordignon, Claudio; Bonini, Chiara; Bondanza, Attilio

    2018-01-01

    Chimeric antigen receptor (CAR)-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR). We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6). We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting.

  15. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene

    Science.gov (United States)

    Casucci, Monica; Falcone, Laura; Camisa, Barbara; Norelli, Margherita; Porcellini, Simona; Stornaiuolo, Anna; Ciceri, Fabio; Traversari, Catia; Bordignon, Claudio; Bonini, Chiara; Bondanza, Attilio

    2018-01-01

    Chimeric antigen receptor (CAR)-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR). We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6). We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting. PMID:29619024

  16. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene

    Directory of Open Access Journals (Sweden)

    Monica Casucci

    2018-03-01

    Full Text Available Chimeric antigen receptor (CAR-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR. We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6. We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting.

  17. Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes

    KAUST Repository

    AlShahrani, Mona; Hoehndorf, Robert

    2018-01-01

    In the past years, several methods have been developed to incorporate information about phenotypes into computational disease gene prioritization methods. These methods commonly compute the similarity between a disease's (or patient's) phenotypes and a database of gene-to-phenotype associations to find the phenotypically most similar match. A key limitation of these methods is their reliance on knowledge about phenotypes associated with particular genes which is highly incomplete in humans as well as in many model organisms such as the mouse. Results: We developed SmuDGE, a method that uses feature learning to generate vector-based representations of phenotypes associated with an entity. SmuDGE can be used as a trainable semantic similarity measure to compare two sets of phenotypes (such as between a disease and gene, or a disease and patient). More importantly, SmuDGE can generate phenotype representations for entities that are only indirectly associated with phenotypes through an interaction network; for this purpose, SmuDGE exploits background knowledge in interaction networks comprising of multiple types of interactions. We demonstrate that SmuDGE can match or outperform semantic similarity in phenotype-based disease gene prioritization, and furthermore significantly extends the coverage of phenotype-based methods to all genes in a connected interaction network.

  18. Microarray profiling and co-expression network analysis of circulating lncRNAs and mRNAs associated with major depressive disorder.

    Directory of Open Access Journals (Sweden)

    Zhifen Liu

    Full Text Available LncRNAs, which represent one of the most highly expressed classes of ncRNAs in the brain, are becoming increasingly interesting with regard to brain functions and disorders. However, changes in the expression of regulatory lncRNAs in Major Depressive Disorder (MDD have not yet been reported. Using microarrays, we profiled the expression of 34834 lncRNAs and 39224 mRNAs in peripheral blood sampled from MDD patients as well as demographically-matched controls. Among these, we found that 2007 lncRNAs and 1667 mRNAs were differentially expressed, 17 of which were documented as depression-related gene in previous studies. Gene Ontology (GO and pathway analyses indicated that the biological functions of differentially expressed mRNAs were related to fundamental metabolic processes and neurodevelopment diseases. To investigate the potential regulatory roles of the differentially expressed lncRNAs on the mRNAs, we also constructed co-expression networks composed of the lncRNAs and mRNAs, which shows significant correlated patterns of expression. In the MDD-derived network, there were a greater number of nodes and connections than that in the control-derived network. The lncRNAs located at chr10:874695-874794, chr10:75873456-75873642, and chr3:47048304-47048512 may be important factors regulating the expression of mRNAs as they have previously been reported associations with MDD. This study is the first to explore genome-wide lncRNA expression and co-expression with mRNA patterns in MDD using microarray technology. We identified circulating lncRNAs that are aberrantly expressed in MDD and the results suggest that lncRNAs may contribute to the molecular pathogenesis of MDD.

  19. Network-based differential gene expression analysis suggests cell cycle related genes regulated by E2F1 underlie the molecular difference between smoker and non-smoker lung adenocarcinoma

    Science.gov (United States)

    2013-01-01

    Background Differential gene expression (DGE) analysis is commonly used to reveal the deregulated molecular mechanisms of complex diseases. However, traditional DGE analysis (e.g., the t test or the rank sum test) tests each gene independently without considering interactions between them. Top-ranked differentially regulated genes prioritized by the analysis may not directly relate to the coherent molecular changes underlying complex diseases. Joint analyses of co-expression and DGE have been applied to reveal the deregulated molecular modules underlying complex diseases. Most of these methods consist of separate steps: first to identify gene-gene relationships under the studied phenotype then to integrate them with gene expression changes for prioritizing signature genes, or vice versa. It is warrant a method that can simultaneously consider gene-gene co-expression strength and corresponding expression level changes so that both types of information can be leveraged optimally. Results In this paper, we develop a gene module based method for differential gene expression analysis, named network-based differential gene expression (nDGE) analysis, a one-step integrative process for prioritizing deregulated genes and grouping them into gene modules. We demonstrate that nDGE outperforms existing methods in prioritizing deregulated genes and discovering deregulated gene modules using simulated data sets. When tested on a series of smoker and non-smoker lung adenocarcinoma data sets, we show that top differentially regulated genes identified by the rank sum test in different sets are not consistent while top ranked genes defined by nDGE in different data sets significantly overlap. nDGE results suggest that a differentially regulated gene module, which is enriched for cell cycle related genes and E2F1 targeted genes, plays a role in the molecular differences between smoker and non-smoker lung adenocarcinoma. Conclusions In this paper, we develop nDGE to prioritize

  20. Utility and Limitations of Using Gene Expression Data to Identify Functional Associations.

    Directory of Open Access Journals (Sweden)

    Sahra Uygun

    2016-12-01

    Full Text Available Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets.

  1. A moth pheromone brewery: production of (Z)-11-hexadecenol by heterologous co-expression of two biosynthetic genes from a noctuid moth in a yeast cell factory.

    Science.gov (United States)

    Hagström, Åsa K; Wang, Hong-Lei; Liénard, Marjorie A; Lassance, Jean-Marc; Johansson, Tomas; Löfstedt, Christer

    2013-12-13

    Moths (Lepidoptera) are highly dependent on chemical communication to find a mate. Compared to conventional unselective insecticides, synthetic pheromones have successfully served to lure male moths as a specific and environmentally friendly way to control important pest species. However, the chemical synthesis and purification of the sex pheromone components in large amounts is a difficult and costly task. The repertoire of enzymes involved in moth pheromone biosynthesis in insecta can be seen as a library of specific catalysts that can be used to facilitate the synthesis of a particular chemical component. In this study, we present a novel approach to effectively aid in the preparation of semi-synthetic pheromone components using an engineered vector co-expressing two key biosynthetic enzymes in a simple yeast cell factory. We first identified and functionally characterized a ∆11 Fatty-Acyl Desaturase and a Fatty-Acyl Reductase from the Turnip moth, Agrotis segetum. The ∆11-desaturase produced predominantly Z11-16:acyl, a common pheromone component precursor, from the abundant yeast palmitic acid and the FAR transformed a series of saturated and unsaturated fatty acids into their corresponding alcohols which may serve as pheromone components in many moth species. Secondly, when we co-expressed the genes in the Brewer's yeast Saccharomyces cerevisiae, a set of long-chain fatty acids and alcohols that are not naturally occurring in yeast were produced from inherent yeast fatty acids, and the presence of (Z)-11-hexadecenol (Z11-16:OH), demonstrated that both heterologous enzymes were active in concert. A 100 ml batch yeast culture produced on average 19.5 μg Z11-16:OH. Finally, we demonstrated that oxidized extracts from the yeast cells containing (Z)-11-hexadecenal and other aldehyde pheromone compounds elicited specific electrophysiological activity from male antennae of the Tobacco budworm, Heliothis virescens, supporting the idea that genes from different

  2. Systems-level organization of non-alcoholic fatty liver disease progression network

    Directory of Open Access Journals (Sweden)

    K. Shubham

    2017-10-01

    Full Text Available Non-Alcoholic Fatty Liver Disease (NAFLD is a hepatic metabolic disorder that is commonly associated with sedentary lifestyle and high fat diets. NAFLD is prevalent in individuals with obesity, insulin resistance and Type 2 Diabetes (T2D. The clinical spectrum of NAFLD ranges from simple steatosis to Non-Alcoholic Steatohepatitis (NASH with fibrosis, which can progress to cirrhosis and hepatocellular carcinoma.The pathogenesis of NAFLD is complex, involving crosstalk between multiple organs, cell-types, and environmental and genetic factors. Dysfunction of White Adipose Tissue (WAT plays a central role in the development of NAFLD and other metabolic disorders. WAT is an active endocrine organ that regulates whole-body energy homeostasis, lipid metabolism, insulin sensitivity and food intake by secreting biologically active molecules (lipokines, adipokines and cytokines. WAT dynamically reacts to nutrient excess or deprivation by remodelling the number (called hyperplasia and/or size (called hypertrophy of adipocytes to store fat or supply nutrients to other tissues by lipolysis, respectively. Adipose tissue remodelling is also accompanied by changes in the composition or function of stromal vascular cells and ECM. The major objective of our study was to identify and characterize the metabolic and signaling modules associated with the progression of NAFLD in the VAT. We performed Weighted Gene Co-expression Network Analysis (WGCNA to organize microarray data obtained from the VAT of patients at different stages of NAFLD into functional modules. In order to obtain insights into the metabolism and its regulation at the genome scale, a co-expression network of metabolic genes in the Human Metabolic Network (HMR2 was constructed and compared with the co-expression network constructed based on all the varying genes. We also used the prior network information on adipocyte metabolism (GEM to verify and extract reporter metabolites. Our analysis revealed

  3. Norrie disease gene is distinct from the monoamine oxidase genes.

    Science.gov (United States)

    Sims, K B; Ozelius, L; Corey, T; Rinehart, W B; Liberfarb, R; Haines, J; Chen, W J; Norio, R; Sankila, E; de la Chapelle, A

    1989-09-01

    The genes for MAO-A and MAO-B appear to be very close to the Norrie disease gene, on the basis of loss and/or disruption of the MAO genes and activities in atypical Norrie disease patients deleted for the DXS7 locus; linkage among the MAO genes, the Norrie disease gene, and the DXS7 locus; and mapping of all these loci to the chromosomal region Xp11. The present study provides evidence that the MAO genes are not disrupted in "classic" Norrie disease patients. Genomic DNA from these "nondeletion" Norrie disease patients did not show rearrangements at the MAOA or DXS7 loci. Normal levels of MAO-A activities, as well as normal amounts and size of the MAO-A mRNA, were observed in cultured skin fibroblasts from these patients, and MAO-B activity in their platelets was normal. Catecholamine metabolites evaluated in plasma and urine were in the control range. Thus, although some atypical Norrie disease patients lack both MAO-A and MAO-B activities, MAO does not appear to be an etiologic factor in classic Norrie disease.

  4. In-Silico Integration Approach to Identify a Key miRNA Regulating a Gene Network in Aggressive Prostate Cancer

    Science.gov (United States)

    Colaprico, Antonio; Bontempi, Gianluca; Castiglioni, Isabella

    2018-01-01

    Like other cancer diseases, prostate cancer (PC) is caused by the accumulation of genetic alterations in the cells that drives malignant growth. These alterations are revealed by gene profiling and copy number alteration (CNA) analysis. Moreover, recent evidence suggests that also microRNAs have an important role in PC development. Despite efforts to profile PC, the alterations (gene, CNA, and miRNA) and biological processes that correlate with disease development and progression remain partially elusive. Many gene signatures proposed as diagnostic or prognostic tools in cancer poorly overlap. The identification of co-expressed genes, that are functionally related, can identify a core network of genes associated with PC with a better reproducibility. By combining different approaches, including the integration of mRNA expression profiles, CNAs, and miRNA expression levels, we identified a gene signature of four genes overlapping with other published gene signatures and able to distinguish, in silico, high Gleason-scored PC from normal human tissue, which was further enriched to 19 genes by gene co-expression analysis. From the analysis of miRNAs possibly regulating this network, we found that hsa-miR-153 was highly connected to the genes in the network. Our results identify a four-gene signature with diagnostic and prognostic value in PC and suggest an interesting gene network that could play a key regulatory role in PC development and progression. Furthermore, hsa-miR-153, controlling this network, could be a potential biomarker for theranostics in high Gleason-scored PC. PMID:29562723

  5. Co-Expression and Co-Localization of Cartilage Glycoproteins CHI3L1 and Lubricin in Osteoarthritic Cartilage: Morphological, Immunohistochemical and Gene Expression Profiles

    Directory of Open Access Journals (Sweden)

    Marta Anna Szychlinska

    2016-03-01

    Full Text Available Osteoarthritis is the most common human arthritis characterized by degeneration of articular cartilage. Several studies reported that levels of human cartilage glycoprotein chitinase 3-like-1 (CHI3L1 are known as a potential marker for the activation of chondrocytes and the progression of Osteoarthritis (OA, whereas lubricin appears to be chondroprotective. The aim of this study was to investigate the co-expression and co-localization of CHI3L1 and lubricin in normal and osteoarthritic rat articular cartilage to correlate their modified expression to a specific grade of OA. Samples of normal and osteoarthritic rat articular cartilage were analyzed by the Kellgren–Lawrence OA severity scores, the Kraus’ modified Mankin score and the Histopathology Osteoarthritis Research Society International (OARSI system for histomorphometric evaluations, and through CHI3L1 and lubricin gene expression, immunohistochemistry and double immuno-staining analysis. The immunoexpression and the mRNA levels of lubricin increased in normal cartilage and decreased in OA cartilage (normal vs. OA, p < 0.01. By contrast, the immunoexpression and the mRNA levels of CHI3L1 increased in OA cartilage and decreased in normal cartilage (normal vs. OA, p < 0.01. Our findings are consistent with reports suggesting that these two glycoproteins are functionally associated with the development of OA and in particular with grade 2/3 of OA, suggesting that in the future they could be helpful to stage the severity and progression of the disease.

  6. Profiling and Co-expression Network Analysis of Learned Helplessness Regulated mRNAs and lncRNAs in the Mouse Hippocampus

    Directory of Open Access Journals (Sweden)

    Chaoqun Li

    2018-01-01

    Full Text Available Although studies provide insights into the neurobiology of stress and depression, the exact molecular mechanisms underlying their pathologies remain largely unknown. Long non-coding RNA (lncRNA has been implicated in brain functions and behavior. A potential link between lncRNA and psychiatric disorders has been proposed. However, it remains undetermined whether IncRNA regulation, in the brain, contributes to stress or depression pathologies. In this study, we used a valid animal model of depression-like symptoms; namely learned helplessness, RNA-seq, Gene Ontology and co-expression network analyses to profile the expression pattern of lncRNA and mRNA in the hippocampus of mice. We identified 6346 differentially expressed transcripts. Among them, 340 lncRNAs and 3559 protein coding mRNAs were differentially expressed in helpless mice in comparison with control and/or non-helpless mice (inescapable stress resilient mice. Gene Ontology and pathway enrichment analyses indicated that induction of helplessness altered expression of mRNAs enriched in fundamental biological functions implicated in stress/depression neurobiology such as synaptic, metabolic, cell survival and proliferation, developmental and chromatin modification functions. To explore the possible regulatory roles of the altered lncRNAs, we constructed co-expression networks composed of the lncRNAs and mRNAs. Among our differentially expressed lncRNAs, 17% showed significant correlation with genes. Functional co-expression analysis linked the identified lncRNAs to several cellular mechanisms implicated in stress/depression neurobiology. Importantly, 57% of the identified regulatory lncRNAs significantly correlated with 18 different synapse-related functions. Thus, the current study identifies for the first time distinct groups of lncRNAs regulated by induction of learned helplessness in the mouse brain. Our results suggest that lncRNA-directed regulatory mechanisms might contribute to

  7. Profiling and Co-expression Network Analysis of Learned Helplessness Regulated mRNAs and lncRNAs in the Mouse Hippocampus.

    Science.gov (United States)

    Li, Chaoqun; Cao, Feifei; Li, Shengli; Huang, Shenglin; Li, Wei; Abumaria, Nashat

    2017-01-01

    Although studies provide insights into the neurobiology of stress and depression, the exact molecular mechanisms underlying their pathologies remain largely unknown. Long non-coding RNA (lncRNA) has been implicated in brain functions and behavior. A potential link between lncRNA and psychiatric disorders has been proposed. However, it remains undetermined whether IncRNA regulation, in the brain, contributes to stress or depression pathologies. In this study, we used a valid animal model of depression-like symptoms; namely learned helplessness, RNA-seq, Gene Ontology and co-expression network analyses to profile the expression pattern of lncRNA and mRNA in the hippocampus of mice. We identified 6346 differentially expressed transcripts. Among them, 340 lncRNAs and 3559 protein coding mRNAs were differentially expressed in helpless mice in comparison with control and/or non-helpless mice (inescapable stress resilient mice). Gene Ontology and pathway enrichment analyses indicated that induction of helplessness altered expression of mRNAs enriched in fundamental biological functions implicated in stress/depression neurobiology such as synaptic, metabolic, cell survival and proliferation, developmental and chromatin modification functions. To explore the possible regulatory roles of the altered lncRNAs, we constructed co-expression networks composed of the lncRNAs and mRNAs. Among our differentially expressed lncRNAs, 17% showed significant correlation with genes. Functional co-expression analysis linked the identified lncRNAs to several cellular mechanisms implicated in stress/depression neurobiology. Importantly, 57% of the identified regulatory lncRNAs significantly correlated with 18 different synapse-related functions. Thus, the current study identifies for the first time distinct groups of lncRNAs regulated by induction of learned helplessness in the mouse brain. Our results suggest that lncRNA-directed regulatory mechanisms might contribute to stress

  8. The Influence of the Coexpression of CD4 and CD8 in Cutaneous Lesions on Prognosis of Mycosis Fungoides: A Preliminary Study

    Directory of Open Access Journals (Sweden)

    Sergio Umberto De Marchi

    2014-01-01

    Full Text Available Background. Although techniques of immunophenotyping have been successful in characterizing the cells in the cutaneous infiltrates of mycosis fungoides little evidence suggests that variations in the phenotypic characterization correlate with prognosis. Objectives. In a preliminary prospective, single-centre, study we correlated the T-cell phenotype in cutaneous biopsies with the progression of the disease to determine whether the coexpression of CD4 and CD8 has an impact on prognosis. Methods. Skin biopsy specimens from 30 newly diagnosed patients were stained with immunoperoxidase techniques to determine their phenotypic characteristics. After a median followup of 42 months patients were divided into two groups with stable and progressive disease. Results. Eighteen patients had the conventional CD4+CD8− T-cell phenotype. Ten patients showed the coexpression of CD4 and CD8 and had a slightly lower rate of progressive disease. Conclusions. The coexpression of CD4 and CD8 in cutaneous lesions is not rare and is associated with a slightly lower rate of progressive disease. Since double positive CD4/CD8 phenotype is rarely reported in mycosis fungoides the presence on conventional immunophenotyping of both CD may be due to a “mixture” of neoplastic cells and inflammatory CD8+ tumor infiltrating lymphocytes. Immunohistochemical study combined with confocal microscopy could clarify this issue.

  9. Gene Therapy for Parkinson's Disease

    Directory of Open Access Journals (Sweden)

    Rachel Denyer

    2012-01-01

    Full Text Available Current pharmacological and surgical treatments for Parkinson's disease offer symptomatic improvements to those suffering from this incurable degenerative neurological disorder, but none of these has convincingly shown effects on disease progression. Novel approaches based on gene therapy have several potential advantages over conventional treatment modalities. These could be used to provide more consistent dopamine supplementation, potentially providing superior symptomatic relief with fewer side effects. More radically, gene therapy could be used to correct the imbalances in basal ganglia circuitry associated with the symptoms of Parkinson's disease, or to preserve or restore dopaminergic neurons lost during the disease process itself. The latter neuroprotective approach is the most exciting, as it could theoretically be disease modifying rather than simply symptom alleviating. Gene therapy agents using these approaches are currently making the transition from the laboratory to the bedside. This paper summarises the theoretical approaches to gene therapy for Parkinson's disease and the findings of clinical trials in this rapidly changing field.

  10. Correlation-maximizing surrogate gene space for visual mining of gene expression patterns in developing barley endosperm tissue

    Directory of Open Access Journals (Sweden)

    Usadel Björn

    2007-05-01

    Full Text Available Abstract Background Micro- and macroarray technologies help acquire thousands of gene expression patterns covering important biological processes during plant ontogeny. Particularly, faithful visualization methods are beneficial for revealing interesting gene expression patterns and functional relationships of coexpressed genes. Such screening helps to gain deeper insights into regulatory behavior and cellular responses, as will be discussed for expression data of developing barley endosperm tissue. For that purpose, high-throughput multidimensional scaling (HiT-MDS, a recent method for similarity-preserving data embedding, is substantially refined and used for (a assessing the quality and reliability of centroid gene expression patterns, and for (b derivation of functional relationships of coexpressed genes of endosperm tissue during barley grain development (0–26 days after flowering. Results Temporal expression profiles of 4824 genes at 14 time points are faithfully embedded into two-dimensional displays. Thereby, similar shapes of coexpressed genes get closely grouped by a correlation-based similarity measure. As a main result, by using power transformation of correlation terms, a characteristic cloud of points with bipolar sandglass shape is obtained that is inherently connected to expression patterns of pre-storage, intermediate and storage phase of endosperm development. Conclusion The new HiT-MDS-2 method helps to create global views of expression patterns and to validate centroids obtained from clustering programs. Furthermore, functional gene annotation for developing endosperm barley tissue is successfully mapped to the visualization, making easy localization of major centroids of enriched functional categories possible.

  11. Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

    Science.gov (United States)

    Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

    2015-06-01

    To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Heterologous co-expression of accA, fabD, and thioesterase genes for improving long-chain fatty acid production in Pseudomonas aeruginosa and Escherichia coli.

    Science.gov (United States)

    Lee, Sunhee; Jeon, Eunyoung; Jung, Yeontae; Lee, Jinwon

    2012-05-01

    The goal of the present study was to increase the content of intracellular long-chain fatty acids in two bacterial strains, Pseudomonas aeruginosa PA14 and Escherichia coli K-12 MG1655, by co-overexpressing essential enzymes that are involved in the fatty acid synthesis metabolic pathway. Recently, microbial fatty acids and their derivatives have been receiving increasing attention as an alternative source of fuel. By introducing two genes (accA and fabD) of P. aeruginosa into the two bacterial strains and by co-expressing with them the fatty acyl-acyl carrier protein thioesterase gene of Streptococcus pyogenes (strain MGAS10270), we have engineered recombinant strains that are efficient producers of long-chain fatty acids (C16 and C18). The recombinant strains exhibit a 1.3-1.7-fold increase in the production of long-chain fatty acids over the wild-type strains. To enhance the production of total long-chain fatty acids, we researched the carbon sources for optimized culture conditions and results were used for post-culture incubation period. E. coli SGJS17 (containing the accA, fabD, and thioesterase genes) produced the highest content of intracellular total fatty acids; in particular, the unsaturated fatty acid content was about 20-fold higher than that in the wild-type E. coli.

  13. Inequalities and Duality in Gene Coexpression Networks of HIV-1 Infection Revealed by the Combination of the Double-Connectivity Approach and the Gini's Method

    Directory of Open Access Journals (Sweden)

    Chuang Ma

    2011-01-01

    Full Text Available The symbiosis (Sym and pathogenesis (Pat is a duality problem of microbial infection, including HIV/AIDS. Statistical analysis of inequalities and duality in gene coexpression networks (GCNs of HIV-1 infection may gain novel insights into AIDS. In this study, we focused on analysis of GCNs of uninfected subjects and HIV-1-infected patients at three different stages of viral infection based on data deposited in the GEO database of NCBI. The inequalities and duality in these GCNs were analyzed by the combination of the double-connectivity (DC approach and the Gini's method. DC analysis reveals that there are significant differences between positive and negative connectivity in HIV-1 stage-specific GCNs. The inequality measures of negative connectivity and edge weight are changed more significantly than those of positive connectivity and edge weight in GCNs from the HIV-1 uninfected to the AIDS stages. With the permutation test method, we identified a set of genes with significant changes in the inequality and duality measure of edge weight. Functional analysis shows that these genes are highly enriched for the immune system, which plays an essential role in the Sym-Pat duality (SPD of microbial infections. Understanding of the SPD problems of HIV-1 infection may provide novel intervention strategies for AIDS.

  14. Comprehensive analysis of differential co-expression patterns reveal transcriptional dysregulation mechanism and identify novel prognostic lncRNAs in esophageal squamous cell carcinoma

    Directory of Open Access Journals (Sweden)

    Li Z

    2017-06-01

    Full Text Available Zhen Li,1 Qianlan Yao,1 Songjian Zhao,1 Yin Wang,2,3 Yixue Li,1,4 Zhen Wang4 1School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 2Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, 3Collaborative Innovation Center for Genetics and Development, Fudan University, 4Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People’s Republic of China Abstract: Esophageal squamous cell carcinoma (ESCC is one of the most common malignancies worldwide and occurs at a relatively high frequency in People’s Republic of China. However, the molecular mechanism underlying ESCC is still unclear. In this study, the mRNA and long non-coding RNA (lncRNA expression profiles of ESCC were downloaded from the Gene Expression Omnibus database, and then differential co-expression analysis was used to reveal the altered co-expression relationship of gene pairs in ESCC tumors. A total of 3,709 mRNAs and 923 lncRNAs were differentially co-expressed between normal and tumor tissues, and we found that most of the gene pairs lost associations in the tumor tissues. The differential regulatory networking approach deciphered that transcriptional dysregulation was ubiquitous in ESCC, and most of the differentially regulated links were modulated by 37 TFs. Our study also found that two novel lncRNAs (ADAMTS9-AS1 and AP000696.2 might be essential in the development of ectoderm and epithelial cells, which could significantly stratify ESCC patients into high-risk and low-risk groups, and were much better than traditional clinical tumor markers. Further inspection of two risk groups showed that the changes in TF-target regulation in the high-risk patients were significantly higher than those in the low-risk patients. In addition, four signal transduction-related DCmRNAs (ERBB3, ENSA, KCNK7, MFSD5

  15. Gene therapy for CNS diseases – Krabbe disease

    Directory of Open Access Journals (Sweden)

    Mohammad A. Rafi

    2016-06-01

    Full Text Available This is a brief report of the 19th Annual Meeting of the American Society of Gene and Cell Therapy that took place from May 4th through May 7th, 2016 in Washington, DC, USA. While the meeting provided many symposiums, lectures, and scientific sessions this report mainly focuses on one of the sessions on the "Gene Therapy for central nervous system (CNS Diseases" and specifically on the "Gene Therapy for the globoid cell leukodystrophy or Krabbe disease. Two presentations focused on this subject utilizing two animal models of this disease: mice and dog models. Different serotypes of adeno-associate viral vectors (AAV alone or in combination with bone marrow transplantations were used in these research projects. The Meeting of the ASGCT reflected continuous growth in the fields of gene and cell therapy and brighter forecast for efficient treatment options for variety of human diseases.

  16. A recombinant pseudorabies virus co-expressing capsid proteins precursor P1-2A of FMDV and VP2 protein of porcine parvovirus: a trivalent vaccine candidate.

    Science.gov (United States)

    Hong, Qi; Qian, Ping; Li, Xiang-Min; Yu, Xiao-Lan; Chen, Huan-Chun

    2007-11-01

    Pseudorabies (PR), foot-and-mouth disease (FMD), and porcine parvovirus disease are three important infectious diseases in swine worldwide. The gene-deleted pseudorabies virus (PRV) has been used as a live-viral vector to develop multivalent genetic engineering vaccine. In this study, a recombinant PRV, which could co-express protein precursor P1-2A of FMDV and VP2 protein of PPV, was constructed using PRV TK(-)/gE(-)/LacZ(+) mutant as the vector. After homologous recombination and plaque purification, recombinant virus PRV TK(-)/gE(-)/P1-2A-VP2 was acquired and identified. Immunogenicity, safety of the recombinant PRV and its protection against PRV were confirmed in a mouse model by indirect ELISA and serum neutralization test. The results show that the recombinant PRV is a candidate vaccine strain to develop a novel trivalent vaccine against PRV, FMDV and PPV in swine.

  17. Gene transfer therapy in vascular diseases.

    Science.gov (United States)

    McKay, M J; Gaballa, M A

    2001-01-01

    Somatic gene therapy of vascular diseases is a promising new field in modern medicine. Recent advancements in gene transfer technology have greatly evolved our understanding of the pathophysiologic role of candidate disease genes. With this knowledge, the expression of selective gene products provides the means to test the therapeutic use of gene therapy in a multitude of medical conditions. In addition, with the completion of genome sequencing programs, gene transfer can be used also to study the biologic function of novel genes in vivo. Novel genes are delivered to targeted tissue via several different vehicles. These vectors include adenoviruses, retroviruses, plasmids, plasmid/liposomes, and oligonucleotides. However, each one of these vectors has inherent limitations. Further investigations into developing delivery systems that not only allow for efficient, targeted gene transfer, but also are stable and nonimmunogenic, will optimize the clinical application of gene therapy in vascular diseases. This review further discusses the available mode of gene delivery and examines six major areas in vascular gene therapy, namely prevention of restenosis, thrombosis, hypertension, atherosclerosis, peripheral vascular disease in congestive heart failure, and ischemia. Although we highlight some of the recent advances in the use of gene therapy in treating vascular disease discovered primarily during the past two years, many excellent studies published during that period are not included in this review due to space limitations. The following is a selective review of practical uses of gene transfer therapy in vascular diseases. This review primarily covers work performed in the last 2 years. For earlier work, the reader may refer to several excellent review articles. For instance, Belalcazer et al. (6) reviewed general aspects of somatic gene therapy and the different vehicles used for the delivery of therapeutic genes. Gene therapy in restenosis and stimulation of

  18. Identification of estrogen receptor dimer selective ligands reveals growth-inhibitory effects on cells that co-express ERα and ERβ.

    Directory of Open Access Journals (Sweden)

    Emily Powell

    Full Text Available Estrogens play essential roles in the progression of mammary and prostatic diseases. The transcriptional effects of estrogens are transduced by two estrogen receptors, ERα and ERβ, which elicit opposing roles in regulating proliferation: ERα is proliferative while ERβ is anti-proliferative. Exogenous expression of ERβ in ERα-positive cancer cell lines inhibits cell proliferation in response to estrogen and reduces xenografted tumor growth in vivo, suggesting that ERβ might oppose ERα's proliferative effects via formation of ERα/β heterodimers. Despite biochemical and cellular evidence of ERα/β heterodimer formation in cells co-expressing both receptors, the biological roles of the ERα/β heterodimer remain to be elucidated. Here we report the identification of two phytoestrogens that selectively activate ERα/β heterodimers at specific concentrations using a cell-based, two-step high throughput small molecule screen for ER transcriptional activity and ER dimer selectivity. Using ERα/β heterodimer-selective ligands at defined concentrations, we demonstrate that ERα/β heterodimers are growth inhibitory in breast and prostate cells which co-express the two ER isoforms. Furthermore, using Automated Quantitative Analysis (AQUA to examine nuclear expression of ERα and ERβ in human breast tissue microarrays, we demonstrate that ERα and ERβ are co-expressed in the same cells in breast tumors. The co-expression of ERα and ERβ in the same cells supports the possibility of ERα/β heterodimer formation at physio- and pathological conditions, further suggesting that targeting ERα/β heterodimers might be a novel therapeutic approach to the treatment of cancers which co-express ERα and ERβ.

  19. Constructing an integrated gene similarity network for the identification of disease genes.

    Science.gov (United States)

    Tian, Zhen; Guo, Maozu; Wang, Chunyu; Xing, LinLin; Wang, Lei; Zhang, Yin

    2017-09-20

    Discovering novel genes that are involved human diseases is a challenging task in biomedical research. In recent years, several computational approaches have been proposed to prioritize candidate disease genes. Most of these methods are mainly based on protein-protein interaction (PPI) networks. However, since these PPI networks contain false positives and only cover less half of known human genes, their reliability and coverage are very low. Therefore, it is highly necessary to fuse multiple genomic data to construct a credible gene similarity network and then infer disease genes on the whole genomic scale. We proposed a novel method, named RWRB, to infer causal genes of interested diseases. First, we construct five individual gene (protein) similarity networks based on multiple genomic data of human genes. Then, an integrated gene similarity network (IGSN) is reconstructed based on similarity network fusion (SNF) method. Finally, we employee the random walk with restart algorithm on the phenotype-gene bilayer network, which combines phenotype similarity network, IGSN as well as phenotype-gene association network, to prioritize candidate disease genes. We investigate the effectiveness of RWRB through leave-one-out cross-validation methods in inferring phenotype-gene relationships. Results show that RWRB is more accurate than state-of-the-art methods on most evaluation metrics. Further analysis shows that the success of RWRB is benefited from IGSN which has a wider coverage and higher reliability comparing with current PPI networks. Moreover, we conduct a comprehensive case study for Alzheimer's disease and predict some novel disease genes that supported by literature. RWRB is an effective and reliable algorithm in prioritizing candidate disease genes on the genomic scale. Software and supplementary information are available at http://nclab.hit.edu.cn/~tianzhen/RWRB/ .

  20. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

    Science.gov (United States)

    Uddin, Raihan; Singh, Shiva M

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they

  1. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    Science.gov (United States)

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  2. Coexpression and Secretion of Endoglucanase and Phytase Genes in Lactobacillus reuteri

    Science.gov (United States)

    Wang, Lei; Yang, Yuxin; Cai, Bei; Cao, Pinghua; Yang, Mingming; Chen, Yulin

    2014-01-01

    A multifunctional transgenic Lactobacillus with probiotic characteristics and an ability to degrade β-glucan and phytic acid (phytate) was engineered to improve nutrient utilization, increase production performance and decrease digestive diseases in broiler chickens. The Bacillus subtilis WL001 endoglucanase gene (celW) and Aspergillus fumigatus WL002 phytase gene (phyW) mature peptide (phyWM) were cloned into an expression vector with the lactate dehydrogenase promoter of Lactobacillus casei and the secretion signal peptide of the Lactococcus lactis usp45 gene. This construct was then transformed into Lactobacillus reuteri XC1 that had been isolated from the gastrointestinal tract of broilers. Heterologous enzyme production and feed effectiveness of this genetically modified L. reuteri strain were investigated and evaluated. Sodium dodecyl sulfate polyacrylamide gel electrophoresis analysis showed that the molecular mass of phyWM and celW was approximately 48.2 and 55 kDa, respectively, consistent with their predicted molecular weights. Endoglucanase and phytase activities in the extracellular fraction of the transformed L. reuteri culture were 0.68 and 0.42 U/mL, respectively. Transformed L. reuteri improved the feed conversion ratio of broilers from 21 to 42 days of age and over the whole feeding period. However, there was no effect on body weight gain and feed intake of chicks. Transformed L. reuteri supplementation improved levels of ash, calcium and phosphorus in tibiae at day 21 and of phosphorus at day 42. In addition, populations of Escherichia coli, Veillonella spp. and Bacteroides vulgatus were decreased, while populations of Bifidobacterium genus and Lactobacillus spp. were increased in the cecum at day 21. PMID:25050780

  3. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  4. Republished review: Gene therapy for ocular diseases.

    Science.gov (United States)

    Liu, Melissa M; Tuo, Jingsheng; Chan, Chi-Chao

    2011-07-01

    The eye is an easily accessible, highly compartmentalised and immune-privileged organ that offers unique advantages as a gene therapy target. Significant advancements have been made in understanding the genetic pathogenesis of ocular diseases, and gene replacement and gene silencing have been implicated as potentially efficacious therapies. Recent improvements have been made in the safety and specificity of vector-based ocular gene transfer methods. Proof-of-concept for vector-based gene therapies has also been established in several experimental models of human ocular diseases. After nearly two decades of ocular gene therapy research, preliminary successes are now being reported in phase 1 clinical trials for the treatment of Leber congenital amaurosis. This review describes current developments and future prospects for ocular gene therapy. Novel methods are being developed to enhance the performance and regulation of recombinant adeno-associated virus- and lentivirus-mediated ocular gene transfer. Gene therapy prospects have advanced for a variety of retinal disorders, including retinitis pigmentosa, retinoschisis, Stargardt disease and age-related macular degeneration. Advances have also been made using experimental models for non-retinal diseases, such as uveitis and glaucoma. These methodological advancements are critical for the implementation of additional gene-based therapies for human ocular diseases in the near future.

  5. Evolutionary signatures amongst disease genes permit novel methods for gene prioritization and construction of informative gene-based networks.

    Directory of Open Access Journals (Sweden)

    Nolan Priedigkeit

    2015-02-01

    Full Text Available Genes involved in the same function tend to have similar evolutionary histories, in that their rates of evolution covary over time. This coevolutionary signature, termed Evolutionary Rate Covariation (ERC, is calculated using only gene sequences from a set of closely related species and has demonstrated potential as a computational tool for inferring functional relationships between genes. To further define applications of ERC, we first established that roughly 55% of genetic diseases posses an ERC signature between their contributing genes. At a false discovery rate of 5% we report 40 such diseases including cancers, developmental disorders and mitochondrial diseases. Given these coevolutionary signatures between disease genes, we then assessed ERC's ability to prioritize known disease genes out of a list of unrelated candidates. We found that in the presence of an ERC signature, the true disease gene is effectively prioritized to the top 6% of candidates on average. We then apply this strategy to a melanoma-associated region on chromosome 1 and identify MCL1 as a potential causative gene. Furthermore, to gain global insight into disease mechanisms, we used ERC to predict molecular connections between 310 nominally distinct diseases. The resulting "disease map" network associates several diseases with related pathogenic mechanisms and unveils many novel relationships between clinically distinct diseases, such as between Hirschsprung's disease and melanoma. Taken together, these results demonstrate the utility of molecular evolution as a gene discovery platform and show that evolutionary signatures can be used to build informative gene-based networks.

  6. Expression of the Bs2 pepper gene confers resistance to bacterial spot disease in tomato.

    Science.gov (United States)

    Tai, T H; Dahlbeck, D; Clark, E T; Gajiwala, P; Pasion, R; Whalen, M C; Stall, R E; Staskawicz, B J

    1999-11-23

    The Bs2 resistance gene of pepper specifically recognizes and confers resistance to strains of Xanthomonas campestris pv. vesicatoria that contain the corresponding bacterial avirulence gene, avrBs2. The involvement of avrBs2 in pathogen fitness and its prevalence in many X. campestris pathovars suggests that the Bs2 gene may be durable in the field and provide resistance when introduced into other plant species. Employing a positional cloning strategy, the Bs2 locus was isolated and the gene was identified by coexpression with avrBs2 in an Agrobacterium-mediated transient assay. A single candidate gene, predicted to encode motifs characteristic of the nucleotide binding site-leucine-rich repeat class of resistance genes, was identified. This gene specifically controlled the hypersensitive response when transiently expressed in susceptible pepper and tomato lines and in a nonhost species, Nicotiana benthamiana, and was designated as Bs2. Functional expression of Bs2 in stable transgenic tomatoes supports its use as a source of resistance in other Solanaceous plant species.

  7. The order of expression is a key factor in the production of active transglutaminase in Escherichia coli by co-expression with its pro-peptide

    Directory of Open Access Journals (Sweden)

    Liu Song

    2011-12-01

    Full Text Available Abstract Background Streptomyces transglutaminase (TGase is naturally synthesized as zymogen (pro-TGase, which is then processed to produce active enzyme by the removal of its N-terminal pro-peptide. This pro-peptide is found to be essential for overexpression of soluble TGase in E. coli. However, expression of pro-TGase by E. coli requires protease-mediated activation in vitro. In this study, we developed a novel co- expression method for the direct production of active TGase in E. coli. Results A TGase from S. hygroscopicus was expressed in E. coli only after fusing with the pelB signal peptide, but fusion with the signal peptide induced insoluble enzyme. Therefore, alternative protocol was designed by co-expressing the TGase and its pro-peptide as independent polypeptides under a single T7 promoter using vector pET-22b(+. Although the pro-peptide was co-expressed, the TGase fused without the signal peptide was undetectable in both soluble and insoluble fractions of the recombinant cells. Similarly, when both genes were expressed in the order of the TGase and the pro-peptide, the solubility of TGase fused with the signal peptide was not improved by the co-expression with its pro-peptide. Interestingly, active TGase was only produced by the cells in which the pro-peptide and the TGase were fused with the signal peptide and sequentially expressed. The purified recombinant and native TGase shared the similar catalytic properties. Conclusions Our results indicated that the pro-peptide can assist correct folding of the TGase inter-molecularly in E. coli, and expression of pro-peptide prior to that of TGase was essential for the production of active TGase. The co-expression strategy based on optimizing the order of gene expression could be useful for the expression of other functional proteins that are synthesized as a precursor.

  8. The Bioinformatic Analysis of the Dysregulated Genes and MicroRNAs in Entorhinal Cortex, Hippocampus, and Blood for Alzheimer’s Disease

    Directory of Open Access Journals (Sweden)

    Xiaocong Pang

    2017-01-01

    Full Text Available Aim. The incidence of Alzheimer’s disease (AD has been increasing in recent years, but there exists no cure and the pathological mechanisms are not fully understood. This study aimed to find out the pathogenesis of learning and memory impairment, new biomarkers, potential therapeutic targets, and drugs for AD. Methods. We downloaded the microarray data of entorhinal cortex (EC and hippocampus (HIP of AD and controls from Gene Expression Omnibus (GEO database, and then the differentially expressed genes (DEGs in EC and HIP regions were analyzed for functional and pathway enrichment. Furthermore, we utilized the DEGs to construct coexpression networks to identify hub genes and discover the small molecules which were capable of reversing the gene expression profile of AD. Finally, we also analyzed microarray and RNA-seq dataset of blood samples to find the biomarkers related to gene expression in brain. Results. We found some functional hub genes, such as ErbB2, ErbB4, OCT3, MIF, CDK13, and GPI. According to GO and KEGG pathway enrichment, several pathways were significantly dysregulated in EC and HIP. CTSD and VCAM1 were dysregulated significantly in blood, EC, and HIP, which were potential biomarkers for AD. Target genes of four microRNAs had similar GO_terms distribution with DEGs in EC and HIP. In addtion, small molecules were screened out for AD treatment. Conclusion. These biological pathways and DEGs or hub genes will be useful to elucidate AD pathogenesis and identify novel biomarkers or drug targets for developing improved diagnostics and therapeutics against AD.

  9. Inductive matrix completion for predicting gene-disease associations.

    Science.gov (United States)

    Natarajan, Nagarajan; Dhillon, Inderjit S

    2014-06-15

    Most existing methods for predicting causal disease genes rely on specific type of evidence, and are therefore limited in terms of applicability. More often than not, the type of evidence available for diseases varies-for example, we may know linked genes, keywords associated with the disease obtained by mining text, or co-occurrence of disease symptoms in patients. Similarly, the type of evidence available for genes varies-for example, specific microarray probes convey information only for certain sets of genes. In this article, we apply a novel matrix-completion method called Inductive Matrix Completion to the problem of predicting gene-disease associations; it combines multiple types of evidence (features) for diseases and genes to learn latent factors that explain the observed gene-disease associations. We construct features from different biological sources such as microarray expression data and disease-related textual data. A crucial advantage of the method is that it is inductive; it can be applied to diseases not seen at training time, unlike traditional matrix-completion approaches and network-based inference methods that are transductive. Comparison with state-of-the-art methods on diseases from the Online Mendelian Inheritance in Man (OMIM) database shows that the proposed approach is substantially better-it has close to one-in-four chance of recovering a true association in the top 100 predictions, compared to the recently proposed Catapult method (second best) that has bigdata.ices.utexas.edu/project/gene-disease. © The Author 2014. Published by Oxford University Press.

  10. GEM2Net: from gene expression modeling to -omics networks, a new CATdb module to investigate Arabidopsis thaliana genes involved in stress response.

    Science.gov (United States)

    Zaag, Rim; Tamby, Jean Philippe; Guichard, Cécile; Tariq, Zakia; Rigaill, Guillem; Delannoy, Etienne; Renou, Jean-Pierre; Balzergue, Sandrine; Mary-Huard, Tristan; Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Brunaud, Véronique

    2015-01-01

    CATdb (http://urgv.evry.inra.fr/CATdb) is a database providing a public access to a large collection of transcriptomic data, mainly for Arabidopsis but also for other plants. This resource has the rare advantage to contain several thousands of microarray experiments obtained with the same technical protocol and analyzed by the same statistical pipelines. In this paper, we present GEM2Net, a new module of CATdb that takes advantage of this homogeneous dataset to mine co-expression units and decipher Arabidopsis gene functions. GEM2Net explores 387 stress conditions organized into 18 biotic and abiotic stress categories. For each one, a model-based clustering is applied on expression differences to identify clusters of co-expressed genes. To characterize functions associated with these clusters, various resources are analyzed and integrated: Gene Ontology, subcellular localization of proteins, Hormone Families, Transcription Factor Families and a refined stress-related gene list associated to publications. Exploiting protein-protein interactions and transcription factors-targets interactions enables to display gene networks. GEM2Net presents the analysis of the 18 stress categories, in which 17,264 genes are involved and organized within 681 co-expression clusters. The meta-data analyses were stored and organized to compose a dynamic Web resource. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Co-expression of the transcription factors CEH-14 and TTX-1 regulates AFD neuron-specific genes gcy-8 and gcy-18 in C. elegans.

    Science.gov (United States)

    Kagoshima, Hiroshi; Kohara, Yuji

    2015-03-15

    A wide variety of cells are generated by the expression of characteristic sets of genes, primarily those regulated by cell-specific transcription. To elucidate the mechanism regulating cell-specific gene expression in a highly specialized cell, AFD thermosensory neuron in Caenorhabditis elegans, we analyzed the promoter sequences of guanylyl cyclase genes, gcy-8 and gcy-18, exclusively expressed in AFD. In this study, we showed that AFD-specific expression of gcy-8 and gcy-18 requires the co-expression of homeodomain proteins, CEH-14/LHX3 and TTX-1/OTX1. We observed that mutation of ttx-1 or ceh-14 caused a reduction in the expression of gcy-8 and gcy-18 and that the expression was completely lost in double mutants. This synergy effect was also observed with other AFD marker genes, such as ntc-1, nlp-21and cng-3. Electrophoretic mobility shift assays revealed direct interaction of CEH-14 and TTX-1 proteins with gcy-8 and gcy-18 promoters in vitro. The binding sites of CEH-14 and TTX-1 proteins were confirmed to be essential for AFD-specific expression of gcy-8 and gcy-18 in vivo. We also demonstrated that forced expression of CEH-14 and TTX-1 in AWB chemosensory neurons induced ectopic expression of gcy-8 and gcy-18 reporters in this neuron. Finally, we showed that the regulation of gcy-8 and gcy-18 expression by ceh-14 and ttx-1 is evolutionally conserved in five Caenorhabditis species. Taken together, ceh-14 and ttx-1 expression determines the fate of AFD as terminal selector genes at the final step of cell specification. Copyright © 2015 Elsevier Inc. All rights reserved.

  12. Gene expression profiling in autoimmune diseases

    DEFF Research Database (Denmark)

    Bovin, Lone Frier; Brynskov, Jørn; Hegedüs, Laszlo

    2007-01-01

    A central issue in autoimmune disease is whether the underlying inflammation is a repeated stereotypical process or whether disease specific gene expression is involved. To shed light on this, we analysed whether genes previously found to be differentially regulated in rheumatoid arthritis (RA...

  13. Genes and Disease: Prader-Willi Syndrome

    Science.gov (United States)

    ... MD): National Center for Biotechnology Information (US); 1998-. Genes and Disease [Internet]. Show details National Center for ... 45K) PDF version of this title (3.8M) Gene sequence Genome view see gene locations Entrez Gene ...

  14. Coexpression of multidrug resistance involve proteins: a flow cytometric analysis.

    Science.gov (United States)

    Boutonnat, J; Bonnefoix, T; Mousseau, M; Seigneurin, D; Ronot, X

    1998-01-01

    Cross resistance to multiple natural cytotoxic products represents a major obstacle in myeloblastic acute leukaemia (AML). Multidrug resistance (MDR) often involves overexpression of plasma membrane drug transporter P-glycoprotein (PGP) or the resistance associated protein (MRP). Recently, a protein overexpressed in a non-PGP MDR lung cancer cell line and termed lung resistance related protein (LRP) was identified. These proteins are known to be associated with a bad prognosis in AML. We have developed a triple indirect labelling analysed by flow cytometry to detect the coexpression of these proteins. Since no cell line expressing all three antigens is known, we mixed K562 cells (resistant to Adriblastine, PGP+, MRP-, LRP-) with GLC4 cells (resistant to Adriblastine, PGP-, MRP+, LRP+) to create a model system to test the method. The antibodies used were UIC2 for PGP, MRPm6 for MRP and LRP56 for LRP. They were revealed by Fab'2 coupled with Fluoresceine-isothiocyanate, Phycoerythrin or Tricolor with isotype specificity. Cells were fixed and permeabilized after PGP labelling because MRPm6 and LRP56 recognize intracellular epitopes. PGP and LRP were easily detected. MRP is expressed at relatively low levels and was more difficult to detect because in the triple labelling the non specific staining was higher than in a single labelling. Despite the increased background in the triple labelling we were able to detect coexpression of PGP, MRP, LRP by flow cytometry. This method appears to be very useful to detect coexpression of markers in AML. Such coexpression could modify the therapeutic approach with revertants.

  15. Analysis of the robustness of network-based disease-gene prioritization methods reveals redundancy in the human interactome and functional diversity of disease-genes.

    Directory of Open Access Journals (Sweden)

    Emre Guney

    Full Text Available Complex biological systems usually pose a trade-off between robustness and fragility where a small number of perturbations can substantially disrupt the system. Although biological systems are robust against changes in many external and internal conditions, even a single mutation can perturb the system substantially, giving rise to a pathophenotype. Recent advances in identifying and analyzing the sequential variations beneath human disorders help to comprehend a systemic view of the mechanisms underlying various disease phenotypes. Network-based disease-gene prioritization methods rank the relevance of genes in a disease under the hypothesis that genes whose proteins interact with each other tend to exhibit similar phenotypes. In this study, we have tested the robustness of several network-based disease-gene prioritization methods with respect to the perturbations of the system using various disease phenotypes from the Online Mendelian Inheritance in Man database. These perturbations have been introduced either in the protein-protein interaction network or in the set of known disease-gene associations. As the network-based disease-gene prioritization methods are based on the connectivity between known disease-gene associations, we have further used these methods to categorize the pathophenotypes with respect to the recoverability of hidden disease-genes. Our results have suggested that, in general, disease-genes are connected through multiple paths in the human interactome. Moreover, even when these paths are disturbed, network-based prioritization can reveal hidden disease-gene associations in some pathophenotypes such as breast cancer, cardiomyopathy, diabetes, leukemia, parkinson disease and obesity to a greater extend compared to the rest of the pathophenotypes tested in this study. Gene Ontology (GO analysis highlighted the role of functional diversity for such diseases.

  16. Ethylene-Related Gene Expression Networks in Wood Formation

    Directory of Open Access Journals (Sweden)

    Carolin Seyfferth

    2018-03-01

    Full Text Available Thickening of tree stems is the result of secondary growth, accomplished by the meristematic activity of the vascular cambium. Secondary growth of the stem entails developmental cascades resulting in the formation of secondary phloem outwards and secondary xylem (i.e., wood inwards of the stem. Signaling and transcriptional reprogramming by the phytohormone ethylene modifies cambial growth and cell differentiation, but the molecular link between ethylene and secondary growth remains unknown. We addressed this shortcoming by analyzing expression profiles and co-expression networks of ethylene pathway genes using the AspWood transcriptome database which covers all stages of secondary growth in aspen (Populus tremula stems. ACC synthase expression suggests that the ethylene precursor 1-aminocyclopropane-1-carboxylic acid (ACC is synthesized during xylem expansion and xylem cell maturation. Ethylene-mediated transcriptional reprogramming occurs during all stages of secondary growth, as deduced from AspWood expression profiles of ethylene-responsive genes. A network centrality analysis of the AspWood dataset identified EIN3D and 11 ERFs as hubs. No overlap was found between the co-expressed genes of the EIN3 and ERF hubs, suggesting target diversification and hence independent roles for these transcription factor families during normal wood formation. The EIN3D hub was part of a large co-expression gene module, which contained 16 transcription factors, among them several new candidates that have not been earlier connected to wood formation and a VND-INTERACTING 2 (VNI2 homolog. We experimentally demonstrated Populus EIN3D function in ethylene signaling in Arabidopsis thaliana. The ERF hubs ERF118 and ERF119 were connected on the basis of their expression pattern and gene co-expression module composition to xylem cell expansion and secondary cell wall formation, respectively. We hereby establish data resources for ethylene-responsive genes and

  17. Identification of Key Pathways and Genes in the Dynamic Progression of HCC Based on WGCNA.

    Science.gov (United States)

    Yin, Li; Cai, Zhihui; Zhu, Baoan; Xu, Cunshuan

    2018-02-14

    Hepatocellular carcinoma (HCC) is a devastating disease worldwide. Though many efforts have been made to elucidate the process of HCC, its molecular mechanisms of development remain elusive due to its complexity. To explore the stepwise carcinogenic process from pre-neoplastic lesions to the end stage of HCC, we employed weighted gene co-expression network analysis (WGCNA) which has been proved to be an effective method in many diseases to detect co-expressed modules and hub genes using eight pathological stages including normal, cirrhosis without HCC, cirrhosis, low-grade dysplastic, high-grade dysplastic, very early and early, advanced HCC and very advanced HCC. Among the eight consecutive pathological stages, five representative modules are selected to perform canonical pathway enrichment and upstream regulator analysis by using ingenuity pathway analysis (IPA) software. We found that cell cycle related biological processes were activated at four neoplastic stages, and the degree of activation of the cell cycle corresponded to the deterioration degree of HCC. The orange and yellow modules enriched in energy metabolism, especially oxidative metabolism, and the expression value of the genes decreased only at four neoplastic stages. The brown module, enriched in protein ubiquitination and ephrin receptor signaling pathways, correlated mainly with the very early stage of HCC. The darkred module, enriched in hepatic fibrosis/hepatic stellate cell activation, correlated with the cirrhotic stage only. The high degree hub genes were identified based on the protein-protein interaction (PPI) network and were verified by Kaplan-Meier survival analysis. The novel five high degree hub genes signature that was identified in our study may shed light on future prognostic and therapeutic approaches. Our study brings a new perspective to the understanding of the key pathways and genes in the dynamic changes of HCC progression. These findings shed light on further investigations.

  18. Tumor SHB gene expression affects disease characteristics in human acute myeloid leukemia.

    Science.gov (United States)

    Jamalpour, Maria; Li, Xiujuan; Cavelier, Lucia; Gustafsson, Karin; Mostoslavsky, Gustavo; Höglund, Martin; Welsh, Michael

    2017-10-01

    The mouse Shb gene coding for the Src Homology 2-domain containing adapter protein B has recently been placed in context of BCRABL1-induced myeloid leukemia in mice and the current study was performed in order to relate SHB to human acute myeloid leukemia (AML). Publicly available AML databases were mined for SHB gene expression and patient survival. SHB gene expression was determined in the Uppsala cohort of AML patients by qPCR. Cell proliferation was determined after SHB gene knockdown in leukemic cell lines. Despite a low frequency of SHB gene mutations, many tumors overexpressed SHB mRNA compared with normal myeloid blood cells. AML patients with tumors expressing low SHB mRNA displayed longer survival times. A subgroup of AML exhibiting a favorable prognosis, acute promyelocytic leukemia (APL) with a PMLRARA translocation, expressed less SHB mRNA than AML tumors in general. When examining genes co-expressed with SHB in AML tumors, four other genes ( PAX5, HDAC7, BCORL1, TET1) related to leukemia were identified. A network consisting of these genes plus SHB was identified that relates to certain phenotypic characteristics, such as immune cell, vascular and apoptotic features. SHB knockdown in the APL PMLRARA cell line NB4 and the monocyte/macrophage cell line MM6 adversely affected proliferation, linking SHB gene expression to tumor cell expansion and consequently to patient survival. It is concluded that tumor SHB gene expression relates to AML survival and its subgroup APL. Moreover, this gene is included in a network of genes that plays a role for an AML phenotype exhibiting certain immune cell, vascular and apoptotic characteristics.

  19. Neural Inductive Matrix Completion for Predicting Disease-Gene Associations

    KAUST Repository

    Hou, Siqing

    2018-05-21

    In silico prioritization of undiscovered associations can help find causal genes of newly discovered diseases. Some existing methods are based on known associations, and side information of diseases and genes. We exploit the possibility of using a neural network model, Neural inductive matrix completion (NIMC), in disease-gene prediction. Comparing to the state-of-the-art inductive matrix completion method, using neural networks allows us to learn latent features from non-linear functions of input features. Previous methods use disease features only from mining text. Comparing to text mining, disease ontology is a more informative way of discovering correlation of dis- eases, from which we can calculate the similarities between diseases and help increase the performance of predicting disease-gene associations. We compare the proposed method with other state-of-the-art methods for pre- dicting associated genes for diseases from the Online Mendelian Inheritance in Man (OMIM) database. Results show that both new features and the proposed NIMC model can improve the chance of recovering an unknown associated gene in the top 100 predicted genes. Best results are obtained by using both the new features and the new model. Results also show the proposed method does better in predicting associated genes for newly discovered diseases.

  20. DISEASES: text mining and data integration of disease-gene associations.

    Science.gov (United States)

    Pletscher-Frankild, Sune; Pallejà, Albert; Tsafou, Kalliopi; Binder, Janos X; Jensen, Lars Juhl

    2015-03-01

    Text mining is a flexible technology that can be applied to numerous different tasks in biology and medicine. We present a system for extracting disease-gene associations from biomedical abstracts. The system consists of a highly efficient dictionary-based tagger for named entity recognition of human genes and diseases, which we combine with a scoring scheme that takes into account co-occurrences both within and between sentences. We show that this approach is able to extract half of all manually curated associations with a false positive rate of only 0.16%. Nonetheless, text mining should not stand alone, but be combined with other types of evidence. For this reason, we have developed the DISEASES resource, which integrates the results from text mining with manually curated disease-gene associations, cancer mutation data, and genome-wide association studies from existing databases. The DISEASES resource is accessible through a web interface at http://diseases.jensenlab.org/, where the text-mining software and all associations are also freely available for download. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  1. Gene-wide analysis detects two new susceptibility genes for Alzheimer's disease.

    Science.gov (United States)

    Escott-Price, Valentina; Bellenguez, Céline; Wang, Li-San; Choi, Seung-Hoan; Harold, Denise; Jones, Lesley; Holmans, Peter; Gerrish, Amy; Vedernikov, Alexey; Richards, Alexander; DeStefano, Anita L; Lambert, Jean-Charles; Ibrahim-Verbaas, Carla A; Naj, Adam C; Sims, Rebecca; Jun, Gyungah; Bis, Joshua C; Beecham, Gary W; Grenier-Boley, Benjamin; Russo, Giancarlo; Thornton-Wells, Tricia A; Denning, Nicola; Smith, Albert V; Chouraki, Vincent; Thomas, Charlene; Ikram, M Arfan; Zelenika, Diana; Vardarajan, Badri N; Kamatani, Yoichiro; Lin, Chiao-Feng; Schmidt, Helena; Kunkle, Brian; Dunstan, Melanie L; Vronskaya, Maria; Johnson, Andrew D; Ruiz, Agustin; Bihoreau, Marie-Thérèse; Reitz, Christiane; Pasquier, Florence; Hollingworth, Paul; Hanon, Olivier; Fitzpatrick, Annette L; Buxbaum, Joseph D; Campion, Dominique; Crane, Paul K; Baldwin, Clinton; Becker, Tim; Gudnason, Vilmundur; Cruchaga, Carlos; Craig, David; Amin, Najaf; Berr, Claudine; Lopez, Oscar L; De Jager, Philip L; Deramecourt, Vincent; Johnston, Janet A; Evans, Denis; Lovestone, Simon; Letenneur, Luc; Hernández, Isabel; Rubinsztein, David C; Eiriksdottir, Gudny; Sleegers, Kristel; Goate, Alison M; Fiévet, Nathalie; Huentelman, Matthew J; Gill, Michael; Brown, Kristelle; Kamboh, M Ilyas; Keller, Lina; Barberger-Gateau, Pascale; McGuinness, Bernadette; Larson, Eric B; Myers, Amanda J; Dufouil, Carole; Todd, Stephen; Wallon, David; Love, Seth; Rogaeva, Ekaterina; Gallacher, John; George-Hyslop, Peter St; Clarimon, Jordi; Lleo, Alberto; Bayer, Anthony; Tsuang, Debby W; Yu, Lei; Tsolaki, Magda; Bossù, Paola; Spalletta, Gianfranco; Proitsi, Petra; Collinge, John; Sorbi, Sandro; Garcia, Florentino Sanchez; Fox, Nick C; Hardy, John; Naranjo, Maria Candida Deniz; Bosco, Paolo; Clarke, Robert; Brayne, Carol; Galimberti, Daniela; Scarpini, Elio; Bonuccelli, Ubaldo; Mancuso, Michelangelo; Siciliano, Gabriele; Moebus, Susanne; Mecocci, Patrizia; Zompo, Maria Del; Maier, Wolfgang; Hampel, Harald; Pilotto, Alberto; Frank-García, Ana; Panza, Francesco; Solfrizzi, Vincenzo; Caffarra, Paolo; Nacmias, Benedetta; Perry, William; Mayhaus, Manuel; Lannfelt, Lars; Hakonarson, Hakon; Pichler, Sabrina; Carrasquillo, Minerva M; Ingelsson, Martin; Beekly, Duane; Alvarez, Victoria; Zou, Fanggeng; Valladares, Otto; Younkin, Steven G; Coto, Eliecer; Hamilton-Nelson, Kara L; Gu, Wei; Razquin, Cristina; Pastor, Pau; Mateo, Ignacio; Owen, Michael J; Faber, Kelley M; Jonsson, Palmi V; Combarros, Onofre; O'Donovan, Michael C; Cantwell, Laura B; Soininen, Hilkka; Blacker, Deborah; Mead, Simon; Mosley, Thomas H; Bennett, David A; Harris, Tamara B; Fratiglioni, Laura; Holmes, Clive; de Bruijn, Renee F A G; Passmore, Peter; Montine, Thomas J; Bettens, Karolien; Rotter, Jerome I; Brice, Alexis; Morgan, Kevin; Foroud, Tatiana M; Kukull, Walter A; Hannequin, Didier; Powell, John F; Nalls, Michael A; Ritchie, Karen; Lunetta, Kathryn L; Kauwe, John S K; Boerwinkle, Eric; Riemenschneider, Matthias; Boada, Mercè; Hiltunen, Mikko; Martin, Eden R; Schmidt, Reinhold; Rujescu, Dan; Dartigues, Jean-François; Mayeux, Richard; Tzourio, Christophe; Hofman, Albert; Nöthen, Markus M; Graff, Caroline; Psaty, Bruce M; Haines, Jonathan L; Lathrop, Mark; Pericak-Vance, Margaret A; Launer, Lenore J; Van Broeckhoven, Christine; Farrer, Lindsay A; van Duijn, Cornelia M; Ramirez, Alfredo; Seshadri, Sudha; Schellenberg, Gerard D; Amouyel, Philippe; Williams, Julie

    2014-01-01

    Alzheimer's disease is a common debilitating dementia with known heritability, for which 20 late onset susceptibility loci have been identified, but more remain to be discovered. This study sought to identify new susceptibility genes, using an alternative gene-wide analytical approach which tests for patterns of association within genes, in the powerful genome-wide association dataset of the International Genomics of Alzheimer's Project Consortium, comprising over 7 m genotypes from 25,580 Alzheimer's cases and 48,466 controls. In addition to earlier reported genes, we detected genome-wide significant loci on chromosomes 8 (TP53INP1, p = 1.4×10-6) and 14 (IGHV1-67 p = 7.9×10-8) which indexed novel susceptibility loci. The additional genes identified in this study, have an array of functions previously implicated in Alzheimer's disease, including aspects of energy metabolism, protein degradation and the immune system and add further weight to these pathways as potential therapeutic targets in Alzheimer's disease.

  2. Gene-wide analysis detects two new susceptibility genes for Alzheimer's disease.

    Directory of Open Access Journals (Sweden)

    Valentina Escott-Price

    Full Text Available Alzheimer's disease is a common debilitating dementia with known heritability, for which 20 late onset susceptibility loci have been identified, but more remain to be discovered. This study sought to identify new susceptibility genes, using an alternative gene-wide analytical approach which tests for patterns of association within genes, in the powerful genome-wide association dataset of the International Genomics of Alzheimer's Project Consortium, comprising over 7 m genotypes from 25,580 Alzheimer's cases and 48,466 controls.In addition to earlier reported genes, we detected genome-wide significant loci on chromosomes 8 (TP53INP1, p = 1.4×10-6 and 14 (IGHV1-67 p = 7.9×10-8 which indexed novel susceptibility loci.The additional genes identified in this study, have an array of functions previously implicated in Alzheimer's disease, including aspects of energy metabolism, protein degradation and the immune system and add further weight to these pathways as potential therapeutic targets in Alzheimer's disease.

  3. Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles.

    Science.gov (United States)

    Zhu, Jie; Qin, Yufang; Liu, Taigang; Wang, Jun; Zheng, Xiaoqi

    2013-01-01

    Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. Programs and data are available upon request.

  4. Exploring the potential relevance of human-specific genes to complex disease

    Directory of Open Access Journals (Sweden)

    Cooper David N

    2011-01-01

    Full Text Available Abstract Although human disease genes generally tend to be evolutionarily more ancient than non-disease genes, complex disease genes appear to be represented more frequently than Mendelian disease genes among genes of more recent evolutionary origin. It is therefore proposed that the analysis of human-specific genes might provide new insights into the genetics of complex disease. Cross-comparison with the Human Gene Mutation Database (http://www.hgmd.org revealed a number of examples of disease-causing and disease-associated mutations in putatively human-specific genes. A sizeable proportion of these were missense polymorphisms associated with complex disease. Since both human-specific genes and genes associated with complex disease have often experienced particularly rapid rates of evolutionary change, either due to weaker purifying selection or positive selection, it is proposed that a significant number of human-specific genes may play a role in complex disease.

  5. Interference, heterogeneity and disease gene mapping

    Energy Technology Data Exchange (ETDEWEB)

    Keats, B. [Louisiana State Univ. Medical Center, New Orleans, LA (United States)

    1996-12-31

    The Human Genome Project has had a major impact on genetic research over the past five years. The number of mapped genes is now over 3,000 compared with approximately 1,600 in 1989 and only about 260 ten years before that. The realization that extensive variation could be detected in anonymous DNA segments greatly enhanced the potential for mapping by linkage analysis. Previously, linkage studies had depended on polymorphisms that could be detected in red blood cell antigens, proteins (revealed by electrophoresis and isoelectric focusing), and cytogenetic heteromorphisms. The identification of thousands of polymorphic DNA markers throughout the human genome has led to the construction of high density genetic linkage maps. These maps provide the data necessary to test hypotheses concerning differences in recombination rates and levels of interference. They are also important for disease gene mapping because the existence of these genes must be inferred from the phenotype. Showing linkage of a disease gene to a DNA marker is the first step towards isolating the disease gene, determining its protein product, and developing effective therapies. However, interpretation of results is not always straightforward. Factors such as etiological heterogeneity and undetected irregular segregation can lead to confusing linkage results and incorrect conclusions about the locations of disease genes. This paper will discuss these phenomena and present examples that illustrate the problems, as well as approaches to dealing with them. 23 refs., 3 figs., 3 tabs.

  6. Identification of conserved drought stress responsive gene-network across tissues and developmental stages in rice.

    Science.gov (United States)

    Smita, Shuchi; Katiyar, Amit; Pandey, Dev Mani; Chinnusamy, Viswanathan; Archak, Sunil; Bansal, Kailash Chander

    2013-01-01

    Identification of genes that are coexpressed across various tissues and environmental stresses is biologically interesting, since they may play coordinated role in similar biological processes. Genes with correlated expression patterns can be best identified by using coexpression network analysis of transcriptome data. In the present study, we analyzed the temporal-spatial coordination of gene expression in root, leaf and panicle of rice under drought stress and constructed network using WGCNA and Cytoscape. Total of 2199 differentially expressed genes (DEGs) were identified in at least three or more tissues, wherein 88 genes have coordinated expression profile among all the six tissues under drought stress. These 88 highly coordinated genes were further subjected to module identification in the coexpression network. Based on chief topological properties we identified 18 hub genes such as ABC transporter, ATP-binding protein, dehydrin, protein phosphatase 2C, LTPL153 - Protease inhibitor, phosphatidylethanolaminebinding protein, lactose permease-related, NADP-dependent malic enzyme, etc. Motif enrichment analysis showed the presence of ABRE cis-elements in the promoters of > 62% of the coordinately expressed genes. Our results suggest that drought stress mediated upregulated gene expression was coordinated through an ABA-dependent signaling pathway across tissues, at least for the subset of genes identified in this study, while down regulation appears to be regulated by tissue specific pathways in rice.

  7. Identification of Gene Biomarkers for Distinguishing Small-Cell Lung Cancer from Non-Small-Cell Lung Cancer Using a Network-Based Approach

    Directory of Open Access Journals (Sweden)

    Fei Long

    2015-01-01

    Full Text Available Lung cancer consists of two main subtypes: small-cell lung cancer (SCLC and non-small-cell lung cancer (NSCLC that are classified according to their physiological phenotypes. In this study, we have developed a network-based approach to identify molecular biomarkers that can distinguish SCLC from NSCLC. By identifying positive and negative coexpression gene pairs in normal lung tissues, SCLC, or NSCLC samples and using functional association information from the STRING network, we first construct a lung cancer-specific gene association network. From the network, we obtain gene modules in which genes are highly functionally associated with each other and are either positively or negatively coexpressed in the three conditions. Then, we identify gene modules that not only are differentially expressed between cancer and normal samples, but also show distinctive expression patterns between SCLC and NSCLC. Finally, we select genes inside those modules with discriminating coexpression patterns between the two lung cancer subtypes and predict them as candidate biomarkers that are of diagnostic use.

  8. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature

    Directory of Open Access Journals (Sweden)

    Ning Ye

    2015-01-01

    Full Text Available The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  9. Uncovering robust patterns of microRNA co-expression across cancers using Bayesian Relevance Networks.

    Directory of Open Access Journals (Sweden)

    Parameswaran Ramachandran

    Full Text Available Co-expression networks have long been used as a tool for investigating the molecular circuitry governing biological systems. However, most algorithms for constructing co-expression networks were developed in the microarray era, before high-throughput sequencing-with its unique statistical properties-became the norm for expression measurement. Here we develop Bayesian Relevance Networks, an algorithm that uses Bayesian reasoning about expression levels to account for the differing levels of uncertainty in expression measurements between highly- and lowly-expressed entities, and between samples with different sequencing depths. It combines data from groups of samples (e.g., replicates to estimate group expression levels and confidence ranges. It then computes uncertainty-moderated estimates of cross-group correlations between entities, and uses permutation testing to assess their statistical significance. Using large scale miRNA data from The Cancer Genome Atlas, we show that our Bayesian update of the classical Relevance Networks algorithm provides improved reproducibility in co-expression estimates and lower false discovery rates in the resulting co-expression networks. Software is available at www.perkinslab.ca.

  10. Co-expression networks reveal the tissue-specific regulation of transcription and splicing.

    Science.gov (United States)

    Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D H; Jo, Brian; Gao, Chuan; McDowell, Ian C; Engelhardt, Barbara E; Battle, Alexis

    2017-11-01

    Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. © 2017 Saha et al.; Published by Cold Spring Harbor Laboratory Press.

  11. Divergent and convergent modes of interaction between wheat and Puccinia graminis f. sp. tritici isolates revealed by the comparative gene co-expression network and genome analyses.

    Science.gov (United States)

    Rutter, William B; Salcedo, Andres; Akhunova, Alina; He, Fei; Wang, Shichen; Liang, Hanquan; Bowden, Robert L; Akhunov, Eduard

    2017-04-12

    Two opposing evolutionary constraints exert pressure on plant pathogens: one to diversify virulence factors in order to evade plant defenses, and the other to retain virulence factors critical for maintaining a compatible interaction with the plant host. To better understand how the diversified arsenals of fungal genes promote interaction with the same compatible wheat line, we performed a comparative genomic analysis of two North American isolates of Puccinia graminis f. sp. tritici (Pgt). The patterns of inter-isolate divergence in the secreted candidate effector genes were compared with the levels of conservation and divergence of plant-pathogen gene co-expression networks (GCN) developed for each isolate. Comprative genomic analyses revealed substantial level of interisolate divergence in effector gene complement and sequence divergence. Gene Ontology (GO) analyses of the conserved and unique parts of the isolate-specific GCNs identified a number of conserved host pathways targeted by both isolates. Interestingly, the degree of inter-isolate sub-network conservation varied widely for the different host pathways and was positively associated with the proportion of conserved effector candidates associated with each sub-network. While different Pgt isolates tended to exploit similar wheat pathways for infection, the mode of plant-pathogen interaction varied for different pathways with some pathways being associated with the conserved set of effectors and others being linked with the diverged or isolate-specific effectors. Our data suggest that at the intra-species level pathogen populations likely maintain divergent sets of effectors capable of targeting the same plant host pathways. This functional redundancy may play an important role in the dynamic of the "arms-race" between host and pathogen serving as the basis for diverse virulence strategies and creating conditions where mutations in certain effector groups will not have a major effect on the pathogen

  12. A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer

    Directory of Open Access Journals (Sweden)

    Mary Qu Yang

    Full Text Available Clear cell renal cell carcinoma (ccRCC is the most common and most aggressive form of renal cell cancer (RCC. The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1, as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways. Keywords: ccRCC, Causative mutation, Pathways, Protein-protein interaction, Gene module, eQTL

  13. Biomedical Information Extraction: Mining Disease Associated Genes from Literature

    Science.gov (United States)

    Huang, Zhong

    2014-01-01

    Disease associated gene discovery is a critical step to realize the future of personalized medicine. However empirical and clinical validation of disease associated genes are time consuming and expensive. In silico discovery of disease associated genes from literature is therefore becoming the first essential step for biomarker discovery to…

  14. A systems approach identifies networks and genes linking sleep and stress: implications for neuropsychiatric disorders.

    Science.gov (United States)

    Jiang, Peng; Scarpa, Joseph R; Fitzpatrick, Karrie; Losic, Bojan; Gao, Vance D; Hao, Ke; Summa, Keith C; Yang, He S; Zhang, Bin; Allada, Ravi; Vitaterna, Martha H; Turek, Fred W; Kasarskis, Andrew

    2015-05-05

    Sleep dysfunction and stress susceptibility are comorbid complex traits that often precede and predispose patients to a variety of neuropsychiatric diseases. Here, we demonstrate multilevel organizations of genetic landscape, candidate genes, and molecular networks associated with 328 stress and sleep traits in a chronically stressed population of 338 (C57BL/6J × A/J) F2 mice. We constructed striatal gene co-expression networks, revealing functionally and cell-type-specific gene co-regulations important for stress and sleep. Using a composite ranking system, we identified network modules most relevant for 15 independent phenotypic categories, highlighting a mitochondria/synaptic module that links sleep and stress. The key network regulators of this module are overrepresented with genes implicated in neuropsychiatric diseases. Our work suggests that the interplay among sleep, stress, and neuropathology emerges from genetic influences on gene expression and their collective organization through complex molecular networks, providing a framework for interrogating the mechanisms underlying sleep, stress susceptibility, and related neuropsychiatric disorders. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Hydroxylation of recombinant human collagen type I alpha 1 in transgenic maize co-expressed with a recombinant human prolyl 4-hydroxylase

    Directory of Open Access Journals (Sweden)

    Pappu Kameshwari M

    2011-06-01

    Full Text Available Abstract Background Collagens require the hydroxylation of proline (Pro residues in their triple-helical domain repeating sequence Xaa-Pro-Gly to function properly as a main structural component of the extracellular matrix in animals at physiologically relevant conditions. The regioselective proline hydroxylation is catalyzed by a specific prolyl 4-hydroxylase (P4H as a posttranslational processing step. Results A recombinant human collagen type I α-1 (rCIα1 with high percentage of hydroxylated prolines (Hyp was produced in transgenic maize seeds when co-expressed with both the α- and β- subunits of a recombinant human P4H (rP4H. Germ-specific expression of rCIα1 using maize globulin-1 gene promoter resulted in an average yield of 12 mg/kg seed for the full-length rCIα1 in seeds without co-expression of rP4H and 4 mg/kg seed for the rCIα1 (rCIα1-OH in seeds with co-expression of rP4H. High-resolution mass spectrometry (HRMS analysis revealed that nearly half of the collagenous repeating triplets in rCIα1 isolated from rP4H co-expressing maize line had the Pro residues changed to Hyp residues. The HRMS analysis determined the Hyp content of maize-derived rCIα1-OH as 18.11%, which is comparable to the Hyp level of yeast-derived rCIα1-OH (17.47% and the native human CIa1 (14.59%, respectively. The increased Hyp percentage was correlated with a markedly enhanced thermal stability of maize-derived rCIα1-OH when compared to the non-hydroxylated rCIα1. Conclusions This work shows that maize has potential to produce adequately modified exogenous proteins with mammalian-like post-translational modifications that may be require for their use as pharmaceutical and industrial products.

  16. Gene therapy for Stargardt disease associated with ABCA4 gene.

    Science.gov (United States)

    Han, Zongchao; Conley, Shannon M; Naash, Muna I

    2014-01-01

    Mutations in the photoreceptor-specific flippase ABCA4 lead to accumulation of the toxic bisretinoid A2E, resulting in atrophy of the retinal pigment epithelium (RPE) and death of the photoreceptor cells. Many blinding diseases are associated with these mutations including Stargardt's disease (STGD1), cone-rod dystrophy, retinitis pigmentosa (RP), and increased susceptibility to age-related macular degeneration. There are no curative treatments for any of these dsystrophies. While the monogenic nature of many of these conditions makes them amenable to treatment with gene therapy, the ABCA4 cDNA is 6.8 kb and is thus too large for the AAV vectors which have been most successful for other ocular genes. Here we review approaches to ABCA4 gene therapy including treatment with novel AAV vectors, lentiviral vectors, and non-viral compacted DNA nanoparticles. Lentiviral and compacted DNA nanoparticles in particular have a large capacity and have been successful in improving disease phenotypes in the Abca4 (-/-) murine model. Excitingly, two Phase I/IIa clinical trials are underway to treat patients with ABCA4-associated Startgardt's disease (STGD1). As a result of the development of these novel technologies, effective therapies for ABCA4-associated diseases may finally be within reach.

  17. Current status of gene therapy for motor neuron disease

    Institute of Scientific and Technical Information of China (English)

    Xingkai An; Rong Peng; Shanshan Zhao

    2006-01-01

    OBJECTIVE: Although the etiology and pathogenesis of motor neuron disease is still unknown, there are many hypotheses on motor neuron mitochondrion, cytoskeleton structure and functional injuries. Thus, gene therapy of motor neuron disease has become a hot topic to apply in viral vector, gene delivery and basic gene techniques.DATA SOURCES: The related articles published between January 2000 and October 2006 were searched in Medline database and ISl database by computer using the keywords "motor neuron disease, gene therapy", and the language is limited to English. Meanwhile, the related references of review were also searched by handiwork. STUDY SELECTION: Original articles and referred articles in review were chosen after first hearing, then the full text which had new ideas were found, and when refer to the similar study in the recent years were considered first.DATA EXTRACTION: Among the 92 related articles, 40 ones were accepted, and 52 were excluded because of repetitive study or reviews.DATA SYNTHESIS: The viral vectors of gene therapy for motor neuron disease include adenoviral, adeno-associated viral vectors, herpes simplex virus type 1 vectors and lentiviral vectors. The delivery of them can be achieved by direct injection into the brain, or by remote delivery after injection vectors into muscle or peripheral nerves, or by ex vivo gene transfer. The viral vectors of gene therapy for motor neuron disease have been successfully developed, but the gene delivery of them is hampered by some difficulties. The RNA interference and neuroprotection are the main technologies for gene-based therapy in motor neuron disease. CONCLUSION : The RNA interference for motor neuron disease has succeeded in animal models, and the neuroprotection also does. But, there are still a lot of questions for gene therapy in the clinical treatment of motor neuron disease.

  18. HFE gene mutations and Wilson's disease in Sardinia.

    Science.gov (United States)

    Sorbello, Orazio; Sini, Margherita; Civolani, Alberto; Demelia, Luigi

    2010-03-01

    Hypocaeruloplasminaemia can lead to tissue iron storage in Wilson's disease and the possibility of iron overload in long-term overtreated patients should be considered. The HFE gene encodes a protein that is intimately involved in intestinal iron absorption. The aim of this study was to determine the prevalence of the HFE gene mutation, its role in iron metabolism of Wilson's disease patients and the interplay of therapy in copper and iron homeostasis. The records of 32 patients with Wilson's disease were reviewed for iron and copper indices, HFE gene mutations and liver biopsy. Twenty-six patients were negative for HFE gene mutations and did not present significant alterations of iron metabolism. The HFE mutation was significantly associated with increased hepatic iron content (PHFE gene wild-type. The HFE gene mutations may be an addictional factor in iron overload in Wilson's disease. Our results showed that an adjustment of dosage of drugs could prevent further iron overload induced by overtreatment only in patients HFE wild-type. 2009. Published by Elsevier Ltd.

  19. Human Immunodeficiency Virus Type-1 Elite Controllers Maintain Low Co-Expression of Inhibitory Receptors on CD4+ T Cells.

    Science.gov (United States)

    Noyan, Kajsa; Nguyen, Son; Betts, Michael R; Sönnerborg, Anders; Buggert, Marcus

    2018-01-01

    Human immunodeficiency virus type-1 (HIV-1) elite controllers (ELCs) represent a unique population that control viral replication in the absence of antiretroviral therapy (cART). It is well established that expression of multiple inhibitory receptors on CD8+ T cells is associated with HIV-1 disease progression. However, whether reduced co-expression of inhibitory receptors on CD4+ T cells is linked to natural viral control and slow HIV-1 disease progression remains undefined. Here, we report on the expression pattern of numerous measurable inhibitory receptors, associated with T cell exhaustion (programmed cell death-1, CTLA-4, and TIGIT), on different CD4+ T cell memory populations in ELCs and HIV-infected subjects with or without long-term cART. We found that the co-expression pattern of inhibitory receptors was significantly reduced in ELCs compared with HIV-1 cART-treated and viremic subjects, and similar to healthy controls. Markers associated with T cell exhaustion varied among different memory CD4+ T cell subsets and highest levels were found mainly on transitional memory T cells. CD4+ T cells co-expressing all inhibitory markers were positively correlated to T cell activation (CD38+ HLA-DR+) as well as the transcription factors Helios and FoxP3. Finally, clinical parameters such as CD4 count, HIV-1 viral load, and the CD4/CD8 ratio all showed significant associations with CD4+ T cell exhaustion. We demonstrate that ELCs are able to maintain lower levels of CD4+ T cell exhaustion despite years of ongoing viral replication compared with successfully cART-treated subjects. Our findings suggest that ELCs harbor a "healthy" state of inhibitory receptor expression on CD4+ T cells that might play part in maintenance of their control status.

  20. From protein-protein interactions to protein co-expression networks: a new perspective to evaluate large-scale proteomic data.

    Science.gov (United States)

    Vella, Danila; Zoppis, Italo; Mauri, Giancarlo; Mauri, Pierluigi; Di Silvestre, Dario

    2017-12-01

    The reductionist approach of dissecting biological systems into their constituents has been successful in the first stage of the molecular biology to elucidate the chemical basis of several biological processes. This knowledge helped biologists to understand the complexity of the biological systems evidencing that most biological functions do not arise from individual molecules; thus, realizing that the emergent properties of the biological systems cannot be explained or be predicted by investigating individual molecules without taking into consideration their relations. Thanks to the improvement of the current -omics technologies and the increasing understanding of the molecular relationships, even more studies are evaluating the biological systems through approaches based on graph theory. Genomic and proteomic data are often combined with protein-protein interaction (PPI) networks whose structure is routinely analyzed by algorithms and tools to characterize hubs/bottlenecks and topological, functional, and disease modules. On the other hand, co-expression networks represent a complementary procedure that give the opportunity to evaluate at system level including organisms that lack information on PPIs. Based on these premises, we introduce the reader to the PPI and to the co-expression networks, including aspects of reconstruction and analysis. In particular, the new idea to evaluate large-scale proteomic data by means of co-expression networks will be discussed presenting some examples of application. Their use to infer biological knowledge will be shown, and a special attention will be devoted to the topological and module analysis.

  1. Identification of human disease genes from interactome network using graphlet interaction.

    Directory of Open Access Journals (Sweden)

    Xiao-Dong Wang

    Full Text Available Identifying genes related to human diseases, such as cancer and cardiovascular disease, etc., is an important task in biomedical research because of its applications in disease diagnosis and treatment. Interactome networks, especially protein-protein interaction networks, had been used to disease genes identification based on the hypothesis that strong candidate genes tend to closely relate to each other in some kinds of measure on the network. We proposed a new measure to analyze the relationship between network nodes which was called graphlet interaction. The graphlet interaction contained 28 different isomers. The results showed that the numbers of the graphlet interaction isomers between disease genes in interactome networks were significantly larger than random picked genes, while graphlet signatures were not. Then, we designed a new type of score, based on the network properties, to identify disease genes using graphlet interaction. The genes with higher scores were more likely to be disease genes, and all candidate genes were ranked according to their scores. Then the approach was evaluated by leave-one-out cross-validation. The precision of the current approach achieved 90% at about 10% recall, which was apparently higher than the previous three predominant algorithms, random walk, Endeavour and neighborhood based method. Finally, the approach was applied to predict new disease genes related to 4 common diseases, most of which were identified by other independent experimental researches. In conclusion, we demonstrate that the graphlet interaction is an effective tool to analyze the network properties of disease genes, and the scores calculated by graphlet interaction is more precise in identifying disease genes.

  2. Identification of Human Disease Genes from Interactome Network Using Graphlet Interaction

    Science.gov (United States)

    Yang, Lun; Wei, Dong-Qing; Qi, Ying-Xin; Jiang, Zong-Lai

    2014-01-01

    Identifying genes related to human diseases, such as cancer and cardiovascular disease, etc., is an important task in biomedical research because of its applications in disease diagnosis and treatment. Interactome networks, especially protein-protein interaction networks, had been used to disease genes identification based on the hypothesis that strong candidate genes tend to closely relate to each other in some kinds of measure on the network. We proposed a new measure to analyze the relationship between network nodes which was called graphlet interaction. The graphlet interaction contained 28 different isomers. The results showed that the numbers of the graphlet interaction isomers between disease genes in interactome networks were significantly larger than random picked genes, while graphlet signatures were not. Then, we designed a new type of score, based on the network properties, to identify disease genes using graphlet interaction. The genes with higher scores were more likely to be disease genes, and all candidate genes were ranked according to their scores. Then the approach was evaluated by leave-one-out cross-validation. The precision of the current approach achieved 90% at about 10% recall, which was apparently higher than the previous three predominant algorithms, random walk, Endeavour and neighborhood based method. Finally, the approach was applied to predict new disease genes related to 4 common diseases, most of which were identified by other independent experimental researches. In conclusion, we demonstrate that the graphlet interaction is an effective tool to analyze the network properties of disease genes, and the scores calculated by graphlet interaction is more precise in identifying disease genes. PMID:24465923

  3. beta. -Amyloid gene dosage in Alzheimer's disease

    Energy Technology Data Exchange (ETDEWEB)

    Murdoch, G H; Manuelidis, L; Kim, J H; Manuelidis, E E

    1988-01-11

    The 4-5 kd amyloid ..beta..-peptide is a major constituent of the characteristic amyloid plaque of Alzheimer's disease. It has been reported that some cases of sporatic Alzheimer's disease are associated with at least a partial duplication of chromosome 21 containing the gene corresponding to the 695 residue precursor of this peptide. To contribute to an understanding of the frequency to such a duplication event in the overall Alzheimer's population, the authors have determined the gene dosage of the ..beta..-amyloid gene in this collection of cases. All cases had a clinical diagnosis of Alzheimer's confirmed neuropathologically. Each Alzheimer's case had an apparent normal diploid ..beta..-amyloid gene dosage, while control Down's cases had the expected triploid dosage. Thus partial duplication of chromosome 21 may be a rare finding in Alzheimer's disease. Similar conclusions were just reported in several studies of the Harvard Alzheimer collection.

  4. [Progress in research on pathogenic genes and gene therapy for inherited retinal diseases].

    Science.gov (United States)

    Zhu, Ling; Cao, Cong; Sun, Jiji; Gao, Tao; Liang, Xiaoyang; Nie, Zhipeng; Ji, Yanchun; Jiang, Pingping; Guan, Minxin

    2017-02-10

    Inherited retinal diseases (IRDs), including retinitis pigmentosa, Usher syndrome, Cone-Rod degenerations, inherited macular dystrophy, Leber's congenital amaurosis, Leber's hereditary optic neuropathy are the most common and severe types of hereditary ocular diseases. So far more than 200 pathogenic genes have been identified. With the growing knowledge of the genetics and mechanisms of IRDs, a number of gene therapeutic strategies have been developed in the laboratory or even entered clinical trials. Here the progress of IRD research on the pathogenic genes and therapeutic strategies, particularly gene therapy, are reviewed.

  5. Discovering disease-associated genes in weighted protein-protein interaction networks

    Science.gov (United States)

    Cui, Ying; Cai, Meng; Stanley, H. Eugene

    2018-04-01

    Although there have been many network-based attempts to discover disease-associated genes, most of them have not taken edge weight - which quantifies their relative strength - into consideration. We use connection weights in a protein-protein interaction (PPI) network to locate disease-related genes. We analyze the topological properties of both weighted and unweighted PPI networks and design an improved random forest classifier to distinguish disease genes from non-disease genes. We use a cross-validation test to confirm that weighted networks are better able to discover disease-associated genes than unweighted networks, which indicates that including link weight in the analysis of network properties provides a better model of complex genotype-phenotype associations.

  6. Evidence of inflammatory immune signaling in chronic fatigue syndrome: A pilot study of gene expression in peripheral blood

    Directory of Open Access Journals (Sweden)

    Vernon Suzanne D

    2008-09-01

    Full Text Available Abstract Background Genomic profiling of peripheral blood reveals altered immunity in chronic fatigue syndrome (CFS however interpretation remains challenging without immune demographic context. The object of this work is to identify modulation of specific immune functional components and restructuring of co-expression networks characteristic of CFS using the quantitative genomics of peripheral blood. Methods Gene sets were constructed a priori for CD4+ T cells, CD8+ T cells, CD19+ B cells, CD14+ monocytes and CD16+ neutrophils from published data. A group of 111 women were classified using empiric case definition (U.S. Centers for Disease Control and Prevention and unsupervised latent cluster analysis (LCA. Microarray profiles of peripheral blood were analyzed for expression of leukocyte-specific gene sets and characteristic changes in co-expression identified from topological evaluation of linear correlation networks. Results Median expression for a set of 6 genes preferentially up-regulated in CD19+ B cells was significantly lower in CFS (p = 0.01 due mainly to PTPRK and TSPAN3 expression. Although no other gene set was differentially expressed at p Conclusion Dissection of blood microarray profiles points to B cell dysfunction with coordinated immune activation supporting persistent inflammation and antibody-mediated NK cell modulation of T cell activity. This has clinical implications as the CD19+ genes identified could provide robust and biologically meaningful basis for the early detection and unambiguous phenotyping of CFS.

  7. Speeding disease gene discovery by sequence based candidate prioritization

    Directory of Open Access Journals (Sweden)

    Porteous David J

    2005-03-01

    Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.

  8. Expression of lycopene biosynthesis genes fused in line with Shine-Dalgarno sequences improves the stress-tolerance of Lactococcus lactis.

    Science.gov (United States)

    Dong, Xiangrong; Wang, Yanping; Yang, Fengyuan; Zhao, Shanshan; Tian, Bing; Li, Tao

    2017-01-01

    Lycopene biosynthetic genes from Deinococcus radiodurans were co-expressed in Lactococcus lactis to produce lycopene and improve its tolerance to stress. Lycopene-related genes from D. radiodurans, DR1395 (crtE), DR0862 (crtB), and DR0861 (crtI), were fused in line with S hine-Dalgarno (SD) sequences and co-expressed in L. lactis. The recombinant strain produced 0.36 mg lycopene g -1  dry cell wt after 48 h fermentation. The survival rate to UV irradiation of the recombinant strain was higher than that of the non-transformed strain. The L. lactis with co-expressed genes responsible for lycopene biosynthesis from D. radiodurans produced lycopene and exhibited increased resistance to UV stress, suggesting that the recombinant strain has important application potential in food industry.

  9. A hybrid network-based method for the detection of disease-related genes

    Science.gov (United States)

    Cui, Ying; Cai, Meng; Dai, Yang; Stanley, H. Eugene

    2018-02-01

    Detecting disease-related genes is crucial in disease diagnosis and drug design. The accepted view is that neighbors of a disease-causing gene in a molecular network tend to cause the same or similar diseases, and network-based methods have been recently developed to identify novel hereditary disease-genes in available biomedical networks. Despite the steady increase in the discovery of disease-associated genes, there is still a large fraction of disease genes that remains under the tip of the iceberg. In this paper we exploit the topological properties of the protein-protein interaction (PPI) network to detect disease-related genes. We compute, analyze, and compare the topological properties of disease genes with non-disease genes in PPI networks. We also design an improved random forest classifier based on these network topological features, and a cross-validation test confirms that our method performs better than previous similar studies.

  10. Keratitis-Ichthyosis-Deafness syndrome-associated Cx26 mutants produce nonfunctional gap junctions but hyperactive hemichannels when co-expressed with wild type Cx43

    Science.gov (United States)

    García, Isaac E.; Maripillán, Jaime; Jara, Oscar; Ceriani, Ricardo; Palacios-Muñoz, Angelina; Ramachandran, Jayalakshimi; Olivero, Pablo; Pérez-Acle, Tomás; González, Carlos; Sáez, Juan C.; Contreras, Jorge E.; Martínez, Agustín D.

    2015-01-01

    Mutations in Cx26 gene are found in most cases of human genetic deafness. Some mutations produce syndromic deafness associated with skin disorders, like Keratitis Ichthyosis Deafness syndrome (KID). Because in the human skin Cx26 is co-expressed with other connexins, like Cx43 and Cx30, and since KID syndrome is inherited as autosomal dominant condition, it is possible that KID mutations change the way Cx26 interacts with other co-expressed connexins. Indeed, some Cx26 syndromic mutations showed gap junction dominant negative effect when co-expressed with wild type connexins, including Cx26 and Cx43. The nature of these interactions and the consequences on hemichannels and gap junction channels functions remain unknown. In this study we demonstrate that syndromic mutations at the N-terminus segment of Cx26, change connexin oligomerization compatibility, allowing aberrant interactions with Cx43. Strikingly, heteromeric oligomer formed by Cx43/Cx26 (syndromic mutants) show exacerbated hemichannel activity, but nonfunctional gap junction channels; this also occurs for those Cx26 KID mutants that do not show functional homomeric hemichannels. Heterologous expression of these hyperactive heteromeric hemichannels increases cell membrane permeability, favoring ATP release and Ca2+ overload. The functional paradox produced by oligomerization of Cx43 and Cx26 KID mutants could underlie the severe syndromic phenotype in human skin. PMID:25625422

  11. Identification of a gene module associated with BMD through the integration of network analysis and genome-wide association data.

    Science.gov (United States)

    Farber, Charles R

    2010-11-01

    Bone mineral density (BMD) is influenced by a complex network of gene interactions; therefore, elucidating the relationships between genes and how those genes, in turn, influence BMD is critical for developing a comprehensive understanding of osteoporosis. To investigate the role of transcriptional networks in the regulation of BMD, we performed a weighted gene coexpression network analysis (WGCNA) using microarray expression data on monocytes from young individuals with low or high BMD. WGCNA groups genes into modules based on patterns of gene coexpression. and our analysis identified 11 gene modules. We observed that the overall expression of one module (referred to as module 9) was significantly higher in the low-BMD group (p = .03). Module 9 was highly enriched for genes belonging to the immune system-related gene ontology (GO) category "response to virus" (p = 7.6 × 10(-11)). Using publically available genome-wide association study data, we independently validated the importance of module 9 by demonstrating that highly connected module 9 hubs were more likely, relative to less highly connected genes, to be genetically associated with BMD. This study highlights the advantages of systems-level analyses to uncover coexpression modules associated with bone mass and suggests that particular monocyte expression patterns may mediate differences in BMD. © 2010 American Society for Bone and Mineral Research.

  12. Discovery of cis-elements between sorghum and rice using co-expression and evolutionary conservation

    Directory of Open Access Journals (Sweden)

    Haberer Georg

    2009-06-01

    Full Text Available Abstract Background The spatiotemporal regulation of gene expression largely depends on the presence and absence of cis-regulatory sites in the promoter. In the economically highly important grass family, our knowledge of transcription factor binding sites and transcriptional networks is still very limited. With the completion of the sorghum genome and the available rice genome sequence, comparative promoter analyses now allow genome-scale detection of conserved cis-elements. Results In this study, we identified thousands of phylogenetic footprints conserved between orthologous rice and sorghum upstream regions that are supported by co-expression information derived from three different rice expression data sets. In a complementary approach, cis-motifs were discovered by their highly conserved co-occurrence in syntenic promoter pairs. Sequence conservation and matches to known plant motifs support our findings. Expression similarities of gene pairs positively correlate with the number of motifs that are shared by gene pairs and corroborate the importance of similar promoter architectures for concerted regulation. This strongly suggests that these motifs function in the regulation of transcript levels in rice and, presumably also in sorghum. Conclusion Our work provides the first large-scale collection of cis-elements for rice and sorghum and can serve as a paradigm for cis-element analysis through comparative genomics in grasses in general.

  13. Parkinson's disease and mitochondrial gene variations

    DEFF Research Database (Denmark)

    Andalib, Sasan; Vafaee, Manouchehr Seyedi; Gjedde, Albert

    2014-01-01

    Parkinson's disease (PD) is a common disorder of the central nervous system in the elderly. The pathogenesis of PD is a complex process, with genetics as an important contributing factor. This factor may stem from mitochondrial gene variations and mutations as well as from nuclear gene variations...

  14. Global developmental gene expression and pathway analysis of normal brain development and mouse models of human neuronal migration defects.

    Directory of Open Access Journals (Sweden)

    Tiziano Pramparo

    2011-03-01

    Full Text Available Heterozygous LIS1 mutations are the most common cause of human lissencephaly, a human neuronal migration defect, and DCX mutations are the most common cause of X-linked lissencephaly. LIS1 is part of a protein complex including NDEL1 and 14-3-3ε that regulates dynein motor function and microtubule dynamics, while DCX stabilizes microtubules and cooperates with LIS1 during neuronal migration and neurogenesis. Targeted gene mutations of Lis1, Dcx, Ywhae (coding for 14-3-3ε, and Ndel1 lead to neuronal migration defects in mouse and provide models of human lissencephaly, as well as aid the study of related neuro-developmental diseases. Here we investigated the developing brain of these four mutants and wild-type mice using expression microarrays, bioinformatic analyses, and in vivo/in vitro experiments to address whether mutations in different members of the LIS1 neuronal migration complex lead to similar and/or distinct global gene expression alterations. Consistent with the overall successful development of the mutant brains, unsupervised clustering and co-expression analysis suggested that cell cycle and synaptogenesis genes are similarly expressed and co-regulated in WT and mutant brains in a time-dependent fashion. By contrast, focused co-expression analysis in the Lis1 and Ndel1 mutants uncovered substantial differences in the correlation among pathways. Differential expression analysis revealed that cell cycle, cell adhesion, and cytoskeleton organization pathways are commonly altered in all mutants, while synaptogenesis, cell morphology, and inflammation/immune response are specifically altered in one or more mutants. We found several commonly dysregulated genes located within pathogenic deletion/duplication regions, which represent novel candidates of human mental retardation and neurocognitive disabilities. Our analysis suggests that gene expression and pathway analysis in mouse models of a similar disorder or within a common pathway can

  15. Generating Gene Ontology-Disease Inferences to Explore Mechanisms of Human Disease at the Comparative Toxicogenomics Database.

    Directory of Open Access Journals (Sweden)

    Allan Peter Davis

    Full Text Available Strategies for discovering common molecular events among disparate diseases hold promise for improving understanding of disease etiology and expanding treatment options. One technique is to leverage curated datasets found in the public domain. The Comparative Toxicogenomics Database (CTD; http://ctdbase.org/ manually curates chemical-gene, chemical-disease, and gene-disease interactions from the scientific literature. The use of official gene symbols in CTD interactions enables this information to be combined with the Gene Ontology (GO file from NCBI Gene. By integrating these GO-gene annotations with CTD's gene-disease dataset, we produce 753,000 inferences between 15,700 GO terms and 4,200 diseases, providing opportunities to explore presumptive molecular underpinnings of diseases and identify biological similarities. Through a variety of applications, we demonstrate the utility of this novel resource. As a proof-of-concept, we first analyze known repositioned drugs (e.g., raloxifene and sildenafil and see that their target diseases have a greater degree of similarity when comparing GO terms vs. genes. Next, a computational analysis predicts seemingly non-intuitive diseases (e.g., stomach ulcers and atherosclerosis as being similar to bipolar disorder, and these are validated in the literature as reported co-diseases. Additionally, we leverage other CTD content to develop testable hypotheses about thalidomide-gene networks to treat seemingly disparate diseases. Finally, we illustrate how CTD tools can rank a series of drugs as potential candidates for repositioning against B-cell chronic lymphocytic leukemia and predict cisplatin and the small molecule inhibitor JQ1 as lead compounds. The CTD dataset is freely available for users to navigate pathologies within the context of extensive biological processes, molecular functions, and cellular components conferred by GO. This inference set should aid researchers, bioinformaticists, and

  16. Lack of association of the Norrie disease gene with retinoschisis phenotype.

    Science.gov (United States)

    Shastry, B S; Hiraoka, M; Trese, M T

    2000-01-01

    It has been reported recently that mice carrying a disrupted Norrie disease gene produced alterations in the murine eye that are similar to congenital retinoschisis. Therefore, it was of interest to determine whether mutations in the Norrie disease gene can account for the disease in families with retinoschisis that do not carry mutations in the retinoschisis gene. The patient set comprised 5 cases of retinoschisis (1 familial and 4 sporadic), all unrelated to each other. Fundus examination of affected individuals showed foveal and peripheral schisis, and the visual acuity range was 20/40-20/60. Peripheral blood specimens were collected from affected and unaffected family members. DNA was extracted and amplified by polymerase chain reaction amplification of exons of the Norrie disease gene. The amplified products were sequenced by the dideoxy chain termination method. The data revealed no disease-specific sequence alterations in the Norrie disease gene. Although we cannot completely exclude the possibility of the Norrie disease gene as a candidate gene, the above results suggest that the structural and functional changes in the Norrie disease gene are not associated with clinically typical retinoschisis families that do not contain mutations in the coding regions and splice sites of the retinoschisis gene.

  17. Gene-Environment Interaction in Parkinson's Disease

    DEFF Research Database (Denmark)

    Chuang, Yu-Hsuan; Lill, Christina M; Lee, Pei-Chen

    2016-01-01

    BACKGROUND AND PURPOSE: Drinking caffeinated coffee has been reported to provide protection against Parkinson's disease (PD). Caffeine is an adenosine A2A receptor (encoded by the gene ADORA2A) antagonist that increases dopaminergic neurotransmission and Cytochrome P450 1A2 (gene: CYP1A2...

  18. Mining biological databases for candidate disease genes

    Science.gov (United States)

    Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

    2001-07-01

    The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).

  19. Transcriptional dynamics of a conserved gene expression network associated with craniofacial divergence in Arctic charr.

    Science.gov (United States)

    Ahi, Ehsan Pashay; Kapralova, Kalina Hristova; Pálsson, Arnar; Maier, Valerie Helene; Gudbrandsson, Jóhannes; Snorrason, Sigurdur S; Jónsson, Zophonías O; Franzdóttir, Sigrídur Rut

    2014-01-01

    Understanding the molecular basis of craniofacial variation can provide insights into key developmental mechanisms of adaptive changes and their role in trophic divergence and speciation. Arctic charr (Salvelinus alpinus) is a polymorphic fish species, and, in Lake Thingvallavatn in Iceland, four sympatric morphs have evolved distinct craniofacial structures. We conducted a gene expression study on candidates from a conserved gene coexpression network, focusing on the development of craniofacial elements in embryos of two contrasting Arctic charr morphotypes (benthic and limnetic). Four Arctic charr morphs were studied: one limnetic and two benthic morphs from Lake Thingvallavatn and a limnetic reference aquaculture morph. The presence of morphological differences at developmental stages before the onset of feeding was verified by morphometric analysis. Following up on our previous findings that Mmp2 and Sparc were differentially expressed between morphotypes, we identified a network of genes with conserved coexpression across diverse vertebrate species. A comparative expression study of candidates from this network in developing heads of the four Arctic charr morphs verified the coexpression relationship of these genes and revealed distinct transcriptional dynamics strongly correlated with contrasting craniofacial morphologies (benthic versus limnetic). A literature review and Gene Ontology analysis indicated that a significant proportion of the network genes play a role in extracellular matrix organization and skeletogenesis, and motif enrichment analysis of conserved noncoding regions of network candidates predicted a handful of transcription factors, including Ap1 and Ets2, as potential regulators of the gene network. The expression of Ets2 itself was also found to associate with network gene expression. Genes linked to glucocorticoid signalling were also studied, as both Mmp2 and Sparc are responsive to this pathway. Among those, several transcriptional

  20. Heat Shock Protein 70 Enhances Mucosal Immunity against Human Norovirus When Coexpressed from a Vesicular Stomatitis Virus Vector

    Science.gov (United States)

    Ma, Yuanmei; Duan, Yue; Wei, Yongwei; Liang, Xueya; Niewiesk, Stefan; Oglesbee, Michael

    2014-01-01

    uncultivable. Thus, a live vector-based vaccine may provide an alternative vaccine strategy. In this study, we developed a vesicular stomatitis virus (VSV)-based human NoV vaccine candidate. We constructed rVSV-HSP70-VP1, coexpressing heat shock protein (HSP70) and capsid (VP1) genes of human NoV, and rVSV-Luc-VP1, coexpressing firefly luciferase (Luc) and VP1 genes. We found that VSVs with a double gene insertion were significantly more attenuated than VSV with a single VP1 insertion (rVSV-VP1). Furthermore, we found that coexpression or coadministration of HSP70 from VSV vector significantly enhanced human NoV-specific mucosal immunity. Collectively, we developed an improved live vectored vaccine candidate for human NoV which will be useful for future clinical studies. PMID:24574391

  1. Once for All: A Novel Robust System for Co-expression of Multiple Chimeric Fluorescent Fusion Proteins in Plants

    Directory of Open Access Journals (Sweden)

    Guitao Zhong

    2017-06-01

    Full Text Available Chimeric fluorescent fusion proteins have been employed as a powerful tool to reveal the subcellular localizations and dynamics of proteins in living cells. Co-expression of a fluorescent fusion protein with well-known organelle markers in the same cell is especially useful in revealing its spatial and temporal functions of the protein in question. However, the conventional methods for co-expressing multiple fluorescent tagged proteins in plants have the drawbacks of low expression efficiency, variations in the expression level and time-consuming genetic crossing. Here, we have developed a novel robust system that allows for high-efficient co-expression of multiple chimeric fluorescent fusion proteins in plants in a time-saving fashion. This system takes advantage of employing a single expression vector which consists of multiple semi-independent expressing cassettes for the protein co-expression thereby overcoming the limitations of using multiple independent expressing plasmids. In addition, it is a highly manipulable DNA assembly system, in which modification and recombination of DNA molecules are easily achieved through an optimized one-step assembly reaction. By employing this effective system, we demonstrated that co-expression of two chimeric fluorescent fusion reporter proteins of vacuolar sorting receptor and secretory carrier membrane protein gave rise to their perspective subcellular localizations in plants via both transient expression and stable transformation. Thus, we believed that this technical advance represents a promising approach for multi-color-protein co-expression in plant cells.

  2. Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease.

    Science.gov (United States)

    Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

    2014-01-01

    We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. http://rged.wall-eva.net. © The Author(s) 2014. Published by Oxford University Press.

  3. Renal Gene Expression Database (RGED): a relational database of gene expression profiles in kidney disease

    Science.gov (United States)

    Zhang, Qingzhou; Yang, Bo; Chen, Xujiao; Xu, Jing; Mei, Changlin; Mao, Zhiguo

    2014-01-01

    We present a bioinformatics database named Renal Gene Expression Database (RGED), which contains comprehensive gene expression data sets from renal disease research. The web-based interface of RGED allows users to query the gene expression profiles in various kidney-related samples, including renal cell lines, human kidney tissues and murine model kidneys. Researchers can explore certain gene profiles, the relationships between genes of interests and identify biomarkers or even drug targets in kidney diseases. The aim of this work is to provide a user-friendly utility for the renal disease research community to query expression profiles of genes of their own interest without the requirement of advanced computational skills. Availability and implementation: Website is implemented in PHP, R, MySQL and Nginx and freely available from http://rged.wall-eva.net. Database URL: http://rged.wall-eva.net PMID:25252782

  4. MyGeneFriends: A Social Network Linking Genes, Genetic Diseases, and Researchers.

    Science.gov (United States)

    Allot, Alexis; Chennen, Kirsley; Nevers, Yannis; Poidevin, Laetitia; Kress, Arnaud; Ripp, Raymond; Thompson, Julie Dawn; Poch, Olivier; Lecompte, Odile

    2017-06-16

    The constant and massive increase of biological data offers unprecedented opportunities to decipher the function and evolution of genes and their roles in human diseases. However, the multiplicity of sources and flow of data mean that efficient access to useful information and knowledge production has become a major challenge. This challenge can be addressed by taking inspiration from Web 2.0 and particularly social networks, which are at the forefront of big data exploration and human-data interaction. MyGeneFriends is a Web platform inspired by social networks, devoted to genetic disease analysis, and organized around three types of proactive agents: genes, humans, and genetic diseases. The aim of this study was to improve exploration and exploitation of biological, postgenomic era big data. MyGeneFriends leverages conventions popularized by top social networks (Facebook, LinkedIn, etc), such as networks of friends, profile pages, friendship recommendations, affinity scores, news feeds, content recommendation, and data visualization. MyGeneFriends provides simple and intuitive interactions with data through evaluation and visualization of connections (friendships) between genes, humans, and diseases. The platform suggests new friends and publications and allows agents to follow the activity of their friends. It dynamically personalizes information depending on the user's specific interests and provides an efficient way to share information with collaborators. Furthermore, the user's behavior itself generates new information that constitutes an added value integrated in the network, which can be used to discover new connections between biological agents. We have developed MyGeneFriends, a Web platform leveraging conventions from popular social networks to redefine the relationship between humans and biological big data and improve human processing of biomedical data. MyGeneFriends is available at lbgi.fr/mygenefriends. ©Alexis Allot, Kirsley Chennen, Yannis

  5. Inhibition of the Secretory pathway by Foot-and-Mouth disease virus 2BC protein is reproduced by co-expression of 2B with 2C, and the site of inhibition is determined by the subcellular location of 2C

    DEFF Research Database (Denmark)

    Moffat, Katy; Knox, Caroline; Howell, Gareth

    2007-01-01

    immune responses in vivo. Foot-and-mouth disease virus (FMDV), another picornavirus, can cause persistent infection of ruminants, suggesting it too may inhibit immune responses. Endoplasmic reticulum (ER)-to-Golgi apparatus transport of proteins is blocked by the FMDV 2BC protein. The observation that 2...... blocked in FMDV-infected cells. The block could be reconstituted by coexpression of 2B and 2C, showing that processing of 2BC did not compromise the ability of FMDV to slow secretion. Under these conditions, 2C was located to the Golgi apparatus, and the block in transport also occurred in the Golgi...... apparatus. Interestingly, the block in transport could be redirected to the ER when 2B was coexpressed with a 2C protein fused to an ER retention element. Thus, for FMDV a block in secretion is dependent on both 2B and 2C, with the latter determining the site of the block....

  6. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E.; Re, Matteo

    2014-01-01

    Objective In the context of “network medicine”, gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. Materials and methods We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. Results The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different “informativeness” embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Conclusions Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further

  7. Disease candidate gene identification and prioritization using protein interaction networks

    Directory of Open Access Journals (Sweden)

    Aronow Bruce J

    2009-02-01

    Full Text Available Abstract Background Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN analyses. Results For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds", and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. Conclusion Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization.

  8. Gene-wide analysis detects two new susceptibility genes for Alzheimer's Disease

    OpenAIRE

    Escott-Price, Valentina; Bellenguez, Céline; Wang, Li-San; Choi, Seung-Hoan; Harold, Denise; Jones, Lesley; Holmans, Peter Alan; Gerrish, Amy; Vedernikov, Alexey; Richards, Alexander; DeStefano, Anita L.; Lambert, Jean-Charles; Ibrahim-Verbaas, Carla A.; Naj, Adam C.; Sims, Rebecca

    2014-01-01

    PUBLISHED BACKGROUND: Alzheimer's disease is a common debilitating dementia with known heritability, for which 20 late onset susceptibility loci have been identified, but more remain to be discovered. This study sought to identify new susceptibility genes, using an alternative gene-wide analytical approach which tests for patterns of association within genes, in the powerful genome-wide association dataset of the International Genomics of Alzheimer's Project Consortium, comprising over...

  9. Tagging of resistance gene(s) to rhizomania disease in sugar beet ...

    African Journals Online (AJOL)

    The rhizomania disease is one of the most important diseases in Iran and some other parts of the world which potentially could play a role in decreasing sugar yield in fields. One approach to combat with this disease is the use of resistance varieties. This varieties have been identified which are having resistance genes to ...

  10. Retinoschisislike alterations in the mouse eye caused by gene targeting of the Norrie disease gene.

    Science.gov (United States)

    Ruether, K; van de Pol, D; Jaissle, G; Berger, W; Tornow, R P; Zrenner, E

    1997-03-01

    To investigate the retinal function and morphology of mice carrying a replacement mutation in exon 2 of the Norrie disease gene. Recently, Norrie disease mutant mice have been generated using gene targeting technology. The mutation removes the 56 N-terminal amino acids of the Norrie gene product. Ganzfeld electroretinograms (ERGs) were obtained in five animals hemizygous or homozygous for the mutant gene and in three female animals heterozygous for the mutant gene. As controls, three males carrying the wild-type gene were examined. Electroretinogram testing included rod a- and b-wave V-log I functions, oscillatory potentials, and cone responses. The fundus morphology has been visualized by scanning laser ophthalmoscopy. Rod and cone ERG responses and fundus morphology were not significantly different among female heterozygotes and wild-type mice. In contrast, the hemizygous mice displayed a severe loss of ERG b-wave, leading to a negatively shaped scotopic ERG and a marked reduction of oscillatory potentials. The a-wave was normal at low intensities, and only with brighter flashes was there a moderate amplitude loss. Cone amplitudes were barely recordable in the gene-targeted males. Ophthalmoscopy revealed snowflakelike vitreal changes, retinoschisis, and pigment epithelium irregularities in hemizygotes and homozygotes, but no changes in female heterozygotes. The negatively shaped scotopic ERG in male mice with a Norrie disease gene mutation probably was caused by retinoschisis. Pigment epithelial changes and degenerations of the outer retina are relatively mild. These findings may be a clue to the embryonal retinoschisislike pathogenesis of Norrie disease in humans or it may indicate a different expression of the Norrie disease gene defect in mice compared to that in humans.

  11. Novel algorithms reveal streptococcal transcriptomes and clues about undefined genes.

    Science.gov (United States)

    Ryan, Patricia A; Kirk, Brian W; Euler, Chad W; Schuch, Raymond; Fischetti, Vincent A

    2007-07-01

    Bacteria-host interactions are dynamic processes, and understanding transcriptional responses that directly or indirectly regulate the expression of genes involved in initial infection stages would illuminate the molecular events that result in host colonization. We used oligonucleotide microarrays to monitor (in vitro) differential gene expression in group A streptococci during pharyngeal cell adherence, the first overt infection stage. We present neighbor clustering, a new computational method for further analyzing bacterial microarray data that combines two informative characteristics of bacterial genes that share common function or regulation: (1) similar gene expression profiles (i.e., co-expression); and (2) physical proximity of genes on the chromosome. This method identifies statistically significant clusters of co-expressed gene neighbors that potentially share common function or regulation by coupling statistically analyzed gene expression profiles with the chromosomal position of genes. We applied this method to our own data and to those of others, and we show that it identified a greater number of differentially expressed genes, facilitating the reconstruction of more multimeric proteins and complete metabolic pathways than would have been possible without its application. We assessed the biological significance of two identified genes by assaying deletion mutants for adherence in vitro and show that neighbor clustering indeed provides biologically relevant data. Neighbor clustering provides a more comprehensive view of the molecular responses of streptococci during pharyngeal cell adherence.

  12. The SPINK gene family and celiac disease susceptibility

    NARCIS (Netherlands)

    Wapenaar, M.C.; Monsuur, A.J.; Poell, J.; Slot, R. van 't; Meijer, J.W.R.; Meijer, G.A.; Mulder, C.J.; Mearin, M.L.; Wijmenga, C.

    2007-01-01

    The gene family of serine protease inhibitors of the Kazal type (SPINK) are functional and positional candidate genes for celiac disease (CD). Our aim was to assess the gut mucosal gene expression and genetic association of SPINK1, -2, -4, and -5 in the Dutch CD population. Gene expression was

  13. The SPINK gene family and celiac disease susceptibility

    NARCIS (Netherlands)

    Wapenaar, Martin C.; Monsuur, Alienke J.; Poell, Jos; Slot, Ruben Van 't; Meijer, Jos W. R.; Meijer, Gerrit A.; Mulder, Chris J.; Mearin, Maria Luisa; Wijmenga, Cisca

    The gene family of serine protease inhibitors of the Kazal type (SPINK) are functional and positional candidate genes for celiac disease (CD). Our aim was to assess the gut mucosal gene expression and genetic association of SPINK1, -2, -4, and -5 in the Dutch CD population. Gene expression was

  14. A bioinformatics prediction approach towards analyzing the glycosylation, co-expression and interaction patterns of epithelial membrane antigen (EMA/MUC1)

    International Nuclear Information System (INIS)

    Kalra, Rajkumar S.; Wadhwa, Renu

    2015-01-01

    Epithelial membrane antigen (EMA or MUC1) is a heavily glycosylated, type I transmembrane glycoprotein commonly expressed by epithelial cells of duct organs. It has been shown to be aberrantly glycosylated in several diseases including cancer. Protein sequence based annotation and analysis of glycosylation profile of glycoproteins by robust computational and comprehensive algorithms provides possible insights to the mechanism(s) of anomalous glycosylation. In present report, by using a number of bioinformatics applications we studied EMA/MUC1 and explored its trans-membrane structural domain sequence that is widely subjected to glycosylation. Exploration of different extracellular motifs led to prediction of N and O-linked glycosylation target sites. Based on the putative O-linked target sites, glycosylated moieties and pathways were envisaged. Furthermore, Protein network analysis demonstrated physical interaction of EMA with a number of proteins and confirmed its functional involvement in cell growth and proliferation pathways. Gene Ontology analysis suggested an involvement of EMA in a number of functions including signal transduction, protein binding, processing and transport along with glycosylation. Thus, present study explored potential of bioinformatics prediction approach in analyzing glycosylation, co-expression and interaction patterns of EMA/MUC1 glycoprotein

  15. A bioinformatics prediction approach towards analyzing the glycosylation, co-expression and interaction patterns of epithelial membrane antigen (EMA/MUC1)

    Energy Technology Data Exchange (ETDEWEB)

    Kalra, Rajkumar S., E-mail: renu-wadhwa@aist.go.jp; Wadhwa, Renu, E-mail: renu-wadhwa@aist.go.jp [Cell Proliferation Research Group and DBT-AIST International Laboratory for Advanced Biomedicine, National Institute of Advanced Industrial Science and Technology (AIST Central 4), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8562 (Japan)

    2015-02-27

    Epithelial membrane antigen (EMA or MUC1) is a heavily glycosylated, type I transmembrane glycoprotein commonly expressed by epithelial cells of duct organs. It has been shown to be aberrantly glycosylated in several diseases including cancer. Protein sequence based annotation and analysis of glycosylation profile of glycoproteins by robust computational and comprehensive algorithms provides possible insights to the mechanism(s) of anomalous glycosylation. In present report, by using a number of bioinformatics applications we studied EMA/MUC1 and explored its trans-membrane structural domain sequence that is widely subjected to glycosylation. Exploration of different extracellular motifs led to prediction of N and O-linked glycosylation target sites. Based on the putative O-linked target sites, glycosylated moieties and pathways were envisaged. Furthermore, Protein network analysis demonstrated physical interaction of EMA with a number of proteins and confirmed its functional involvement in cell growth and proliferation pathways. Gene Ontology analysis suggested an involvement of EMA in a number of functions including signal transduction, protein binding, processing and transport along with glycosylation. Thus, present study explored potential of bioinformatics prediction approach in analyzing glycosylation, co-expression and interaction patterns of EMA/MUC1 glycoprotein.

  16. Coexpression of β-D-galactosidase and L-arabinose isomerase in the production of D-tagatose: a functional sweetener.

    Science.gov (United States)

    Zhan, Yijing; Xu, Zheng; Li, Sha; Liu, Xiaoliu; Xu, Lu; Feng, Xiaohai; Xu, Hong

    2014-03-19

    The functional sweetener, d-tagatose, is commonly transformed from galactose by l-arabinose isomerase. To make use of a much cheaper starting material, lactose, hydrolization, and isomerization are required to take place collaboratively. Therefore, a single-step method involving β-d-galactosidase was explored for d-tagatose production. The two vital genes, β-d-galactosidase gene (lacZ) and l-arabinose isomerase mutant gene (araA') were extracted separately from Escherichia coli strains and incorporated into E. coli simultaneously. This gave us E. coli-ZY, a recombinant producing strain capable of coexpressing the two key enzymes. The resulted cells exhibited maximum d-tagatose producing activity at 34 °C and pH 6.5 and in the presence of borate, 10 mM Fe(2+), and 1 mM Mn(2+). Further monitoring showed that the recombinant cells could hydrolyze more than 95% lactose and convert 43% d-galactose into d-tagatose. This research has verified the feasibility of single-step d-tagatose fermentation, thereby laying down the foundation for industrial usage of lactose.

  17. The Epstein-Barr virus BFRF1 and BFLF2 proteins interact and coexpression alters their cellular localization

    International Nuclear Information System (INIS)

    Lake, Cathleen M.; Hutt-Fletcher, Lindsey M.

    2004-01-01

    The BFRF1 protein of Epstein-Barr virus (EBV) is a recently identified membrane protein that is the homolog of the alphaherpesvirus UL34 gene product. We report here that a yeast two-hybrid screen identified the BFLF2 gene product, a homolog of alphaherpesvirus UL31, as a protein that interacts with BFRF1. Expression of BFLF2 in mammalian cells revealed a protein of approximately 28 kDa that associated with BFRF1 in a noncovalently linked complex. When expressed alone, the BFRF1 protein was found in the cytoplasm and perinuclear region. BFLF2 was found diffusely in the nucleus in the absence of BFRF1, but coexpression of BFRF1 and BFLF2 resulted in colocalization of the two proteins at the nuclear rim. These data recapitulate the behavior of the alphaherpesvirus homologs of BFRF1 and BFLF2 and suggest that functional as well as structural and positional homology may be conserved

  18. Gene editing as a promising approach for respiratory diseases.

    Science.gov (United States)

    Bai, Yichun; Liu, Yang; Su, Zhenlei; Ma, Yana; Ren, Chonghua; Zhao, Runzhen; Ji, Hong-Long

    2018-03-01

    Respiratory diseases, which are leading causes of mortality and morbidity in the world, are dysfunctions of the nasopharynx, the trachea, the bronchus, the lung and the pleural cavity. Symptoms of chronic respiratory diseases, such as cough, sneezing and difficulty breathing, may seriously affect the productivity, sleep quality and physical and mental well-being of patients, and patients with acute respiratory diseases may have difficulty breathing, anoxia and even life-threatening respiratory failure. Respiratory diseases are generally heterogeneous, with multifaceted causes including smoking, ageing, air pollution, infection and gene mutations. Clinically, a single pulmonary disease can exhibit more than one phenotype or coexist with multiple organ disorders. To correct abnormal function or repair injured respiratory tissues, one of the most promising techniques is to correct mutated genes by gene editing, as some gene mutations have been clearly demonstrated to be associated with genetic or heterogeneous respiratory diseases. Zinc finger nucleases (ZFN), transcription activator-like effector nucleases (TALEN) and clustered regulatory interspaced short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9) systems are three innovative gene editing technologies developed recently. In this short review, we have summarised the structure and operating principles of the ZFNs, TALENs and CRISPR/Cas9 systems and their preclinical and clinical applications in respiratory diseases. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  19. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods.

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E; Re, Matteo

    2014-06-01

    In the context of "network medicine", gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different "informativeness" embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both

  20. In vivo modification of tyrosine residues in recombinant mussel adhesive protein by tyrosinase co-expression in Escherichia coli

    Directory of Open Access Journals (Sweden)

    Choi Yoo

    2012-10-01

    Full Text Available Abstract Background In nature, mussel adhesive proteins (MAPs show remarkable adhesive properties, biocompatibility, and biodegradability. Thus, they have been considered promising adhesive biomaterials for various biomedical and industrial applications. However, limited production of natural MAPs has hampered their practical applications. Recombinant production in bacterial cells could be one alternative to obtain useable amounts of MAPs, although additional post-translational modification of tyrosine residues into 3,4-dihydroxyphenyl-alanine (Dopa and Dopaquinone is required. The superior properties of MAPs are mainly attributed to the introduction of quinone-derived intermolecular cross-links. To solve this problem, we utilized a co-expression strategy of recombinant MAP and tyrosinase in Escherichia coli to successfully modify tyrosine residues in vivo. Results A recombinant hybrid MAP, fp-151, was used as a target for in vivo modification, and a dual vector system of pET and pACYC-Duet provided co-expression of fp-151 and tyrosinase. As a result, fp-151 was over-expressed and mainly obtained from the soluble fraction in the co-expression system. Without tyrosinase co-expression, fp-151 was over-expressed in an insoluble form in inclusion bodies. The modification of tyrosine residues in the soluble-expressed fp-151 was clearly observed from nitroblue tetrazolium staining and liquid-chromatography-mass/mass spectrometry analyses. The purified, in vivo modified, fp-151 from the co-expression system showed approximately 4-fold higher bulk-scale adhesive strength compared to in vitro tyrosinase-treated fp-151. Conclusion Here, we reported a co-expression system to obtain in vivo modified MAP; additional in vitro tyrosinase modification was not needed to obtain adhesive properties and the in vivo modified MAP showed superior adhesive strength compared to in vitro modified protein. It is expected that this co-expression strategy will accelerate

  1. A maize resistance gene functions against bacterial streak disease in rice.

    Science.gov (United States)

    Zhao, Bingyu; Lin, Xinghua; Poland, Jesse; Trick, Harold; Leach, Jan; Hulbert, Scot

    2005-10-25

    Although cereal crops all belong to the grass family (Poacea), most of their diseases are specific to a particular species. Thus, a given cereal species is typically resistant to diseases of other grasses, and this nonhost resistance is generally stable. To determine the feasibility of transferring nonhost resistance genes (R genes) between distantly related grasses to control specific diseases, we identified a maize R gene that recognizes a rice pathogen, Xanthomonas oryzae pv. oryzicola, which causes bacterial streak disease. Bacterial streak is an important disease of rice in Asia, and no simply inherited sources of resistance have been identified in rice. Although X. o. pv. oryzicola does not cause disease on maize, we identified a maize gene, Rxo1, that conditions a resistance reaction to a diverse collection of pathogen strains. Surprisingly, Rxo1 also controls resistance to the unrelated pathogen Burkholderia andropogonis, which causes bacterial stripe of sorghum and maize. The same gene thus controls resistance reactions to both pathogens and nonpathogens of maize. Rxo1 has a nucleotide-binding site-leucine-rich repeat structure, similar to many previously identified R genes. Most importantly, Rxo1 functions after transfer as a transgene to rice, demonstrating the feasibility of nonhost R gene transfer between cereals and providing a valuable tool for controlling bacterial streak disease.

  2. Network-based association of hypoxia-responsive genes with cardiovascular diseases

    International Nuclear Information System (INIS)

    Wang, Rui-Sheng; Oldham, William M; Loscalzo, Joseph

    2014-01-01

    Molecular oxygen is indispensable for cellular viability and function. Hypoxia is a stress condition in which oxygen demand exceeds supply. Low cellular oxygen content induces a number of molecular changes to activate regulatory pathways responsible for increasing the oxygen supply and optimizing cellular metabolism under limited oxygen conditions. Hypoxia plays critical roles in the pathobiology of many diseases, such as cancer, heart failure, myocardial ischemia, stroke, and chronic lung diseases. Although the complicated associations between hypoxia and cardiovascular (and cerebrovascular) diseases (CVD) have been recognized for some time, there are few studies that investigate their biological link from a systems biology perspective. In this study, we integrate hypoxia genes, CVD genes, and the human protein interactome in order to explore the relationship between hypoxia and cardiovascular diseases at a systems level. We show that hypoxia genes are much closer to CVD genes in the human protein interactome than that expected by chance. We also find that hypoxia genes play significant bridging roles in connecting different cardiovascular diseases. We construct a hypoxia-CVD bipartite network and find several interesting hypoxia-CVD modules with significant gene ontology similarity. Finally, we show that hypoxia genes tend to have more CVD interactors in the human interactome than in random networks of matching topology. Based on these observations, we can predict novel genes that may be associated with CVD. This network-based association study gives us a broad view of the relationships between hypoxia and cardiovascular diseases and provides new insights into the role of hypoxia in cardiovascular biology. (paper)

  3. Understanding gene expression in coronary artery disease through ...

    Indian Academy of Sciences (India)

    Understanding gene expression in coronary artery disease through global profiling, network analysis and independent validation of key candidate genes. Prathima ... Table 2. Differentially expressed genes in CAD compared to age and gender matched controls. .... Regulation of nuclear pre-mRNA domain containing 1A.

  4. Norrie disease and MAO genes: nearest neighbors.

    Science.gov (United States)

    Chen, Z Y; Denney, R M; Breakefield, X O

    1995-01-01

    The Norrie disease and MAO genes are tandemly arranged in the p11.4-p11.3 region of the human X chromosome in the order tel-MAOA-MAOB-NDP-cent. This relationship is conserved in the mouse in the order tel-MAOB-MAOA-NDP-cent. The MAO genes appear to have arisen by tandem duplication of an ancestral MAO gene, but their positional relationship to NDP appears to be random. Distinctive X-linked syndromes have been described for mutations in the MAOA and NDP genes, and in addition, individuals have been identified with contiguous gene syndromes due to chromosomal deletions which encompass two or three of these genes. Loss of function of the NDP gene causes a syndrome of congenital blindness and progressive hearing loss, sometimes accompanied by signs of CNS dysfunction, including variable mental retardation and psychiatric symptoms. Other mutations in the NDP gene have been found to underlie another X-linked eye disease, exudative vitreo-retinopathy. An MAOA deficiency state has been described in one family to date, with features of altered amine and amine metabolite levels, low normal intelligence, apparent difficulty in impulse control and cardiovascular difficulty in affected males. A contiguous gene syndrome in which all three genes are lacking, as well as other as yet unidentified flanking genes, results in severe mental retardation, small stature, seizures and congenital blindness, as well as altered amine and amine metabolites. Issues that remain to be resolved are the function of the NDP gene product, the frequency and phenotype of the MAOA deficiency state, and the possible occurrence and phenotype of an MAOB deficiency state.

  5. Interactions between co-expressed Arabidopsis sucrose transporters in the split-ubiquitin system

    Directory of Open Access Journals (Sweden)

    Lalonde Sylvie

    2003-03-01

    Full Text Available Abstract Background The Arabidopsis genome contains nine sucrose transporter paralogs falling into three clades: SUT1-like, SUT2 and SUT4. The carriers differ in their kinetic properties. Many transport proteins are known to exist as oligomers. The yeast-based split ubiquitin system can be used to analyze the ability of membrane proteins to interact. Results Promoter-GUS fusions were used to analyze the cellular expression of the three transporter genes in transgenic Arabidopsis plants. All three fusion genes are co-expressed in companion cells. Protein-protein interactions between Arabidopsis sucrose transporters were tested using the split ubiquitin system. Three paralogous sucrose transporters are capable of interacting as either homo- or heteromers. The interactions are specific, since a potassium channel and a glucose transporter did not show interaction with sucrose transporters. Also the biosynthetic and metabolizing enzymes, sucrose phosphate phosphatase and sucrose synthase, which were found to be at least in part bound to the plasma membrane, did not specifically interact with sucrose transporters. Conclusions The split-ubiquitin system provides a powerful tool to detect potential interactions between plant membrane proteins by heterologous expression in yeast, and can be used to screen for interactions with membrane proteins as baits. Like other membrane proteins, the Arabidopsis sucrose transporters are able to form oligomers. The biochemical approaches are required to confirm the in planta interaction.

  6. Human gene therapy and imaging in neurological diseases

    International Nuclear Information System (INIS)

    Jacobs, Andreas H.; Winkler, Alexandra; Castro, Maria G.; Lowenstein, Pedro

    2005-01-01

    Molecular imaging aims to assess non-invasively disease-specific biological and molecular processes in animal models and humans in vivo. Apart from precise anatomical localisation and quantification, the most intriguing advantage of such imaging is the opportunity it provides to investigate the time course (dynamics) of disease-specific molecular events in the intact organism. Further, molecular imaging can be used to address basic scientific questions, e.g. transcriptional regulation, signal transduction or protein/protein interaction, and will be essential in developing treatment strategies based on gene therapy. Most importantly, molecular imaging is a key technology in translational research, helping to develop experimental protocols which may later be applied to human patients. Over the past 20 years, imaging based on positron emission tomography (PET) and magnetic resonance imaging (MRI) has been employed for the assessment and ''phenotyping'' of various neurological diseases, including cerebral ischaemia, neurodegeneration and brain gliomas. While in the past neuro-anatomical studies had to be performed post mortem, molecular imaging has ushered in the era of in vivo functional neuro-anatomy by allowing neuroscience to image structure, function, metabolism and molecular processes of the central nervous system in vivo in both health and disease. Recently, PET and MRI have been successfully utilised together in the non-invasive assessment of gene transfer and gene therapy in humans. To assess the efficiency of gene transfer, the same markers are being used in animals and humans, and have been applied for phenotyping human disease. Here, we review the imaging hallmarks of focal and disseminated neurological diseases, such as cerebral ischaemia, neurodegeneration and glioblastoma multiforme, as well as the attempts to translate gene therapy's experimental knowledge into clinical applications and the way in which this process is being promoted through the use of

  7. Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice

    Directory of Open Access Journals (Sweden)

    Shuchi eSmita

    2015-12-01

    Full Text Available MYB transcription factor (TF is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by top down and guide gene approaches. More than 50% of OsMYBs were strongly correlated under fifty experimental conditions with 51 hub genes via top down approach. Further, clusters were identified using Markov Clustering (MCL. To maximize the clustering performance, parameter evaluation of the MCL inflation score (I was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by guide gene approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought

  8. Co-expression of the C-terminal domain of Yersinia enterocolitica ...

    Indian Academy of Sciences (India)

    Home; Journals; Journal of Biosciences; Volume 40; Issue 1. Co-expression of the C-terminal domain of Yersinia enterocolitica invasin enhances the efficacy of classical swine-fever-vectored vaccine based on human adenovirus. Helin Li Pengbo Ning Zhi Lin Wulong Liang Kai Kang Lei He Yanming Zhang. Articles Volume ...

  9. Keratinocyte Growth Factor Gene Electroporation into Skeletal Muscle as a Novel Gene Therapeutic Approach for Elastase-Induced Pulmonary Emphysema in Mice

    International Nuclear Information System (INIS)

    Tobinaga, Shuichi; Matsumoto, Keitaro; Nagayasu, Takeshi; Furukawa, Katsuro; Abo, Takafumi; Yamasaki, Naoya; Tsuchiya, Tomoshi; Miyazaki, Takuro; Koji, Takehiko

    2015-01-01

    Pulmonary emphysema is a progressive disease with airspace destruction and an effective therapy is needed. Keratinocyte growth factor (KGF) promotes pulmonary epithelial proliferation and has the potential to induce lung regeneration. The aim of this study was to determine the possibility of using KGF gene therapy for treatment of a mouse emphysema model induced by porcine pancreatic elastase (PPE). Eight-week-old BALB/c male mice treated with intra-tracheal PPE administration were transfected with 80 μg of a recombinant human KGF (rhKGF)-expressing FLAG-CMV14 plasmid (pKGF-FLAG gene), or with the pFLAG gene expressing plasmid as a control, into the quadriceps muscle by electroporation. In the lung, the expression of proliferating cell nuclear antigen (PCNA) was augmented, and surfactant protein A (SP-A) and KGF receptor (KGFR) were co-expressed in PCNA-positive cells. Moreover, endogenous KGF and KGFR gene expression increased significantly by pKGF-FLAG gene transfection. Arterial blood gas analysis revealed that the PaO 2 level was not significantly reduced on day 14 after PPE instillation with pKGF-FLAG gene transfection compared to that of normal mice. These results indicated that KGF gene therapy with electroporation stimulated lung epithelial proliferation and protected depression of pulmonary function in a mouse emphysema model, suggesting a possible method of treating pulmonary emphysema

  10. Identification and network-enabled characterization of auxin response factor genes in Medicago truncatula

    Directory of Open Access Journals (Sweden)

    David J. Burks

    2016-12-01

    Full Text Available The Auxin Response Factor (ARF family of transcription factors is an important regulator of environmental response and symbiotic nodulation in the legume Medicago truncatula. While previous studies have identified members of this family, a recent spurt in gene expression data coupled with genome update and reannotation calls for a reassessment of the prevalence of ARF genes and their interaction networks in M. truncatula. We performed a comprehensive analysis of the M. truncatula genome and transcriptome that entailed search for novel ARF genes and the co-expression networks. Our investigation revealed 8 novel M. truncatula ARF (MtARF genes, of the total 22 identified, and uncovered novel gene co-expression networks as well. Furthermore, the topological clustering and single enrichment analysis of several network models revealed the roles of individual members of the MtARF family in nitrogen regulation, nodule initiation, and post-embryonic development through a specialized protein packaging and secretory pathway. In summary, this study not just shines new light on an important gene family, but also provides a guideline for identification of new members of gene families and their functional characterization through network analyses.

  11. Retinitis pigmentosa: genes and disease mechanisms.

    Science.gov (United States)

    Ferrari, Stefano; Di Iorio, Enzo; Barbaro, Vanessa; Ponzin, Diego; Sorrentino, Francesco S; Parmeggiani, Francesco

    2011-06-01

    Retinitis pigmentosa (RP) is a group of inherited disorders affecting 1 in 3000-7000 people and characterized by abnormalities of the photoreceptors (rods and cones) or the retinal pigment epithelium of the retina which lead to progressive visual loss. RP can be inherited in an autosomal dominant, autosomal recessive or X-linked manner. While usually limited to the eye, RP may also occur as part of a syndrome as in the Usher syndrome and Bardet-Biedl syndrome. Over 40 genes have been associated with RP so far, with the majority of them expressed in either the photoreceptors or the retinal pigment epithelium. The tremendous heterogeneity of the disease makes the genetics of RP complicated, thus rendering genotype-phenotype correlations not fully applicable yet. In addition to the multiplicity of mutations, in fact, different mutations in the same gene may cause different diseases. We will here review which genes are involved in the genesis of RP and how mutations can lead to retinal degeneration. In the future, a more thorough analysis of genetic and clinical data together with a better understanding of the genotype-phenotype correlation might allow to reveal important information with respect to the likelihood of disease development and choices of therapy.

  12. Genome-Scale Co-Expression Network Comparison across Escherichia coli and Salmonella enterica Serovar Typhimurium Reveals Significant Conservation at the Regulon Level of Local Regulators Despite Their Dissimilar Lifestyles

    Science.gov (United States)

    Zarrineh, Peyman; Sánchez-Rodríguez, Aminael; Hosseinkhan, Nazanin; Narimani, Zahra; Marchal, Kathleen; Masoudi-Nejad, Ali

    2014-01-01

    Availability of genome-wide gene expression datasets provides the opportunity to study gene expression across different organisms under a plethora of experimental conditions. In our previous work, we developed an algorithm called COMODO (COnserved MODules across Organisms) that identifies conserved expression modules between two species. In the present study, we expanded COMODO to detect the co-expression conservation across three organisms by adapting the statistics behind it. We applied COMODO to study expression conservation/divergence between Escherichia coli, Salmonella enterica, and Bacillus subtilis. We observed that some parts of the regulatory interaction networks were conserved between E. coli and S. enterica especially in the regulon of local regulators. However, such conservation was not observed between the regulatory interaction networks of B. subtilis and the two other species. We found co-expression conservation on a number of genes involved in quorum sensing, but almost no conservation for genes involved in pathogenicity across E. coli and S. enterica which could partially explain their different lifestyles. We concluded that despite their different lifestyles, no significant rewiring have occurred at the level of local regulons involved for instance, and notable conservation can be detected in signaling pathways and stress sensing in the phylogenetically close species S. enterica and E. coli. Moreover, conservation of local regulons seems to depend on the evolutionary time of divergence across species disappearing at larger distances as shown by the comparison with B. subtilis. Global regulons follow a different trend and show major rewiring even at the limited evolutionary distance that separates E. coli and S. enterica. PMID:25101984

  13. FocusHeuristics - expression-data-driven network optimization and disease gene prediction.

    Science.gov (United States)

    Ernst, Mathias; Du, Yang; Warsow, Gregor; Hamed, Mohamed; Endlich, Nicole; Endlich, Karlhans; Murua Escobar, Hugo; Sklarz, Lisa-Madeleine; Sender, Sina; Junghanß, Christian; Möller, Steffen; Fuellen, Georg; Struckmann, Stephan

    2017-02-16

    To identify genes contributing to disease phenotypes remains a challenge for bioinformatics. Static knowledge on biological networks is often combined with the dynamics observed in gene expression levels over disease development, to find markers for diagnostics and therapy, and also putative disease-modulatory drug targets and drugs. The basis of current methods ranges from a focus on expression-levels (Limma) to concentrating on network characteristics (PageRank, HITS/Authority Score), and both (DeMAND, Local Radiality). We present an integrative approach (the FocusHeuristics) that is thoroughly evaluated based on public expression data and molecular disease characteristics provided by DisGeNet. The FocusHeuristics combines three scores, i.e. the log fold change and another two, based on the sum and difference of log fold changes of genes/proteins linked in a network. A gene is kept when one of the scores to which it contributes is above a threshold. Our FocusHeuristics is both, a predictor for gene-disease-association and a bioinformatics method to reduce biological networks to their disease-relevant parts, by highlighting the dynamics observed in expression data. The FocusHeuristics is slightly, but significantly better than other methods by its more successful identification of disease-associated genes measured by AUC, and it delivers mechanistic explanations for its choice of genes.

  14. Usage of U7 snRNA in gene therapy of hemoglobin C disorder ...

    African Journals Online (AJOL)

    Here, a bioinformatic analysis was performed to study the effect of co-expression between human Hb C b-globin chain gene and U7.623. The gene ontological results show that full recovery of hemoglobin function and biological process can be derived. This confirms that U7 snRNA can be a good tool for gene therapy in Hb ...

  15. Genetics of sputum gene expression in chronic obstructive pulmonary disease.

    Directory of Open Access Journals (Sweden)

    Weiliang Qiu

    Full Text Available Previous expression quantitative trait loci (eQTL studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs. The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5, the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus.

  16. Genetics of Sputum Gene Expression in Chronic Obstructive Pulmonary Disease

    Science.gov (United States)

    Qiu, Weiliang; Cho, Michael H.; Riley, John H.; Anderson, Wayne H.; Singh, Dave; Bakke, Per; Gulsvik, Amund; Litonjua, Augusto A.; Lomas, David A.; Crapo, James D.; Beaty, Terri H.; Celli, Bartolome R.; Rennard, Stephen; Tal-Singer, Ruth; Fox, Steven M.; Silverman, Edwin K.; Hersh, Craig P.

    2011-01-01

    Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus. PMID:21949713

  17. Increased burden of deleterious variants in essential genes in autism spectrum disorder.

    Science.gov (United States)

    Ji, Xiao; Kember, Rachel L; Brown, Christopher D; Bućan, Maja

    2016-12-27

    Autism spectrum disorder (ASD) is a heterogeneous, highly heritable neurodevelopmental syndrome characterized by impaired social interaction, communication, and repetitive behavior. It is estimated that hundreds of genes contribute to ASD. We asked if genes with a strong effect on survival and fitness contribute to ASD risk. Human orthologs of genes with an essential role in pre- and postnatal development in the mouse [essential genes (EGs)] are enriched for disease genes and under strong purifying selection relative to human orthologs of mouse genes with a known nonlethal phenotype [nonessential genes (NEGs)]. This intolerance to deleterious mutations, commonly observed haploinsufficiency, and the importance of EGs in development suggest a possible cumulative effect of deleterious variants in EGs on complex neurodevelopmental disorders. With a comprehensive catalog of 3,915 mammalian EGs, we provide compelling evidence for a stronger contribution of EGs to ASD risk compared with NEGs. By examining the exonic de novo and inherited variants from 1,781 ASD quartet families, we show a significantly higher burden of damaging mutations in EGs in ASD probands compared with their non-ASD siblings. The analysis of EGs in the developing brain identified clusters of coexpressed EGs implicated in ASD. Finally, we suggest a high-priority list of 29 EGs with potential ASD risk as targets for future functional and behavioral studies. Overall, we show that large-scale studies of gene function in model organisms provide a powerful approach for prioritization of genes and pathogenic variants identified by sequencing studies of human disease.

  18. An efficient co-expression and purification system for the complex of Stx4 and C-terminal domain of Synip

    International Nuclear Information System (INIS)

    Tian Wei; Ma Cong; Liu Yingfang; Xu Tao

    2008-01-01

    Synip and Stx4 complex plays a key role in GLUT4 vesicle trafficking and fusion with plasma membrane. The interaction of Synip with Stx4 prevents interaction of VAMP2 located in GLUT4 vesicle with Stx4 in basal state. Insulin induces the dissociation of the Synip and Stx4 complex, and then triggers VAMP2 to interact with Stx4 to form the SNARE complex, thus promoting the vesicle fusion. In this report, we adopt a novel system for co-expression of the Synip and Stx4 by using two common vectors pGEX6p-1 and pET28a(+) to investigate their expression, purification, and interaction. Through this co-expression system, we successfully co-expressed the Synip and Stx4 complex with high yield, and co-purified at an approximate 1:1 molar ratio with high purity (95%). We also demonstrate that the 1-28 residues of Stx4 are dispensable for interaction with Synip using this co-expression system

  19. High Incidence of ACE/PAI-1 in Association to a Spectrum of Other Polymorphic Cardiovascular Genes Involving PBMCs Proinflammatory Cytokines in Hypertensive Hypercholesterolemic Patients: Reversibility with a Combination of ACE Inhibitor and Statin.

    Science.gov (United States)

    AlBacha, Jeanne d'Arc; Khoury, Mira; Mouawad, Charbel; Haddad, Katia; Hamoui, Samar; Azar, Albert; Fajloun, Ziad; Makdissy, Nehman

    2015-01-01

    Cardiovascular diseases (CVDs) are significantly high in the Lebanese population with the two most predominant forms being atherosclerosis and venous thrombosis. The purpose of our study was to assess the association of a spectrum of CVD related genes and combined state of hypertension hypercholesterolemia (HH) in unrelated Lebanese. Twelve polymorphisms were studied by multiplex PCR and reverse hybridization of DNA from 171 healthy individuals and 144 HH subjects. Two genes were significantly associated with HH: ACE (OR: 9.20, PACE activity and PAI-I increased significantly with Del/Del and 4G/5G genotypes. The co-expression of Del/4G(+/+) was detected in 113 out of 171 (66.0%) controls and 125 out of 144 (86.8%) HH subjects. Del/4G(-/-) was detected in only 6 (3.5%) controls and undetected in the HH group. Three venous thrombosis related genes [FV(Leiden), MTHFR(A1298C) and FXIII(V34L)] were significantly related to the prominence of the co-expression of Del/4G(+/+). A range of 2 to 8 combined polymorphisms co-expressed per subject where 5 mutations were the most detected. In Del/4G(+/+) subjects, peripheral blood mononuclear cells (PBMCs) produced significant elevated levels of IFN-γ and TNF-α contrary to IL-10, and no variations occurred for IL-4. ACE inhibitor (ramipril) in combination with statin (atorvastatin) and not alone reversed significantly the situation. This first report from Lebanon sheds light on an additional genetic predisposition of a complex spectrum of genes involved in CVD and suggests that the most requested gene FVL by physicians may not be sufficient to diagnose eventual future problems that can occur in the cardiovascular system. Subjects expressing the double mutations (Del/4G) are at high risk for the onset of CVDs.

  20. Blood Transcriptional Signatures for Disease Progression in a Rat Model of Osteoarthritis

    Directory of Open Access Journals (Sweden)

    Michał Korostyński

    2017-01-01

    Full Text Available Biomarkers of osteoarthritis (OA that can accurately diagnose the disease at the earliest stage would significantly support efforts to develop treatments for prevention and early intervention. We have sought to determine the time course of alterations in peripheral blood gene expression profile associated with the development of OA. Blood samples were collected from a tail vein of individual rats with monosodium iodoacetate- (MIA- induced OA (2, 14, 21, and 28 days after the treatment. We used whole-genome microarrays to reveal OA-related transcriptional alterations of 72 transcripts. Three main groups of coexpressed genes revealed diverse time-dependent profiles of up- and downregulation. Functional links that connect expression of the gradually downregulated genes to the G13 signaling pathway were indicated. The mRNA abundance levels of the identified transcripts were further analyzed in publicly available gene expression dataset obtained from a GARP study cohort of OA patients. We revealed three-gene signature differentially expressed in both rat and human blood (TNK2, KCTD2, and WDR37. The alterations in expression of the selected transcripts in peripheral blood samples of the patients indicate heterogeneity of the OA profiles potentially related to disease progress and severity of clinical symptoms. Our study identifies several potential stage-specific biomarkers of OA progression.

  1. Excessive burden of lysosomal storage disorder gene variants in Parkinson's disease

    NARCIS (Netherlands)

    Robak, L.A.; Jansen, I.E.; Rooij, J van; Uitterlinden, A.G.; Kraaij, R.; Jankovic, J.; Heutink, P.; Shulman, J.M.; Bloem, B.; Post, B.; Scheffer, H.; Warrenburg, B.P.C. van de; et al.,

    2017-01-01

    Mutations in the glucocerebrosidase gene (GBA), which cause Gaucher disease, are also potent risk factors for Parkinson's disease. We examined whether a genetic burden of variants in other lysosomal storage disorder genes is more broadly associated with Parkinson's disease susceptibility. The

  2. Gene prioritization for livestock diseases by data integration

    DEFF Research Database (Denmark)

    Jiang, Li; Sørensen, Peter; Thomsen, Bo Stjerne

    2012-01-01

    in bovine mastitis. Gene-associated phenome profile and transcriptome profile in response to Escherichia coli infection in the mammary gland were integrated to make a global inference of bovine genes involved in mastitis. The top ranked genes were highly enriched for pathways and biological processes...... underlying inflammation and immune responses, which supports the validity of our approach for identifying genes that are relevant to animal health and disease. These gene-associated phenotypes were used for a local prioritization of candidate genes located in a QTL affecting the susceptibility to mastitis...

  3. Pleiotropic Effects of Variants in Dementia Genes in Parkinson Disease

    Directory of Open Access Journals (Sweden)

    Laura Ibanez

    2018-04-01

    Full Text Available Background: The prevalence of dementia in Parkinson disease (PD increases dramatically with advancing age, approaching 80% in patients who survive 20 years with the disease. Increasing evidence suggests clinical, pathological and genetic overlap between Alzheimer disease, dementia with Lewy bodies and frontotemporal dementia with PD. However, the contribution of the dementia-causing genes to PD risk, cognitive impairment and dementia in PD is not fully established.Objective: To assess the contribution of coding variants in Mendelian dementia-causing genes on the risk of developing PD and the effect on cognitive performance of PD patients.Methods: We analyzed the coding regions of the amyloid-beta precursor protein (APP, Presenilin 1 and 2 (PSEN1, PSEN2, and Granulin (GRN genes from 1,374 PD cases and 973 controls using pooled-DNA targeted sequence, human exome-chip and whole-exome sequencing (WES data by single variant and gene base (SKAT-O and burden tests analyses. Global cognitive function was assessed using the Mini-Mental State Examination (MMSE or the Montreal Cognitive Assessment (MoCA. The effect of coding variants in dementia-causing genes on cognitive performance was tested by multiple regression analysis adjusting for gender, disease duration, age at dementia assessment, study site and APOE carrier status.Results: Known AD pathogenic mutations in the PSEN1 (p.A79V and PSEN2 (p.V148I genes were found in 0.3% of all PD patients. There was a significant burden of rare, likely damaging variants in the GRN and PSEN1 genes in PD patients when compared with frequencies in the European population from the ExAC database. Multiple regression analysis revealed that PD patients carrying rare variants in the APP, PSEN1, PSEN2, and GRN genes exhibit lower cognitive tests scores than non-carrier PD patients (p = 2.0 × 10−4, independent of age at PD diagnosis, age at evaluation, APOE status or recruitment site.Conclusions: Pathogenic mutations in

  4. Gene-Environment Interactions in the Development of Complex Disease Phenotypes

    Directory of Open Access Journals (Sweden)

    Kenneth Olden

    2008-03-01

    Full Text Available The lack of knowledge about the earliest events in disease development is due to the multi-factorial nature of disease risk. This information gap is the consequence of the lack of appreciation for the fact that most diseases arise from the complex interactions between genes and the environment as a function of the age or stage of development of the individual. Whether an environmental exposure causes illness or not is dependent on the efficiency of the so-called “environmental response machinery” (i.e., the complex of metabolic pathways that can modulate response to environmental perturbations that one has inherited. Thus, elucidating the causes of most chronic diseases will require an understanding of both the genetic and environmental contribution to their etiology. Unfortunately, the exploration of the relationship between genes and the environment has been hampered in the past by the limited knowledge of the human genome, and by the inclination of scientists to study disease development using experimental models that consider exposure to a single environmental agent. Rarely in the past were interactions between multiple genes or between genes and environmental agents considered in studies of human disease etiology. The most critical issue is how to relate exposure-disease association studies to pathways and mechanisms. To understand how genes and environmental factors interact to perturb biological pathways to cause injury or disease, scientists will need tools with the capacity to monitor the global expression of thousands of genes, proteins and metabolites simultaneously. The generation of such data in multiple species can be used to identify conserved and functionally significant genes and pathways involved in geneenvironment interactions. Ultimately, it is this knowledge that will be used to guide agencies such as the U.S. Department of Health and Human Services in decisions regarding biomedical research funding

  5. NDP gene mutations in 14 French families with Norrie disease.

    Science.gov (United States)

    Royer, Ghislaine; Hanein, Sylvain; Raclin, Valérie; Gigarel, Nadine; Rozet, Jean-Michel; Munnich, Arnold; Steffann, Julie; Dufier, Jean-Louis; Kaplan, Josseline; Bonnefont, Jean-Paul

    2003-12-01

    Norrie disease is a rare X-inked recessive condition characterized by congenital blindness and occasionally deafness and mental retardation in males. This disease has been ascribed to mutations in the NDP gene on chromosome Xp11.1. Previous investigations of the NDP gene have identified largely sixty disease-causing sequence variants. Here, we report on ten different NDP gene allelic variants in fourteen of a series of 21 families fulfilling inclusion criteria. Two alterations were intragenic deletions and eight were nucleotide substitutions or splicing variants, six of them being hitherto unreported, namely c.112C>T (p.Arg38Cys), c.129C>G (p.His43Gln), c.133G>A (p.Val45Met), c.268C>T (p.Arg90Cys), c.382T>C (p.Cys128Arg), c.23479-1G>C (unknown). No NDP gene sequence variant was found in seven of the 21 families. This observation raises the issue of misdiagnosis, phenocopies, or existence of other X-linked or autosomal genes, the mutations of which would mimic the Norrie disease phenotype. Copyright 2003 Wiley-Liss, Inc.

  6. Efficient androst-1,4-diene-3,17-dione production by co-expressing 3-ketosteroid-Δ1 -dehydrogenase and catalase in Bacillus subtilis.

    Science.gov (United States)

    Shao, M; Sha, Z; Zhang, X; Rao, Z; Xu, M; Yang, T; Xu, Z; Yang, S

    2017-01-01

    3-ketosteroid-Δ 1 -dehydrogenase (KSDD), a flavin adenine dinucleotide (FAD)-dependent enzyme involved in sterol metabolism, specifically catalyses the conversion of androst-4-ene-3,17-dione (AD) to androst-1,4-diene-3,17-dione (ADD). However, the low KSDD activity and the toxic effects of hydrogen peroxide (H 2 O 2 ) generated during the biotransformation of AD to ADD with FAD regeneration hinder its application on AD conversion. The aim of this work was to improve KSDD activity and eliminate the toxic effects of the generated H 2 O 2 to enhance ADD production. The ksdd gene obtained from Mycobacterium neoaurum JC-12 was codon-optimized to increase its expression level in Bacillus subtilis, and the KSDD activity reached 12·3 U mg -1 , which was sevenfold of that of codon-unoptimized gene. To improve AD conversion, catalase was co-expressed with KSDD in B. subtilis 168/pMA5-ksdd opt -katA to eliminate the toxic effects of H 2 O 2 generated during AD conversion. Finally, under optimized bioconversion conditions, fed-batch strategy was carried out and the ADD yield improved to 8·76 g l -1 . This work demonstrates the potential to improve enzyme activity by codon-optimization and eliminate the toxic effects of H 2 O 2 by co-expressing catalase. This study showed the highest ADD productivity ever reported and provides a promising strain for efficient ADD production in the pharmaceutical industry. © 2016 The Society for Applied Microbiology.

  7. LGscore: A method to identify disease-related genes using biological literature and Google data.

    Science.gov (United States)

    Kim, Jeongwoo; Kim, Hyunjin; Yoon, Youngmi; Park, Sanghyun

    2015-04-01

    Since the genome project in 1990s, a number of studies associated with genes have been conducted and researchers have confirmed that genes are involved in disease. For this reason, the identification of the relationships between diseases and genes is important in biology. We propose a method called LGscore, which identifies disease-related genes using Google data and literature data. To implement this method, first, we construct a disease-related gene network using text-mining results. We then extract gene-gene interactions based on co-occurrences in abstract data obtained from PubMed, and calculate the weights of edges in the gene network by means of Z-scoring. The weights contain two values: the frequency and the Google search results. The frequency value is extracted from literature data, and the Google search result is obtained using Google. We assign a score to each gene through a network analysis. We assume that genes with a large number of links and numerous Google search results and frequency values are more likely to be involved in disease. For validation, we investigated the top 20 inferred genes for five different diseases using answer sets. The answer sets comprised six databases that contain information on disease-gene relationships. We identified a significant number of disease-related genes as well as candidate genes for Alzheimer's disease, diabetes, colon cancer, lung cancer, and prostate cancer. Our method was up to 40% more accurate than existing methods. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. A maize resistance gene functions against bacterial streak disease in rice

    OpenAIRE

    Zhao, Bingyu; Lin, Xinghua; Poland, Jesse; Trick, Harold; Leach, Jan; Hulbert, Scot

    2005-01-01

    Although cereal crops all belong to the grass family (Poacea), most of their diseases are specific to a particular species. Thus, a given cereal species is typically resistant to diseases of other grasses, and this nonhost resistance is generally stable. To determine the feasibility of transferring nonhost resistance genes (R genes) between distantly related grasses to control specific diseases, we identified a maize R gene that recognizes a rice pathogen, Xanthomonas oryzae pv. oryzicola, wh...

  9. ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.

    Science.gov (United States)

    Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2017-01-04

    The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Association between Polymorphisms in Antioxidant Genes and Inflammatory Bowel Disease.

    Directory of Open Access Journals (Sweden)

    Cristiana Costa Pereira

    Full Text Available Inflammation is the driving force in inflammatory bowel disease (IBD and its link to oxidative stress and carcinogenesis has long been accepted. The antioxidant system of the intestinal mucosa in IBD is compromised resulting in increased oxidative injury. This defective antioxidant system may be the result of genetic variants in antioxidant genes, which can represent susceptibility factors for IBD, namely Crohn's disease (CD and ulcerative colitis (UC. Single nucleotide polymorphisms (SNPs in the antioxidant genes SOD2 (rs4880 and GPX1 (rs1050450 were genotyped in a Portuguese population comprising 436 Crohn's disease and 367 ulcerative colitis patients, and 434 healthy controls. We found that the AA genotype in GPX1 is associated with ulcerative colitis (OR = 1.93, adjusted P-value = 0.037. Moreover, we found nominal significant associations between SOD2 and Crohn's disease susceptibility and disease subphenotypes but these did not withstand the correction for multiple testing. These findings indicate a possible link between disease phenotypes and antioxidant genes. These results suggest a potential role for antioxidant genes in IBD pathogenesis and should be considered in future association studies.

  11. Gene expression profiling in autoimmune diseases: chronic inflammation or disease specific patterns?

    DEFF Research Database (Denmark)

    Bovin, Lone Frier; Brynskov, Jørn; Hegedüs, Laszlo

    2007-01-01

    ) patients and healthy individuals were specific for the arthritic process or likewise altered in other chronic inflammatory diseases such as chronic autoimmune thyroiditis (Hashimoto's thyroiditis, HT) and inflammatory bowel disease (IBD). Using qPCR for 18 RA-discriminative genes, there were no significant...

  12. New Genes and New Insights from Old Genes: Update on Alzheimer Disease

    Science.gov (United States)

    Ringman, John M.; Coppola, Giovanni

    2013-01-01

    Purpose of Review: This article discusses the current status of knowledge regarding the genetic basis of Alzheimer disease (AD) with a focus on clinically relevant aspects. Recent Findings: The genetic architecture of AD is complex, as it includes multiple susceptibility genes and likely nongenetic factors. Rare but highly penetrant autosomal dominant mutations explain a small minority of the cases but have allowed tremendous advances in understanding disease pathogenesis. The identification of a strong genetic risk factor, APOE, reshaped the field and introduced the notion of genetic risk for AD. More recently, large-scale genome-wide association studies are adding to the picture a number of common variants with very small effect sizes. Large-scale resequencing studies are expected to identify additional risk factors, including rare susceptibility variants and structural variation. Summary: Genetic assessment is currently of limited utility in clinical practice because of the low frequency (Mendelian mutations) or small effect size (common risk factors) of the currently known susceptibility genes. However, genetic studies are identifying with confidence a number of novel risk genes, and this will further our understanding of disease biology and possibly the identification of therapeutic targets. PMID:23558482

  13. DRUMS: a human disease related unique gene mutation search engine.

    Science.gov (United States)

    Li, Zuofeng; Liu, Xingnan; Wen, Jingran; Xu, Ye; Zhao, Xin; Li, Xuan; Liu, Lei; Zhang, Xiaoyan

    2011-10-01

    With the completion of the human genome project and the development of new methods for gene variant detection, the integration of mutation data and its phenotypic consequences has become more important than ever. Among all available resources, locus-specific databases (LSDBs) curate one or more specific genes' mutation data along with high-quality phenotypes. Although some genotype-phenotype data from LSDB have been integrated into central databases little effort has been made to integrate all these data by a search engine approach. In this work, we have developed disease related unique gene mutation search engine (DRUMS), a search engine for human disease related unique gene mutation as a convenient tool for biologists or physicians to retrieve gene variant and related phenotype information. Gene variant and phenotype information were stored in a gene-centred relational database. Moreover, the relationships between mutations and diseases were indexed by the uniform resource identifier from LSDB, or another central database. By querying DRUMS, users can access the most popular mutation databases under one interface. DRUMS could be treated as a domain specific search engine. By using web crawling, indexing, and searching technologies, it provides a competitively efficient interface for searching and retrieving mutation data and their relationships to diseases. The present system is freely accessible at http://www.scbit.org/glif/new/drums/index.html. © 2011 Wiley-Liss, Inc.

  14. Role of T cell receptor delta gene in susceptibility to celiac disease.

    Science.gov (United States)

    Roschmann, E; Wienker, T F; Volk, B A

    1996-02-01

    There is a strong genetic influence on the susceptibility to celiac disease. Although in the vast majority of patients with celiac disease, the HLA-DQ(alpha1*0501, beta1*0201) heterodimer encoded by the alleles HLA-DQA1*0501 and HLA-DQB1*0201 seems to confer the primary disease susceptibility, it cannot be excluded that other genes contribute to disease susceptibility, as indicated by the difference in concordance rates between monozygotic twins and HLA identical siblings (70% vs. 30%). Obviously other genes involved in the genetic control of T cell mediated immune response could potentially influence susceptibility to celiac disease. The density of T cells using the gammadelta T cell receptor (TCR) is considerably increased in the jejunal epithelium of patients with celiac disease, an abnormality considered to be specific for celiac disease. This suggests an involvement of gammadelta T cells in the pathogenesis of the disease. To ascertain whether the TCR delta (TCRD) gene contributes to celiac disease susceptibility we carried out an association study and genetic linkage analysis using a highly polymorphic microsatellite marker at the TCRD locus on chromosome 14q11.2. The association study demonstrated no significant difference in allele frequencies of the TCRD gene marker between celiac disease patients and controls; accordingly, the relative risk estimates did not reach the level of statistical significance. In the linkage analysis, performed in 23 families, the logarithm of the odds (LOD) scores calculated for celiac disease versus the TCRD gene marker excluded linkage, suggesting that there is no determinant contributing to celiac disease status at or 5 cM distant to the analyzed TCRD gene marker. In conclusion, the results of the present study provide no evidence that the analyzed TCRD gene contributes substantially to celiac disease susceptibility.

  15. Phenotypic characterization of neurotensin messenger RNA-expressing cells in the neuroleptic-treated rat striatum: a detailed cellular co-expression study

    Energy Technology Data Exchange (ETDEWEB)

    Emson, P C; Westmore, K; Augood, S J [MRC Molecular Neuroscience Group, The Department of Neurobiology, The Babraham Institute, Babraham, Cambridge (United Kingdom)

    1996-12-11

    The chemical phenotype of proneurotensin messenger RNA-expressing cells was determined in the acute haloperidol-treated rat striatum using a combination of [{sup 35}S]-labelled and alkaline phosphatase-labelled oligonucleotides. Cellular sites of proneurotensin messenger RNA expression were visualized simultaneously on tissue sections processed to reveal cellular sites of preproenkephalin A messenger RNA or the dopamine and adenylate cyclase phosphoprotein-32, messenger RNA. The cellular co-expression of preproenkepahlin A and preprotachykinin messenger RNA was also examined within forebrain structures. Cellular sites of preproenkephalin A and dopamine and adenylate cyclase phosphoprotein-32 messenger RNAs were visualized using alkaline phosphatase-labelled oligonucleotides whilst sites of preprotachykinin and proneurotensin messenger RNA expression were detected using [{sup 35}S]-labelled oligos. Cellular sites of enkephalin and dopamine and adenylate cyclase phosphoprotein-32 gene expression were identified microscopically by the concentration of purple alkaline phosphatase reaction product within the cell cytoplasm, whereas sites of substance P and proneurotensin gene expression were identified by the dense clustering of silver grains overlying cells.An intense hybridization signal was detected for all three neuropeptide messenger RNAs in the striatum, the nucleus accumbens and septum. Dopamine and adenylate cyclase phosphoprotein-32 messenger RNA was detected within the neostriatum but not within the septum. In all forebrain regions examined, with the exception of the islands of Cajella, the cellular expression of enkephalin messenger RNA and substance P messenger RNA was discordant; the two neuropeptide messenger RNAs were detected essentially in different cells, although in the striatum and nucleus accumbens occasional isolated cells were detected which contained both hybridization signals; dense clusters of silver grains overlay alkaline phosphatase

  16. Phenotypic characterization of neurotensin messenger RNA-expressing cells in the neuroleptic-treated rat striatum: a detailed cellular co-expression study

    International Nuclear Information System (INIS)

    Emson, P.C.; Westmore, K.; Augood, S.J.

    1996-01-01

    The chemical phenotype of proneurotensin messenger RNA-expressing cells was determined in the acute haloperidol-treated rat striatum using a combination of [ 35 S]-labelled and alkaline phosphatase-labelled oligonucleotides. Cellular sites of proneurotensin messenger RNA expression were visualized simultaneously on tissue sections processed to reveal cellular sites of preproenkephalin A messenger RNA or the dopamine and adenylate cyclase phosphoprotein-32, messenger RNA. The cellular co-expression of preproenkepahlin A and preprotachykinin messenger RNA was also examined within forebrain structures. Cellular sites of preproenkephalin A and dopamine and adenylate cyclase phosphoprotein-32 messenger RNAs were visualized using alkaline phosphatase-labelled oligonucleotides whilst sites of preprotachykinin and proneurotensin messenger RNA expression were detected using [ 35 S]-labelled oligos. Cellular sites of enkephalin and dopamine and adenylate cyclase phosphoprotein-32 gene expression were identified microscopically by the concentration of purple alkaline phosphatase reaction product within the cell cytoplasm, whereas sites of substance P and proneurotensin gene expression were identified by the dense clustering of silver grains overlying cells.An intense hybridization signal was detected for all three neuropeptide messenger RNAs in the striatum, the nucleus accumbens and septum. Dopamine and adenylate cyclase phosphoprotein-32 messenger RNA was detected within the neostriatum but not within the septum. In all forebrain regions examined, with the exception of the islands of Cajella, the cellular expression of enkephalin messenger RNA and substance P messenger RNA was discordant; the two neuropeptide messenger RNAs were detected essentially in different cells, although in the striatum and nucleus accumbens occasional isolated cells were detected which contained both hybridization signals; dense clusters of silver grains overlay alkaline phosphatase-positive cells

  17. Radiochemotherapy of hepatocarcinoma via lentivirus-mediated transfer of human sodium iodide symporter gene and herpes simplex virus thymidine kinase gene

    Energy Technology Data Exchange (ETDEWEB)

    Chen Libo, E-mail: libochen888@hotmail.com [Department of Nuclear Medicine, Shanghai Sixth People' s Hospital, Shanghai Jiao Tong University, Shanghai 200233 (China); Guo Guoying [Xinyuan Institute of Medicine and Biotechnology, School of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018 (China); Liu Tianjing; Guo Lihe [Division of Biochemistry and Cell Biology, Shanghai Institute for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China); Zhu Ruisen [Department of Nuclear Medicine, Shanghai Sixth People' s Hospital, Shanghai Jiao Tong University, Shanghai 200233 (China)

    2011-07-15

    Herpes simplex virus thymidine kinase (HSV-TK) gene/ganciclovir (GCV) system has been widely used as a traditional gene therapy modality, and the sodium/iodide symporter gene (NIS) has been found to be a novel therapeutic gene. Since the therapeutic effects of radioiodine therapy or prodrug chemotherapy on cancers following NIS or HSV-TK gene transfer need to be enhanced, this study was designed to investigate the feasibility of radiochemotherapy for hepatocarcinoma via coexpression of NIS gene and HSV-TK gene. Methods: HepG2 cells were stably transfected with NIS, TK and GFP gene via recombinant lentiviral vector and named HepG2/NTG. Gene expression was examined by reverse transcriptase polymerase chain reaction, fluorescence imaging and iodide uptake. The therapeutic effects were assessed by MTT assay and clonogenic assay. Results: HepG2/NTG cells concentrated {sup 125}I{sup -} up to 76-fold higher than the wild-type cells within 20 min, and the efflux happened with a T{sub 1/2eff} of less than 10 min. The iodide uptake in HepG2/NTG cells was specifically inhibited by sodium perchlorate. Dose-dependent toxicity to HepG2/NTG cells by either GCV or {sup 131}I was revealed by clonogenic assay and MTT assay, respectively. The survival rate of HepG2/NTG cells decreased to 49.7%{+-}2.5%, 43.4%{+-}2.8% and 8.6%{+-}1.2% after exposure to {sup 131}I, GCV and combined therapy, respectively. Conclusion: We demonstrate that radiochemotherapy of hepatocarcinoma via lentiviral-mediated coexpression of NIS gene and HSV-TK gene leads to stronger killing effect than single treatment, and in vivo studies are needed to verify these findings.

  18. Radiochemotherapy of hepatocarcinoma via lentivirus-mediated transfer of human sodium iodide symporter gene and herpes simplex virus thymidine kinase gene

    International Nuclear Information System (INIS)

    Chen Libo; Guo Guoying; Liu Tianjing; Guo Lihe; Zhu Ruisen

    2011-01-01

    Herpes simplex virus thymidine kinase (HSV-TK) gene/ganciclovir (GCV) system has been widely used as a traditional gene therapy modality, and the sodium/iodide symporter gene (NIS) has been found to be a novel therapeutic gene. Since the therapeutic effects of radioiodine therapy or prodrug chemotherapy on cancers following NIS or HSV-TK gene transfer need to be enhanced, this study was designed to investigate the feasibility of radiochemotherapy for hepatocarcinoma via coexpression of NIS gene and HSV-TK gene. Methods: HepG2 cells were stably transfected with NIS, TK and GFP gene via recombinant lentiviral vector and named HepG2/NTG. Gene expression was examined by reverse transcriptase polymerase chain reaction, fluorescence imaging and iodide uptake. The therapeutic effects were assessed by MTT assay and clonogenic assay. Results: HepG2/NTG cells concentrated 125 I - up to 76-fold higher than the wild-type cells within 20 min, and the efflux happened with a T 1/2eff of less than 10 min. The iodide uptake in HepG2/NTG cells was specifically inhibited by sodium perchlorate. Dose-dependent toxicity to HepG2/NTG cells by either GCV or 131 I was revealed by clonogenic assay and MTT assay, respectively. The survival rate of HepG2/NTG cells decreased to 49.7%±2.5%, 43.4%±2.8% and 8.6%±1.2% after exposure to 131 I, GCV and combined therapy, respectively. Conclusion: We demonstrate that radiochemotherapy of hepatocarcinoma via lentiviral-mediated coexpression of NIS gene and HSV-TK gene leads to stronger killing effect than single treatment, and in vivo studies are needed to verify these findings.

  19. Co-expression of putative stemness and epithelial-to-mesenchymal transition markers on single circulating tumour cells from patients with early and metastatic breast cancer.

    Science.gov (United States)

    Papadaki, Maria A; Kallergi, Galatea; Zafeiriou, Zafeiris; Manouras, Lefteris; Theodoropoulos, Panayiotis A; Mavroudis, Dimitris; Georgoulias, Vassilis; Agelaki, Sofia

    2014-09-03

    The detection of circulating tumor cells (CTCs) in peripheral blood (PB) of patients with breast cancer predicts poor clinical outcome. Cancer cells with stemness and epithelial-to-mesenchymal transition (EMT) features display enhanced malignant and metastatic potential. A new methodology was developed in order to investigate the co-expression of a stemness and an EMT marker (ALDH1 and TWIST, respectively) on single CTCs of patients with early and metastatic breast cancer. Triple immunofluorescence using anti-pancytokeratin (A45-B/B3), anti-ALDH1 and anti-TWIST antibodies was performed in cytospins prepared from hepatocellular carcinoma HepG2 cells and SKBR-3, MCF-7 and MDA.MB.231 breast cancer cell lines. Evaluation of ALDH1 expression levels (high, low or absent) and TWIST subcellular localization (nuclear, cytoplasmic or absent) was performed using the ARIOL system. Cytospins prepared from peripheral blood of patients with early (n = 80) and metastatic (n = 50) breast cancer were analyzed for CTC detection (based on pan-cytokeratin expression and cytomorphological criteria) and characterized according to ALDH1 and TWIST. CTCs were detected in 13 (16%) and 25 (50%) patients with early and metastatic disease, respectively. High ALDH1 expression (ALDH1high) and nuclear TWIST localization (TWISTnuc) on CTCs was confirmed in more patients with metastatic than early breast cancer (80% vs. 30.8%, respectively; p = 0.009). In early disease, ALDH1low/neg CTCs (p = 0.006) and TWISTcyt/neg CTCs (p = 0.040) were mainly observed. Regarding co-expression of these markers, ALDH1high/TWISTnuc CTCs were more frequently evident in the metastatic setting (76% vs. 15.4% of patients, p = 0.001; 61.5% vs. 12.9% of total CTCs), whereas in early disease ALDH1low/neg/TWISTcyt/neg CTCs were mainly detected (61.5% vs. 20% of patients, p = 0.078; 41.9% vs. 7.7% of total CTCs). A new assay is provided for the evaluation of ALDH1 and TWIST co-expression at the

  20. Enhanced phytoremediation of mixed heavy metal (mercury)-organic pollutants (trichloroethylene) with transgenic alfalfa co-expressing glutathione S-transferase and human P450 2E1.

    Science.gov (United States)

    Zhang, Yuanyuan; Liu, Junhong; Zhou, Yuanming; Gong, Tingyun; Wang, Jing; Ge, Yinlin

    2013-09-15

    Soil contamination is a global environmental problem and many efforts have been made to find efficient remediation methods over the last decade. Moreover, remediation of mixed contaminated soils are more difficult. In the present study, transgenic alfalfa plants pKHCG co-expressing glutathione S-transferase (GST) and human P450 2E1 (CYP2E1) genes were used for phytoremediation of mixed mercury (Hg)-trichloroethylene (TCE) contaminants. Simultaneous expression of GST and CYP2E1 may produce a significant synergistic effect, and leads to improved resistance and accumulation to heavy metal-organic complex contaminants. Based on the tolerance and accumulation assays, pKHCG transgenic plants were more resistant to Hg/TCE complex pollutants and many folds higher in Hg/TCE-accumulation than the non-transgenic control plants in mixed contaminated soil. It is confirmed that GST and CYP2E1 co-expression may be a useful strategy to help achieve mixed heavy metal-organic pollutants phytoremediation. Copyright © 2013 Elsevier B.V. All rights reserved.

  1. Gene Expression Analysis in Tubule Interstitial Compartments Reveals Candidate Agents for IgA Nephropathy

    Directory of Open Access Journals (Sweden)

    Jinling Wang

    2014-09-01

    Full Text Available Background/Aims: Our aim was to explore the molecular mechanism underlying development of IgA nephropathy and discover candidate agents for IgA nephropathy. Methods: The differentially expressed genes (DEGs between patients with IgA nephropathy and normal controls were identified by the data of GSE35488 downloaded from GEO (Gene Expression Omnibus database. The co-expressed gene pairs among DEGs were screened to construct the gene-gene interaction network. Gene Ontology (GO enrichment analysis was performed to analyze the functions of DEGs. The biologically active small molecules capable of targeting IgA nephropathy were identified using the Connectivity Map (cMap database. Results: A total of 55 genes involved in response to organic substance, transcription factor activity and response to steroid hormone stimulus were identified to be differentially expressed in IgA nephropathy patients compared to healthy individuals. A network with 45 co-expressed gene pairs was constructed. DEGs in the network were significantly enriched in response to organic substance. Additionally, a group of small molecules were identified, such as doxorubicin and thapsigargin. Conclusion: Our work provided a systematic insight in understanding the mechanism of IgA nephropathy. Small molecules such as thapsigargin might be potential candidate agents for the treatment of IgA nephropathy.

  2. Prioritizing chronic obstructive pulmonary disease (COPD) candidate genes in COPD-related networks.

    Science.gov (United States)

    Zhang, Yihua; Li, Wan; Feng, Yuyan; Guo, Shanshan; Zhao, Xilei; Wang, Yahui; He, Yuehan; He, Weiming; Chen, Lina

    2017-11-28

    Chronic obstructive pulmonary disease (COPD) is a multi-factor disease, which could be caused by many factors, including disturbances of metabolism and protein-protein interactions (PPIs). In this paper, a weighted COPD-related metabolic network and a weighted COPD-related PPI network were constructed base on COPD disease genes and functional information. Candidate genes in these weighted COPD-related networks were prioritized by making use of a gene prioritization method, respectively. Literature review and functional enrichment analysis of the top 100 genes in these two networks suggested the correlation of COPD and these genes. The performance of our gene prioritization method was superior to that of ToppGene and ToppNet for genes from the COPD-related metabolic network or the COPD-related PPI network after assessing using leave-one-out cross-validation, literature validation and functional enrichment analysis. The top-ranked genes prioritized from COPD-related metabolic and PPI networks could promote the better understanding about the molecular mechanism of this disease from different perspectives. The top 100 genes in COPD-related metabolic network or COPD-related PPI network might be potential markers for the diagnosis and treatment of COPD.

  3. NorWood: a gene expression resource for evo-devo studies of conifer wood development.

    Science.gov (United States)

    Jokipii-Lukkari, Soile; Sundell, David; Nilsson, Ove; Hvidsten, Torgeir R; Street, Nathaniel R; Tuominen, Hannele

    2017-10-01

    The secondary xylem of conifers is composed mainly of tracheids that differ anatomically and chemically from angiosperm xylem cells. There is currently no high-spatial-resolution data available profiling gene expression during wood formation for any coniferous species, which limits insight into tracheid development. RNA-sequencing data from replicated, high-spatial-resolution section series throughout the cambial and woody tissues of Picea abies were used to generate the NorWood.conGenIE.org web resource, which facilitates exploration of the associated gene expression profiles and co-expression networks. Integration within PlantGenIE.org enabled a comparative regulomics analysis, revealing divergent co-expression networks between P. abies and the two angiosperm species Arabidopsis thaliana and Populus tremula for the secondary cell wall (SCW) master regulator NAC Class IIB transcription factors. The SCW cellulose synthase genes (CesAs) were located in the neighbourhoods of the NAC factors in A. thaliana and P. tremula, but not in P. abies. The NorWood co-expression network enabled identification of potential SCW CesA regulators in P. abies. The NorWood web resource represents a powerful community tool for generating evo-devo insights into the divergence of wood formation between angiosperms and gymnosperms and for advancing understanding of the regulation of wood development in P. abies. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  4. Systems genetics identifies a convergent gene network for cognition and neurodevelopmental disease.

    Science.gov (United States)

    Johnson, Michael R; Shkura, Kirill; Langley, Sarah R; Delahaye-Duriez, Andree; Srivastava, Prashant; Hill, W David; Rackham, Owen J L; Davies, Gail; Harris, Sarah E; Moreno-Moral, Aida; Rotival, Maxime; Speed, Doug; Petrovski, Slavé; Katz, Anaïs; Hayward, Caroline; Porteous, David J; Smith, Blair H; Padmanabhan, Sandosh; Hocking, Lynne J; Starr, John M; Liewald, David C; Visconti, Alessia; Falchi, Mario; Bottolo, Leonardo; Rossetti, Tiziana; Danis, Bénédicte; Mazzuferi, Manuela; Foerch, Patrik; Grote, Alexander; Helmstaedter, Christoph; Becker, Albert J; Kaminski, Rafal M; Deary, Ian J; Petretto, Enrico

    2016-02-01

    Genetic determinants of cognition are poorly characterized, and their relationship to genes that confer risk for neurodevelopmental disease is unclear. Here we performed a systems-level analysis of genome-wide gene expression data to infer gene-regulatory networks conserved across species and brain regions. Two of these networks, M1 and M3, showed replicable enrichment for common genetic variants underlying healthy human cognitive abilities, including memory. Using exome sequence data from 6,871 trios, we found that M3 genes were also enriched for mutations ascertained from patients with neurodevelopmental disease generally, and intellectual disability and epileptic encephalopathy in particular. M3 consists of 150 genes whose expression is tightly developmentally regulated, but which are collectively poorly annotated for known functional pathways. These results illustrate how systems-level analyses can reveal previously unappreciated relationships between neurodevelopmental disease-associated genes in the developed human brain, and provide empirical support for a convergent gene-regulatory network influencing cognition and neurodevelopmental disease.

  5. Coexpression of the high molecular weight glutenin subunit 1Ax1 and puroindoline improves dough mixing properties in durum wheat (Triticum turgidum L. ssp. durum).

    Science.gov (United States)

    Li, Yin; Wang, Qiong; Li, Xiaoyan; Xiao, Xin; Sun, Fusheng; Wang, Cheng; Hu, Wei; Feng, Zhijuan; Chang, Junli; Chen, Mingjie; Wang, Yuesheng; Li, Kexiu; Yang, Guangxiao; He, Guangyuan

    2012-01-01

    Wheat end-use quality mainly derives from two interrelated characteristics: the compositions of gluten proteins and grain hardness. The composition of gluten proteins determines dough rheological properties and thus confers the unique viscoelastic property on dough. One group of gluten proteins, high molecular weight glutenin subunits (HMW-GS), plays an important role in dough functional properties. On the other hand, grain hardness, which influences the milling process of flour, is controlled by Puroindoline a (Pina) and Puroindoline b (Pinb) genes. However, little is known about the combined effects of HMW-GS and PINs on dough functional properties. In this study, we crossed a Pina-expressing transgenic line with a 1Ax1-expressing line of durum wheat and screened out lines coexpressing 1Ax1 and Pina or lines expressing either 1Ax1 or Pina. Dough mixing analysis of these lines demonstrated that expression of 1Ax1 improved both dough strength and over-mixing tolerance, while expression of PINA detrimentally affected the dough resistance to extension. In lines coexpressing 1Ax1 and Pina, faster hydration of flour during mixing was observed possibly due to the lower water absorption and damaged starch caused by PINA expression. In addition, expression of 1Ax1 appeared to compensate the detrimental effect of PINA on dough resistance to extension. Consequently, coexpression of 1Ax1 and PINA in durum wheat had combined effects on dough mixing behaviors with a better dough strength and resistance to extension than those from lines expressing either 1Ax1 or Pina. The results in our study suggest that simultaneous modulation of dough strength and grain hardness in durum wheat could significantly improve its breadmaking quality and may not even impair its pastamaking potential. Therefore, coexpression of 1Ax1 and PINA in durum wheat has useful implications for breeding durum wheat with dual functionality (for pasta and bread) and may improve the economic values of durum

  6. Coexpression of the high molecular weight glutenin subunit 1Ax1 and puroindoline improves dough mixing properties in durum wheat (Triticum turgidum L. ssp. durum.

    Directory of Open Access Journals (Sweden)

    Yin Li

    Full Text Available Wheat end-use quality mainly derives from two interrelated characteristics: the compositions of gluten proteins and grain hardness. The composition of gluten proteins determines dough rheological properties and thus confers the unique viscoelastic property on dough. One group of gluten proteins, high molecular weight glutenin subunits (HMW-GS, plays an important role in dough functional properties. On the other hand, grain hardness, which influences the milling process of flour, is controlled by Puroindoline a (Pina and Puroindoline b (Pinb genes. However, little is known about the combined effects of HMW-GS and PINs on dough functional properties. In this study, we crossed a Pina-expressing transgenic line with a 1Ax1-expressing line of durum wheat and screened out lines coexpressing 1Ax1 and Pina or lines expressing either 1Ax1 or Pina. Dough mixing analysis of these lines demonstrated that expression of 1Ax1 improved both dough strength and over-mixing tolerance, while expression of PINA detrimentally affected the dough resistance to extension. In lines coexpressing 1Ax1 and Pina, faster hydration of flour during mixing was observed possibly due to the lower water absorption and damaged starch caused by PINA expression. In addition, expression of 1Ax1 appeared to compensate the detrimental effect of PINA on dough resistance to extension. Consequently, coexpression of 1Ax1 and PINA in durum wheat had combined effects on dough mixing behaviors with a better dough strength and resistance to extension than those from lines expressing either 1Ax1 or Pina. The results in our study suggest that simultaneous modulation of dough strength and grain hardness in durum wheat could significantly improve its breadmaking quality and may not even impair its pastamaking potential. Therefore, coexpression of 1Ax1 and PINA in durum wheat has useful implications for breeding durum wheat with dual functionality (for pasta and bread and may improve the economic

  7. In Search of 'Birth Month Genes': Using Existing Data Repositories to Locate Genes Underlying Birth Month-Disease Relationships.

    Science.gov (United States)

    Boland, Mary Regina; Tatonetti, Nicholas P

    2016-01-01

    Prenatal and perinatal exposures vary seasonally (e.g., sunlight, allergens) and many diseases are linked with variance in exposure. Epidemiologists often measure these changes using birth month as a proxy for seasonal variance. Likewise, Genome-Wide Association Studies have associated or implicated these same diseases with many genes. Both disparate data types (epidemiological and genetic) can provide key insights into the underlying disease biology. We developed an algorithm that links 1) epidemiological data from birth month studies with 2) genetic data from published gene-disease association studies. Our framework uses existing data repositories - PubMed, DisGeNET and Gene Ontology - to produce a bipartite network that connects enriched seasonally varying biofactorss with birth month dependent diseases (BMDDs) through their overlapping developmental gene sets. As a proof-of-concept, we investigate 7 known BMDDs and highlight three important biological networks revealed by our algorithm and explore some interesting genetic mechanisms potentially responsible for the seasonal contribution to BMDDs.

  8. Polycistronic gene expression in Aspergillus niger.

    Science.gov (United States)

    Schuetze, Tabea; Meyer, Vera

    2017-09-25

    Genome mining approaches predict dozens of biosynthetic gene clusters in each of the filamentous fungal genomes sequenced so far. However, the majority of these gene clusters still remain cryptic because they are not expressed in their natural host. Simultaneous expression of all genes belonging to a biosynthetic pathway in a heterologous host is one approach to activate biosynthetic gene clusters and to screen the metabolites produced for bioactivities. Polycistronic expression of all pathway genes under control of a single and tunable promoter would be the method of choice, as this does not only simplify cloning procedures, but also offers control on timing and strength of expression. However, polycistronic gene expression is a feature not commonly found in eukaryotic host systems, such as Aspergillus niger. In this study, we tested the suitability of the viral P2A peptide for co-expression of three genes in A. niger. Two genes descend from Fusarium oxysporum and are essential to produce the secondary metabolite enniatin (esyn1, ekivR). The third gene (luc) encodes the reporter luciferase which was included to study position effects. Expression of the polycistronic gene cassette was put under control of the Tet-On system to ensure tunable gene expression in A. niger. In total, three polycistronic expression cassettes which differed in the position of luc were constructed and targeted to the pyrG locus in A. niger. This allowed direct comparison of the luciferase activity based on the position of the luciferase gene. Doxycycline-mediated induction of the Tet-On expression cassettes resulted in the production of one long polycistronic mRNA as proven by Northern analyses, and ensured comparable production of enniatin in all three strains. Notably, gene position within the polycistronic expression cassette matters, as, luciferase activity was lowest at position one and had a comparable activity at positions two and three. The P2A peptide can be used to express at

  9. Green tissue-specific co-expression of chitinase and oxalate oxidase 4 genes in rice for enhanced resistance against sheath blight.

    Science.gov (United States)

    Karmakar, Subhasis; Molla, Kutubuddin Ali; Chanda, Palas K; Sarkar, Sailendra Nath; Datta, Swapan K; Datta, Karabi

    2016-01-01

    Green tissue-specific simultaneous overexpression of two defense-related genes ( OsCHI11 & OsOXO4 ) in rice leads to significant resistance against sheath blight pathogen ( R. solani ) without distressing any agronomically important traits. Overexpressing two defense-related genes (OsOXO4 and OsCHI11) cloned from rice is effective at enhancing resistance against sheath blight caused by Rhizoctonia solani. These genes were expressed under the control of two different green tissue-specific promoters, viz. maize phosphoenolpyruvate carboxylase gene promoter, PEPC, and rice cis-acting 544-bp DNA element, immediately upstream of the D54O translational start site, P D54O-544 . Putative T0 transgenic rice plants were screened by PCR and integration of genes was confirmed by Southern hybridization of progeny (T1) rice plants. Successful expression of OsOXO4 and OsCHI11 in all tested plants was confirmed. Expression of PR genes increased significantly following pathogen infection in overexpressing transgenic plants. Following infection, transgenic plants exhibited elevated hydrogen peroxide levels, significant changes in activity of ROS scavenging enzymes and reduced membrane damage when compared to their wild-type counterpart. In a Rhizoctonia solani toxin assay, a detached leaf inoculation test and an in vivo plant bioassay, transgenic plants showed a significant reduction in disease symptoms in comparison to non-transgenic control plants. This is the first report of overexpression of two different PR genes driven by two green tissue-specific promoters providing enhanced sheath blight resistance in transgenic rice.

  10. Web tools for the prioritization of candidate disease genes.

    NARCIS (Netherlands)

    Oti, M.O.; Ballouz, S.; Wouters, M.A.

    2011-01-01

    Despite increasing sequencing capacity, genetic disease investigation still frequently results in the identification of loci containing multiple candidate disease genes that need to be tested for involvement in the disease. This process can be expedited by prioritizing the candidates prior to

  11. Integrated Analyses of Gene Expression Profiles Digs out Common Markers for Rheumatic Diseases

    Science.gov (United States)

    Wang, Lan; Wu, Long-Fei; Lu, Xin; Mo, Xing-Bo; Tang, Zai-Xiang; Lei, Shu-Feng; Deng, Fei-Yan

    2015-01-01

    Objective Rheumatic diseases have some common symptoms. Extensive gene expression studies, accumulated thus far, have successfully identified signature molecules for each rheumatic disease, individually. However, whether there exist shared factors across rheumatic diseases has yet to be tested. Methods We collected and utilized 6 public microarray datasets covering 4 types of representative rheumatic diseases including rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis, and osteoarthritis. Then we detected overlaps of differentially expressed genes across datasets and performed a meta-analysis aiming at identifying common differentially expressed genes that discriminate between pathological cases and normal controls. To further gain insights into the functions of the identified common differentially expressed genes, we conducted gene ontology enrichment analysis and protein-protein interaction analysis. Results We identified a total of eight differentially expressed genes (TNFSF10, CX3CR1, LY96, TLR5, TXN, TIA1, PRKCH, PRF1), each associated with at least 3 of the 4 studied rheumatic diseases. Meta-analysis warranted the significance of the eight genes and highlighted the general significance of four genes (CX3CR1, LY96, TLR5, and PRF1). Protein-protein interaction and gene ontology enrichment analyses indicated that the eight genes interact with each other to exert functions related to immune response and immune regulation. Conclusion The findings support that there exist common factors underlying rheumatic diseases. For rheumatoid arthritis, systemic lupus erythematosus, ankylosing spondylitis and osteoarthritis diseases, those common factors include TNFSF10, CX3CR1, LY96, TLR5, TXN, TIA1, PRKCH, and PRF1. In-depth studies on these common factors may provide keys to understanding the pathogenesis and developing intervention strategies for rheumatic diseases. PMID:26352601

  12. Dysregulation of RNA Mediated Gene Expression in Motor Neuron Diseases.

    Science.gov (United States)

    Gonçalves, Inês do Carmo G; Rehorst, Wiebke A; Kye, Min Jeong

    2016-01-01

    Recent findings indicate an important role for RNA-mediated gene expression in motor neuron diseases, including ALS (amyotrophic lateral sclerosis) and SMA (spinal muscular atrophy). ALS, also known as Lou Gehrig's disease, is an adult-onset progressive neurodegenerative disorder, whereby SMA or "children's Lou Gehrig's disease" is considered a pediatric neurodevelopmental disorder. Despite the difference in genetic causes, both ALS and SMA share common phenotypes; dysfunction/loss of motor neurons that eventually leads to muscle weakness and atrophy. With advanced techniques in molecular genetics and cell biology, current data suggest that these two distinct motor neuron diseases share more than phenotypes; ALS and SMA have similar cellular pathological mechanisms including mitochondrial dysfunction, oxidative stress and dysregulation in RNA-mediated gene expression. Here, we will discuss the current findings on these two diseases with specific focus on RNA-mediated gene regulation including miRNA expression, pre-mRNA processing and RNA binding proteins.

  13. Aldosterone synthase gene is not a major susceptibility gene for progression of chronic kidney disease in patients with autosomal dominant polycystic kidney disease

    Directory of Open Access Journals (Sweden)

    Gnanasambandan Ramanathan

    2017-01-01

    Full Text Available Autosomal dominant polycystic kidney disease (ADPKD is the most common heritable kidney disease and is characterized by bilateral renal cysts. Hypertension is a frequent cause of chronic kidney disease (CKD and mortality in patients with ADPKD. The aldosterone synthase gene polymorphisms of the renin-angiotensin-aldosterone system have been extensively studied as hypertension candidate genes. The present study is aimed to investigate the potential modifier effect of CYP11B2 gene on the progression of CKD in ADPKD. One hundred and two ADPKD patients and 106 healthy controls were recruited based on Ravine inclusion and exclusion criteria. The three tag-SNPs within CYP11B2 gene (rs3802230, rs4543, and rs4544 were genotyped using FRET-based KASPar method. Cochran-Armitage trend test was used to assess the potential associations between these polymorphisms and CKD stages. Mantel- Haenszel stratified analysis was used to explore confounding and interaction effects of these polymorphisms. Of the three tag-SNPs genotyped, rs4544 polymorphism was monomorphic and rs3802230 deviated Hardy-Weinberg equilibrium. The CYP11B2 tag-SNPs did not show significant association with ADPKD or CKD. Further, these polymorphisms did not exhibit confounding effect on the relationship between CKD progression and hypertension. Our results suggest that aldosterone synthase gene is not a major susceptibility gene for progression of CKD in South Indian ADPKD patients.

  14. Comparative gene expression between two yeast species

    Directory of Open Access Journals (Sweden)

    Guan Yuanfang

    2013-01-01

    Full Text Available Abstract Background Comparative genomics brings insight into sequence evolution, but even more may be learned by coupling sequence analyses with experimental tests of gene function and regulation. However, the reliability of such comparisons is often limited by biased sampling of expression conditions and incomplete knowledge of gene functions across species. To address these challenges, we previously systematically generated expression profiles in Saccharomyces bayanus to maximize functional coverage as compared to an existing Saccharomyces cerevisiae data repository. Results In this paper, we take advantage of these two data repositories to compare patterns of ortholog expression in a wide variety of conditions. First, we developed a scalable metric for expression divergence that enabled us to detect a significant correlation between sequence and expression conservation on the global level, which previous smaller-scale expression studies failed to detect. Despite this global conservation trend, between-species gene expression neighborhoods were less well-conserved than within-species comparisons across different environmental perturbations, and approximately 4% of orthologs exhibited a significant change in co-expression partners. Furthermore, our analysis of matched perturbations collected in both species (such as diauxic shift and cell cycle synchrony demonstrated that approximately a quarter of orthologs exhibit condition-specific expression pattern differences. Conclusions Taken together, these analyses provide a global view of gene expression patterns between two species, both in terms of the conditions and timing of a gene's expression as well as co-expression partners. Our results provide testable hypotheses that will direct future experiments to determine how these changes may be specified in the genome.

  15. Extracting Gene Networks for Low-Dose Radiation Using Graph Theoretical Algorithms

    Energy Technology Data Exchange (ETDEWEB)

    Voy, Brynn H [ORNL; Scharff, Jon [University of Tennessee, Knoxville (UTK); Perkins, Andy [University of Tennessee, Knoxville (UTK); Saxton, Arnold [University of Tennessee, Knoxville (UTK); Borate, Bhavesh [University of Tennessee, Knoxville (UTK); Chesler, Elissa J [ORNL; Branstetter, Lisa R [ORNL; Langston, Michael A [University of Tennessee, Knoxville (UTK)

    2006-01-01

    Genes with common functions often exhibit correlated expression levels, which can be used to identify sets of interacting genes from microarray data. Microarrays typically measure expression across genomic space, creating a massive matrix of co-expression that must be mined to extract only the most relevant gene interactions. We describe a graph theoretical approach to extracting co-expressed sets of genes, based on the computation of cliques. Unlike the results of traditional clustering algorithms, cliques are not disjoint and allow genes to be assigned to multiple sets of interacting partners, consistent with biological reality. A graph is created by thresholding the correlation matrix to include only the correlations most likely to signify functional relationships. Cliques computed from the graph correspond to sets of genes for which significant edges are present between all members of the set, representing potential members of common or interacting pathways. Clique membership can be used to infer function about poorly annotated genes, based on the known functions of better-annotated genes with which they share clique membership (i.e., ''guilt-by-association''). We illustrate our method by applying it to microarray data collected from the spleens of mice exposed to low-dose ionizing radiation. Differential analysis is used to identify sets of genes whose interactions are impacted by radiation exposure. The correlation graph is also queried independently of clique to extract edges that are impacted by radiation. We present several examples of multiple gene interactions that are altered by radiation exposure and thus represent potential molecular pathways that mediate the radiation response.

  16. Extracting gene networks for low-dose radiation using graph theoretical algorithms.

    Directory of Open Access Journals (Sweden)

    Brynn H Voy

    2006-07-01

    Full Text Available Genes with common functions often exhibit correlated expression levels, which can be used to identify sets of interacting genes from microarray data. Microarrays typically measure expression across genomic space, creating a massive matrix of co-expression that must be mined to extract only the most relevant gene interactions. We describe a graph theoretical approach to extracting co-expressed sets of genes, based on the computation of cliques. Unlike the results of traditional clustering algorithms, cliques are not disjoint and allow genes to be assigned to multiple sets of interacting partners, consistent with biological reality. A graph is created by thresholding the correlation matrix to include only the correlations most likely to signify functional relationships. Cliques computed from the graph correspond to sets of genes for which significant edges are present between all members of the set, representing potential members of common or interacting pathways. Clique membership can be used to infer function about poorly annotated genes, based on the known functions of better-annotated genes with which they share clique membership (i.e., "guilt-by-association". We illustrate our method by applying it to microarray data collected from the spleens of mice exposed to low-dose ionizing radiation. Differential analysis is used to identify sets of genes whose interactions are impacted by radiation exposure. The correlation graph is also queried independently of clique to extract edges that are impacted by radiation. We present several examples of multiple gene interactions that are altered by radiation exposure and thus represent potential molecular pathways that mediate the radiation response.

  17. Probability-based collaborative filtering model for predicting gene-disease associations.

    Science.gov (United States)

    Zeng, Xiangxiang; Ding, Ningxiang; Rodríguez-Patón, Alfonso; Zou, Quan

    2017-12-28

    Accurately predicting pathogenic human genes has been challenging in recent research. Considering extensive gene-disease data verified by biological experiments, we can apply computational methods to perform accurate predictions with reduced time and expenses. We propose a probability-based collaborative filtering model (PCFM) to predict pathogenic human genes. Several kinds of data sets, containing data of humans and data of other nonhuman species, are integrated in our model. Firstly, on the basis of a typical latent factorization model, we propose model I with an average heterogeneous regularization. Secondly, we develop modified model II with personal heterogeneous regularization to enhance the accuracy of aforementioned models. In this model, vector space similarity or Pearson correlation coefficient metrics and data on related species are also used. We compared the results of PCFM with the results of four state-of-arts approaches. The results show that PCFM performs better than other advanced approaches. PCFM model can be leveraged for predictions of disease genes, especially for new human genes or diseases with no known relationships.

  18. Characterization of claustral neurons by comparative gene expression profiling and dye-injection analyses

    Directory of Open Access Journals (Sweden)

    Akiya eWatakabe

    2014-05-01

    Full Text Available The identity of the claustrum as a part of cerebral cortex, and in particular of the adjacent insular cortex, has been investigated by connectivity features and patterns of gene expression. In the present paper, we mapped the cortical and claustral expression of several cortical genes in rodent and macaque monkey brains (nurr1, latexin, cux2, and netrinG2 to further assess shared features between cortex and claustrum. In mice, these genes were densely expressed in the claustrum, but very sparsely in the cortex and not present in the striatum. To test whether the cortical vs. claustral cell types can be distinguished by co-expression of these genes, we performed a panel of double ISH in mouse and macaque brain. NetrinG2 and nurr1 genes were co-expressed across entire cortex and claustrum, but cux2 and nurr1 were co-expressed only in the insular cortex and claustrum. Latexin was expressed, in the macaque, only in the claustrum. The nurr1+ claustral neurons expressed VGluT1, a marker for cortical glutamatergic cells and send cortical projections. Taken together, our data suggest a partial commonality between claustral neurons and a subtype of cortical neurons in the monkey brain. Moreover, in the embryonic (E110 macaque brain, many nurr1+ neurons were scattered in the white matter between the claustrum and the insular cortex, possibly representing their migratory history. In a second set of experiments, we injected Lucifer Yellow intracellularly in mouse and rat slices to investigate whether dendrites of insular and claustral neurons can cross the border of the two brain regions. Dendrites of claustral neurons did not invade the overlying insular territory. In summary, gene expression profile of the claustrum is similar to that of the neocortex, in both rodent and macaque brains, but with modifications in density of expression and cellular co-localization of specific genes.

  19. A loss of Pdxk model of Parkinson disease in Drosophila can be suppressed by Buffy.

    Science.gov (United States)

    M'Angale, P Githure; Staveley, Brian E

    2017-06-12

    The identification of a DNA variant in pyridoxal kinase (Pdxk) associated with increased risk to Parkinson disease (PD) gene led us to study the inhibition of this gene in the Dopa decarboxylase (Ddc)-expressing neurons of the well-studied model organism Drosophila melanogaster. The multitude of biological functions attributable to the vitamers catalysed by this kinase reveal an overabundance of possible links to PD, that include dopamine synthesis, antioxidant activity and mitochondrial function. Drosophila possesses a single homologue of Pdxk and we used RNA interference to inhibit the activity of this kinase in the Ddc-Gal4-expressing neurons. We further investigated any association between this enhanced disease risk gene with the established PD model induced by expression of α-synuclein in the same neurons. We relied on the pro-survival functions of Buffy, an anti-apoptotic Bcl-2 homologue, to rescue the Pdxk-induced phenotypes. To drive the expression of Pdxk RNA interference in DA neurons of Drosophila, we used Ddc-Gal4 which drives expression in both dopaminergic and serotonergic neurons, to result in decreased longevity and compromised climbing ability, phenotypes that are strongly associated with Drosophila models of PD. The inhibition of Pdxk in the α-synuclein-induced Drosophila model of PD did not alter longevity and climbing ability of these flies. It has been previously shown that deficiency in vitamers lead to mitochondrial dysfunction and neuronal decay, therefore, co-expression of Pdxk-RNAi with the sole pro-survival Bcl-2 homologue Buffy in the Ddc-Gal4-expressing neurons, resulted in increased survival and a restored climbing ability. In a similar manner, when we inhibited Pdxk in the developing eye using GMR-Gal4, we found that there was a decrease in the number of ommatidia and the disruption of the ommatidial array was more pronounced. When Pdxk was inhibited with the α-synuclein-induced developmental eye defects, the eye phenotypes were

  20. Inflammatory bowel disease: the role of inflammatory cytokine gene polymorphisms

    Directory of Open Access Journals (Sweden)

    Joanna Balding

    2004-01-01

    Full Text Available THE mechanisms responsible for development of inflammatory bowel disease (IBD have not been fully elucidated, although the main cause of disease pathology is attributed to up-regulated inflammatory processes. The aim of this study was to investigate frequencies of polymorphisms in genes encoding pro-inflammatory and anti-inflammatory markers in IBD patients and controls. We determined genotypes of patients with IBD (n=172 and healthy controls (n=389 for polymorphisms in genes encoding various cytokines (interleukin (IL-1β, IL-6, tumour necrosis factor (TNF, IL-10, IL-1 receptor antagonist. Association of these genotypes to disease incidence and pathophysiology was investigated. No strong association was found with occurrence of IBD. Variation was observed between the ulcerative colitis study group and the control population for the TNF-α-308 polymorphism (p=0.0135. There was also variation in the frequency of IL-6-174 and TNF-α-308 genotypes in the ulcerative colitis group compared with the Crohn's disease group (p=0.01. We concluded that polymorphisms in inflammatory genes are associated with variations in IBD phenotype and disease susceptibility. Whether the polymorphisms are directly involved in regulating cytokine production, and consequently pathophysiology of IBD, or serve merely as markers in linkage disequilibrium with susceptibility genes remains unclear.

  1. Complex nature of SNP genotype effects on gene expression in primary human leucocytes

    Directory of Open Access Journals (Sweden)

    Dinesen Lotte C

    2009-01-01

    Full Text Available Abstract Background Genome wide association studies have been hugely successful in identifying disease risk variants, yet most variants do not lead to coding changes and how variants influence biological function is usually unknown. Methods We correlated gene expression and genetic variation in untouched primary leucocytes (n = 110 from individuals with celiac disease – a common condition with multiple risk variants identified. We compared our observations with an EBV-transformed HapMap B cell line dataset (n = 90, and performed a meta-analysis to increase power to detect non-tissue specific effects. Results In celiac peripheral blood, 2,315 SNP variants influenced gene expression at 765 different transcripts (cis expression quantitative trait loci, eQTLs. 135 of the detected SNP-probe effects (reflecting 51 unique probes were also detected in a HapMap B cell line published dataset, all with effects in the same allelic direction. Overall gene expression differences within the two datasets predominantly explain the limited overlap in observed cis-eQTLs. Celiac associated risk variants from two regions, containing genes IL18RAP and CCR3, showed significant cis genotype-expression correlations in the peripheral blood but not in the B cell line datasets. We identified 14 genes where a SNP affected the expression of different probes within the same gene, but in opposite allelic directions. By incorporating genetic variation in co-expression analyses, functional relationships between genes can be more significantly detected. Conclusion In conclusion, the complex nature of genotypic effects in human populations makes the use of a relevant tissue, large datasets, and analysis of different exons essential to enable the identification of the function for many genetic risk variants in common diseases.

  2. KMeyeDB: a graphical database of mutations in genes that cause eye diseases.

    Science.gov (United States)

    Kawamura, Takashi; Ohtsubo, Masafumi; Mitsuyama, Susumu; Ohno-Nakamura, Saho; Shimizu, Nobuyoshi; Minoshima, Shinsei

    2010-06-01

    KMeyeDB (http://mutview.dmb.med.keio.ac.jp/) is a database of human gene mutations that cause eye diseases. We have substantially enriched the amount of data in the database, which now contains information about the mutations of 167 human genes causing eye-related diseases including retinitis pigmentosa, cone-rod dystrophy, night blindness, Oguchi disease, Stargardt disease, macular degeneration, Leber congenital amaurosis, corneal dystrophy, cataract, glaucoma, retinoblastoma, Bardet-Biedl syndrome, and Usher syndrome. KMeyeDB is operated using the database software MutationView, which deals with various characters of mutations, gene structure, protein functional domains, and polymerase chain reaction (PCR) primers, as well as clinical data for each case. Users can access the database using an ordinary Internet browser with smooth user-interface, without user registration. The results are displayed on the graphical windows together with statistical calculations. All mutations and associated data have been collected from published articles. Careful data analysis with KMeyeDB revealed many interesting features regarding the mutations in 167 genes that cause 326 different types of eye diseases. Some genes are involved in multiple types of eye diseases, whereas several eye diseases are caused by different mutations in one gene.

  3. Functional modules by relating protein interaction networks and gene expression.

    Science.gov (United States)

    Tornow, Sabine; Mewes, H W

    2003-11-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.

  4. PTPN22 gene polymorphisms in autoimmune diseases with special reference to systemic lupus erythematosus disease susceptibility

    Directory of Open Access Journals (Sweden)

    Pradhan V

    2010-01-01

    Full Text Available Systemic lupus erythematosus (SLE is a prototype autoimmune disease. SLE is a result of one or more immune mechanisms, like autoantibody production, complement activation, multiple inflammation and immune complex deposition leading to organ tissue damage. SLE affected patients are susceptible to common and opportunistic infections. There are several reports suggesting that Mycobacterium tuberculosis infection precipitates SLE in patients from endemic areas. Genetic factors and environmental factors also play an important role in the overall susceptibility to SLE pathophysiology. Recently, protein tyrosine phosphatase, non-receptor type 22 (PTPN22 gene, has been found to be associated with several autoimmune diseases like SLE, Grave′s disease and Hashimoto thyroiditis. The missense R620W polymorphism, rs 2476601, in PTPN22 gene at the nucleotide 1858 in codon 620 (620Arg > Trp has been associated with autoimmune diseases. The PTPN22 locus is also found to be responsible for development of pulmonary tuberculosis in certain populations. The PTPN22 1858C/T gene locus will be ideal to look for SLE susceptibility to tuberculosis in the Indian population. In this review, we focus on human PTPN22 gene structure and function as well as the association of PTPN22 gene polymorphisms with SLE susceptibility

  5. A brief history of Alzheimer's disease gene discovery.

    Science.gov (United States)

    Tanzi, Rudolph E

    2013-01-01

    The rich and colorful history of gene discovery in Alzheimer's disease (AD) over the past three decades is as complex and heterogeneous as the disease, itself. Twin and family studies indicate that genetic factors are estimated to play a role in at least 80% of AD cases. The inheritance of AD exhibits a dichotomous pattern. On one hand, rare mutations inAPP, PSEN1, and PSEN2 are fully penetrant for early-onset (95%) late-onset AD. These four genes account for 30-50% of the inheritability of AD. Genome-wide association studies have recently led to the identification of additional highly confirmed AD candidate genes. Here, I review the past, present, and future of attempts to elucidate the complex and heterogeneous genetic underpinnings of AD along with some of the unique events that made these discoveries possible.

  6. Uncovering the liver’s role in immunity through RNA co-expression networks

    Czech Academy of Sciences Publication Activity Database

    Harrall, K. K.; Kechris, K. J.; Tabakoff, B.; Hoffman, P.L.; Hines, L. M.; Tsukamoto, H.; Pravenec, Michal; Printz, M.; Saba, L. M.

    2016-01-01

    Roč. 27, 9-10 (2016), s. 469-484 ISSN 0938-8990 R&D Projects: GA ČR(CZ) GAP301/12/0696 Institutional support: RVO:67985823 Keywords : RNA coexpression networks * liver * immunity * rat * recombinant inbred strains Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.509, year: 2016

  7. A proteomic network approach across the ALS-FTD disease spectrum resolves clinical phenotypes and genetic vulnerability in human brain.

    Science.gov (United States)

    Umoh, Mfon E; Dammer, Eric B; Dai, Jingting; Duong, Duc M; Lah, James J; Levey, Allan I; Gearing, Marla; Glass, Jonathan D; Seyfried, Nicholas T

    2018-01-01

    Amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) are neurodegenerative diseases with overlap in clinical presentation, neuropathology, and genetic underpinnings. The molecular basis for the overlap of these disorders is not well established. We performed a comparative unbiased mass spectrometry-based proteomic analysis of frontal cortical tissues from postmortem cases clinically defined as ALS, FTD, ALS and FTD (ALS/FTD), and controls. We also included a subset of patients with the C9orf72 expansion mutation, the most common genetic cause of both ALS and FTD Our systems-level analysis of the brain proteome integrated both differential expression and co-expression approaches to assess the relationship of these differences to clinical and pathological phenotypes. Weighted co-expression network analysis revealed 15 modules of co-expressed proteins, eight of which were significantly different across the ALS-FTD disease spectrum. These included modules associated with RNA binding proteins, synaptic transmission, and inflammation with cell-type specificity that showed correlation with TDP-43 pathology and cognitive dysfunction. Modules were also examined for their overlap with TDP-43 protein-protein interactions, revealing one module enriched with RNA-binding proteins and other causal ALS genes that increased in FTD/ALS and FTD cases. A module enriched with astrocyte and microglia proteins was significantly increased in ALS cases carrying the C9orf72 mutation compared to sporadic ALS cases, suggesting that the genetic expansion is associated with inflammation in the brain even without clinical evidence of dementia. Together, these findings highlight the utility of integrative systems-level proteomic approaches to resolve clinical phenotypes and genetic mechanisms underlying the ALS-FTD disease spectrum in human brain. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.

  8. Integration of heterogeneous molecular networks to unravel gene-regulation in Mycobacterium tuberculosis.

    Science.gov (United States)

    van Dam, Jesse C J; Schaap, Peter J; Martins dos Santos, Vitor A P; Suárez-Diez, María

    2014-09-26

    Different methods have been developed to infer regulatory networks from heterogeneous omics datasets and to construct co-expression networks. Each algorithm produces different networks and efforts have been devoted to automatically integrate them into consensus sets. However each separate set has an intrinsic value that is diluted and partly lost when building a consensus network. Here we present a methodology to generate co-expression networks and, instead of a consensus network, we propose an integration framework where the different networks are kept and analysed with additional tools to efficiently combine the information extracted from each network. We developed a workflow to efficiently analyse information generated by different inference and prediction methods. Our methodology relies on providing the user the means to simultaneously visualise and analyse the coexisting networks generated by different algorithms, heterogeneous datasets, and a suite of analysis tools. As a show case, we have analysed the gene co-expression networks of Mycobacterium tuberculosis generated using over 600 expression experiments. Regarding DNA damage repair, we identified SigC as a key control element, 12 new targets for LexA, an updated LexA binding motif, and a potential mismatch repair system. We expanded the DevR regulon with 27 genes while identifying 9 targets wrongly assigned to this regulon. We discovered 10 new genes linked to zinc uptake and a new regulatory mechanism for ZuR. The use of co-expression networks to perform system level analysis allows the development of custom made methodologies. As show cases we implemented a pipeline to integrate ChIP-seq data and another method to uncover multiple regulatory layers. Our workflow is based on representing the multiple types of information as network representations and presenting these networks in a synchronous framework that allows their simultaneous visualization while keeping specific associations from the different

  9. Time since onset of disease and individual clinical markers associate with transcriptional changes in uncomplicated dengue.

    Directory of Open Access Journals (Sweden)

    Cornelia A M van de Weg

    2015-03-01

    Full Text Available BACKGROUND: Dengue virus (DENV infection causes viral haemorrhagic fever that is characterized by extensive activation of the immune system. The aim of this study is to investigate the kinetics of the transcriptome signature changes during the course of disease and the association of genes in these signatures with clinical parameters. METHODOLOGY/PRINCIPLE FINDINGS: Sequential whole blood samples from DENV infected patients in Jakarta were profiled using affymetrix microarrays, which were analysed using principal component analysis, limma, gene set analysis, and weighted gene co-expression network analysis. We show that time since onset of disease, but not diagnosis, has a large impact on the blood transcriptome of patients with non-severe dengue. Clinical diagnosis (according to the WHO classification does not associate with differential gene expression. Network analysis however, indicated that the clinical markers platelet count, fibrinogen, albumin, IV fluid distributed per day and liver enzymes SGOT and SGPT strongly correlate with gene modules that are enriched for genes involved in the immune response. Overall, we see a shift in the transcriptome from immunity and inflammation to repair and recovery during the course of a DENV infection. CONCLUSIONS/SIGNIFICANCE: Time since onset of disease associates with the shift in transcriptome signatures from immunity and inflammation to cell cycle and repair mechanisms in patients with non-severe dengue. The strong association of time with blood transcriptome changes hampers both the discovery as well as the potential application of biomarkers in dengue. However, we identified gene expression modules that associate with key clinical parameters of dengue that reflect the systemic activity of disease during the course of infection. The expression level of these gene modules may support earlier detection of disease progression as well as clinical management of dengue.

  10. A molecular network of the aging human brain provides insights into the pathology and cognitive decline of Alzheimer's disease.

    Science.gov (United States)

    Mostafavi, Sara; Gaiteri, Chris; Sullivan, Sarah E; White, Charles C; Tasaki, Shinya; Xu, Jishu; Taga, Mariko; Klein, Hans-Ulrich; Patrick, Ellis; Komashko, Vitalina; McCabe, Cristin; Smith, Robert; Bradshaw, Elizabeth M; Root, David E; Regev, Aviv; Yu, Lei; Chibnik, Lori B; Schneider, Julie A; Young-Pearse, Tracy L; Bennett, David A; De Jager, Philip L

    2018-06-01

    There is a need for new therapeutic targets with which to prevent Alzheimer's disease (AD), a major contributor to aging-related cognitive decline. Here we report the construction and validation of a molecular network of the aging human frontal cortex. Using RNA sequence data from 478 individuals, we first build a molecular network using modules of coexpressed genes and then relate these modules to AD and its neuropathologic and cognitive endophenotypes. We confirm these associations in two independent AD datasets. We also illustrate the use of the network in prioritizing amyloid- and cognition-associated genes for in vitro validation in human neurons and astrocytes. These analyses based on unique cohorts enable us to resolve the role of distinct cortical modules that have a direct effect on the accumulation of AD pathology from those that have a direct effect on cognitive decline, exemplifying a network approach to complex diseases.

  11. Radionuclide reporter gene imaging for cardiac gene therapy

    International Nuclear Information System (INIS)

    Inubushi, Masayuki; Tamaki, Nagara

    2007-01-01

    In the field of cardiac gene therapy, angiogenic gene therapy has been most extensively investigated. The first clinical trial of cardiac angiogenic gene therapy was reported in 1998, and at the peak, more than 20 clinical trial protocols were under evaluation. However, most trials have ceased owing to the lack of decisive proof of therapeutic effects and the potential risks of viral vectors. In order to further advance cardiac angiogenic gene therapy, remaining open issues need to be resolved: there needs to be improvement of gene transfer methods, regulation of gene expression, development of much safer vectors and optimisation of therapeutic genes. For these purposes, imaging of gene expression in living organisms is of great importance. In radionuclide reporter gene imaging, ''reporter genes'' transferred into cell nuclei encode for a protein that retains a complementary ''reporter probe'' of a positron or single-photon emitter; thus expression of the reporter genes can be imaged with positron emission tomography or single-photon emission computed tomography. Accordingly, in the setting of gene therapy, the location, magnitude and duration of the therapeutic gene co-expression with the reporter genes can be monitored non-invasively. In the near future, gene therapy may evolve into combination therapy with stem/progenitor cell transplantation, so-called cell-based gene therapy or gene-modified cell therapy. Radionuclide reporter gene imaging is now expected to contribute in providing evidence on the usefulness of this novel therapeutic approach, as well as in investigating the molecular mechanisms underlying neovascularisation and safety issues relevant to further progress in conventional gene therapy. (orig.)

  12. Current Experimental Studies of Gene Therapy in Parkinson's Disease

    Directory of Open Access Journals (Sweden)

    Jing-ya Lin

    2017-05-01

    Full Text Available Parkinson's disease (PD was characterized by late-onset, progressive dopamine neuron loss and movement disorders. The progresses of PD affected the neural function and integrity. To date, most researches had largely addressed the dopamine replacement therapies, but the appearance of L-dopa-induced dyskinesia hampered the use of the drug. And the mechanism of PD is so complicated that it's hard to solve the problem by just add drugs. Researchers began to focus on the genetic underpinnings of Parkinson's disease, searching for new method that may affect the neurodegeneration processes in it. In this paper, we reviewed current delivery methods used in gene therapies for PD, we also summarized the primary target of the gene therapy in the treatment of PD, such like neurotrophic factor (for regeneration, the synthesis of neurotransmitter (for prolong the duration of L-dopa, and the potential proteins that might be a target to modulate via gene therapy. Finally, we discussed RNA interference therapies used in Parkinson's disease, it might act as a new class of drug. We mainly focus on the efficiency and tooling features of different gene therapies in the treatment of PD.

  13. Meta-analysis of peripheral blood gene expression modules for COPD phenotypes.

    Directory of Open Access Journals (Sweden)

    Dominik Reinhold

    Full Text Available Chronic obstructive pulmonary disease (COPD occurs typically in current or former smokers, but only a minority of people with smoking history develops the disease. Besides environmental factors, genetics is an important risk factor for COPD. However, the relationship between genetics, environment and phenotypes is not well understood. Sample sizes for genome-wide expression studies based on lung tissue have been small due to the invasive nature of sample collection. Increasing evidence for the systemic nature of the disease makes blood a good alternative source to study the disease, but there have also been few large-scale blood genomic studies in COPD. Due to the complexity and heterogeneity of COPD, examining groups of interacting genes may have more relevance than identifying individual genes. Therefore, we used Weighted Gene Co-expression Network Analysis to find groups of genes (modules that are highly connected. However, module definitions may vary between individual data sets. To alleviate this problem, we used a consensus module definition based on two cohorts, COPDGene and ECLIPSE. We studied the relationship between the consensus modules and COPD phenotypes airflow obstruction and emphysema. We also used these consensus module definitions on an independent cohort (TESRA and performed a meta analysis involving all data sets. We found several modules that are associated with COPD phenotypes, are enriched in functional categories and are overrepresented for cell-type specific genes. Of the 14 consensus modules, three were strongly associated with airflow obstruction (meta p ≤ 0.0002, and two had some association with emphysema (meta p ≤ 0.06; some associations were stronger in the case-control cohorts, and others in the cases-only subcohorts. Gene Ontology terms that were overrepresented included "immune response" and "defense response." The cell types whose type-specific genes were overrepresented in modules (p < 0.05 included

  14. Disease Modeling and Gene Therapy of Copper Storage Disease in Canine Hepatic Organoids

    Directory of Open Access Journals (Sweden)

    Sathidpak Nantasanti

    2015-11-01

    Full Text Available The recent development of 3D-liver stem cell cultures (hepatic organoids opens up new avenues for gene and/or stem cell therapy to treat liver disease. To test safety and efficacy, a relevant large animal model is essential but not yet established. Because of its shared pathologies and disease pathways, the dog is considered the best model for human liver disease. Here we report the establishment of a long-term canine hepatic organoid culture allowing undifferentiated expansion of progenitor cells that can be differentiated toward functional hepatocytes. We show that cultures can be initiated from fresh and frozen liver tissues using Tru-Cut or fine-needle biopsies. The use of Wnt agonists proved important for canine organoid proliferation and inhibition of differentiation. Finally, we demonstrate that successful gene supplementation in hepatic organoids of COMMD1-deficient dogs restores function and can be an effective means to cure copper storage disease.

  15. G2D: a tool for mining genes associated with disease

    OpenAIRE

    Perez-Iratxeta, Carolina; Wjst, Matthias; Bork, Peer; Andrade, Miguel A

    2005-01-01

    Abstract Background Human inherited diseases can be associated by genetic linkage with one or more genomic regions. The availability of the complete sequence of the human genome allows examining those locations for an associated gene. We previously developed an algorithm to prioritize genes on a chromosomal region according to their possible relation to an inherited disease using a combination of data mining on biomedical databases and gene sequence analysis. Results We have implemented this ...

  16. Prioritization of candidate disease genes by combining topological similarity and semantic similarity.

    Science.gov (United States)

    Liu, Bin; Jin, Min; Zeng, Pan

    2015-10-01

    The identification of gene-phenotype relationships is very important for the treatment of human diseases. Studies have shown that genes causing the same or similar phenotypes tend to interact with each other in a protein-protein interaction (PPI) network. Thus, many identification methods based on the PPI network model have achieved good results. However, in the PPI network, some interactions between the proteins encoded by candidate gene and the proteins encoded by known disease genes are very weak. Therefore, some studies have combined the PPI network with other genomic information and reported good predictive performances. However, we believe that the results could be further improved. In this paper, we propose a new method that uses the semantic similarity between the candidate gene and known disease genes to set the initial probability vector of a random walk with a restart algorithm in a human PPI network. The effectiveness of our method was demonstrated by leave-one-out cross-validation, and the experimental results indicated that our method outperformed other methods. Additionally, our method can predict new causative genes of multifactor diseases, including Parkinson's disease, breast cancer and obesity. The top predictions were good and consistent with the findings in the literature, which further illustrates the effectiveness of our method. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. Allele-specific DNA methylation of disease susceptibility genes in Japanese patients with inflammatory bowel disease.

    Science.gov (United States)

    Chiba, Hirofumi; Kakuta, Yoichi; Kinouchi, Yoshitaka; Kawai, Yosuke; Watanabe, Kazuhiro; Nagao, Munenori; Naito, Takeo; Onodera, Motoyuki; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Endo, Katsuya; Negoro, Kenichi; Nagasaki, Masao; Unno, Michiaki; Shimosegawa, Tooru

    2018-01-01

    Inflammatory bowel disease (IBD) has an unknown etiology; however, accumulating evidence suggests that IBD is a multifactorial disease influenced by a combination of genetic and environmental factors. The influence of genetic variants on DNA methylation in cis and cis effects on expression have been demonstrated. We hypothesized that IBD susceptibility single-nucleotide polymorphisms (SNPs) regulate susceptibility gene expressions in cis by regulating DNA methylation around SNPs. For this, we determined cis-regulated allele-specific DNA methylation (ASM) around IBD susceptibility genes in CD4+ effector/memory T cells (Tem) in lamina propria mononuclear cells (LPMCs) in patients with IBD and examined the association between the ASM SNP genotype and neighboring susceptibility gene expressions. CD4+ effector/memory T cells (Tem) were isolated from LPMCs in 15 Japanese IBD patients (ten Crohn's disease [CD] and five ulcerative colitis [UC] patients). ASM analysis was performed by methylation-sensitive SNP array analysis. We defined ASM as a changing average relative allele score ([Formula: see text]) >0.1 after digestion by methylation-sensitive restriction enzymes. Among SNPs showing [Formula: see text] >0.1, we extracted the probes located on tag-SNPs of 200 IBD susceptibility loci and around IBD susceptibility genes as candidate ASM SNPs. To validate ASM, bisulfite-pyrosequencing was performed. Transcriptome analysis was examined in 11 IBD patients (seven CD and four UC patients). The relation between rs36221701 genotype and neighboring gene expressions were analyzed. We extracted six candidate ASM SNPs around IBD susceptibility genes. The top of [Formula: see text] (0.23) was rs1130368 located on HLA-DQB1. ASM around rs36221701 ([Formula: see text] = 0.14) located near SMAD3 was validated using bisulfite pyrosequencing. The SMAD3 expression was significantly associated with the rs36221701 genotype (p = 0.016). We confirmed the existence of cis-regulated ASM around

  18. Allele-specific DNA methylation of disease susceptibility genes in Japanese patients with inflammatory bowel disease

    Science.gov (United States)

    Chiba, Hirofumi; Kakuta, Yoichi; Kinouchi, Yoshitaka; Kawai, Yosuke; Watanabe, Kazuhiro; Nagao, Munenori; Naito, Takeo; Onodera, Motoyuki; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Endo, Katsuya; Negoro, Kenichi; Nagasaki, Masao; Unno, Michiaki; Shimosegawa, Tooru

    2018-01-01

    Background Inflammatory bowel disease (IBD) has an unknown etiology; however, accumulating evidence suggests that IBD is a multifactorial disease influenced by a combination of genetic and environmental factors. The influence of genetic variants on DNA methylation in cis and cis effects on expression have been demonstrated. We hypothesized that IBD susceptibility single-nucleotide polymorphisms (SNPs) regulate susceptibility gene expressions in cis by regulating DNA methylation around SNPs. For this, we determined cis-regulated allele-specific DNA methylation (ASM) around IBD susceptibility genes in CD4+ effector/memory T cells (Tem) in lamina propria mononuclear cells (LPMCs) in patients with IBD and examined the association between the ASM SNP genotype and neighboring susceptibility gene expressions. Methods CD4+ effector/memory T cells (Tem) were isolated from LPMCs in 15 Japanese IBD patients (ten Crohn's disease [CD] and five ulcerative colitis [UC] patients). ASM analysis was performed by methylation-sensitive SNP array analysis. We defined ASM as a changing average relative allele score (ΔRAS¯) >0.1 after digestion by methylation-sensitive restriction enzymes. Among SNPs showing ΔRAS¯ >0.1, we extracted the probes located on tag-SNPs of 200 IBD susceptibility loci and around IBD susceptibility genes as candidate ASM SNPs. To validate ASM, bisulfite-pyrosequencing was performed. Transcriptome analysis was examined in 11 IBD patients (seven CD and four UC patients). The relation between rs36221701 genotype and neighboring gene expressions were analyzed. Results We extracted six candidate ASM SNPs around IBD susceptibility genes. The top of ΔRAS¯ (0.23) was rs1130368 located on HLA-DQB1. ASM around rs36221701 (ΔRAS¯ = 0.14) located near SMAD3 was validated using bisulfite pyrosequencing. The SMAD3 expression was significantly associated with the rs36221701 genotype (p = 0.016). Conclusions We confirmed the existence of cis-regulated ASM around IBD

  19. Cross disease analysis of co-functional microRNA pairs on a reconstructed network of disease-gene-microRNA tripartite.

    Science.gov (United States)

    Peng, Hui; Lan, Chaowang; Zheng, Yi; Hutvagner, Gyorgy; Tao, Dacheng; Li, Jinyan

    2017-03-24

    MicroRNAs always function cooperatively in their regulation of gene expression. Dysfunctions of these co-functional microRNAs can play significant roles in disease development. We are interested in those multi-disease associated co-functional microRNAs that regulate their common dysfunctional target genes cooperatively in the development of multiple diseases. The research is potentially useful for human disease studies at the transcriptional level and for the study of multi-purpose microRNA therapeutics. We designed a computational method to detect multi-disease associated co-functional microRNA pairs and conducted cross disease analysis on a reconstructed disease-gene-microRNA (DGR) tripartite network. The construction of the DGR tripartite network is by the integration of newly predicted disease-microRNA associations with those relationships of diseases, microRNAs and genes maintained by existing databases. The prediction method uses a set of reliable negative samples of disease-microRNA association and a pre-computed kernel matrix instead of kernel functions. From this reconstructed DGR tripartite network, multi-disease associated co-functional microRNA pairs are detected together with their common dysfunctional target genes and ranked by a novel scoring method. We also conducted proof-of-concept case studies on cancer-related co-functional microRNA pairs as well as on non-cancer disease-related microRNA pairs. With the prioritization of the co-functional microRNAs that relate to a series of diseases, we found that the co-function phenomenon is not unusual. We also confirmed that the regulation of the microRNAs for the development of cancers is more complex and have more unique properties than those of non-cancer diseases.

  20. The dynamic landscape of gene regulation during Bombyx mori oogenesis.

    Science.gov (United States)

    Zhang, Qiang; Sun, Wei; Sun, Bang-Yong; Xiao, Yang; Zhang, Ze

    2017-09-11

    Oogenesis in the domestic silkworm (Bombyx mori) is a complex process involving previtellogenesis, vitellogenesis and choriogenesis. During this process, follicles show drastic morphological and physiological changes. However, the genome-wide regulatory profiles of gene expression during oogenesis remain to be determined. In this study, we obtained time-series transcriptome data and used these data to reveal the dynamic landscape of gene regulation during oogenesis. A total of 1932 genes were identified to be differentially expressed among different stages, most of which occurred during the transition from late vitellogenesis to early choriogenesis. Using weighted gene co-expression network analysis, we identified six stage-specific gene modules that correspond to multiple regulatory pathways. Strikingly, the biosynthesis pathway of the molting hormone 20-hydroxyecdysone (20E) was enriched in one of the modules. Further analysis showed that the ecdysteroid 20-hydroxylase gene (CYP314A1) of steroidgenesis genes was mainly expressed in previtellogenesis and early vitellogenesis. However, the 20E-inactivated genes, particularly the ecdysteroid 26-hydroxylase encoding gene (Cyp18a1), were highly expressed in late vitellogenesis. These distinct expression patterns between 20E synthesis and catabolism-related genes might ensure the rapid decline of the hormone titer at the transition point from vitellogenesis to choriogenesis. In addition, we compared landscapes of gene regulation between silkworm (Lepidoptera) and fruit fly (Diptera) oogeneses. Our results show that there is some consensus in the modules of gene co-expression during oogenesis in these insects. The data presented in this study provide new insights into the regulatory mechanisms underlying oogenesis in insects with polytrophic meroistic ovaries. The results also provide clues for further investigating the roles of epigenetic reconfiguration and circadian rhythm in insect oogenesis.

  1. Gene targeting approaches to complex genetic diseases: atherosclerosis and essential hypertension.

    OpenAIRE

    Smithies, O; Maeda, N

    1995-01-01

    Gene targeting allows precise, predetermined changes to be made in a chosen gene in the mouse genome. To date, targeting has been used most often for generation of animals completely lacking the product of a gene of interest. The resulting "knockout" mice have confirmed some hypotheses, have upset others, but have rarely been uninformative. Models of several human genetic diseases have been produced by targeting--including Gaucher disease, cystic fibrosis, and the fragile X syndrome. These di...

  2. Role of genes in oro-dental diseases

    Directory of Open Access Journals (Sweden)

    Kavitha B

    2010-01-01

    Full Text Available In oral cavity, the spectrum of diseases due to genetic alterations ranges from developmental disturbances of teeth to the pre-cancerous and cancerous lesions. Of late, significant progress has been made in the molecular analysis of tumors. With molecular genetic testing emerging as diagnostic, prognostic, and therapeutic approach, a review of genetic alterations ranging from the development of oro-facial structures to the tumors in the head and neck region are addressed in this article. The functional regulatory aspect of genes in relation to oro-facial structures are discussed separately, i.e., in relation to tooth genesis, tooth agenesis (non-syndromic, syndromic, tooth structural alterations, syndromic oro-facial defects, bone diseases, skin diseases (genodermatoses, and malignant tumors. In this literature, various genes involved in the development of the oro-facial structures and tooth in particular are discussed. The genetic basis of disorders in the tooth development (agenesis, hypodontia, tooth structural defects like amelogenesis imperfecta (AI, dentinogenesis imperfecta (DI, and oro-facial structural alterations (various syndromes are explained.

  3. Polymorphisms in miRNA genes and their involvement in autoimmune diseases susceptibility.

    Science.gov (United States)

    Latini, Andrea; Ciccacci, Cinzia; Novelli, Giuseppe; Borgiani, Paola

    2017-08-01

    MicroRNAs (miRNAs) are small non-coding RNA molecules that negatively regulate the expression of multiple protein-encoding genes at the post-transcriptional level. MicroRNAs are involved in different pathways, such as cellular proliferation and differentiation, signal transduction and inflammation, and play crucial roles in the development of several diseases, such as cancer, diabetes, and cardiovascular diseases. They have recently been recognized to play a role also in the pathogenesis of autoimmune diseases. Although the majority of studies are focused on miRNA expression profiles investigation, a growing number of studies have been investigating the role of polymorphisms in miRNA genes in the autoimmune diseases development. Indeed, polymorphisms affecting the miRNA genes can modify the set of targets they regulate or the maturation efficiency. This review is aimed to give an overview about the available studies that have investigated the association of miRNA gene polymorphisms with the susceptibility to various autoimmune diseases and to their clinical phenotypes.

  4. Disease Modeling and Gene Therapy of Copper Storage Disease in Canine Hepatic Organoids

    NARCIS (Netherlands)

    Nantasanti, Sathidpak; Spee, Bart; Kruitwagen, Hedwig S.; Chen, Chen; Geijsen, Niels; Oosterhoff, Loes A.; van Wolferen, Monique E.; Pelaez, Nicolas; Fieten, Hille; Wubbolts, Richard W.; Grinwis, Guy C.; Chan, Jefferson; Huch, Meritxell; Vries, Robert R. G.; Clevers, Hans; de Bruin, Alain; Rothuizen, Jan; Penning, Louis C.; Schotanus, Baukje A.

    2015-01-01

    The recent development of 3D-liver stem cell cultures (hepatic organoids) opens up new avenues for gene and/or stem cell therapy to treat liver disease. To test safety and efficacy, a relevant large animal model is essential but not yet established. Because of its shared pathologies and disease

  5. Two-Way Gene Interaction From Microarray Data Based on Correlation Methods.

    Science.gov (United States)

    Alavi Majd, Hamid; Talebi, Atefeh; Gilany, Kambiz; Khayyer, Nasibeh

    2016-06-01

    Gene networks have generated a massive explosion in the development of high-throughput techniques for monitoring various aspects of gene activity. Networks offer a natural way to model interactions between genes, and extracting gene network information from high-throughput genomic data is an important and difficult task. The purpose of this study is to construct a two-way gene network based on parametric and nonparametric correlation coefficients. The first step in constructing a Gene Co-expression Network is to score all pairs of gene vectors. The second step is to select a score threshold and connect all gene pairs whose scores exceed this value. In the foundation-application study, we constructed two-way gene networks using nonparametric methods, such as Spearman's rank correlation coefficient and Blomqvist's measure, and compared them with Pearson's correlation coefficient. We surveyed six genes of venous thrombosis disease, made a matrix entry representing the score for the corresponding gene pair, and obtained two-way interactions using Pearson's correlation, Spearman's rank correlation, and Blomqvist's coefficient. Finally, these methods were compared with Cytoscape, based on BIND, and Gene Ontology, based on molecular function visual methods; R software version 3.2 and Bioconductor were used to perform these methods. Based on the Pearson and Spearman correlations, the results were the same and were confirmed by Cytoscape and GO visual methods; however, Blomqvist's coefficient was not confirmed by visual methods. Some results of the correlation coefficients are not the same with visualization. The reason may be due to the small number of data.

  6. Life-threatening infectious diseases of childhood: single-gene inborn errors of immunity?

    Science.gov (United States)

    Alcaïs, Alexandre; Quintana-Murci, Lluis; Thaler, David S; Schurr, Erwin; Abel, Laurent; Casanova, Jean-Laurent

    2010-12-01

    The hypothesis that inborn errors of immunity underlie infectious diseases is gaining experimental support. However, the apparent modes of inheritance of predisposition or resistance differ considerably among diseases and among studies. A coherent genetic architecture of infectious diseases is lacking. We suggest here that life-threatening infectious diseases in childhood, occurring in the course of primary infection, result mostly from individually rare but collectively diverse single-gene variations of variable clinical penetrance, whereas the genetic component of predisposition to secondary or reactivation infections in adults is more complex. This model is consistent with (i) the high incidence of most infectious diseases in early childhood, followed by a steady decline; (ii) theoretical modeling of the impact of monogenic or polygenic predisposition on the incidence distribution of infectious diseases before reproductive age; (iii) available molecular evidence from both monogenic and complex genetics of infectious diseases in children and adults; (iv) current knowledge of immunity to primary and secondary or latent infections; (v) the state of the art in the clinical genetics of noninfectious pediatric and adult diseases; and (vi) evolutionary data for the genes underlying single-gene and complex disease risk. With the recent advent of new-generation deep resequencing, this model of single-gene variations underlying severe pediatric infectious diseases is experimentally testable. © 2010 New York Academy of Sciences.

  7. Systematic identification of latent disease-gene associations from PubMed articles.

    Science.gov (United States)

    Zhang, Yuji; Shen, Feichen; Mojarad, Majid Rastegar; Li, Dingcheng; Liu, Sijia; Tao, Cui; Yu, Yue; Liu, Hongfang

    2018-01-01

    Recent scientific advances have accumulated a tremendous amount of biomedical knowledge providing novel insights into the relationship between molecular and cellular processes and diseases. Literature mining is one of the commonly used methods to retrieve and extract information from scientific publications for understanding these associations. However, due to large data volume and complicated associations with noises, the interpretability of such association data for semantic knowledge discovery is challenging. In this study, we describe an integrative computational framework aiming to expedite the discovery of latent disease mechanisms by dissecting 146,245 disease-gene associations from over 25 million of PubMed indexed articles. We take advantage of both Latent Dirichlet Allocation (LDA) modeling and network-based analysis for their capabilities of detecting latent associations and reducing noises for large volume data respectively. Our results demonstrate that (1) the LDA-based modeling is able to group similar diseases into disease topics; (2) the disease-specific association networks follow the scale-free network property; (3) certain subnetwork patterns were enriched in the disease-specific association networks; and (4) genes were enriched in topic-specific biological processes. Our approach offers promising opportunities for latent disease-gene knowledge discovery in biomedical research.

  8. Chronic obstructive pulmonary disease candidate gene prioritization based on metabolic networks and functional information.

    Directory of Open Access Journals (Sweden)

    Xinyan Wang

    Full Text Available Chronic obstructive pulmonary disease (COPD is a multi-factor disease, in which metabolic disturbances played important roles. In this paper, functional information was integrated into a COPD-related metabolic network to assess similarity between genes. Then a gene prioritization method was applied to the COPD-related metabolic network to prioritize COPD candidate genes. The gene prioritization method was superior to ToppGene and ToppNet in both literature validation and functional enrichment analysis. Top-ranked genes prioritized from the metabolic perspective with functional information could promote the better understanding about the molecular mechanism of this disease. Top 100 genes might be potential markers for diagnostic and effective therapies.

  9. Network analysis of differential expression for the identification of disease-causing genes.

    Directory of Open Access Journals (Sweden)

    Daniela Nitsch

    Full Text Available Genetic studies (in particular linkage and association studies identify chromosomal regions involved in a disease or phenotype of interest, but those regions often contain many candidate genes, only a few of which can be followed-up for biological validation. Recently, computational methods to identify (prioritize the most promising candidates within a region have been proposed, but they are usually not applicable to cases where little is known about the phenotype (no or few confirmed disease genes, fragmentary understanding of the biological cascades involved. We seek to overcome this limitation by replacing knowledge about the biological process by experimental data on differential gene expression between affected and healthy individuals. Considering the problem from the perspective of a gene/protein network, we assess a candidate gene by considering the level of differential expression in its neighborhood under the assumption that strong candidates will tend to be surrounded by differentially expressed neighbors. We define a notion of soft neighborhood where each gene is given a contributing weight, which decreases with the distance from the candidate gene on the protein network. To account for multiple paths between genes, we define the distance using the Laplacian exponential diffusion kernel. We score candidates by aggregating the differential expression of neighbors weighted as a function of distance. Through a randomization procedure, we rank candidates by p-values. We illustrate our approach on four monogenic diseases and successfully prioritize the known disease causing genes.

  10. Coexpression network analysis in abdominal and gluteal adipose tissue reveals regulatory genetic loci for metabolic syndrome and related phenotypes

    DEFF Research Database (Denmark)

    Min, Josine L; Nicholson, George; Halgrimsdottir, Ingileif

    2012-01-01

    Metabolic Syndrome (MetS) is highly prevalent and has considerable public health impact, but its underlying genetic factors remain elusive. To identify gene networks involved in MetS, we conducted whole-genome expression and genotype profiling on abdominal (ABD) and gluteal (GLU) adipose tissue...... and 51 (0.6%) in GLU only. Differential eigengene network analysis of 8,256 shared probesets detected 22 shared modules with high preservation across adipose depots (D(ABD-GLU) = 0.89), seven of which were associated with MetS (FDR P100,000 individuals; rs10282458, affecting expression of RARRES2...... and their interactions influence complex traits such as MetS, integrated analysis of genotypes and coexpression networks across multiple tissues relevant to clinical traits is an efficient strategy to identify novel associations....

  11. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  12. A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes

    DEFF Research Database (Denmark)

    Hansen, Kasper Lage; Hansen, Niclas Tue; Karlberg, Erik, Olof, Linnart

    2008-01-01

    to be overexpressed in the normal tissues where defects cause pathology. In contrast, cancer genes and complexes were not overexpressed in the tissues from which the tumors emanate. We specifically identified a complex involved in XY sex reversal that is testis-specific and down-regulated in ovaries. We also......Heritable diseases are caused by germ-line mutations that, despite tissuewide presence, often lead to tissue-specific pathology. Here, we make a systematic analysis of the link between tissue-specific gene expression and pathological manifestations in many human diseases and cancers. Diseases were...

  13. Lignin, mitochondrial family and photorespiratory transporter classification as case studies in using co-expression, co-response and protein locations to aid in identifying transport functions

    Directory of Open Access Journals (Sweden)

    Takayuki eTohge

    2014-03-01

    Full Text Available Whole genome sequencing and the relative ease of transcript profiling have facilitated the collection and data warehousing of immense quantities of expression data. However, a substantial proportion of genes are not yet functionally annotated a problem which is particularly acute for transport proteins. In Arabidopsis, for example, only a minor fraction of the estimated 700 intracellular transporters have been identified at the molecular genetic level. Furthermore it is only within the last couple of years that critical genes such as those encoding the final transport step required for the long distance transport of sucrose and the first transporter of the core photorespiratory pathway have been identified. Here we will describe how transcriptional coordination between genes of known function and non-annotated genes allows the identification of putative transporters on the premise that such co-expressed genes tend to be functionally related. We will additionally extend this to include the expansion of this approach to include phenotypic information from other levels of cellular organization such as proteomic and metabolomic data and provide case studies wherein this approach has successfully been used to fill knowledge gaps in important metabolic pathways and physiological processes.

  14. Diurnal Transcriptome and Gene Network Represented through Sparse Modeling in Brachypodium distachyon

    Directory of Open Access Journals (Sweden)

    Satoru Koda

    2017-11-01

    Full Text Available We report the comprehensive identification of periodic genes and their network inference, based on a gene co-expression analysis and an Auto-Regressive eXogenous (ARX model with a group smoothly clipped absolute deviation (SCAD method using a time-series transcriptome dataset in a model grass, Brachypodium distachyon. To reveal the diurnal changes in the transcriptome in B. distachyon, we performed RNA-seq analysis of its leaves sampled through a diurnal cycle of over 48 h at 4 h intervals using three biological replications, and identified 3,621 periodic genes through our wavelet analysis. The expression data are feasible to infer network sparsity based on ARX models. We found that genes involved in biological processes such as transcriptional regulation, protein degradation, and post-transcriptional modification and photosynthesis are significantly enriched in the periodic genes, suggesting that these processes might be regulated by circadian rhythm in B. distachyon. On the basis of the time-series expression patterns of the periodic genes, we constructed a chronological gene co-expression network and identified putative transcription factors encoding genes that might be involved in the time-specific regulatory transcriptional network. Moreover, we inferred a transcriptional network composed of the periodic genes in B. distachyon, aiming to identify genes associated with other genes through variable selection by grouping time points for each gene. Based on the ARX model with the group SCAD regularization using our time-series expression datasets of the periodic genes, we constructed gene networks and found that the networks represent typical scale-free structure. Our findings demonstrate that the diurnal changes in the transcriptome in B. distachyon leaves have a sparse network structure, demonstrating the spatiotemporal gene regulatory network over the cyclic phase transitions in B. distachyon diurnal growth.

  15. Altered Expression of Genes Implicated in Xylan Biosynthesis Affects Penetration Resistance against Powdery Mildew.

    Science.gov (United States)

    Chowdhury, Jamil; Lück, Stefanie; Rajaraman, Jeyaraman; Douchkov, Dimitar; Shirley, Neil J; Schwerdt, Julian G; Schweizer, Patrick; Fincher, Geoffrey B; Burton, Rachel A; Little, Alan

    2017-01-01

    Heteroxylan has recently been identified as an important component of papillae, which are formed during powdery mildew infection of barley leaves. Deposition of heteroxylan near the sites of attempted fungal penetration in the epidermal cell wall is believed to enhance the physical resistance to the fungal penetration peg and hence to improve pre-invasion resistance. Several glycosyltransferase (GT) families are implicated in the assembly of heteroxylan in the plant cell wall, and are likely to work together in a multi-enzyme complex. Members of key GT families reported to be involved in heteroxylan biosynthesis are up-regulated in the epidermal layer of barley leaves during powdery mildew infection. Modulation of their expression leads to altered susceptibility levels, suggesting that these genes are important for penetration resistance. The highest level of resistance was achieved when a GT43 gene was co-expressed with a GT47 candidate gene, both of which have been predicted to be involved in xylan backbone biosynthesis. Altering the expression level of several candidate heteroxylan synthesis genes can significantly alter disease susceptibility. This is predicted to occur through changes in the amount and structure of heteroxylan in barley papillae.

  16. LncRNAs: emerging players in gene regulation and disease ...

    Indian Academy of Sciences (India)

    and Glavac 2013), accounting for about 20,000 protein coding ... general information on lncRNAs' feature (Da Sacco et al. 2012). ..... mal cells, stabilized Zeb2 intron encompasses an internal ..... cially growth-control genes and cell mobility-induced genes ..... RNAs in development and disease of the central nervous system.

  17. Vitamin D receptor gene variants in Parkinson's disease patients ...

    African Journals Online (AJOL)

    Background: Vitamin D plays an important role in neurodegenerative disorders as a crucial neuro-immunomodulator. Accumulating data provide evidences that vitamin D receptor (VDR) gene is a candidate gene for susceptibility to Parkinson's disease (PD). Aim: To find out whether the risk of the development of sporadic ...

  18. Expression of novel Alzheimer's disease risk genes in control and Alzheimer's disease brains.

    Directory of Open Access Journals (Sweden)

    Celeste M Karch

    Full Text Available Late onset Alzheimer's disease (LOAD etiology is influenced by complex interactions between genetic and environmental risk factors. Large-scale genome wide association studies (GWAS for LOAD have identified 10 novel risk genes: ABCA7, BIN1, CD2AP, CD33, CLU, CR1, EPHA1, MS4A6A, MS4A6E, and PICALM. We sought to measure the influence of GWAS single nucleotide polymorphisms (SNPs and gene expression levels on clinical and pathological measures of AD in brain tissue from the parietal lobe of AD cases and age-matched, cognitively normal controls. We found that ABCA7, CD33, and CR1 expression levels were associated with clinical dementia rating (CDR, with higher expression being associated with more advanced cognitive decline. BIN1 expression levels were associated with disease progression, where higher expression was associated with a delayed age at onset. CD33, CLU, and CR1 expression levels were associated with disease status, where elevated expression levels were associated with AD. Additionally, MS4A6A expression levels were associated with Braak tangle and Braak plaque scores, with elevated expression levels being associated with more advanced brain pathology. We failed to detect an association between GWAS SNPs and gene expression levels in our brain series. The minor allele of rs3764650 in ABCA7 is associated with age at onset and disease duration, and the minor allele of rs670139 in MS4A6E was associated with Braak tangle and Braak plaque score. These findings suggest that expression of some GWAS genes, namely ABCA7, BIN1, CD33, CLU, CR1 and the MS4A family, are altered in AD brains.

  19. Gene therapy and angiogenesis in patients with coronary artery disease

    DEFF Research Database (Denmark)

    Kastrup, Jens

    2010-01-01

    -blind placebo-controlled trials could not confirm the initial high efficacy of either the growth factor protein or the gene therapy approaches observed in earlier small trials. The clinical studies so far have all been without any gene-related serious adverse events. Future trials will focus on whether...... an improvement in clinical results can be obtained with a cocktail of growth factors or by a combination of gene and stem cell therapy in patients with severe coronary artery disease, which cannot be treated effectively with current treatment strategies....... of VEGF and FGF in patients with coronary artery disease. The initial small and unblinded studies with either recombinant growth factor proteins or genes encoding growth factors were encouraging, demonstrating both clinical improvement and evidence of angiogenesis. However, subsequent larger double...

  20. Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data

    Directory of Open Access Journals (Sweden)

    Hu Xiaohua

    2011-07-01

    Full Text Available Abstract Background The wide use of high-throughput DNA microarray technology provide an increasingly detailed view of human transcriptome from hundreds to thousands of genes. Although biomedical researchers typically design microarray experiments to explore specific biological contexts, the relationships between genes are hard to identified because they are complex and noisy high-dimensional data and are often hindered by low statistical power. The main challenge now is to extract valuable biological information from the colossal amount of data to gain insight into biological processes and the mechanisms of human disease. To overcome the challenge requires mathematical and computational methods that are versatile enough to capture the underlying biological features and simple enough to be applied efficiently to large datasets. Methods Unsupervised machine learning approaches provide new and efficient analysis of gene expression profiles. In our study, two unsupervised knowledge-based matrix factorization methods, independent component analysis (ICA and nonnegative matrix factorization (NMF are integrated to identify significant genes and related pathways in microarray gene expression dataset of Alzheimer’s disease. The advantage of these two approaches is they can be performed as a biclustering method by which genes and conditions can be clustered simultaneously. Furthermore, they can group genes into different categories for identifying related diagnostic pathways and regulatory networks. The difference between these two method lies in ICA assume statistical independence of the expression modes, while NMF need positivity constrains to generate localized gene expression profiles. Results In our work, we performed FastICA and non-smooth NMF methods on DNA microarray gene expression data of Alzheimer’s disease respectively. The simulation results shows that both of the methods can clearly classify severe AD samples from control samples, and

  1. Exploring the Diagnostic Potential of Immune Biomarker Co-expression in Gulf War Illness.

    Science.gov (United States)

    Broderick, Gordon; Fletcher, Mary Ann; Gallagher, Michael; Barnes, Zachary; Vernon, Suzanne D; Klimas, Nancy G

    2018-01-01

    Complex disorders like Gulf War illness (GWI) often defy diagnosis on the basis of a single biomarker and may only be distinguishable by considering the co-expression of multiple markers measured in response to a challenge. We demonstrate the practical application of such an approach using an example where blood was collected from 26 GWI, 13 healthy control subjects, and 9 unhealthy controls with chronic fatigue at three points during a graded exercise challenge. A 3-way multivariate projection model based on 12 markers of endocrine and immune function was constructed using a training set of n = 10 GWI and n = 11 healthy controls. These groups were separated almost completely on the basis of two co-expression patterns. In a separate test set these same features allowed for discrimination of new GWI subjects (n = 16) from unhealthy (n = 9) and healthy control subjects with a sensitivity of 70% and a specificity of 90%.

  2. Analysis of neurodegenerative Mendelian genes in clinically diagnosed Alzheimer Disease.

    Directory of Open Access Journals (Sweden)

    Maria Victoria Fernández

    2017-11-01

    Full Text Available Alzheimer disease (AD, Frontotemporal lobar degeneration (FTD, Amyotrophic lateral sclerosis (ALS and Parkinson disease (PD have a certain degree of clinical, pathological and molecular overlap. Previous studies indicate that causative mutations in AD and FTD/ALS genes can be found in clinical familial AD. We examined the presence of causative and low frequency coding variants in the AD, FTD, ALS and PD Mendelian genes, in over 450 families with clinical history of AD and over 11,710 sporadic cases and cognitive normal participants from North America. Known pathogenic mutations were found in 1.05% of the sporadic cases, in 0.69% of the cognitively normal participants and in 4.22% of the families. A trend towards enrichment, albeit non-significant, was observed for most AD, FTD and PD genes. Only PSEN1 and PINK1 showed consistent association with AD cases when we used ExAC as the control population. These results suggest that current study designs may contain heterogeneity and contamination of the control population, and that current statistical methods for the discovery of novel genes with real pathogenic variants in complex late onset diseases may be inadequate or underpowered to identify genes carrying pathogenic mutations.

  3. Molecular networks and the evolution of human cognitive specializations.

    Science.gov (United States)

    Fontenot, Miles; Konopka, Genevieve

    2014-12-01

    Inroads into elucidating the origins of human cognitive specializations have taken many forms, including genetic, genomic, anatomical, and behavioral assays that typically compare humans to non-human primates. While the integration of all of these approaches is essential for ultimately understanding human cognition, here, we review the usefulness of coexpression network analysis for specifically addressing this question. An increasing number of studies have incorporated coexpression networks into brain expression studies comparing species, disease versus control tissue, brain regions, or developmental time periods. A clearer picture has emerged of the key genes driving brain evolution, as well as the developmental and regional contributions of gene expression patterns important for normal brain development and those misregulated in cognitive diseases. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Improved soluble expression and characterization of the Hc domain of Clostridium botulinum neurotoxin serotype A in Escherichia coli by using a PCR-synthesized gene and a Trx co-expression strain.

    Science.gov (United States)

    Chen, Rongchang; Shi, Jing; Cai, Kun; Tu, Wei; Hou, Xiaojun; Liu, Hao; Xiao, Le; Wang, Qin; Tang, Yunming; Wang, Hui

    2010-05-01

    Botulinum neurotoxin serotype A (BoNT/A) is an extremely potent bacterial protein toxin. The Hc fragment of BoNT/A (AHc) was shown to be non-toxic, antigenic, and capable of eliciting a protective immunity in animals challenged with homologous BoNT. In this study, we synthesized AHc gene by using T4 DNA ligase and PCR. The AHc was expressed at a high level in Escherichia coli successfully. Because of using the Trx co-expression strain, the expressed AHc is in a soluble and active form. The yield of the purified AHc was about 70mg/L, and its purity was up to 90% through one-step affinity chromatography. The AHc was positively identified by the antibodies raised against BoNT/A using immunological-dot-blot and Western blot assays. AHc was shown to bind with gangliosides and elicit immunity against BoNT/A, indicating that the expressed and purified AHc protein retains a functionally active conformation. Furthermore, the purified AHc has a strong immunogenicity and can be used as a potential subunit candidate vaccine for botulinum toxin serotype A. Copyright (c) 2009 Elsevier Inc. All rights reserved.

  5. [From gene to disease: from the ABCA4 gene to Stargardt disease, cone-rod dystrophy and retinitis pigmentosa

    NARCIS (Netherlands)

    Cremers, F.P.M.; Maugeri, A.; Klevering, B.J.; Hoefsloot, L.H.; Hoyng, C.B.

    2002-01-01

    Autosomal recessive Stargardt disease is caused by mutations in the ABCA4 gene. Mutations in ABCA4 are also found in two-thirds of cases with autosomal recessive cone-rod dystrophy, and a small fraction of patients with autosomal recessive retinitis pigmentosa. Patients with autosomal recessive

  6. Defining the Human Macula Transcriptome and Candidate Retinal Disease Genes UsingEyeSAGE

    Science.gov (United States)

    Rickman, Catherine Bowes; Ebright, Jessica N.; Zavodni, Zachary J.; Yu, Ling; Wang, Tianyuan; Daiger, Stephen P.; Wistow, Graeme; Boon, Kathy; Hauser, Michael A.

    2009-01-01

    Purpose To develop large-scale, high-throughput annotation of the human macula transcriptome and to identify and prioritize candidate genes for inherited retinal dystrophies, based on ocular-expression profiles using serial analysis of gene expression (SAGE). Methods Two human retina and two retinal pigment epithelium (RPE)/choroid SAGE libraries made from matched macula or midperipheral retina and adjacent RPE/choroid of morphologically normal 28- to 66-year-old donors and a human central retina longSAGE library made from 41- to 66-year-old donors were generated. Their transcription profiles were entered into a relational database, EyeSAGE, including microarray expression profiles of retina and publicly available normal human tissue SAGE libraries. EyeSAGE was used to identify retina- and RPE-specific and -associated genes, and candidate genes for retina and RPE disease loci. Differential and/or cell-type specific expression was validated by quantitative and single-cell RT-PCR. Results Cone photoreceptor-associated gene expression was elevated in the macula transcription profiles. Analysis of the longSAGE retina tags enhanced tag-to-gene mapping and revealed alternatively spliced genes. Analysis of candidate gene expression tables for the identified Bardet-Biedl syndrome disease gene (BBS5) in the BBS5 disease region table yielded BBS5 as the top candidate. Compelling candidates for inherited retina diseases were identified. Conclusions The EyeSAGE database, combining three different gene-profiling platforms including the authors’ multidonor-derived retina/RPE SAGE libraries and existing single-donor retina/RPE libraries, is a powerful resource for definition of the retina and RPE transcriptomes. It can be used to identify retina-specific genes, including alternatively spliced transcripts and to prioritize candidate genes within mapped retinal disease regions. PMID:16723438

  7. Molecular detection of disease resistance genes to powdery mildew ...

    African Journals Online (AJOL)

    A study was conducted to detect the presence of disease resistance genes to infection of wheat powdery mildew (Blumeria graminis f. sp. tritici) in selected wheat cultivars from China using molecular markers. Genomic DNA of sixty cultivars was extracted and tested for the presence of selected prominent resistance genes to ...

  8. A computational method based on the integration of heterogeneous networks for predicting disease-gene associations.

    Directory of Open Access Journals (Sweden)

    Xingli Guo

    Full Text Available The identification of disease-causing genes is a fundamental challenge in human health and of great importance in improving medical care, and provides a better understanding of gene functions. Recent computational approaches based on the interactions among human proteins and disease similarities have shown their power in tackling the issue. In this paper, a novel systematic and global method that integrates two heterogeneous networks for prioritizing candidate disease-causing genes is provided, based on the observation that genes causing the same or similar diseases tend to lie close to one another in a network of protein-protein interactions. In this method, the association score function between a query disease and a candidate gene is defined as the weighted sum of all the association scores between similar diseases and neighbouring genes. Moreover, the topological correlation of these two heterogeneous networks can be incorporated into the definition of the score function, and finally an iterative algorithm is designed for this issue. This method was tested with 10-fold cross-validation on all 1,126 diseases that have at least a known causal gene, and it ranked the correct gene as one of the top ten in 622 of all the 1,428 cases, significantly outperforming a state-of-the-art method called PRINCE. The results brought about by this method were applied to study three multi-factorial disorders: breast cancer, Alzheimer disease and diabetes mellitus type 2, and some suggestions of novel causal genes and candidate disease-causing subnetworks were provided for further investigation.

  9. Identifying noncoding risk variants using disease-relevant gene regulatory networks.

    Science.gov (United States)

    Gao, Long; Uzun, Yasin; Gao, Peng; He, Bing; Ma, Xiaoke; Wang, Jiahui; Han, Shizhong; Tan, Kai

    2018-02-16

    Identifying noncoding risk variants remains a challenging task. Because noncoding variants exert their effects in the context of a gene regulatory network (GRN), we hypothesize that explicit use of disease-relevant GRNs can significantly improve the inference accuracy of noncoding risk variants. We describe Annotation of Regulatory Variants using Integrated Networks (ARVIN), a general computational framework for predicting causal noncoding variants. It employs a set of novel regulatory network-based features, combined with sequence-based features to infer noncoding risk variants. Using known causal variants in gene promoters and enhancers in a number of diseases, we show ARVIN outperforms state-of-the-art methods that use sequence-based features alone. Additional experimental validation using reporter assay further demonstrates the accuracy of ARVIN. Application of ARVIN to seven autoimmune diseases provides a holistic view of the gene subnetwork perturbed by the combinatorial action of the entire set of risk noncoding mutations.

  10. Correlation between NFATC1 gene polymorphisms and congenital heart disease in children.

    Science.gov (United States)

    Li, C-L; Niu, L; Fu, M-Y; Tian, J; Wang, Q-W; An, X-J

    2017-08-01

    To analyze the links between NFATC1 gene polymorphism and congenital heart disease in children. In the present study, we selected 85 children patients with congenital heart disease who were hospitalized from February 2013 to February 2015 as research subjects (observation group), and 92 healthy subjects as control group. Restriction fragment length polymorphism (RFLP) was used for analysis of NFATC1 gene in samples from each group. The distribution of NFATC1 genotype and allele between the observation group (children with congenital heart disease) and the control group showed no significant difference (p >0.05), but AA, GG genotypes, and allele frequency between pathological samples of children with congenital heart disease and the control group displayed significant difference (p congenital heart disease in observation group also showed a difference, i.e., homozygote (AA, GG) ratio in children with severe congenital heart disease is relatively high. There is a correlation between NFATC1 genes and the incidence of congenital heart disease in children, and a correlation between different genotypes and allele frequency and the incidence of the disease.

  11. DDMGD: the database of text-mined associations between genes methylated in diseases from different species

    KAUST Repository

    Raies, A. B.

    2014-11-14

    Gathering information about associations between methylated genes and diseases is important for diseases diagnosis and treatment decisions. Recent advancements in epigenetics research allow for large-scale discoveries of associations of genes methylated in diseases in different species. Searching manually for such information is not easy, as it is scattered across a large number of electronic publications and repositories. Therefore, we developed DDMGD database (http://www.cbrc.kaust.edu.sa/ddmgd/) to provide a comprehensive repository of information related to genes methylated in diseases that can be found through text mining. DDMGD\\'s scope is not limited to a particular group of genes, diseases or species. Using the text mining system DEMGD we developed earlier and additional post-processing, we extracted associations of genes methylated in different diseases from PubMed Central articles and PubMed abstracts. The accuracy of extracted associations is 82% as estimated on 2500 hand-curated entries. DDMGD provides a user-friendly interface facilitating retrieval of these associations ranked according to confidence scores. Submission of new associations to DDMGD is provided. A comparison analysis of DDMGD with several other databases focused on genes methylated in diseases shows that DDMGD is comprehensive and includes most of the recent information on genes methylated in diseases.

  12. Human amyloid β peptide and tau co-expression impairs behavior and causes specific gene expression changes in Caenorhabditis elegans.

    Science.gov (United States)

    Wang, Chenyin; Saar, Valeria; Leung, Ka Lai; Chen, Liang; Wong, Garry

    2018-01-01

    Alzheimer's disease (AD) is a progressive neurodegenerative disorder characterized by the presence of extracellular amyloid plaques consisting of Amyloid-β peptide (Aβ) aggregates and neurofibrillary tangles formed by aggregation of hyperphosphorylated microtubule-associated protein tau. We generated a novel invertebrate model of AD by crossing Aβ1-42 (strain CL2355) with either pro-aggregating tau (strain BR5270) or anti-aggregating tau (strain BR5271) pan-neuronal expressing transgenic Caenorhabditis elegans. The lifespan and progeny viability of the double transgenic strains were significantly decreased compared with wild type N2 (P5E-21). RNA interference of 13 available top up-regulated genes in Aβ1-42+pro-aggregating tau animals revealed that F-box family genes and nep-4 could enhance life span deficits and chemotaxis deficits while Y39G8C.2 (TTBK2) could suppress these behaviors. Comparing the list of regulated genes from C. elegans to the top 60 genes related to human AD confirmed an overlap of 8 genes: patched homolog 1, PTCH1 (ptc-3), the Rab GTPase activating protein, TBC1D16 (tbc-16), the WD repeat and FYVE domain-containing protein 3, WDFY3 (wdfy-3), ADP-ribosylation factor guanine nucleotide exchange factor 2, ARFGEF2 (agef-1), Early B-cell Factor, EBF1 (unc-3), d-amino-acid oxidase, DAO (daao-1), glutamate receptor, metabotropic 1, GRM1 (mgl-2), prolyl 4-hydroxylase subunit alpha 2, P4HA2 (dpy-18 and phy-2). Taken together, our C. elegans double transgenic model provides insight on the fundamental neurobiologic processes underlying human AD and recapitulates selected transcriptomic changes observed in human AD brains. Copyright © 2017 Elsevier Inc. All rights reserved.

  13. Integrated in silico Analyses of Regulatory and Metabolic Networks of Synechococcus sp. PCC 7002 Reveal Relationships between Gene Centrality and Essentiality

    Directory of Open Access Journals (Sweden)

    Hyun-Seob Song

    2015-03-01

    Full Text Available Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as “topologically important.” Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termed as “functionally important” genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.

  14. Gene networks specific for innate immunity define post-traumatic stress disorder.

    Science.gov (United States)

    Breen, M S; Maihofer, A X; Glatt, S J; Tylee, D S; Chandler, S D; Tsuang, M T; Risbrough, V B; Baker, D G; O'Connor, D T; Nievergelt, C M; Woelk, C H

    2015-12-01

    The molecular factors involved in the development of Post-Traumatic Stress Disorder (PTSD) remain poorly understood. Previous transcriptomic studies investigating the mechanisms of PTSD apply targeted approaches to identify individual genes under a cross-sectional framework lack a holistic view of the behaviours and properties of these genes at the system-level. Here we sought to apply an unsupervised gene-network based approach to a prospective experimental design using whole-transcriptome RNA-Seq gene expression from peripheral blood leukocytes of U.S. Marines (N=188), obtained both pre- and post-deployment to conflict zones. We identified discrete groups of co-regulated genes (i.e., co-expression modules) and tested them for association to PTSD. We identified one module at both pre- and post-deployment containing putative causal signatures for PTSD development displaying an over-expression of genes enriched for functions of innate-immune response and interferon signalling (Type-I and Type-II). Importantly, these results were replicated in a second non-overlapping independent dataset of U.S. Marines (N=96), further outlining the role of innate immune and interferon signalling genes within co-expression modules to explain at least part of the causal pathophysiology for PTSD development. A second module, consequential of trauma exposure, contained PTSD resiliency signatures and an over-expression of genes involved in hemostasis and wound responsiveness suggesting that chronic levels of stress impair proper wound healing during/after exposure to the battlefield while highlighting the role of the hemostatic system as a clinical indicator of chronic-based stress. These findings provide novel insights for early preventative measures and advanced PTSD detection, which may lead to interventions that delay or perhaps abrogate the development of PTSD.

  15. Systems Pharmacology-Based Approach of Connecting Disease Genes in Genome-Wide Association Studies with Traditional Chinese Medicine.

    Science.gov (United States)

    Kim, Jihye; Yoo, Minjae; Shin, Jimin; Kim, Hyunmin; Kang, Jaewoo; Tan, Aik Choon

    2018-01-01

    Traditional Chinese medicine (TCM) originated in ancient China has been practiced over thousands of years for treating various symptoms and diseases. However, the molecular mechanisms of TCM in treating these diseases remain unknown. In this study, we employ a systems pharmacology-based approach for connecting GWAS diseases with TCM for potential drug repurposing and repositioning. We studied 102 TCM components and their target genes by analyzing microarray gene expression experiments. We constructed disease-gene networks from 2558 GWAS studies. We applied a systems pharmacology approach to prioritize disease-target genes. Using this bioinformatics approach, we analyzed 14,713 GWAS disease-TCM-target gene pairs and identified 115 disease-gene pairs with q value < 0.2. We validated several of these GWAS disease-TCM-target gene pairs with literature evidence, demonstrating that this computational approach could reveal novel indications for TCM. We also develop TCM-Disease web application to facilitate the traditional Chinese medicine drug repurposing efforts. Systems pharmacology is a promising approach for connecting GWAS diseases with TCM for potential drug repurposing and repositioning. The computational approaches described in this study could be easily expandable to other disease-gene network analysis.

  16. Digital Gene Expression Analysis to Screen Disease Resistance-Relevant Genes from Leaves of Herbaceous Peony (Paeonia lactiflora Pall. Infected by Botrytis cinerea.

    Directory of Open Access Journals (Sweden)

    Saijie Gong

    Full Text Available Herbaceous peony (Paeonia lactiflora Pall. is a well-known traditional flower in China and is widely used for landscaping and garden greening due to its high ornamental value. However, disease spots usually appear after the flowering of the plant and may result in the withering of the plant in severe cases. This study examined the disease incidence in an herbaceous peony field in the Yangzhou region, Jiangsu Province. Based on morphological characteristics and molecular data, the disease in this area was identified as a gray mold caused by Botrytis cinerea. Based on previously obtained transcriptome data, eight libraries generated from two herbaceous peony cultivars 'Zifengyu' and 'Dafugui' with different susceptibilities to the disease were then analyzed using digital gene expression profiling (DGE. Thousands of differentially expressed genes (DEGs were screened by comparing the eight samples, and these genes were annotated using the Gene ontology (GO and Kyoto encyclopedia of genes and genomes (KEGG database. The pathways related to plant-pathogen interaction, secondary metabolism synthesis and antioxidant system were concentrated, and 51, 76, and 13 disease resistance-relevant candidate genes were identified, respectively. The expression patterns of these candidate genes differed between the two cultivars: their expression of the disease-resistant cultivar 'Zifengyu' sharply increased during the early stages of infection, while it was relatively subdued in the disease-sensitive cultivar 'Dafugui'. A selection of ten candidate genes was evaluated by quantitative real-time PCR (qRT-PCR to validate the DGE data. These results revealed the transcriptional changes that took place during the interaction of herbaceous peony with B. cinerea, providing insight into the molecular mechanisms of host resistance to gray mold.

  17. Identification of a developmental gene expression signature, including HOX genes, for the normal human colonic crypt stem cell niche: overexpression of the signature parallels stem cell overpopulation during colon tumorigenesis.

    Science.gov (United States)

    Bhatlekar, Seema; Addya, Sankar; Salunek, Moreh; Orr, Christopher R; Surrey, Saul; McKenzie, Steven; Fields, Jeremy Z; Boman, Bruce M

    2014-01-15

    Our goal was to identify a unique gene expression signature for human colonic stem cells (SCs). Accordingly, we determined the gene expression pattern for a known SC-enriched region--the crypt bottom. Colonic crypts and isolated crypt subsections (top, middle, and bottom) were purified from fresh, normal, human, surgical specimens. We then used an innovative strategy that used two-color microarrays (∼18,500 genes) to compare gene expression in the crypt bottom with expression in the other crypt subsections (middle or top). Array results were validated by PCR and immunostaining. About 25% of genes analyzed were expressed in crypts: 88 preferentially in the bottom, 68 in the middle, and 131 in the top. Among genes upregulated in the bottom, ∼30% were classified as growth and/or developmental genes including several in the PI3 kinase pathway, a six-transmembrane protein STAMP1, and two homeobox (HOXA4, HOXD10) genes. qPCR and immunostaining validated that HOXA4 and HOXD10 are selectively expressed in the normal crypt bottom and are overexpressed in colon carcinomas (CRCs). Immunostaining showed that HOXA4 and HOXD10 are co-expressed with the SC markers CD166 and ALDH1 in cells at the normal crypt bottom, and the number of these co-expressing cells is increased in CRCs. Thus, our findings show that these two HOX genes are selectively expressed in colonic SCs and that HOX overexpression in CRCs parallels the SC overpopulation that occurs during CRC development. Our study suggests that developmental genes play key roles in the maintenance of normal SCs and crypt renewal, and contribute to the SC overpopulation that drives colon tumorigenesis.

  18. Targeted delivery of genes to endothelial cells and cell- and gene-based therapy in pulmonary vascular diseases.

    Science.gov (United States)

    Suen, Colin M; Mei, Shirley H J; Kugathasan, Lakshmi; Stewart, Duncan J

    2013-10-01

    Pulmonary arterial hypertension (PAH) is a devastating disease that, despite significant advances in medical therapies over the last several decades, continues to have an extremely poor prognosis. Gene therapy is a method to deliver therapeutic genes to replace defective or mutant genes or supplement existing cellular processes to modify disease. Over the last few decades, several viral and nonviral methods of gene therapy have been developed for preclinical PAH studies with varying degrees of efficacy. However, these gene delivery methods face challenges of immunogenicity, low transduction rates, and nonspecific targeting which have limited their translation to clinical studies. More recently, the emergence of regenerative approaches using stem and progenitor cells such as endothelial progenitor cells (EPCs) and mesenchymal stem cells (MSCs) have offered a new approach to gene therapy. Cell-based gene therapy is an approach that augments the therapeutic potential of EPCs and MSCs and may deliver on the promise of reversal of established PAH. These new regenerative approaches have shown tremendous potential in preclinical studies; however, large, rigorously designed clinical studies will be necessary to evaluate clinical efficacy and safety. © 2013 American Physiological Society. Compr Physiol 3:1749-1779, 2013.

  19. Two Alzheimer’s disease risk genes increase entorhinal cortex volume in young adults

    Directory of Open Access Journals (Sweden)

    Amanda Marie Dibattista

    2014-10-01

    Full Text Available Alzheimer’s disease (AD risk genes alter brain structure and function decades before disease onset. Apolipoprotein E (APOE is the strongest known genetic risk factor for Alzheimer’s disease, and a related gene, apolipoprotein J (APOJ, also affects disease risk. However, the extent to which these genes affect brain structure in young adults remains unclear. Here, we report that AD risk alleles of these two genes, APOE-ε4 and APOJ-C, cumulatively alter brain volume in young adults. Using voxel-based morphometry in 57 individuals, we examined the entorhinal cortex, one of the earliest brain regions affected in AD pathogenesis. APOE-ε4 carriers exhibited higher right entorhinal cortex volume compared to non-carriers. Interestingly, APOJ-C risk genotype was associated with higher bilateral entorhinal cortex volume in non-APOE-ε4 carriers. To determine the combined disease risk of APOE and APOJ status per subject, we used cumulative odds ratios as regressors for volumetric measurements. Higher disease risk corresponded to greater right entorhinal cortex volume. These results suggest that, years before disease onset, two key AD genetic risk factors may exert influence on the structure of a brain region where AD pathogenesis takes root.

  20. DDMGD: the database of text-mined associations between genes methylated in diseases from different species.

    Science.gov (United States)

    Bin Raies, Arwa; Mansour, Hicham; Incitti, Roberto; Bajic, Vladimir B

    2015-01-01

    Gathering information about associations between methylated genes and diseases is important for diseases diagnosis and treatment decisions. Recent advancements in epigenetics research allow for large-scale discoveries of associations of genes methylated in diseases in different species. Searching manually for such information is not easy, as it is scattered across a large number of electronic publications and repositories. Therefore, we developed DDMGD database (http://www.cbrc.kaust.edu.sa/ddmgd/) to provide a comprehensive repository of information related to genes methylated in diseases that can be found through text mining. DDMGD's scope is not limited to a particular group of genes, diseases or species. Using the text mining system DEMGD we developed earlier and additional post-processing, we extracted associations of genes methylated in different diseases from PubMed Central articles and PubMed abstracts. The accuracy of extracted associations is 82% as estimated on 2500 hand-curated entries. DDMGD provides a user-friendly interface facilitating retrieval of these associations ranked according to confidence scores. Submission of new associations to DDMGD is provided. A comparison analysis of DDMGD with several other databases focused on genes methylated in diseases shows that DDMGD is comprehensive and includes most of the recent information on genes methylated in diseases. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Integrating genome-wide genetic variations and monocyte expression data reveals trans-regulated gene modules in humans.

    Directory of Open Access Journals (Sweden)

    Maxime Rotival

    2011-12-01

    Full Text Available One major expectation from the transcriptome in humans is to characterize the biological basis of associations identified by genome-wide association studies. So far, few cis expression quantitative trait loci (eQTLs have been reliably related to disease susceptibility. Trans-regulating mechanisms may play a more prominent role in disease susceptibility. We analyzed 12,808 genes detected in at least 5% of circulating monocyte samples from a population-based sample of 1,490 European unrelated subjects. We applied a method of extraction of expression patterns-independent component analysis-to identify sets of co-regulated genes. These patterns were then related to 675,350 SNPs to identify major trans-acting regulators. We detected three genomic regions significantly associated with co-regulated gene modules. Association of these loci with multiple expression traits was replicated in Cardiogenics, an independent study in which expression profiles of monocytes were available in 758 subjects. The locus 12q13 (lead SNP rs11171739, previously identified as a type 1 diabetes locus, was associated with a pattern including two cis eQTLs, RPS26 and SUOX, and 5 trans eQTLs, one of which (MADCAM1 is a potential candidate for mediating T1D susceptibility. The locus 12q24 (lead SNP rs653178, which has demonstrated extensive disease pleiotropy, including type 1 diabetes, hypertension, and celiac disease, was associated to a pattern strongly correlating to blood pressure level. The strongest trans eQTL in this pattern was CRIP1, a known marker of cellular proliferation in cancer. The locus 12q15 (lead SNP rs11177644 was associated with a pattern driven by two cis eQTLs, LYZ and YEATS4, and including 34 trans eQTLs, several of them tumor-related genes. This study shows that a method exploiting the structure of co-expressions among genes can help identify genomic regions involved in trans regulation of sets of genes and can provide clues for understanding the

  2. Tissue-specific functional networks for prioritizing phenotype and disease genes.

    Directory of Open Access Journals (Sweden)

    Yuanfang Guan

    Full Text Available Integrated analyses of functional genomics data have enormous potential for identifying phenotype-associated genes. Tissue-specificity is an important aspect of many genetic diseases, reflecting the potentially different roles of proteins and pathways in diverse cell lineages. Accounting for tissue specificity in global integration of functional genomics data is challenging, as "functionality" and "functional relationships" are often not resolved for specific tissue types. We address this challenge by generating tissue-specific functional networks, which can effectively represent the diversity of protein function for more accurate identification of phenotype-associated genes in the laboratory mouse. Specifically, we created 107 tissue-specific functional relationship networks through integration of genomic data utilizing knowledge of tissue-specific gene expression patterns. Cross-network comparison revealed significantly changed genes enriched for functions related to specific tissue development. We then utilized these tissue-specific networks to predict genes associated with different phenotypes. Our results demonstrate that prediction performance is significantly improved through using the tissue-specific networks as compared to the global functional network. We used a testis-specific functional relationship network to predict genes associated with male fertility and spermatogenesis phenotypes, and experimentally confirmed one top prediction, Mbyl1. We then focused on a less-common genetic disease, ataxia, and identified candidates uniquely predicted by the cerebellum network, which are supported by both literature and experimental evidence. Our systems-level, tissue-specific scheme advances over traditional global integration and analyses and establishes a prototype to address the tissue-specific effects of genetic perturbations, diseases and drugs.

  3. Characterization of Chemically Induced Liver Injuries Using Gene Co-Expression Modules

    Science.gov (United States)

    2014-09-16

    evaluated the periportal fibrosis gene signature in the GEO dataset - GSE13747 [34]. In this dataset, liver fibrosis was induced by bile duct ...dataset, liver fibrosis was induced by bile duct ligation. Figure 10-D shows the observed correlation between log-ratios of periportal fibrosis...at 15 days of exposure obtained from TG-GATEs, and D) liver fibrosis produced by bile duct ligation obtained from GSE13747. doi:10.1371/journal.pone

  4. Evaluation of the norrie disease gene in a family with incontinentia pigmenti.

    Science.gov (United States)

    Shastry, B S; Trese, M T

    2000-01-01

    Incontinentia pigmenti (IP) is an ectodermal multisystem disorder which can affect dental, ocular, cardiac and neurologic structures. The ocular changes of IP can have a very similar appearance to the retinal detachment of X-linked familial exudative vitreoretinopathy, which has been shown to be caused by the mutations in the Norrie disease gene. Therefore, it is of interest to determine whether similar mutations in the gene can account for the retinal pathology in patients with IP. To test our hypothesis, we have analyzed the entire Norrie disease gene for a family with IP, by single strand conformational polymorphism followed by DNA sequencing. The sequencing data revealed no disease-specific sequence alterations. These data suggest that ocular findings of IP are perhaps associated with different genes and there is no direct relationship between the genotype and phenotype. Copyright 2000 S. Karger AG, Basel

  5. Importance of correlation between gene expression levels: application to the type I interferon signature in rheumatoid arthritis.

    Science.gov (United States)

    Reynier, Frédéric; Petit, Fabien; Paye, Malick; Turrel-Davin, Fanny; Imbert, Pierre-Emmanuel; Hot, Arnaud; Mougin, Bruno; Miossec, Pierre

    2011-01-01

    The analysis of gene expression data shows that many genes display similarity in their expression profiles suggesting some co-regulation. Here, we investigated the co-expression patterns in gene expression data and proposed a correlation-based research method to stratify individuals. Using blood from rheumatoid arthritis (RA) patients, we investigated the gene expression profiles from whole blood using Affymetrix microarray technology. Co-expressed genes were analyzed by a biclustering method, followed by gene ontology analysis of the relevant biclusters. Taking the type I interferon (IFN) pathway as an example, a classification algorithm was developed from the 102 RA patients and extended to 10 systemic lupus erythematosus (SLE) patients and 100 healthy volunteers to further characterize individuals. We developed a correlation-based algorithm referred to as Classification Algorithm Based on a Biological Signature (CABS), an alternative to other approaches focused specifically on the expression levels. This algorithm applied to the expression of 35 IFN-related genes showed that the IFN signature presented a heterogeneous expression between RA, SLE and healthy controls which could reflect the level of global IFN signature activation. Moreover, the monitoring of the IFN-related genes during the anti-TNF treatment identified changes in type I IFN gene activity induced in RA patients. In conclusion, we have proposed an original method to analyze genes sharing an expression pattern and a biological function showing that the activation levels of a biological signature could be characterized by its overall state of correlation.

  6. Widespread and highly persistent gene transfer to the CNS by retrovirus vector in utero: implication for gene therapy to Krabbe disease.

    Science.gov (United States)

    Shen, Jin-Song; Meng, Xing-Li; Yokoo, Takashi; Sakurai, Ken; Watabe, Kazuhiko; Ohashi, Toya; Eto, Yoshikatsu

    2005-05-01

    Brain-directed prenatal gene therapy may benefit some lysosomal storage diseases that affect the central nervous system (CNS) before birth. Our previous study showed that intrauterine introduction of recombinant adenoviruses into cerebral ventricles results in efficient gene transfer to the CNS in the mouse. However, transgene expression decreased with time due to the non-integrative property of adenoviral vectors. In this study, in order to obtain permanent gene transduction, we investigated the feasibility of retrovirus-mediated in utero gene transduction. Concentrated retrovirus encoding the LacZ gene was injected into the cerebral ventricles of the embryos of normal and twitcher mice (a murine model of Krabbe disease) at embryonic day 12. The distribution and maintenance of the transgene expression in the recipient brain were analyzed histochemically, biochemically and by the quantitative polymerase chain reaction method pre- and postnatally. Efficient and highly persistent gene transduction to the brain was achieved both in normal and the twitcher mouse. Transduced neurons, astrocytes and oligodendrocytes were distributed throughout the brain. The transduced LacZ gene, its transcript and protein expression in the brain were maintained for 14 months without decrement. In addition, gene transduction to multiple tissues other than the brain was also detected at low levels. This study suggests that brain-directed in utero gene transfer using retrovirus vector may be beneficial to the treatment of lysosomal storage diseases with severe brain damage early in life, such as Krabbe disease. Copyright (c) 2005 John Wiley & Sons, Ltd.

  7. Gene expression patterns in peripheral blood correlate with the extent of coronary artery disease.

    Directory of Open Access Journals (Sweden)

    Peter R Sinnaeve

    Full Text Available Systemic and local inflammation plays a prominent role in the pathogenesis of atherosclerotic coronary artery disease, but the relationship of whole blood gene expression changes with coronary disease remains unclear. We have investigated whether gene expression patterns in peripheral blood correlate with the severity of coronary disease and whether these patterns correlate with the extent of atherosclerosis in the vascular wall. Patients were selected according to their coronary artery disease index (CADi, a validated angiographical measure of the extent of coronary atherosclerosis that correlates with outcome. RNA was extracted from blood of 120 patients with at least a stenosis greater than 50% (CADi > or = 23 and from 121 controls without evidence of coronary stenosis (CADi = 0. 160 individual genes were found to correlate with CADi (rho > 0.2, P<0.003. Prominent differential expression was observed especially in genes involved in cell growth, apoptosis and inflammation. Using these 160 genes, a partial least squares multivariate regression model resulted in a highly predictive model (r(2 = 0.776, P<0.0001. The expression pattern of these 160 genes in aortic tissue also predicted the severity of atherosclerosis in human aortas, showing that peripheral blood gene expression associated with coronary atherosclerosis mirrors gene expression changes in atherosclerotic arteries. In conclusion, the simultaneous expression pattern of 160 genes in whole blood correlates with the severity of coronary artery disease and mirrors expression changes in the atherosclerotic vascular wall.

  8. Imaging the impact of genes on Parkinson's disease

    DEFF Research Database (Denmark)

    van der Vegt, J P M; van Nuenen, B F L; Bloem, B R

    2009-01-01

    by the discovery of mutations in single genes that can cause autosomal dominant (alpha-synuclein (SNCA)) and leucine rich repeat kinase 2 (LRRK2) gene) or recessive (Parkin, PTEN-induced putative kinase 1 (PINK1), DJ-1, and ATP13A2 gene) forms of PD. Here, we review how structural and functional neuroimaging...... of individuals carrying a mutation in one of the PD genes has offered a unique avenue of research into the pathogenesis of PD. In symptomatic mutation carriers (i.e. those with overt disease), brain mapping can help to link the molecular pathogenesis of PD more directly with functional and structural changes...... monogenic forms of PD, common polymorphisms in genes that influence mono-aminergic signaling or synaptic plasticity may have modifying effects on distinct aspects of PD. We also discuss how functional and structural neuroimaging can be used to better characterize these genotype-phenotype correlations....

  9. Co-expression of Nisin Z and Leucocin C as a Basis for Effective Protection Against Listeria monocytogenes in Pasteurized Milk

    Directory of Open Access Journals (Sweden)

    Yuxin Fu

    2018-03-01

    Full Text Available Nisin, an important bacteriocin from Lactococcus lactis subsp., is primarily active against various Gram-positive bacteria. Leucocin C, produced by Leuconostoc carnosum 4010, is a class IIa bacteriocin used to inhibit the growth of Listeria monocytogenes. Because two bacteriocins have different modes of action, the combined use of them could be a potential strategy for effective inhibition of foodborne pathogens. In this study, L. lactis N8-r-lecCI (N8 harboring lecCI gene coexpressing nisin–leucocin C was constructed based on the food-grade carrier L. lactis N8. Production of both bacteriocins was stably maintained. Antimicrobial measurements showed that the recombinant strain is effectively against Listeria monocytogenes and Staphylococcus aureus and moderately against Salmonella enterica serovar Enteritidis and Escherichia coli because of its stronger antibacterial activity than the parental strain, this result first demonstrated that the co-expression of nisin and leucocin C results in highly efficient antimicrobial activity. The checkerboard assay showed that the antibacterial activity of L. lactis N8-r-lecCI supernatant was enhanced in the presence of low concentration of EDTA. Analysis of the scanning electron microscope image showed the biggest cellular morphology change in L. monocytogenes treated with a mixture of EDTA and L. lactis N8-r-lecCI supernatant. The practical effect was verified in pasteurized milk through time-kill assay. The L. lactis N8-r-lecCI strain expressing both nisin and leucocin C has a promising application prospect in pasteurized milk processing and preservation because of its strong antibacterial activity.

  10. Discovering hidden relationships between renal diseases and regulated genes through 3D network visualizations

    Directory of Open Access Journals (Sweden)

    Bhavnani Suresh K

    2010-11-01

    Full Text Available Abstract Background In a recent study, two-dimensional (2D network layouts were used to visualize and quantitatively analyze the relationship between chronic renal diseases and regulated genes. The results revealed complex relationships between disease type, gene specificity, and gene regulation type, which led to important insights about the underlying biological pathways. Here we describe an attempt to extend our understanding of these complex relationships by reanalyzing the data using three-dimensional (3D network layouts, displayed through 2D and 3D viewing methods. Findings The 3D network layout (displayed through the 3D viewing method revealed that genes implicated in many diseases (non-specific genes tended to be predominantly down-regulated, whereas genes regulated in a few diseases (disease-specific genes tended to be up-regulated. This new global relationship was quantitatively validated through comparison to 1000 random permutations of networks of the same size and distribution. Our new finding appeared to be the result of using specific features of the 3D viewing method to analyze the 3D renal network. Conclusions The global relationship between gene regulation and gene specificity is the first clue from human studies that there exist common mechanisms across several renal diseases, which suggest hypotheses for the underlying mechanisms. Furthermore, the study suggests hypotheses for why the 3D visualization helped to make salient a new regularity that was difficult to detect in 2D. Future research that tests these hypotheses should enable a more systematic understanding of when and how to use 3D network visualizations to reveal complex regularities in biological networks.

  11. Identification of novel type 2 diabetes candidate genes involved in the crosstalk between the mitochondrial and the insulin signaling systems.

    Directory of Open Access Journals (Sweden)

    Josep M Mercader

    Full Text Available Type 2 Diabetes (T2D is a highly prevalent chronic metabolic disease with strong co-morbidity with obesity and cardiovascular diseases. There is growing evidence supporting the notion that a crosstalk between mitochondria and the insulin signaling cascade could be involved in the etiology of T2D and insulin resistance. In this study we investigated the molecular basis of this crosstalk by using systems biology approaches. We combined, filtered, and interrogated different types of functional interaction data, such as direct protein-protein interactions, co-expression analyses, and metabolic and signaling dependencies. As a result, we constructed the mitochondria-insulin (MITIN network, which highlights 286 genes as candidate functional linkers between these two systems. The results of internal gene expression analysis of three independent experimental models of mitochondria and insulin signaling perturbations further support the connecting roles of these genes. In addition, we further assessed whether these genes are involved in the etiology of T2D using the genome-wide association study meta-analysis from the DIAGRAM consortium, involving 8,130 T2D cases and 38,987 controls. We found modest enrichment of genes associated with T2D amongst our linker genes (p = 0.0549, including three already validated T2D SNPs and 15 additional SNPs, which, when combined, were collectively associated to increased fasting glucose levels according to MAGIC genome wide meta-analysis (p = 8.12×10(-5. This study highlights the potential of combining systems biology, experimental, and genome-wide association data mining for identifying novel genes and related variants that increase vulnerability to complex diseases.

  12. Genomic analysis of primordial dwarfism reveals novel disease genes.

    Science.gov (United States)

    Shaheen, Ranad; Faqeih, Eissa; Ansari, Shinu; Abdel-Salam, Ghada; Al-Hassnan, Zuhair N; Al-Shidi, Tarfa; Alomar, Rana; Sogaty, Sameera; Alkuraya, Fowzan S

    2014-02-01

    Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in defining clinical subgroups. In this study, we present the results of clinical and genomic characterization of 16 new patients in whom a broad definition of PD was used (e.g., 3M syndrome was included). We report a novel PD syndrome with distinct facies in two unrelated patients, each with a different homozygous truncating mutation in CRIPT. Our analysis also reveals, in addition to mutations in known PD disease genes, the first instance of biallelic truncating BRCA2 mutation causing PD with normal bone marrow analysis. In addition, we have identified a novel locus for Seckel syndrome based on a consanguineous multiplex family and identified a homozygous truncating mutation in DNA2 as the likely cause. An additional novel PD disease candidate gene XRCC4 was identified by autozygome/exome analysis, and the knockout mouse phenotype is highly compatible with PD. Thus, we add a number of novel genes to the growing list of PD-linked genes, including one which we show to be linked to a novel PD syndrome with a distinct facial appearance. PD is extremely heterogeneous genetically and clinically, and genomic tools are often required to reach a molecular diagnosis.

  13. Molecular analysis of the NDP gene in two families with Norrie disease.

    Science.gov (United States)

    Rivera-Vega, M Refugio; Chiñas-Lopez, Silvet; Vaca, Ana Luisa Jimenez; Arenas-Sordo, M Luz; Kofman-Alfaro, Susana; Messina-Baas, Olga; Cuevas-Covarrubias, Sergio Alberto

    2005-04-01

    To describe the molecular defects in the Norrie disease protein (NDP) gene in two families with Norrie disease (ND). We analysed two families with ND at molecular level through polymerase chain reaction, DNA sequence analysis and GeneScan. Two molecular defects found in the NDP gene were: a missense mutation (265C > G) within codon 97 that resulted in the interchange of arginine by proline, and a partial deletion in the untranslated 3' region of exon 3 of the NDP gene. Clinical findings were more severe in the family that presented the partial deletion. We also diagnosed the carrier status of one daughter through GeneScan; this method proved to be a useful tool for establishing female carriers of ND. Here we report two novel mutations in the NDP gene in Mexican patients and propose that GeneScan is a viable mean of establishing ND carrier status.

  14. Latest progress of BIGH3 gene in corneal diseases and diabetic retinopathy

    Directory of Open Access Journals (Sweden)

    Fan-Qian Song

    2017-03-01

    Full Text Available BIGH3 gene plays an important role in ocular diseases. On the one hand, it is closely related to the occurrence of corneal diseases. BIGH3 gene can inhibit corneal neovascularization, lead to corneal dystrophy, participate in keratoconus formation. On the other hand, it can lead to the formation of neovascularization in diabetic retinopathy. The latest experiments show that TGF beta secreted by macrophages can promote the expression of BIGH3 mRNA and BIGH3 protein, and promote apoptosis of retinal endothelial cells and pericytes, which leads to the formation of neovascularization in diabetic retinopathy. This article will describe the new progress of BIGH3 gene in ocular diseases from several aspects as mentioned above.

  15. Gene therapy for sickle cell disease: An update.

    Science.gov (United States)

    Demirci, Selami; Uchida, Naoya; Tisdale, John F

    2018-05-30

    Sickle cell disease (SCD) is one of the most common life-threatening monogenic diseases affecting millions of people worldwide. Allogenic hematopietic stem cell transplantation is the only known cure for the disease with high success rates, but the limited availability of matched sibling donors and the high risk of transplantation-related side effects force the scientific community to envision additional therapies. Ex vivo gene therapy through globin gene addition has been investigated extensively and is currently being tested in clinical trials that have begun reporting encouraging data. Recent improvements in our understanding of the molecular pathways controlling mammalian erythropoiesis and globin switching offer new and exciting therapeutic options. Rapid and substantial advances in genome engineering tools, particularly CRISPR/Cas9, have raised the possibility of genetic correction in induced pluripotent stem cells as well as patient-derived hematopoietic stem and progenitor cells. However, these techniques are still in their infancy, and safety/efficacy issues remain that must be addressed before translating these promising techniques into clinical practice. Published by Elsevier Inc.

  16. Induced Pluripotency and Gene Editing in Disease Modelling: Perspectives and Challenges

    Science.gov (United States)

    Seah, Yu Fen Samantha; EL Farran, Chadi A.; Warrier, Tushar; Xu, Jian; Loh, Yuin-Han

    2015-01-01

    Embryonic stem cells (ESCs) are chiefly characterized by their ability to self-renew and to differentiate into any cell type derived from the three main germ layers. It was demonstrated that somatic cells could be reprogrammed to form induced pluripotent stem cells (iPSCs) via various strategies. Gene editing is a technique that can be used to make targeted changes in the genome, and the efficiency of this process has been significantly enhanced by recent advancements. The use of engineered endonucleases, such as homing endonucleases, zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and Cas9 of the CRISPR system, has significantly enhanced the efficiency of gene editing. The combination of somatic cell reprogramming with gene editing enables us to model human diseases in vitro, in a manner considered superior to animal disease models. In this review, we discuss the various strategies of reprogramming and gene targeting with an emphasis on the current advancements and challenges of using these techniques to model human diseases. PMID:26633382

  17. Induced Pluripotency and Gene Editing in Disease Modelling: Perspectives and Challenges

    Directory of Open Access Journals (Sweden)

    Yu Fen Samantha Seah

    2015-12-01

    Full Text Available Embryonic stem cells (ESCs are chiefly characterized by their ability to self-renew and to differentiate into any cell type derived from the three main germ layers. It was demonstrated that somatic cells could be reprogrammed to form induced pluripotent stem cells (iPSCs via various strategies. Gene editing is a technique that can be used to make targeted changes in the genome, and the efficiency of this process has been significantly enhanced by recent advancements. The use of engineered endonucleases, such as homing endonucleases, zinc finger nucleases (ZFNs, transcription activator-like effector nucleases (TALENs and Cas9 of the CRISPR system, has significantly enhanced the efficiency of gene editing. The combination of somatic cell reprogramming with gene editing enables us to model human diseases in vitro, in a manner considered superior to animal disease models. In this review, we discuss the various strategies of reprogramming and gene targeting with an emphasis on the current advancements and challenges of using these techniques to model human diseases.

  18. Rapid cloning of disease-resistance genes in plants using mutagenesis and sequence capture

    Science.gov (United States)

    Genetic solutions to protect crops against pests and pathogens are preferable to agrichemicals 1. Wild crop relatives carry immense diversity of disease resistance (R) genes that could enable more sustainable disease control. However, recruiting R genes for crop improvement typically involves long b...

  19. Human ETS2 gene on chromosome 21 is not rearranged in Alzheimer disease

    International Nuclear Information System (INIS)

    Sacchi, N.; Nalbantoglu, J.; Sergovich, F.R.; Papas, T.S.

    1988-01-01

    The human ETS2 gene, a member of the ETS gene family, with sequence homology with the retroviral ets sequence of the avian erythroblastosis retrovirus E26 is located on chromosome 21. Molecular genetic analysis of Down syndrome (DS) patients with partial trisomy 21 allowed us to reinforce the supposition that ETS2 may be a gene of the minimal DS genetic region. It was originally proposed that a duplication of a portion of the DS region represents the genetic basis of Alzheimer disease, a condition associated also with DS. No evidence of either rearrangements or duplications of ETS2 could be detected in DNA from fibroblasts and brain tissue of Alzheimer disease patients with either the sporadic or the familiar form of the disease. Thus, an altered ETS2 gene dosage does not seem to be a genetic cause or component of Alzheimer disease

  20. Effective gene therapy in an authentic model of Tay-Sachs-related diseases.

    Science.gov (United States)

    Cachón-González, M Begoña; Wang, Susan Z; Lynch, Andrew; Ziegler, Robin; Cheng, Seng H; Cox, Timothy M

    2006-07-05

    Tay-Sachs disease is a prototypic neurodegenerative disease. Lysosomal storage of GM2 ganglioside in Tay-Sachs and the related disorder, Sandhoff disease, is caused by deficiency of beta-hexosaminidase A, a heterodimeric protein. Tay-Sachs-related diseases (GM2 gangliosidoses) are incurable, but gene therapy has the potential for widespread correction of the underlying lysosomal defect by means of the secretion-recapture cellular pathway for enzymatic complementation. Sandhoff mice, lacking the beta-subunit of hexosaminidase, manifest many signs of classical human Tay-Sachs disease and, with an acute course, die before 20 weeks of age. We treated Sandhoff mice by stereotaxic intracranial inoculation of recombinant adeno-associated viral vectors encoding the complementing human beta-hexosaminidase alpha and beta subunit genes and elements, including an HIV tat sequence, to enhance protein expression and distribution. Animals survived for >1 year with sustained, widespread, and abundant enzyme delivery in the nervous system. Onset of the disease was delayed with preservation of motor function; inflammation and GM2 ganglioside storage in the brain and spinal cord was reduced. Gene delivery of beta-hexosaminidase A by using adeno-associated viral vectors has realistic potential for treating the human Tay-Sachs-related diseases.

  1. Two Alzheimer’s disease risk genes increase entorhinal cortex volume in young adults

    Science.gov (United States)

    DiBattista, Amanda Marie; Stevens, Benson W.; Rebeck, G. William; Green, Adam E.

    2014-01-01

    Alzheimer’s disease (AD) risk genes alter brain structure and function decades before disease onset. Apolipoprotein E (APOE) is the strongest known genetic risk factor for AD, and a related gene, apolipoprotein J (APOJ), also affects disease risk. However, the extent to which these genes affect brain structure in young adults remains unclear. Here, we report that AD risk alleles of these two genes, APOE-ε4 and APOJ-C, cumulatively alter brain volume in young adults. Using voxel-based morphometry (VBM) in 57 individuals, we examined the entorhinal cortex, one of the earliest brain regions affected in AD pathogenesis. Apolipoprotein E-ε4 carriers exhibited higher right entorhinal cortex volume compared to non-carriers. Interestingly, APOJ-C risk genotype was associated with higher bilateral entorhinal cortex volume in non-APOE-ε4 carriers. To determine the combined disease risk of APOE and APOJ status per subject, we used cumulative odds ratios as regressors for volumetric measurements. Higher disease risk corresponded to greater right entorhinal cortex volume. These results suggest that, years before disease onset, two key AD genetic risk factors may exert influence on the structure of a brain region where AD pathogenesis takes root. PMID:25339884

  2. A novel joint analysis framework improves identification of differentially expressed genes in cross disease transcriptomic analysis

    Directory of Open Access Journals (Sweden)

    Wenyi Qin

    2018-02-01

    Full Text Available Abstract Motivation Detecting differentially expressed (DE genes between disease and normal control group is one of the most common analyses in genome-wide transcriptomic data. Since most studies don’t have a lot of samples, researchers have used meta-analysis to group different datasets for the same disease. Even then, in many cases the statistical power is still not enough. Taking into account the fact that many diseases share the same disease genes, it is desirable to design a statistical framework that can identify diseases’ common and specific DE genes simultaneously to improve the identification power. Results We developed a novel empirical Bayes based mixture model to identify DE genes in specific study by leveraging the shared information across multiple different disease expression data sets. The effectiveness of joint analysis was demonstrated through comprehensive simulation studies and two real data applications. The simulation results showed that our method consistently outperformed single data set analysis and two other meta-analysis methods in identification power. In real data analysis, overall our method demonstrated better identification power in detecting DE genes and prioritized more disease related genes and disease related pathways than single data set analysis. Over 150% more disease related genes are identified by our method in application to Huntington’s disease. We expect that our method would provide researchers a new way of utilizing available data sets from different diseases when sample size of the focused disease is limited.

  3. Systems Toxicology of Chemically Induced Liver and Kidney Injuries: Histopathology-Associated Gene Co-Expression Modules

    Science.gov (United States)

    2016-01-04

    Research article Received: 9 October 2015, Revised: 18 November 2015, Accepted: 23 November 2015 Published online in Wiley Online Library: 4 January...Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Materiel Command, Fort...ISAIterate, which requires a starter gene set that is typically built using existing gene-related knowledge; here we used ~200 starter gene sets from

  4. Application of R to investigate common gene regulatory network pathway among bipolar disorder and associate diseases

    Directory of Open Access Journals (Sweden)

    Nahida Habib

    2016-12-01

    Full Text Available Depression, Major Depression or mental disorder creates severe diseases. Mental illness such as Unipolar Major Depression, Bipolar Disorder, Dysthymia, Schizophrenia, Cardiovascular Diseases (Hypertension, Coronary Heart Disease, Stroke etc., are known as Major Depression. Several studies have revealed the possibilities about the association among Bipolar Disorder, Schizophrenia, Coronary Heart Diseases and Stroke with each other. The current study aimed to investigate the relationships between genetic variants in the above four diseases and to create a common pathway or PPI network. The associated genes of each disease are collected from different gene database with verification using R. After performing some preprocessing, mining and operations using R on collected genes, seven (7 common associated genes are discovered on selected four diseases (SZ, BD, CHD and Stroke. In each of the iteration, the numbers of collected genes are reduced up to 51%, 36%, 10%, 2% and finally less than 1% respectively. Moreover, common pathway on selected diseases has been investigated in this research.

  5. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records.

    Science.gov (United States)

    Jiang, Li; Edwards, Stefan M; Thomsen, Bo; Workman, Christopher T; Guldbrandtsen, Bernt; Sørensen, Peter

    2014-09-24

    Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data

  6. Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes

    OpenAIRE

    Kreiman, Gabriel

    2004-01-01

    Sequence information and high‐throughput methods to measure gene expression levels open the door to explore transcriptional regulation using computational tools. Combinatorial regulation and sparseness of regulatory elements throughout the genome allow organisms to control the spatial and temporal patterns of gene expression. Here we study the organization of cis‐regulatory elements in sets of co‐regulated genes. We build an algorithm to search for combinations of transcription factor binding...

  7. A novel approach to simulate gene-environment interactions in complex diseases

    Directory of Open Access Journals (Sweden)

    Nicodemi Mario

    2010-01-01

    Full Text Available Abstract Background Complex diseases are multifactorial traits caused by both genetic and environmental factors. They represent the major part of human diseases and include those with largest prevalence and mortality (cancer, heart disease, obesity, etc.. Despite a large amount of information that has been collected about both genetic and environmental risk factors, there are few examples of studies on their interactions in epidemiological literature. One reason can be the incomplete knowledge of the power of statistical methods designed to search for risk factors and their interactions in these data sets. An improvement in this direction would lead to a better understanding and description of gene-environment interactions. To this aim, a possible strategy is to challenge the different statistical methods against data sets where the underlying phenomenon is completely known and fully controllable, for example simulated ones. Results We present a mathematical approach that models gene-environment interactions. By this method it is possible to generate simulated populations having gene-environment interactions of any form, involving any number of genetic and environmental factors and also allowing non-linear interactions as epistasis. In particular, we implemented a simple version of this model in a Gene-Environment iNteraction Simulator (GENS, a tool designed to simulate case-control data sets where a one gene-one environment interaction influences the disease risk. The main aim has been to allow the input of population characteristics by using standard epidemiological measures and to implement constraints to make the simulator behaviour biologically meaningful. Conclusions By the multi-logistic model implemented in GENS it is possible to simulate case-control samples of complex disease where gene-environment interactions influence the disease risk. The user has full control of the main characteristics of the simulated population and a Monte

  8. A transcriptional analysis of carotenoid, chlorophyll and plastidial isoprenoid biosynthesis genes during development and osmotic stress responses in Arabidopsis thaliana

    KAUST Repository

    Meier, Stuart; Tzfadia, Oren; Vallabhaneni, Ratnakar; Gehring, Christoph A; Wurtzel, Eleanore T

    2011-01-01

    Background: The carotenoids are pure isoprenoids that are essential components of the photosynthetic apparatus and are coordinately synthesized with chlorophylls in chloroplasts. However, little is known about the mechanisms that regulate carotenoid biosynthesis or the mechanisms that coordinate this synthesis with that of chlorophylls and other plastidial synthesized isoprenoid-derived compounds, including quinones, gibberellic acid and abscisic acid. Here, a comprehensive transcriptional analysis of individual carotenoid and isoprenoid-related biosynthesis pathway genes was performed in order to elucidate the role of transcriptional regulation in the coordinated synthesis of these compounds and to identify regulatory components that may mediate this process in Arabidopsis thaliana.Results: A global microarray expression correlation analysis revealed that the phytoene synthase gene, which encodes the first dedicated and rate-limiting enzyme of carotenogenesis, is highly co-expressed with many photosynthesis-related genes including many isoprenoid-related biosynthesis pathway genes. Chemical and mutant analysis revealed that induction of the co-expressed genes following germination was dependent on gibberellic acid and brassinosteroids (BR) but was inhibited by abscisic acid (ABA). Mutant analyses further revealed that expression of many of the genes is suppressed in dark grown plants by Phytochrome Interacting transcription Factors (PIFs) and activated by photoactivated phytochromes, which in turn degrade PIFs and mediate a coordinated induction of the genes. The promoters of PSY and the co-expressed genes were found to contain an enrichment in putative BR-auxin response elements and G-boxes, which bind PIFs, further supporting a role for BRs and PIFs in regulating expression of the genes. In osmotically stressed root tissue, transcription of Calvin cycle, methylerythritol 4-phosphate pathway and carotenoid biosynthesis genes is induced and uncoupled from that of

  9. A transcriptional analysis of carotenoid, chlorophyll and plastidial isoprenoid biosynthesis genes during development and osmotic stress responses in Arabidopsis thaliana

    KAUST Repository

    Meier, Stuart

    2011-05-19

    Background: The carotenoids are pure isoprenoids that are essential components of the photosynthetic apparatus and are coordinately synthesized with chlorophylls in chloroplasts. However, little is known about the mechanisms that regulate carotenoid biosynthesis or the mechanisms that coordinate this synthesis with that of chlorophylls and other plastidial synthesized isoprenoid-derived compounds, including quinones, gibberellic acid and abscisic acid. Here, a comprehensive transcriptional analysis of individual carotenoid and isoprenoid-related biosynthesis pathway genes was performed in order to elucidate the role of transcriptional regulation in the coordinated synthesis of these compounds and to identify regulatory components that may mediate this process in Arabidopsis thaliana.Results: A global microarray expression correlation analysis revealed that the phytoene synthase gene, which encodes the first dedicated and rate-limiting enzyme of carotenogenesis, is highly co-expressed with many photosynthesis-related genes including many isoprenoid-related biosynthesis pathway genes. Chemical and mutant analysis revealed that induction of the co-expressed genes following germination was dependent on gibberellic acid and brassinosteroids (BR) but was inhibited by abscisic acid (ABA). Mutant analyses further revealed that expression of many of the genes is suppressed in dark grown plants by Phytochrome Interacting transcription Factors (PIFs) and activated by photoactivated phytochromes, which in turn degrade PIFs and mediate a coordinated induction of the genes. The promoters of PSY and the co-expressed genes were found to contain an enrichment in putative BR-auxin response elements and G-boxes, which bind PIFs, further supporting a role for BRs and PIFs in regulating expression of the genes. In osmotically stressed root tissue, transcription of Calvin cycle, methylerythritol 4-phosphate pathway and carotenoid biosynthesis genes is induced and uncoupled from that of

  10. A transcriptional analysis of carotenoid, chlorophyll and plastidial isoprenoid biosynthesis genes during development and osmotic stress responses in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Vallabhaneni Ratnakar

    2011-05-01

    Full Text Available Abstract Background The carotenoids are pure isoprenoids that are essential components of the photosynthetic apparatus and are coordinately synthesized with chlorophylls in chloroplasts. However, little is known about the mechanisms that regulate carotenoid biosynthesis or the mechanisms that coordinate this synthesis with that of chlorophylls and other plastidial synthesized isoprenoid-derived compounds, including quinones, gibberellic acid and abscisic acid. Here, a comprehensive transcriptional analysis of individual carotenoid and isoprenoid-related biosynthesis pathway genes was performed in order to elucidate the role of transcriptional regulation in the coordinated synthesis of these compounds and to identify regulatory components that may mediate this process in Arabidopsis thaliana. Results A global microarray expression correlation analysis revealed that the phytoene synthase gene, which encodes the first dedicated and rate-limiting enzyme of carotenogenesis, is highly co-expressed with many photosynthesis-related genes including many isoprenoid-related biosynthesis pathway genes. Chemical and mutant analysis revealed that induction of the co-expressed genes following germination was dependent on gibberellic acid and brassinosteroids (BR but was inhibited by abscisic acid (ABA. Mutant analyses further revealed that expression of many of the genes is suppressed in dark grown plants by Phytochrome Interacting transcription Factors (PIFs and activated by photoactivated phytochromes, which in turn degrade PIFs and mediate a coordinated induction of the genes. The promoters of PSY and the co-expressed genes were found to contain an enrichment in putative BR-auxin response elements and G-boxes, which bind PIFs, further supporting a role for BRs and PIFs in regulating expression of the genes. In osmotically stressed root tissue, transcription of Calvin cycle, methylerythritol 4-phosphate pathway and carotenoid biosynthesis genes is induced

  11. The Norrie disease gene maps to a 150 kb region on chromosome Xp11.3.

    Science.gov (United States)

    Sims, K B; Lebo, R V; Benson, G; Shalish, C; Schuback, D; Chen, Z Y; Bruns, G; Craig, I W; Golbus, M S; Breakefield, X O

    1992-05-01

    Norrie disease is a human X-linked recessive disorder of unknown etiology characterized by congenital blindness, sensory neural deafness and mental retardation. This disease gene was previously linked to the DXS7 (L1.28) locus and the MAO genes in band Xp11.3. We report here fine physical mapping of the obligate region containing the Norrie disease gene (NDP) defined by a recombination and by the smallest submicroscopic chromosomal deletion associated with Norrie disease identified to date. Analysis, using in addition two overlapping YAC clones from this region, allowed orientation of the MAOA and MAOB genes in a 5'-3'-3'-5' configuration. A recombination event between a (GT)n polymorphism in intron 2 of the MAOB gene and the NDP locus, in a family previously reported to have a recombination between DXS7 and NDP, delineates a flanking marker telomeric to this disease gene. An anonymous DNA probe, dc12, present in one of the YACs and in a patient with a submicroscopic deletion which includes MAOA and MAOB but not L1.28, serves as a flanking marker centromeric to the disease gene. An Alu-PCR fragment from the right arm of the MAO YAC (YMAO.AluR) is not deleted in this patient and also delineates the centromeric extent of the obligate disease region. The apparent order of these loci is telomere ... DXS7-MAOA-MAOB-NDP-dc12-YMAO.AluR ... centromere. Together these data define the obligate region containing the NDP gene to a chromosomal segment less than 150 kb.

  12. The duplicated genes database: identification and functional annotation of co-localised duplicated genes across genomes.

    Directory of Open Access Journals (Sweden)

    Marion Ouedraogo

    Full Text Available BACKGROUND: There has been a surge in studies linking genome structure and gene expression, with special focus on duplicated genes. Although initially duplicated from the same sequence, duplicated genes can diverge strongly over evolution and take on different functions or regulated expression. However, information on the function and expression of duplicated genes remains sparse. Identifying groups of duplicated genes in different genomes and characterizing their expression and function would therefore be of great interest to the research community. The 'Duplicated Genes Database' (DGD was developed for this purpose. METHODOLOGY: Nine species were included in the DGD. For each species, BLAST analyses were conducted on peptide sequences corresponding to the genes mapped on a same chromosome. Groups of duplicated genes were defined based on these pairwise BLAST comparisons and the genomic location of the genes. For each group, Pearson correlations between gene expression data and semantic similarities between functional GO annotations were also computed when the relevant information was available. CONCLUSIONS: The Duplicated Gene Database provides a list of co-localised and duplicated genes for several species with the available gene co-expression level and semantic similarity value of functional annotation. Adding these data to the groups of duplicated genes provides biological information that can prove useful to gene expression analyses. The Duplicated Gene Database can be freely accessed through the DGD website at http://dgd.genouest.org.

  13. Understanding gene expression in coronary artery disease through ...

    Indian Academy of Sciences (India)

    Understanding gene expression in coronary artery disease through global profiling, network analysis ... A_33_P3249595 B-cell CLL/lymphoma 11A (zinc finger protein). BCL11A. 2.29 ..... It acts as a cytoplasmic sensor for viral infection and ...

  14. Utilization of gene mapping and candidate gene mutation screening for diagnosing clinically equivocal conditions: a Norrie disease case study.

    Science.gov (United States)

    Chini, Vasiliki; Stambouli, Danai; Nedelea, Florina Mihaela; Filipescu, George Alexandru; Mina, Diana; Kambouris, Marios; El-Shantil, Hatem

    2014-06-01

    Prenatal diagnosis was requested for an undiagnosed eye disease showing X-linked inheritance in a family. No medical records existed for the affected family members. Mapping of the X chromosome and candidate gene mutation screening identified a c.C267A[p.F89L] mutation in NPD previously described as possibly causing Norrie disease. The detection of the c.C267A[p.F89L] variant in another unrelated family confirms the pathogenic nature of the mutation for the Norrie disease phenotype. Gene mapping, haplotype analysis, and candidate gene screening have been previously utilized in research applications but were applied here in a diagnostic setting due to the scarcity of available clinical information. The clinical diagnosis and mutation identification were critical for providing proper genetic counseling and prenatal diagnosis for this family.

  15. Identification and expression analyses of MYB and WRKY transcription factor genes in Papaver somniferum L.

    Science.gov (United States)

    Kakeshpour, Tayebeh; Nayebi, Shadi; Rashidi Monfared, Sajad; Moieni, Ahmad; Karimzadeh, Ghasem

    2015-10-01

    Papaver somniferum L. is an herbaceous, annual and diploid plant that is important from pharmacological and strategic point of view. The cDNA clones of two putative MYB and WRKY genes were isolated (GeneBank accession numbers KP411870 and KP203854, respectively) from this plant, via the nested-PCR method, and characterized. The MYB transcription factor (TF) comprises 342 amino acids, and exhibits the structural features of the R2R3MYB protein family. The WRKY TF, a 326 amino acid-long polypeptide, falls structurally into the group II of WRKY protein family. Quantitative real-time PCR (qRT-PCR) analyses indicate the presence of these TFs in all organs of P. somniferum L. and Papaver bracteatum L. Highest expression levels of these two TFs were observed in the leaf tissues of P. somniferum L. while in P. bracteatum L. the espression levels were highest in the root tissues. Promoter analysis of the 10 co-expressed gene clustered involved in noscapine biosynthesis pathway in P. somniferum L. suggested that not only these 10 genes are co-expressed, but also share common regulatory motifs and TFs including MYB and WRKY TFs, and that may explain their common regulation.

  16. Prediction of disease-related genes based on weighted tissue-specific networks by using DNA methylation.

    Science.gov (United States)

    Li, Min; Zhang, Jiayi; Liu, Qing; Wang, Jianxin; Wu, Fang-Xiang

    2014-01-01

    Predicting disease-related genes is one of the most important tasks in bioinformatics and systems biology. With the advances in high-throughput techniques, a large number of protein-protein interactions are available, which make it possible to identify disease-related genes at the network level. However, network-based identification of disease-related genes is still a challenge as the considerable false-positives are still existed in the current available protein interaction networks (PIN). Considering the fact that the majority of genetic disorders tend to manifest only in a single or a few tissues, we constructed tissue-specific networks (TSN) by integrating PIN and tissue-specific data. We further weighed the constructed tissue-specific network (WTSN) by using DNA methylation as it plays an irreplaceable role in the development of complex diseases. A PageRank-based method was developed to identify disease-related genes from the constructed networks. To validate the effectiveness of the proposed method, we constructed PIN, weighted PIN (WPIN), TSN, WTSN for colon cancer and leukemia, respectively. The experimental results on colon cancer and leukemia show that the combination of tissue-specific data and DNA methylation can help to identify disease-related genes more accurately. Moreover, the PageRank-based method was effective to predict disease-related genes on the case studies of colon cancer and leukemia. Tissue-specific data and DNA methylation are two important factors to the study of human diseases. The same method implemented on the WTSN can achieve better results compared to those being implemented on original PIN, WPIN, or TSN. The PageRank-based method outperforms degree centrality-based method for identifying disease-related genes from WTSN.

  17. The quantitative basis of the Arabidopsis innate immune system to endemic pathogens depends on pathogen genetics

    DEFF Research Database (Denmark)

    Corwin, Jason A; Copeland, Daniel; Feusier, Julie

    2016-01-01

    The most established model of the eukaryotic innate immune system is derived from examples of large effect monogenic quantitative resistance to pathogens. However, many host-pathogen interactions involve many genes of small to medium effect and exhibit quantitative resistance. We used the Arabido......The most established model of the eukaryotic innate immune system is derived from examples of large effect monogenic quantitative resistance to pathogens. However, many host-pathogen interactions involve many genes of small to medium effect and exhibit quantitative resistance. We used....... cinerea, we identified a total of 2,982 genes associated with quantitative resistance using lesion area and 3,354 genes associated with camalexin production as measures of the interaction. Most genes were associated with resistance to a specific Botrytis isolate, which demonstrates the influence...... genes associated with quantitative resistance. Using publically available co-expression data, we condensed the quantitative resistance associated genes into co-expressed gene networks. GO analysis of these networks implicated several biological processes commonly connected to disease resistance...

  18. Center for Fetal Monkey Gene Transfer for Heart, Lung, and Blood Diseases: An NHLBI Resource for the Gene Therapy Community

    Science.gov (United States)

    Skarlatos, Sonia I.

    2012-01-01

    Abstract The goals of the National Heart, Lung, and Blood Institute (NHLBI) Center for Fetal Monkey Gene Transfer for Heart, Lung, and Blood Diseases are to conduct gene transfer studies in monkeys to evaluate safety and efficiency; and to provide NHLBI-supported investigators with expertise, resources, and services to actively pursue gene transfer approaches in monkeys in their research programs. NHLBI-supported projects span investigators throughout the United States and have addressed novel approaches to gene delivery; “proof-of-principle”; assessed whether findings in small-animal models could be demonstrated in a primate species; or were conducted to enable new grant or IND submissions. The Center for Fetal Monkey Gene Transfer for Heart, Lung, and Blood Diseases successfully aids the gene therapy community in addressing regulatory barriers, and serves as an effective vehicle for advancing the field. PMID:22974119

  19. Text mining and network analysis to find functional associations of genes in high altitude diseases.

    Science.gov (United States)

    Bhasuran, Balu; Subramanian, Devika; Natarajan, Jeyakumar

    2018-05-02

    Travel to elevations above 2500 m is associated with the risk of developing one or more forms of acute altitude illness such as acute mountain sickness (AMS), high altitude cerebral edema (HACE) or high altitude pulmonary edema (HAPE). Our work aims to identify the functional association of genes involved in high altitude diseases. In this work we identified the gene networks responsible for high altitude diseases by using the principle of gene co-occurrence statistics from literature and network analysis. First, we mined the literature data from PubMed on high-altitude diseases, and extracted the co-occurring gene pairs. Next, based on their co-occurrence frequency, gene pairs were ranked. Finally, a gene association network was created using statistical measures to explore potential relationships. Network analysis results revealed that EPO, ACE, IL6 and TNF are the top five genes that were found to co-occur with 20 or more genes, while the association between EPAS1 and EGLN1 genes is strongly substantiated. The network constructed from this study proposes a large number of genes that work in-toto in high altitude conditions. Overall, the result provides a good reference for further study of the genetic relationships in high altitude diseases. Copyright © 2018 Elsevier Ltd. All rights reserved.

  20. Expression of VP60 gene from rabbit haemorrhagic disease virus ...

    African Journals Online (AJOL)

    The VP60 gene from rabbit haemorrhagic disease virus (RHDV) YL strain in Northeast of China, under control of the ats1A promoter from Rubisco small subunit genes of Arabidopsis thaliana, was introduced into the transfer deoxyribonucleic acid (T-DNA) region of plant transfer vector pCAMBIA1300 and transferred to ...

  1. The chimeric gene CHRFAM7A, a partial duplication of the CHRNA7 gene, is a dominant negative regulator of α7*nAChR function.

    Science.gov (United States)

    Araud, Tanguy; Graw, Sharon; Berger, Ralph; Lee, Michael; Neveu, Estele; Bertrand, Daniel; Leonard, Sherry

    2011-10-15

    The human α7 neuronal nicotinic acetylcholine receptor gene (CHRNA7) is a candidate gene for schizophrenia and an important drug target for cognitive deficits in the disorder. Activation of the α7*nAChR, results in opening of the channel and entry of mono- and divalent cations, including Ca(2+), that presynaptically participates to neurotransmitter release and postsynaptically to down-stream changes in gene expression. Schizophrenic patients have low levels of α7*nAChR, as measured by binding of the ligand [(125)I]-α-bungarotoxin (I-BTX). The structure of the gene, CHRNA7, is complex. During evolution, CHRNA7 was partially duplicated as a chimeric gene (CHRFAM7A), which is expressed in the human brain and elsewhere in the body. The association between a 2bp deletion in CHRFAM7A and schizophrenia suggested that this duplicate gene might contribute to cognitive impairment. To examine the putative contribution of CHRFAM7A on receptor function, co-expression of α7 and the duplicate genes was carried out in cell lines and Xenopus oocytes. Expression of the duplicate alone yielded protein expression but no functional receptor and co-expression with α7 caused a significant reduction of the amplitude of the ACh-evoked currents. Reduced current amplitude was not correlated with a reduction of I-BTX binding, suggesting the presence of non-functional (ACh-silent) receptors. This hypothesis is supported by a larger increase of the ACh-evoked current by the allosteric modulator 1-(5-chloro-2,4-dimethoxy-phenyl)-3-(5-methyl-isoxazol-3-yl)-urea (PNU-120596) in cells expressing the duplicate than in the control. These results suggest that CHRFAM7A acts as a dominant negative modulator of CHRNA7 function and is critical for receptor regulation in humans. Copyright © 2011 Elsevier Inc. All rights reserved.

  2. Rat Models of Cardiovascular Disease Demonstrate Distinctive Pulmonary Gene Expressions for Vascular Response Genes: Impact of Ozone Exposure

    Science.gov (United States)

    Comparative gene expression profiling of multiple tissues from rat strains with genetic predisposition to diverse cardiovascular diseases (CVD) can help decode the transcriptional program that governs organ-specific functions. We examined expressions of CVD genes in the lungs of ...

  3. Identification of Photosynthesis-Associated C4 Candidate Genes through Comparative Leaf Gradient Transcriptome in Multiple Lineages of C3 and C4 Species

    Science.gov (United States)

    Ding, Zehong; Weissmann, Sarit; Wang, Minghui; Du, Baijuan; Huang, Lei; Wang, Lin; Tu, Xiaoyu; Zhong, Silin; Myers, Christopher; Brutnell, Thomas P.; Sun, Qi; Li, Pinghua

    2015-01-01

    Leaves of C4 crops usually have higher radiation, water and nitrogen use efficiencies compared to the C3 species. Engineering C4 traits into C3 crops has been proposed as one of the most promising ways to repeal the biomass yield ceiling. To better understand the function of C4 photosynthesis, and to identify candidate genes that are associated with the C4 pathways, a comparative transcription network analysis was conducted on leaf developmental gradients of three C4 species including maize, green foxtail and sorghum and one C3 species, rice. By combining the methods of gene co-expression and differentially co-expression networks, we identified a total of 128 C4 specific genes. Besides the classic C4 shuttle genes, a new set of genes associated with light reaction, starch and sucrose metabolism, metabolites transportation, as well as transcription regulation, were identified as involved in C4 photosynthesis. These findings will provide important insights into the differential gene regulation between C3 and C4 species, and a good genetic resource for establishing C4 pathways in C3 crops. PMID:26465154

  4. Identification of Photosynthesis-Associated C4 Candidate Genes through Comparative Leaf Gradient Transcriptome in Multiple Lineages of C3 and C4 Species.

    Science.gov (United States)

    Ding, Zehong; Weissmann, Sarit; Wang, Minghui; Du, Baijuan; Huang, Lei; Wang, Lin; Tu, Xiaoyu; Zhong, Silin; Myers, Christopher; Brutnell, Thomas P; Sun, Qi; Li, Pinghua

    2015-01-01

    Leaves of C4 crops usually have higher radiation, water and nitrogen use efficiencies compared to the C3 species. Engineering C4 traits into C3 crops has been proposed as one of the most promising ways to repeal the biomass yield ceiling. To better understand the function of C4 photosynthesis, and to identify candidate genes that are associated with the C4 pathways, a comparative transcription network analysis was conducted on leaf developmental gradients of three C4 species including maize, green foxtail and sorghum and one C3 species, rice. By combining the methods of gene co-expression and differentially co-expression networks, we identified a total of 128 C4 specific genes. Besides the classic C4 shuttle genes, a new set of genes associated with light reaction, starch and sucrose metabolism, metabolites transportation, as well as transcription regulation, were identified as involved in C4 photosynthesis. These findings will provide important insights into the differential gene regulation between C3 and C4 species, and a good genetic resource for establishing C4 pathways in C3 crops.

  5. Identification of Photosynthesis-Associated C4 Candidate Genes through Comparative Leaf Gradient Transcriptome in Multiple Lineages of C3 and C4 Species.

    Directory of Open Access Journals (Sweden)

    Zehong Ding

    Full Text Available Leaves of C4 crops usually have higher radiation, water and nitrogen use efficiencies compared to the C3 species. Engineering C4 traits into C3 crops has been proposed as one of the most promising ways to repeal the biomass yield ceiling. To better understand the function of C4 photosynthesis, and to identify candidate genes that are associated with the C4 pathways, a comparative transcription network analysis was conducted on leaf developmental gradients of three C4 species including maize, green foxtail and sorghum and one C3 species, rice. By combining the methods of gene co-expression and differentially co-expression networks, we identified a total of 128 C4 specific genes. Besides the classic C4 shuttle genes, a new set of genes associated with light reaction, starch and sucrose metabolism, metabolites transportation, as well as transcription regulation, were identified as involved in C4 photosynthesis. These findings will provide important insights into the differential gene regulation between C3 and C4 species, and a good genetic resource for establishing C4 pathways in C3 crops.

  6. Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm

    Science.gov (United States)

    Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E.; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder

    2018-01-01

    Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops. PMID:29672525

  7. Comparative genomic and transcriptomic analysis of selected fatty acid biosynthesis genes and CNL disease resistance genes in oil palm.

    Science.gov (United States)

    Rosli, Rozana; Amiruddin, Nadzirah; Ab Halim, Mohd Amin; Chan, Pek-Lan; Chan, Kuang-Lim; Azizi, Norazah; Morris, Priscilla E; Leslie Low, Eng-Ti; Ong-Abdullah, Meilina; Sambanthamurthi, Ravigadevi; Singh, Rajinder; Murphy, Denis J

    2018-01-01

    Comparative genomics and transcriptomic analyses were performed on two agronomically important groups of genes from oil palm versus other major crop species and the model organism, Arabidopsis thaliana. The first analysis was of two gene families with key roles in regulation of oil quality and in particular the accumulation of oleic acid, namely stearoyl ACP desaturases (SAD) and acyl-acyl carrier protein (ACP) thioesterases (FAT). In both cases, these were found to be large gene families with complex expression profiles across a wide range of tissue types and developmental stages. The detailed classification of the oil palm SAD and FAT genes has enabled the updating of the latest version of the oil palm gene model. The second analysis focused on disease resistance (R) genes in order to elucidate possible candidates for breeding of pathogen tolerance/resistance. Ortholog analysis showed that 141 out of the 210 putative oil palm R genes had homologs in banana and rice. These genes formed 37 clusters with 634 orthologous genes. Classification of the 141 oil palm R genes showed that the genes belong to the Kinase (7), CNL (95), MLO-like (8), RLK (3) and Others (28) categories. The CNL R genes formed eight clusters. Expression data for selected R genes also identified potential candidates for breeding of disease resistance traits. Furthermore, these findings can provide information about the species evolution as well as the identification of agronomically important genes in oil palm and other major crops.

  8. Global gene expression profile progression in Gaucher disease mouse models

    Directory of Open Access Journals (Sweden)

    Zhang Wujuan

    2011-01-01

    Full Text Available Abstract Background Gaucher disease is caused by defective glucocerebrosidase activity and the consequent accumulation of glucosylceramide. The pathogenic pathways resulting from lipid laden macrophages (Gaucher cells in visceral organs and their abnormal functions are obscure. Results To elucidate this pathogenic pathway, developmental global gene expression analyses were conducted in distinct Gba1 point-mutated mice (V394L/V394L and D409 V/null. About 0.9 to 3% of genes had altered expression patterns (≥ ± 1.8 fold change, representing several categories, but particularly macrophage activation and immune response genes. Time course analyses (12 to 28 wk of INFγ-regulated pro-inflammatory (13 and IL-4-regulated anti-inflammatory (11 cytokine/mediator networks showed tissue differential profiles in the lung and liver of the Gba1 mutant mice, implying that the lipid-storage macrophages were not functionally inert. The time course alterations of the INFγ and IL-4 pathways were similar, but varied in degree in these tissues and with the Gba1 mutation. Conclusions Biochemical and pathological analyses demonstrated direct relationships between the degree of tissue glucosylceramides and the gene expression profile alterations. These analyses implicate IFNγ-regulated pro-inflammatory and IL-4-regulated anti-inflammatory networks in differential disease progression with implications for understanding the Gaucher disease course and pathophysiology.

  9. Surfactant gene polymorphisms and interstitial lung diseases

    Directory of Open Access Journals (Sweden)

    Pantelidis Panagiotis

    2001-11-01

    Full Text Available Abstract Pulmonary surfactant is a complex mixture of phospholipids and proteins, which is present in the alveolar lining fluid and is essential for normal lung function. Alterations in surfactant composition have been reported in several interstitial lung diseases (ILDs. Furthermore, a mutation in the surfactant protein C gene that results in complete absence of the protein has been shown to be associated with familial ILD. The role of surfactant in lung disease is therefore drawing increasing attention following the elucidation of the genetic basis underlying its surface expression and the proof of surfactant abnormalities in ILD.

  10. Abnormalities in Alternative Splicing of Apoptotic Genes and Cardiovascular Diseases

    Directory of Open Access Journals (Sweden)

    Zodwa Dlamini

    2015-11-01

    Full Text Available Apoptosis is required for normal heart development in the embryo, but has also been shown to be an important factor in the occurrence of heart disease. Alternative splicing of apoptotic genes is currently emerging as a diagnostic and therapeutic target for heart disease. This review addresses the involvement of abnormalities in alternative splicing of apoptotic genes in cardiac disorders including cardiomyopathy, myocardial ischemia and heart failure. Many pro-apoptotic members of the Bcl-2 family have alternatively spliced isoforms that lack important active domains. These isoforms can play a negative regulatory role by binding to and inhibiting the pro-apoptotic forms. Alternative splicing is observed to be increased in various cardiovascular diseases with the level of alternate transcripts increasing elevated in diseased hearts compared to healthy subjects. In many cases these isoforms appear to be the underlying cause of the disease, while in others they may be induced in response to cardiovascular pathologies. Regardless of this, the detection of alternate splicing events in the heart can serve as useful diagnostic or prognostic tools, while those splicing events that seem to play a causative role in cardiovascular disease make attractive future drug targets.

  11. Understanding Epistatic Interactions between Genes Targeted by Non-coding Regulatory Elements in Complex Diseases

    Directory of Open Access Journals (Sweden)

    Min Kyung Sung

    2014-12-01

    Full Text Available Genome-wide association studies have proven the highly polygenic architecture of complex diseases or traits; therefore, single-locus-based methods are usually unable to detect all involved loci, especially when individual loci exert small effects. Moreover, the majority of associated single-nucleotide polymorphisms resides in non-coding regions, making it difficult to understand their phenotypic contribution. In this work, we studied epistatic interactions associated with three common diseases using Korea Association Resource (KARE data: type 2 diabetes mellitus (DM, hypertension (HT, and coronary artery disease (CAD. We showed that epistatic single-nucleotide polymorphisms (SNPs were enriched in enhancers, as well as in DNase I footprints (the Encyclopedia of DNA Elements [ENCODE] Project Consortium 2012, which suggested that the disruption of the regulatory regions where transcription factors bind may be involved in the disease mechanism. Accordingly, to identify the genes affected by the SNPs, we employed whole-genome multiple-cell-type enhancer data which discovered using DNase I profiles and Cap Analysis Gene Expression (CAGE. Assigned genes were significantly enriched in known disease associated gene sets, which were explored based on the literature, suggesting that this approach is useful for detecting relevant affected genes. In our knowledge-based epistatic network, the three diseases share many associated genes and are also closely related with each other through many epistatic interactions. These findings elucidate the genetic basis of the close relationship between DM, HT, and CAD.

  12. Predicting protein-protein interactions in Arabidopsis thaliana through integration of orthology, gene ontology and co-expression

    Directory of Open Access Journals (Sweden)

    Vandepoele Klaas

    2009-06-01

    Full Text Available Abstract Background Large-scale identification of the interrelationships between different components of the cell, such as the interactions between proteins, has recently gained great interest. However, unraveling large-scale protein-protein interaction maps is laborious and expensive. Moreover, assessing the reliability of the interactions can be cumbersome. Results In this study, we have developed a computational method that exploits the existing knowledge on protein-protein interactions in diverse species through orthologous relations on the one hand, and functional association data on the other hand to predict and filter protein-protein interactions in Arabidopsis thaliana. A highly reliable set of protein-protein interactions is predicted through this integrative approach making use of existing protein-protein interaction data from yeast, human, C. elegans and D. melanogaster. Localization, biological process, and co-expression data are used as powerful indicators for protein-protein interactions. The functional repertoire of the identified interactome reveals interactions between proteins functioning in well-conserved as well as plant-specific biological processes. We observe that although common mechanisms (e.g. actin polymerization and components (e.g. ARPs, actin-related proteins exist between different lineages, they are active in specific processes such as growth, cancer metastasis and trichome development in yeast, human and Arabidopsis, respectively. Conclusion We conclude that the integration of orthology with functional association data is adequate to predict protein-protein interactions. Through this approach, a high number of novel protein-protein interactions with diverse biological roles is discovered. Overall, we have predicted a reliable set of protein-protein interactions suitable for further computational as well as experimental analyses.

  13. Retinal Diseases Caused by Mutations in Genes Not Specifically Associated with the Clinical Diagnosis.

    Directory of Open Access Journals (Sweden)

    Xia Wang

    Full Text Available When seeking a confirmed molecular diagnosis in the research setting, patients with one descriptive diagnosis of retinal disease could carry pathogenic variants in genes not specifically associated with that description. However, this event has not been evaluated systematically in clinical diagnostic laboratories that validate fully all target genes to minimize false negatives/positives.We performed targeted next-generation sequencing analysis on 207 ocular disease-related genes for 42 patients whose DNA had been tested negative for disease-specific panels of genes known to be associated with retinitis pigmentosa, Leber congenital amaurosis, or exudative vitreoretinopathy.Pathogenic variants, including single nucleotide variations and copy number variations, were identified in 9 patients, including 6 with variants in syndromic retinal disease genes and 3 whose molecular diagnosis could not be distinguished easily from their submitted clinical diagnosis, accounting for 21% (9/42 of the unsolved cases.Our study underscores the clinical and genetic heterogeneity of retinal disorders and provides valuable reference to estimate the fraction of clinical samples whose retinal disorders could be explained by genes not specifically associated with the corresponding clinical diagnosis. Our data suggest that sequencing a larger set of retinal disorder related genes can increase the molecular diagnostic yield, especially for clinically hard-to-distinguish cases.

  14. Assessment of brain reference genes for RT-qPCR studies in neurodegenerative diseases

    OpenAIRE

    Rydbirk, Rasmus; Folke, Jonas; Winge, Kristian; Aznar, Susana; Pakkenberg, Bente; Brudek, Tomasz

    2016-01-01

    Evaluation of gene expression levels by reverse transcription quantitative real-time PCR (RT-qPCR) has for many years been the favourite approach for discovering disease-associated alterations. Normalization of results to stably expressed reference genes (RGs) is pivotal to obtain reliable results. This is especially important in relation to neurodegenerative diseases where disease-related structural changes may affect the most commonly used RGs. We analysed 15 candidate RGs in 98 brain sampl...

  15. Gene regulation is governed by a core network in hepatocellular carcinoma.

    Science.gov (United States)

    Gu, Zuguang; Zhang, Chenyu; Wang, Jin

    2012-05-01

    Hepatocellular carcinoma (HCC) is one of the most lethal cancers worldwide, and the mechanisms that lead to the disease are still relatively unclear. However, with the development of high-throughput technologies it is possible to gain a systematic view of biological systems to enhance the understanding of the roles of genes associated with HCC. Thus, analysis of the mechanism of molecule interactions in the context of gene regulatory networks can reveal specific sub-networks that lead to the development of HCC. In this study, we aimed to identify the most important gene regulations that are dysfunctional in HCC generation. Our method for constructing gene regulatory network is based on predicted target interactions, experimentally-supported interactions, and co-expression model. Regulators in the network included both transcription factors and microRNAs to provide a complete view of gene regulation. Analysis of gene regulatory network revealed that gene regulation in HCC is highly modular, in which different sets of regulators take charge of specific biological processes. We found that microRNAs mainly control biological functions related to mitochondria and oxidative reduction, while transcription factors control immune responses, extracellular activity and the cell cycle. On the higher level of gene regulation, there exists a core network that organizes regulations between different modules and maintains the robustness of the whole network. There is direct experimental evidence for most of the regulators in the core gene regulatory network relating to HCC. We infer it is the central controller of gene regulation. Finally, we explored the influence of the core gene regulatory network on biological pathways. Our analysis provides insights into the mechanism of transcriptional and post-transcriptional control in HCC. In particular, we highlight the importance of the core gene regulatory network; we propose that it is highly related to HCC and we believe further

  16. The Role of Nuclear Bodies in Gene Expression and Disease

    Science.gov (United States)

    Morimoto, Marie; Boerkoel, Cornelius F.

    2013-01-01

    This review summarizes the current understanding of the role of nuclear bodies in regulating gene expression. The compartmentalization of cellular processes, such as ribosome biogenesis, RNA processing, cellular response to stress, transcription, modification and assembly of spliceosomal snRNPs, histone gene synthesis and nuclear RNA retention, has significant implications for gene regulation. These functional nuclear domains include the nucleolus, nuclear speckle, nuclear stress body, transcription factory, Cajal body, Gemini of Cajal body, histone locus body and paraspeckle. We herein review the roles of nuclear bodies in regulating gene expression and their relation to human health and disease. PMID:24040563

  17. Neurocarta: aggregating and sharing disease-gene relations for the neurosciences.

    Science.gov (United States)

    Portales-Casamar, Elodie; Ch'ng, Carolyn; Lui, Frances; St-Georges, Nicolas; Zoubarev, Anton; Lai, Artemis Y; Lee, Mark; Kwok, Cathy; Kwok, Willie; Tseng, Luchia; Pavlidis, Paul

    2013-02-26

    Understanding the genetic basis of diseases is key to the development of better diagnoses and treatments. Unfortunately, only a small fraction of the existing data linking genes to phenotypes is available through online public resources and, when available, it is scattered across multiple access tools. Neurocarta is a knowledgebase that consolidates information on genes and phenotypes across multiple resources and allows tracking and exploring of the associations. The system enables automatic and manual curation of evidence supporting each association, as well as user-enabled entry of their own annotations. Phenotypes are recorded using controlled vocabularies such as the Disease Ontology to facilitate computational inference and linking to external data sources. The gene-to-phenotype associations are filtered by stringent criteria to focus on the annotations most likely to be relevant. Neurocarta is constantly growing and currently holds more than 30,000 lines of evidence linking over 7,000 genes to 2,000 different phenotypes. Neurocarta is a one-stop shop for researchers looking for candidate genes for any disorder of interest. In Neurocarta, they can review the evidence linking genes to phenotypes and filter out the evidence they're not interested in. In addition, researchers can enter their own annotations from their experiments and analyze them in the context of existing public annotations. Neurocarta's in-depth annotation of neurodevelopmental disorders makes it a unique resource for neuroscientists working on brain development.

  18. Circumvention of chaperone requirement for aggregate formation of a short polyglutamine tract by the co-expression of a long polyglutamine tract.

    Science.gov (United States)

    Kimura, Yoko; Koitabashi, Sumiko; Kakizuka, Akira; Fujita, Takashi

    2002-10-04

    Polyglutamine disease is now recognized as one of the conformational, amyloid-related diseases. In this disease, polyglutamine expansion in proteins has toxic effects on cells and also results in the formation of aggregates. Polyglutamine aggregate formation is accompanied by conversion of the polyglutamine from a soluble to an insoluble form. In yeast, the efficiency of the aggregate formation is determined by the balance of various parameters, including the length of the polyglutamine tract, the function of Hsp104, and the level of polyglutamine expression. In this study, we found that the co-expression of a long polyglutamine tract, which formed aggregates independently of the function of Hsp104, enhanced the formation of aggregates of a short polyglutamine tract in wild-type cells as well as in Deltahsp104 mutant cells. Thus, the expression of a long polyglutamine tract would be an additional parameter determining the efficiency of aggregate formation of a short polyglutamine tract. The co-localization of aggregates of long and short polyglutamine tracts suggests the possibility that the enhancement occurs due to the seeding of aggregates of the long polyglutamine tracts.

  19. Investigation of genes coding for inflammatory components in Parkinson's disease.

    Science.gov (United States)

    Håkansson, Anna; Westberg, Lars; Nilsson, Staffan; Buervenich, Silvia; Carmine, Andrea; Holmberg, Björn; Sydow, Olof; Olson, Lars; Johnels, Bo; Eriksson, Elias; Nissbrandt, Hans

    2005-05-01

    Several findings obtained recently indicate that inflammation may contribute to the pathogenesis in Parkinson's disease (PD). Genetic variants of genes coding for components involved in immune reactions in the brain might therefore influence the risk of developing PD or the age of disease onset. Five single nucleotide polymorphisms (SNPs) in the genes coding for interferon-gamma (IFN-gamma; T874A in intron 1), interferon-gamma receptor 2 (IFN-gamma R2; Gln64Arg), interleukin-10 (IL-10; G1082A in the promoter region), platelet-activating factor acetylhydrolase (PAF-AH; Val379Ala), and intercellular adhesion molecule 1 (ICAM-1; Lys469Glu) were genotyped, using pyrosequencing, in 265 patients with PD and 308 controls. None of the investigated SNPs was found to be associated with PD; however, the G1082A polymorphism in the IL-10 gene promoter was found to be related to the age of disease onset. Linear regression showed a significantly earlier onset with more A-alleles (P = 0.0095; after Bonferroni correction, P = 0.048), resulting in a 5-year delayed age of onset of the disease for individuals having two G-alleles compared with individuals having two A-alleles. The results indicate that the IL-10 G1082A SNP could possibly be related to the age of onset of PD. Copyright 2005 Movement Disorder Society.

  20. A novel mutation in the Norrie disease gene.

    Science.gov (United States)

    Ott, S; Patel, R J; Appukuttan, B; Wang, X; Stout, J T

    2000-04-01

    Norrie disease is an X-linked recessive disorder characterized by congenital blindness and in some cases mental retardation and deafness.(1) The variability of signs among patients often complicates diagnosis. Signs such as an ocular pseudoglioma, progressive deafness, and mental disturbance are considered classic features.(2) Only one third of patients with Norrie disease have sensorineural deafness, and approximately one half of the affected individuals exhibit mental retardation, often with psychotic features.(3) Histologic analysis has suggested that retinal dysgenesis occurs early in eye development and involves cells in the inner wall of the optic cup.(4) The gene associated with Norrie disease was identified in 1992. (5,6) We report a novel mutation identified in a patient in whom Norrie disease was diagnosed.

  1. Co-expression and characterization of enterocin CRL35 and its mutant in Escherichia coli Rosetta

    Directory of Open Access Journals (Sweden)

    Masías Emilse

    2014-01-01

    Full Text Available Even though many sequences and structures of bacteriocins from lactic acid bacteria have been fully characterized so far, little information is currently available about bacteriocins heterologously produced by Escherichia coli. For this purpose, the structural gene of enterocin CRL35, munA, was PCR-amplified using specific primers and cloned downstream of PelB sequence in the pET22b (+ expression vector. E. coli Rosetta (DE3 pLysS was chosen as the host for production and enterocin was purified by an easy two-step protocol. The bacteriocin was correctly expressed with the expected intramolecular disulfide bond. Nevertheless, it was found that a variant of the enterocin, differing by 12 Da from the native polypeptide, was co-expressed by E. coli Rosetta in comparable amount. Indeed, the mutant bacteriocin contained two amino acid substitutions that were characterized by matrix assisted laser desorption ionization-time of flight (MALDI-TOF and HPLCelectrospray (ESI-Q-TOF tandem mass spectrometry (MS/ MS sequencing. This is the first report regarding the production of mutants of pediocin-like bacteriocins in the E. coli expression system.

  2. Putative carotenoid genes expressed under the regulation of Shine-Dalgarno regions in Escherichia coli for efficient lycopene production.

    Science.gov (United States)

    Jin, Weiyue; Xu, Xian; Jiang, Ling; Zhang, Zhidong; Li, Shuang; Huang, He

    2015-11-01

    Putative genes crtE, crtB, and crtI from Deinococcus wulumiqiensis R12, a novel species, were identified by genome mining and were co-expressed using the optimized Shine-Dalgarno (SD) regions to improve lycopene yield. A lycopene biosynthesis pathway was constructed by co-expressing these three genes in Escherichia coli. After optimizing the upstream SD regions and the culture medium, the recombinant strain EDW11 produced 88 mg lycopene g(-1) dry cell wt (780 mg lycopene l(-1)) after 40 h fermentation without IPTG induction, while the strain EDW without optimized SD regions only produced 49 mg lycopene g(-1) dry cell wt (417 mg lycopene l(-1)). Based on the optimization of the upstream SD regions and culture medium, the yield of the strain EDW11 reached a high level during microbial lycopene production until now.

  3. Gene regulatory networks elucidating huanglongbing disease mechanisms.

    Directory of Open Access Journals (Sweden)

    Federico Martinelli

    Full Text Available Next-generation sequencing was exploited to gain deeper insight into the response to infection by Candidatus liberibacter asiaticus (CaLas, especially the immune disregulation and metabolic dysfunction caused by source-sink disruption. Previous fruit transcriptome data were compared with additional RNA-Seq data in three tissues: immature fruit, and young and mature leaves. Four categories of orchard trees were studied: symptomatic, asymptomatic, apparently healthy, and healthy. Principal component analysis found distinct expression patterns between immature and mature fruits and leaf samples for all four categories of trees. A predicted protein - protein interaction network identified HLB-regulated genes for sugar transporters playing key roles in the overall plant responses. Gene set and pathway enrichment analyses highlight the role of sucrose and starch metabolism in disease symptom development in all tissues. HLB-regulated genes (glucose-phosphate-transporter, invertase, starch-related genes would likely determine the source-sink relationship disruption. In infected leaves, transcriptomic changes were observed for light reactions genes (downregulation, sucrose metabolism (upregulation, and starch biosynthesis (upregulation. In parallel, symptomatic fruits over-expressed genes involved in photosynthesis, sucrose and raffinose metabolism, and downregulated starch biosynthesis. We visualized gene networks between tissues inducing a source-sink shift. CaLas alters the hormone crosstalk, resulting in weak and ineffective tissue-specific plant immune responses necessary for bacterial clearance. Accordingly, expression of WRKYs (including WRKY70 was higher in fruits than in leaves. Systemic acquired responses were inadequately activated in young leaves, generally considered the sites where most new infections occur.

  4. Co-expression of TIMP-1 and its cell surface binding partner CD63 in glioblastomas

    DEFF Research Database (Denmark)

    Aaberg-Jessen, Charlotte; Sørensen, Mia D.; Matos, Ana L.S.A.

    2018-01-01

    scoring. CD63 expression in tumor-associated microglia/macrophages was examined by double-immunofluorescence with ionized calcium-binding adapter molecule 1 (Iba1). The association between CD63 and TIMP-1 was investigated using previously obtained TIMP-1 data from our astrocytoma cohort. Cellular co-expression...... of CD63 was widely distributed in astrocytomas with a significantly increased level in glioblastomas. CD63 levels did not significantly correlate with patient survival at a protein level, and CD63 did not augment the prognostic significance of TIMP-1. Up to 38% of the CD63+ cells expressed Iba1; however......, Iba1 did not appear to impact the prognostic value of CD63. A significant correlation was found between TIMP-1 and CD63, and the TIMP-1 and CD63 proteins were co-expressed at the cellular level and located in close molecular proximity, suggesting that TIMP-1 and CD63 could be co...

  5. Pathways of Lipid Metabolism in Marine Algae, Co-Expression Network, Bottlenecks and Candidate Genes for Enhanced Production of EPA and DHA in Species of Chromista

    Directory of Open Access Journals (Sweden)

    Alice Mühlroth

    2013-11-01

    Full Text Available The importance of n-3 long chain polyunsaturated fatty acids (LC-PUFAs for human health has received more focus the last decades, and the global consumption of n-3 LC-PUFA has increased. Seafood, the natural n-3 LC-PUFA source, is harvested beyond a sustainable capacity, and it is therefore imperative to develop alternative n-3 LC-PUFA sources for both eicosapentaenoic acid (EPA, 20:5n-3 and docosahexaenoic acid (DHA, 22:6n-3. Genera of algae such as Nannochloropsis, Schizochytrium, Isochrysis and Phaedactylum within the kingdom Chromista have received attention due to their ability to produce n-3 LC-PUFAs. Knowledge of LC-PUFA synthesis and its regulation in algae at the molecular level is fragmentary and represents a bottleneck for attempts to enhance the n-3 LC-PUFA levels for industrial production. In the present review, Phaeodactylum tricornutum has been used to exemplify the synthesis and compartmentalization of n-3 LC-PUFAs. Based on recent transcriptome data a co-expression network of 106 genes involved in lipid metabolism has been created. Together with recent molecular biological and metabolic studies, a model pathway for n-3 LC-PUFA synthesis in P. tricornutum has been proposed, and is compared to industrialized species of Chromista. Limitations of the n-3 LC-PUFA synthesis by enzymes such as thioesterases, elongases, acyl-CoA synthetases and acyltransferases are discussed and metabolic bottlenecks are hypothesized such as the supply of the acetyl-CoA and NADPH. A future industrialization will depend on optimization of chemical compositions and increased biomass production, which can be achieved by exploitation of the physiological potential, by selective breeding and by genetic engineering.

  6. MIDBRAIN CATECHOLAMINERGIC NEURONS CO-EXPRESS α-SYNUCLEIN AND TAU IN PROGRESSIVE SUPRANUCLEAR PALSY

    Directory of Open Access Journals (Sweden)

    María Elena eErro Aguirre

    2015-03-01

    Full Text Available Objective: To analyze the frequency and distribution of α-synuclein deposits in progressive supranuclear palsy (PSP.Methods: The brains of 25 cases of pathologically confirmed PSP were evaluated with immunohistochemistry for α-synuclein and tau. Multiple immunofluorescent stains were applied to analyze the expression of tau and α-synuclein aggregates in catecholaminergic neurons. Patients’ clinical symptoms were retrospectively recorded. Results: Deposits α-synuclein in the form of typical Lewy bodies (LBs were only found in two PSP cases (8% that fulfilled the clinical subtype of PSP known as Richardson’s syndrome (RS. LBs were present in the locus ceruleus, substantia nigra pars compacta, basal forebrain, amygdala and cingulated cortex in a distribution mimicking that of Parkinson’s disease. Triple-immunolabeling revealed co-expression of α-synuclein and tau proteins in some tyrosine hydroxilase-positive neurons of the locus ceruleus and substantia nigra pars compacta.Conclusions: There is no apparent clinical correlation between the presence of LBs in PSP. Tau protein co-aggregate with α-synuclein in catecholaminergic neurons of PSP brains suggesting a synergistic interaction between the two proteins. This is in keeping with the current view of neurodegenerative disorders as ‘misfolded protein diseases’.

  7. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes

    DEFF Research Database (Denmark)

    de Jong, Simone; Boks, Marco P M; Fuller, Tova F

    2012-01-01

    Despite large-scale genome-wide association studies (GWAS), the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood...... of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co......, and regulated by the major histocompatibility (MHC) complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes...

  8. Fibrosis-Related Gene Expression in Single Ventricle Heart Disease.

    Science.gov (United States)

    Nakano, Stephanie J; Siomos, Austine K; Garcia, Anastacia M; Nguyen, Hieu; SooHoo, Megan; Galambos, Csaba; Nunley, Karin; Stauffer, Brian L; Sucharov, Carmen C; Miyamoto, Shelley D

    2017-12-01

    To evaluate fibrosis and fibrosis-related gene expression in the myocardium of pediatric subjects with single ventricle with right ventricular failure. Real-time quantitative polymerase chain reaction was performed on explanted right ventricular myocardium of pediatric subjects with single ventricle disease and controls with nonfailing heart disease. Subjects were divided into 3 groups: single ventricle failing (right ventricular failure before or after stage I palliation), single ventricle nonfailing (infants listed for primary transplantation with normal right ventricular function), and stage III (Fontan or right ventricular failure after stage III). To evaluate subjects of similar age and right ventricular volume loading, single ventricle disease with failure was compared with single ventricle without failure and stage III was compared with nonfailing right ventricular disease. Histologic fibrosis was assessed in all hearts. Mann-Whitney tests were performed to identify differences in gene expression. Collagen (Col1α, Col3) expression is decreased in single ventricle congenital heart disease with failure compared with nonfailing single ventricle congenital heart disease (P = .019 and P = .035, respectively), and is equivalent in stage III compared with nonfailing right ventricular heart disease. Tissue inhibitors of metalloproteinase (TIMP-1, TIMP-3, and TIMP-4) are downregulated in stage III compared with nonfailing right ventricular heart disease (P = .0047, P = .013 and P = .013, respectively). Matrix metalloproteinases (MMP-2, MMP-9) are similar between nonfailing single ventricular heart disease and failing single ventricular heart disease, and between stage III heart disease and nonfailing right ventricular heart disease. There is no difference in the prevalence of right ventricular fibrosis by histology in subjects with single ventricular failure heart disease with right ventricular failure (18%) compared with those with normal right

  9. Systems Pharmacogenomics Finds RUNX1 Is an Aspirin-Responsive Transcription Factor Linked to Cardiovascular Disease and Colon Cancer

    Directory of Open Access Journals (Sweden)

    Deepak Voora, MD

    2016-09-01

    Full Text Available Aspirin prevents cardiovascular disease and colon cancer; however aspirin's inhibition of platelet COX-1 only partially explains its diverse effects. We previously identified an aspirin response signature (ARS in blood consisting of 62 co-expressed transcripts that correlated with aspirin's effects on platelets and myocardial infarction (MI. Here we report that 60% of ARS transcripts are regulated by RUNX1 – a hematopoietic transcription factor - and 48% of ARS gene promoters contain a RUNX1 binding site. Megakaryocytic cells exposed to aspirin and its metabolite (salicylic acid, a weak COX-1 inhibitor showed up regulation in the RUNX1 P1 isoform and MYL9, which is transcriptionally regulated by RUNX1. In human subjects, RUNX1 P1 expression in blood and RUNX1-regulated platelet proteins, including MYL9, were aspirin-responsive and associated with platelet function. In cardiovascular disease patients RUNX1 P1 expression was associated with death or MI. RUNX1 acts as a tumor suppressor gene in gastrointestinal malignancies. We show that RUNX1 P1 expression is associated with colon cancer free survival suggesting a role for RUNX1 in aspirin's protective effect in colon cancer. Our studies reveal an effect of aspirin on RUNX1 and gene expression that may additionally explain aspirin's effects in cardiovascular disease and cancer.

  10. Mouse model of Epstein-Barr virus LMP1- and LMP2A-driven germinal center B-cell lymphoproliferative disease.

    Science.gov (United States)

    Minamitani, Takeharu; Ma, Yijie; Zhou, Hufeng; Kida, Hiroshi; Tsai, Chao-Yuan; Obana, Masanori; Okuzaki, Daisuke; Fujio, Yasushi; Kumanogoh, Atsushi; Zhao, Bo; Kikutani, Hitoshi; Kieff, Elliott; Gewurz, Benjamin E; Yasui, Teruhito

    2017-05-02

    Epstein-Barr virus (EBV) is a major cause of immunosuppression-related B-cell lymphomas and Hodgkin lymphoma (HL). In these malignancies, EBV latent membrane protein 1 (LMP1) and LMP2A provide infected B cells with surrogate CD40 and B-cell receptor growth and survival signals. To gain insights into their synergistic in vivo roles in germinal center (GC) B cells, from which most EBV-driven lymphomas arise, we generated a mouse model with conditional GC B-cell LMP1 and LMP2A coexpression. LMP1 and LMP2A had limited effects in immunocompetent mice. However, upon T- and NK-cell depletion, LMP1/2A caused massive plasmablast outgrowth, organ damage, and death. RNA-sequencing analyses identified EBV oncoprotein effects on GC B-cell target genes, including up-regulation of multiple proinflammatory chemokines and master regulators of plasma cell differentiation. LMP1/2A coexpression also up-regulated key HL markers, including CD30 and mixed hematopoietic lineage markers. Collectively, our results highlight synergistic EBV membrane oncoprotein effects on GC B cells and provide a model for studies of their roles in immunosuppression-related lymphoproliferative diseases.

  11. Fluid Mechanics, Arterial Disease, and Gene Expression.

    Science.gov (United States)

    Tarbell, John M; Shi, Zhong-Dong; Dunn, Jessilyn; Jo, Hanjoong

    2014-01-01

    This review places modern research developments in vascular mechanobiology in the context of hemodynamic phenomena in the cardiovascular system and the discrete localization of vascular disease. The modern origins of this field are traced, beginning in the 1960s when associations between flow characteristics, particularly blood flow-induced wall shear stress, and the localization of atherosclerotic plaques were uncovered, and continuing to fluid shear stress effects on the vascular lining endothelial) cells (ECs), including their effects on EC morphology, biochemical production, and gene expression. The earliest single-gene studies and genome-wide analyses are considered. The final section moves from the ECs lining the vessel wall to the smooth muscle cells and fibroblasts within the wall that are fluid me chanically activated by interstitial flow that imposes shear stresses on their surfaces comparable with those of flowing blood on EC surfaces. Interstitial flow stimulates biochemical production and gene expression, much like blood flow on ECs.

  12. Iron overload and HFE gene mutations in Czech patients with chronic liver diseases.

    Science.gov (United States)

    Dostalikova-Cimburova, Marketa; Kratka, Karolina; Stransky, Jaroslav; Putova, Ivana; Cieslarova, Blanka; Horak, Jiri

    2012-01-01

    The aim of the study was to identify the prevalence of HFE gene mutations in Czech patients with chronic liver diseases and the influence of the mutations on iron status. The presence of HFE gene mutations (C282Y, H63D, and S65C) analyzed by the PCR-RFLP method, presence of cirrhosis, and serum iron indices were compared among 454 patients with different chronic liver diseases (51 with chronic hepatitis B, 122 with chronic hepatitis C, 218 with alcoholic liver disease, and 63 patients with hemochromatosis). Chronic liver diseases patients other than hemochromatics did not have an increased frequency of HFE gene mutations compared to controls. Although 33.3% of patients with hepatitis B, 43% of patients with hepatitis C, and 73.2% of patients with alcoholic liver disease had elevated transferrin saturation or serum ferritin levels, the presence of HFE gene mutations was not significantly associated with iron overload in these patients. Additionally, patients with cirrhosis did not have frequencies of HFE mutations different from those without cirrhosis. This study emphasizes the importance, not only of C282Y, but also of the H63D homozygous genetic constellation in Czech hemochromatosis patients. Our findings show that increased iron indices are common in chronic liver diseases but {\\it HFE} mutations do not play an important role in the pathogenesis of chronic hepatitis B, chronic hepatitis C, and alcoholic liver disease.

  13. HerDing: herb recommendation system to treat diseases using genes and chemicals.

    Science.gov (United States)

    Choi, Wonjun; Choi, Chan-Hun; Kim, Young Ran; Kim, Seon-Jong; Na, Chang-Su; Lee, Hyunju

    2016-01-01

    In recent years, herbs have been researched for new drug candidates because they have a long empirical history of treating diseases and are relatively free from side effects. Studies to scientifically prove the medical efficacy of herbs for target diseases often spend a considerable amount of time and effort in choosing candidate herbs and in performing experiments to measure changes of marker genes when treating herbs. A computational approach to recommend herbs for treating diseases might be helpful to promote efficiency in the early stage of such studies. Although several databases related to traditional Chinese medicine have been already developed, there is no specialized Web tool yet recommending herbs to treat diseases based on disease-related genes. Therefore, we developed a novel search engine, HerDing, focused on retrieving candidate herb-related information with user search terms (a list of genes, a disease name, a chemical name or an herb name). HerDing was built by integrating public databases and by applying a text-mining method. The HerDing website is free and open to all users, and there is no login requirement. Database URL: http://combio.gist.ac.kr/herding. © The Author(s) 2016. Published by Oxford University Press.

  14. Global gene expression analysis of peripheral blood mononuclear cells in rhesus monkey infants with CA16 infection-induced HFMD.

    Science.gov (United States)

    Song, Jie; Hu, Yajie; Hu, Yunguang; Wang, Jingjing; Zhang, Xiaolong; Wang, Lichun; Guo, Lei; Wang, Yancui; Ning, Ruotong; Liao, Yun; Zhang, Ying; Zheng, Huiwen; Shi, Haijing; He, Zhanlong; Li, Qihan; Liu, Longding

    2016-03-02

    Coxsackievirus A16 (CA16) is a dominant pathogen that results in hand, foot, and mouth disease and causes outbreaks worldwide, particularly in the Asia-Pacific region. However, the underlying molecular mechanisms remain unclear. Our previous study has demonstrated that the basic CA16 pathogenic process was successfully mimicked in rhesus monkey infant. The present study focused on the global gene expression changes in peripheral blood mononuclear cells of rhesus monkey infants with hand, foot, and mouth disease induced by CA16 infection at different time points. Genome-wide expression analysis was performed with Agilent whole-genome microarrays and established bioinformatics tools. Nine hundred and forty-eight significant differentially expressed genes that were associated with 5 gene ontology categories, including cell communication, cell cycle, immune system process, regulation of transcription and metabolic process were identified. Subsequently, the mapping of genes related to the immune system process by PANTHER pathway analysis revealed the predominance of inflammation mediated by chemokine and cytokine signaling pathways and the interleukin signaling pathway. Ultimately, co-expressed genes and their networks were analyzed. The results revealed the gene expression profile of the immune system in response to CA16 in rhesus monkey infants and suggested that such an immune response was generated as a result of the positive mobilization of the immune system. This initial microarray study will provide insights into the molecular mechanism of CA16 infection and will facilitate the identification of biomarkers for the evaluation of vaccines against this virus. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. A database of annotated promoters of genes associated with common respiratory and related diseases

    KAUST Repository

    Chowdhary, Rajesh

    2012-07-01

    Many genes have been implicated in the pathogenesis of common respiratory and related diseases (RRDs), yet the underlying mechanisms are largely unknown. Differential gene expression patterns in diseased and healthy individuals suggest that RRDs affect or are affected by modified transcription regulation programs. It is thus crucial to characterize implicated genes in terms of transcriptional regulation. For this purpose, we conducted a promoter analysis of genes associated with 11 common RRDs including allergic rhinitis, asthma, bronchiectasis, bronchiolitis, bronchitis, chronic obstructive pulmonary disease, cystic fibrosis, emphysema, eczema, psoriasis, and urticaria, many of which are thought to be genetically related. The objective of the present study was to obtain deeper insight into the transcriptional regulation of these disease-associated genes by annotating their promoter regions with transcription factors (TFs) and TF binding sites (TFBSs). We discovered many TFs that are significantly enriched in the target disease groups including associations that have been documented in the literature. We also identified a number of putative TFs/TFBSs that appear to be novel. The results of our analysis are provided in an online database that is freely accessible to researchers at http://www.respiratorygenomics.com. Promoter-associated TFBS information and related genomic features, such as histone modification sites, microsatellites, CpG islands, and SNPs, are graphically summarized in the database. Users can compare and contrast underlying mechanisms of specific RRDs relative to candidate genes, TFs, gene ontology terms, micro-RNAs, and biological pathways for the conduct of metaanalyses. This database represents a novel, useful resource for RRD researchers. Copyright © 2012 by the American Thoracic Society.

  16. A database of annotated promoters of genes associated with common respiratory and related diseases

    KAUST Repository

    Chowdhary, Rajesh; Tan, Sinlam; Pavesi, Giulio; Jin, Gg; Dong, Difeng; Mathur, Sameer K.; Burkart, Arthur; Narang, Vipin; Glurich, Ingrid E.; Raby, Benjamin A.; Weiss, Scott T.; Limsoon, Wong; Liu, Jun; Bajic, Vladimir B.

    2012-01-01

    Many genes have been implicated in the pathogenesis of common respiratory and related diseases (RRDs), yet the underlying mechanisms are largely unknown. Differential gene expression patterns in diseased and healthy individuals suggest that RRDs affect or are affected by modified transcription regulation programs. It is thus crucial to characterize implicated genes in terms of transcriptional regulation. For this purpose, we conducted a promoter analysis of genes associated with 11 common RRDs including allergic rhinitis, asthma, bronchiectasis, bronchiolitis, bronchitis, chronic obstructive pulmonary disease, cystic fibrosis, emphysema, eczema, psoriasis, and urticaria, many of which are thought to be genetically related. The objective of the present study was to obtain deeper insight into the transcriptional regulation of these disease-associated genes by annotating their promoter regions with transcription factors (TFs) and TF binding sites (TFBSs). We discovered many TFs that are significantly enriched in the target disease groups including associations that have been documented in the literature. We also identified a number of putative TFs/TFBSs that appear to be novel. The results of our analysis are provided in an online database that is freely accessible to researchers at http://www.respiratorygenomics.com. Promoter-associated TFBS information and related genomic features, such as histone modification sites, microsatellites, CpG islands, and SNPs, are graphically summarized in the database. Users can compare and contrast underlying mechanisms of specific RRDs relative to candidate genes, TFs, gene ontology terms, micro-RNAs, and biological pathways for the conduct of metaanalyses. This database represents a novel, useful resource for RRD researchers. Copyright © 2012 by the American Thoracic Society.

  17. Disease susceptibility genes shared by primary biliary cirrhosis and Crohn's disease in the Japanese population.

    Science.gov (United States)

    Aiba, Yoshihiro; Yamazaki, Keiko; Nishida, Nao; Kawashima, Minae; Hitomi, Yuki; Nakamura, Hitomi; Komori, Atsumasa; Fuyuno, Yuta; Takahashi, Atsushi; Kawaguchi, Takaaki; Takazoe, Masakazu; Suzuki, Yasuo; Motoya, Satoshi; Matsui, Toshiyuki; Esaki, Motohiro; Matsumoto, Takayuki; Kubo, Michiaki; Tokunaga, Katsushi; Nakamura, Minoru

    2015-09-01

    We previously identified TNFSF15 as the most significant susceptibility gene at non-HLA loci for both primary biliary cirrhosis (PBC) and Crohn's diseases (CD) in the Japanese population. The aim of this study is to identify further disease susceptibility genes shared by PBC and CD. We selected 15 and 33 genetic variants that were significantly associated with PBC and CD, respectively, based on previously reported genome-wide association studies of the Japanese population. Next, an association study was independently performed for these genetic variants in CD (1312 CD patients and 3331 healthy controls) and PBC (1279 PBC patients and 1015 healthy controls) cohorts. Two CD susceptibility genes, ICOSLG rs2838519 and IL12B rs6556412, were also nominally associated with susceptibility to PBC (P=3.85 × 10(-2) and P=8.40 × 10(-3), respectively). Three PBC susceptibility genes, CXCR5 rs6421571, STAT4 rs7574865 and NFKB1 rs230534, were nominally associated with susceptibility to CD (P=2.82 × 10(-2), P=3.88 × 10(-2) and P=2.04 × 10(-2), respectively). The effect of ICOSLG and CXCR5 variants were concordant but the effect of STAT4, NFKB1 and IL12B variants were discordant for PBC and CD. TNFSF15 and ICOSLG-CXCR5 might constitute a shared pathogenic pathway in the development of PBC and CD in the Japanese population, whereas IL12B-STAT4-NFKB1 might constitute an opposite pathogenic pathway, reflecting the different balance between Th1 and Th17 in the two diseases.

  18. Gene targeted therapeutics for liver disease in alpha-1 antitrypsin deficiency.

    LENUS (Irish Health Repository)

    McLean, Caitriona

    2009-01-01

    Alpha-1 antitrypsin (A1AT) is a 52 kDa serine protease inhibitor that is synthesized in and secreted from the liver. Although it is present in all tissues in the body the present consensus is that its main role is to inhibit neutrophil elastase in the lung. A1AT deficiency occurs due to mutations of the A1AT gene that reduce serum A1AT levels to <35% of normal. The most clinically significant form of A1AT deficiency is caused by the Z mutation (Glu342Lys). ZA1AT polymerizes in the endoplasmic reticulum of liver cells and the resulting accumulation of the mutant protein can lead to liver disease, while the reduction in circulating A1AT can result in lung disease including early onset emphysema. There is currently no available treatment for the liver disease other than transplantation and therapies for the lung manifestations of the disease remain limited. Gene therapy is an evolving field which may be of use as a treatment for A1AT deficiency. As the liver disease associated with A1AT deficiency may represent a gain of function possible gene therapies for this condition include the use of ribozymes, peptide nucleic acids (PNAs) and RNA interference (RNAi), which by decreasing the amount of aberrant protein in cells may impact on the pathogenesis of the condition.

  19. Evidence for somatic gene conversion and deletion in bipolar disorder, Crohn's disease, coronary artery disease, hypertension, rheumatoid arthritis, type-1 diabetes, and type-2 diabetes

    Directory of Open Access Journals (Sweden)

    Ross Kenneth

    2011-02-01

    Full Text Available Abstract Background During gene conversion, genetic information is transferred unidirectionally between highly homologous but non-allelic regions of DNA. While germ-line gene conversion has been implicated in the pathogenesis of some diseases, somatic gene conversion has remained technically difficult to investigate on a large scale. Methods A novel analysis technique is proposed for detecting the signature of somatic gene conversion from SNP microarray data. The Wellcome Trust Case Control Consortium has gathered SNP microarray data for two control populations and cohorts for bipolar disorder (BD, cardiovascular disease (CAD, Crohn's disease (CD, hypertension (HT, rheumatoid arthritis (RA, type-1 diabetes (T1D and type-2 diabetes (T2D. Using the new analysis technique, the seven disease cohorts are analyzed to identify cohort-specific SNPs at which conversion is predicted. The quality of the predictions is assessed by identifying known disease associations for genes in the homologous duplicons, and comparing the frequency of such associations with background rates. Results Of 28 disease/locus pairs meeting stringent conditions, 22 show various degrees of disease association, compared with only 8 of 70 in a mock study designed to measure the background association rate (P -9. Additional candidate genes are identified using less stringent filtering conditions. In some cases, somatic deletions appear likely. RA has a distinctive pattern of events relative to other diseases. Similarities in patterns are apparent between BD and HT. Conclusions The associations derived represent the first evidence that somatic gene conversion could be a significant causative factor in each of the seven diseases. The specific genes provide potential insights about disease mechanisms, and are strong candidates for further study. Please see Commentary: http://www.biomedcentral.com/1741-7015/9/13/abstract.

  20. Vitiligo blood transcriptomics provides new insights into disease mechanisms and identifies potential novel therapeutic targets.

    Science.gov (United States)

    Dey-Rao, Rama; Sinha, Animesh A

    2017-01-28

    Significant gaps remain regarding the pathomechanisms underlying the autoimmune response in vitiligo (VL), where the loss of self-tolerance leads to the targeted killing of melanocytes. Specifically, there is incomplete information regarding alterations in the systemic environment that are relevant to the disease state. We undertook a genome-wide profiling approach to examine gene expression in the peripheral blood of VL patients and healthy controls in the context of our previously published VL-skin gene expression profile. We used several in silico bioinformatics-based analyses to provide new insights into disease mechanisms and suggest novel targets for future therapy. Unsupervised clustering methods of the VL-blood dataset demonstrate a "disease-state"-specific set of co-expressed genes. Ontology enrichment analysis of 99 differentially expressed genes (DEGs) uncovers a down-regulated immune/inflammatory response, B-Cell antigen receptor (BCR) pathways, apoptosis and catabolic processes in VL-blood. There is evidence for both type I and II interferon (IFN) playing a role in VL pathogenesis. We used interactome analysis to identify several key blood associated transcriptional factors (TFs) from within (STAT1, STAT6 and NF-kB), as well as "hidden" (CREB1, MYC, IRF4, IRF1, and TP53) from the dataset that potentially affect disease pathogenesis. The TFs overlap with our reported lesional-skin transcriptional circuitry, underscoring their potential importance to the disease. We also identify a shared VL-blood and -skin transcriptional "hot spot" that maps to chromosome 6, and includes three VL-blood dysregulated genes (PSMB8, PSMB9 and TAP1) described as potential VL-associated genetic susceptibility loci. Finally, we provide bioinformatics-based support for prioritizing dysregulated genes in VL-blood or skin as potential therapeutic targets. We examined the VL-blood transcriptome in context with our (previously published) VL-skin transcriptional profile to address