WorldWideScience

Sample records for based gene-gene interactions

  1. A Nonlinear Model for Gene-Based Gene-Environment Interaction

    Directory of Open Access Journals (Sweden)

    Jian Sa

    2016-06-01

    Full Text Available A vast amount of literature has confirmed the role of gene-environment (G×E interaction in the etiology of complex human diseases. Traditional methods are predominantly focused on the analysis of interaction between a single nucleotide polymorphism (SNP and an environmental variable. Given that genes are the functional units, it is crucial to understand how gene effects (rather than single SNP effects are influenced by an environmental variable to affect disease risk. Motivated by the increasing awareness of the power of gene-based association analysis over single variant based approach, in this work, we proposed a sparse principle component regression (sPCR model to understand the gene-based G×E interaction effect on complex disease. We first extracted the sparse principal components for SNPs in a gene, then the effect of each principal component was modeled by a varying-coefficient (VC model. The model can jointly model variants in a gene in which their effects are nonlinearly influenced by an environmental variable. In addition, the varying-coefficient sPCR (VC-sPCR model has nice interpretation property since the sparsity on the principal component loadings can tell the relative importance of the corresponding SNPs in each component. We applied our method to a human birth weight dataset in Thai population. We analyzed 12,005 genes across 22 chromosomes and found one significant interaction effect using the Bonferroni correction method and one suggestive interaction. The model performance was further evaluated through simulation studies. Our model provides a system approach to evaluate gene-based G×E interaction.

  2. Gene-based testing of interactions in association studies of quantitative traits.

    Directory of Open Access Journals (Sweden)

    Li Ma

    Full Text Available Various methods have been developed for identifying gene-gene interactions in genome-wide association studies (GWAS. However, most methods focus on individual markers as the testing unit, and the large number of such tests drastically erodes statistical power. In this study, we propose novel interaction tests of quantitative traits that are gene-based and that confer advantage in both statistical power and biological interpretation. The framework of gene-based gene-gene interaction (GGG tests combine marker-based interaction tests between all pairs of markers in two genes to produce a gene-level test for interaction between the two. The tests are based on an analytical formula we derive for the correlation between marker-based interaction tests due to linkage disequilibrium. We propose four GGG tests that extend the following P value combining methods: minimum P value, extended Simes procedure, truncated tail strength, and truncated P value product. Extensive simulations point to correct type I error rates of all tests and show that the two truncated tests are more powerful than the other tests in cases of markers involved in the underlying interaction not being directly genotyped and in cases of multiple underlying interactions. We applied our tests to pairs of genes that exhibit a protein-protein interaction to test for gene-level interactions underlying lipid levels using genotype data from the Atherosclerosis Risk in Communities study. We identified five novel interactions that are not evident from marker-based interaction testing and successfully replicated one of these interactions, between SMAD3 and NEDD9, in an independent sample from the Multi-Ethnic Study of Atherosclerosis. We conclude that our GGG tests show improved power to identify gene-level interactions in existing, as well as emerging, association studies.

  3. Ontology-based literature mining of E. coli vaccine-associated gene interaction networks.

    Science.gov (United States)

    Hur, Junguk; Özgür, Arzucan; He, Yongqun

    2017-03-14

    Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i.e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of

  4. Detection of Gene Interactions Based on Syntactic Relations

    Directory of Open Access Journals (Sweden)

    Mi-Young Kim

    2008-01-01

    Full Text Available Interactions between proteins and genes are considered essential in the description of biomolecular phenomena, and networks of interactions are applied in a system's biology approach. Recently, many studies have sought to extract information from biomolecular text using natural language processing technology. Previous studies have asserted that linguistic information is useful for improving the detection of gene interactions. In particular, syntactic relations among linguistic information are good for detecting gene interactions. However, previous systems give a reasonably good precision but poor recall. To improve recall without sacrificing precision, this paper proposes a three-phase method for detecting gene interactions based on syntactic relations. In the first phase, we retrieve syntactic encapsulation categories for each candidate agent and target. In the second phase, we construct a verb list that indicates the nature of the interaction between pairs of genes. In the last phase, we determine direction rules to detect which of two genes is the agent or target. Even without biomolecular knowledge, our method performs reasonably well using a small training dataset. While the first phase contributes to improve recall, the second and third phases contribute to improve precision. In the experimental results using ICML 05 Workshop on Learning Language in Logic (LLL05 data, our proposed method gave an F-measure of 67.2% for the test data, significantly outperforming previous methods. We also describe the contribution of each phase to the performance.

  5. GBOOST: a GPU-based tool for detecting gene-gene interactions in genome-wide case control studies.

    Science.gov (United States)

    Yung, Ling Sing; Yang, Can; Wan, Xiang; Yu, Weichuan

    2011-05-01

    Collecting millions of genetic variations is feasible with the advanced genotyping technology. With a huge amount of genetic variations data in hand, developing efficient algorithms to carry out the gene-gene interaction analysis in a timely manner has become one of the key problems in genome-wide association studies (GWAS). Boolean operation-based screening and testing (BOOST), a recent work in GWAS, completes gene-gene interaction analysis in 2.5 days on a desktop computer. Compared with central processing units (CPUs), graphic processing units (GPUs) are highly parallel hardware and provide massive computing resources. We are, therefore, motivated to use GPUs to further speed up the analysis of gene-gene interactions. We implement the BOOST method based on a GPU framework and name it GBOOST. GBOOST achieves a 40-fold speedup compared with BOOST. It completes the analysis of Wellcome Trust Case Control Consortium Type 2 Diabetes (WTCCC T2D) genome data within 1.34 h on a desktop computer equipped with Nvidia GeForce GTX 285 display card. GBOOST code is available at http://bioinformatics.ust.hk/BOOST.html#GBOOST.

  6. A genetic ensemble approach for gene-gene interaction identification

    Directory of Open Access Journals (Sweden)

    Ho Joshua WK

    2010-10-01

    Full Text Available Abstract Background It has now become clear that gene-gene interactions and gene-environment interactions are ubiquitous and fundamental mechanisms for the development of complex diseases. Though a considerable effort has been put into developing statistical models and algorithmic strategies for identifying such interactions, the accurate identification of those genetic interactions has been proven to be very challenging. Methods In this paper, we propose a new approach for identifying such gene-gene and gene-environment interactions underlying complex diseases. This is a hybrid algorithm and it combines genetic algorithm (GA and an ensemble of classifiers (called genetic ensemble. Using this approach, the original problem of SNP interaction identification is converted into a data mining problem of combinatorial feature selection. By collecting various single nucleotide polymorphisms (SNP subsets as well as environmental factors generated in multiple GA runs, patterns of gene-gene and gene-environment interactions can be extracted using a simple combinatorial ranking method. Also considered in this study is the idea of combining identification results obtained from multiple algorithms. A novel formula based on pairwise double fault is designed to quantify the degree of complementarity. Conclusions Our simulation study demonstrates that the proposed genetic ensemble algorithm has comparable identification power to Multifactor Dimensionality Reduction (MDR and is slightly better than Polymorphism Interaction Analysis (PIA, which are the two most popular methods for gene-gene interaction identification. More importantly, the identification results generated by using our genetic ensemble algorithm are highly complementary to those obtained by PIA and MDR. Experimental results from our simulation studies and real world data application also confirm the effectiveness of the proposed genetic ensemble algorithm, as well as the potential benefits of

  7. Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions.

    Science.gov (United States)

    Hur, Junguk; Özgür, Arzucan; Xiang, Zuoshuang; He, Yongqun

    2015-01-01

    Literature mining of gene-gene interactions has been enhanced by ontology-based name classifications. However, in biomedical literature mining, interaction keywords have not been carefully studied and used beyond a collection of keywords. In this study, we report the development of a new Interaction Network Ontology (INO) that classifies >800 interaction keywords and incorporates interaction terms from the PSI Molecular Interactions (PSI-MI) and Gene Ontology (GO). Using INO-based literature mining results, a modified Fisher's exact test was established to analyze significantly over- and under-represented enriched gene-gene interaction types within a specific area. Such a strategy was applied to study the vaccine-mediated gene-gene interactions using all PubMed abstracts. The Vaccine Ontology (VO) and INO were used to support the retrieval of vaccine terms and interaction keywords from the literature. INO is aligned with the Basic Formal Ontology (BFO) and imports terms from 10 other existing ontologies. Current INO includes 540 terms. In terms of interaction-related terms, INO imports and aligns PSI-MI and GO interaction terms and includes over 100 newly generated ontology terms with 'INO_' prefix. A new annotation property, 'has literature mining keywords', was generated to allow the listing of different keywords mapping to the interaction types in INO. Using all PubMed documents published as of 12/31/2013, approximately 266,000 vaccine-associated documents were identified, and a total of 6,116 gene-pairs were associated with at least one INO term. Out of 78 INO interaction terms associated with at least five gene-pairs of the vaccine-associated sub-network, 14 terms were significantly over-represented (i.e., more frequently used) and 17 under-represented based on our modified Fisher's exact test. These over-represented and under-represented terms share some common top-level terms but are distinct at the bottom levels of the INO hierarchy. The analysis of these

  8. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  9. Gene-based interaction analysis shows GABAergic genes interacting with parenting in adolescent depressive symptoms

    NARCIS (Netherlands)

    Van Assche, Evelien; Moons, Tim; Cinar, Ozan; Viechtbauer, Wolfgang; Oldehinkel, Albertine J.; Van Leeuwen, Karla; Verschueren, Karine; Colpin, Hilde; Lambrechts, Diether; Van den Noortgate, Wim; Goossens, Luc; Claes, Stephan; van Winkel, Ruud

    2017-01-01

    BACKGROUND: Most gene-environment interaction studies (G × E) have focused on single candidate genes. This approach is criticized for its expectations of large effect sizes and occurrence of spurious results. We describe an approach that accounts for the polygenic nature of most psychiatric

  10. Two-Way Gene Interaction From Microarray Data Based on Correlation Methods.

    Science.gov (United States)

    Alavi Majd, Hamid; Talebi, Atefeh; Gilany, Kambiz; Khayyer, Nasibeh

    2016-06-01

    Gene networks have generated a massive explosion in the development of high-throughput techniques for monitoring various aspects of gene activity. Networks offer a natural way to model interactions between genes, and extracting gene network information from high-throughput genomic data is an important and difficult task. The purpose of this study is to construct a two-way gene network based on parametric and nonparametric correlation coefficients. The first step in constructing a Gene Co-expression Network is to score all pairs of gene vectors. The second step is to select a score threshold and connect all gene pairs whose scores exceed this value. In the foundation-application study, we constructed two-way gene networks using nonparametric methods, such as Spearman's rank correlation coefficient and Blomqvist's measure, and compared them with Pearson's correlation coefficient. We surveyed six genes of venous thrombosis disease, made a matrix entry representing the score for the corresponding gene pair, and obtained two-way interactions using Pearson's correlation, Spearman's rank correlation, and Blomqvist's coefficient. Finally, these methods were compared with Cytoscape, based on BIND, and Gene Ontology, based on molecular function visual methods; R software version 3.2 and Bioconductor were used to perform these methods. Based on the Pearson and Spearman correlations, the results were the same and were confirmed by Cytoscape and GO visual methods; however, Blomqvist's coefficient was not confirmed by visual methods. Some results of the correlation coefficients are not the same with visualization. The reason may be due to the small number of data.

  11. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    Science.gov (United States)

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  12. Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases

    Directory of Open Access Journals (Sweden)

    Ma'ayan Avi

    2007-10-01

    Full Text Available Abstract Background In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP, generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Results Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Conclusion Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.

  13. A Partial Least Square Approach for Modeling Gene-gene and Gene-environment Interactions When Multiple Markers Are Genotyped

    Science.gov (United States)

    Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C.

    2008-01-01

    Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense SNPs in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches: the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey’s 1-df model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women’s Health Initiative (WHI), this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with BMI. PMID:18615621

  14. A partial least-square approach for modeling gene-gene and gene-environment interactions when multiple markers are genotyped.

    Science.gov (United States)

    Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C

    2009-01-01

    Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense single nucleotype polymorphisms (SNPs) in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches, the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey's one-degree-of-freedom model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women's Health Initiative, this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with body mass index.

  15. Double-Bottom Chaotic Map Particle Swarm Optimization Based on Chi-Square Test to Determine Gene-Gene Interactions

    Science.gov (United States)

    Yang, Cheng-Hong; Chang, Hsueh-Wei

    2014-01-01

    Gene-gene interaction studies focus on the investigation of the association between the single nucleotide polymorphisms (SNPs) of genes for disease susceptibility. Statistical methods are widely used to search for a good model of gene-gene interaction for disease analysis, and the previously determined models have successfully explained the effects between SNPs and diseases. However, the huge numbers of potential combinations of SNP genotypes limit the use of statistical methods for analysing high-order interaction, and finding an available high-order model of gene-gene interaction remains a challenge. In this study, an improved particle swarm optimization with double-bottom chaotic maps (DBM-PSO) was applied to assist statistical methods in the analysis of associated variations to disease susceptibility. A big data set was simulated using the published genotype frequencies of 26 SNPs amongst eight genes for breast cancer. Results showed that the proposed DBM-PSO successfully determined two- to six-order models of gene-gene interaction for the risk association with breast cancer (odds ratio > 1.0; P value <0.05). Analysis results supported that the proposed DBM-PSO can identify good models and provide higher chi-square values than conventional PSO. This study indicates that DBM-PSO is a robust and precise algorithm for determination of gene-gene interaction models for breast cancer. PMID:24895547

  16. Research progress in machine learning methods for gene-gene interaction detection.

    Science.gov (United States)

    Peng, Zhe-Ye; Tang, Zi-Jun; Xie, Min-Zhu

    2018-03-20

    Complex diseases are results of gene-gene and gene-environment interactions. However, the detection of high-dimensional gene-gene interactions is computationally challenging. In the last two decades, machine-learning approaches have been developed to detect gene-gene interactions with some successes. In this review, we summarize the progress in research on machine learning methods, as applied to gene-gene interaction detection. It systematically examines the principles and limitations of the current machine learning methods used in genome wide association studies (GWAS) to detect gene-gene interactions, such as neural networks (NN), random forest (RF), support vector machines (SVM) and multifactor dimensionality reduction (MDR), and provides some insights on the future research directions in the field.

  17. A kernel regression approach to gene-gene interaction detection for case-control studies.

    Science.gov (United States)

    Larson, Nicholas B; Schaid, Daniel J

    2013-11-01

    Gene-gene interactions are increasingly being addressed as a potentially important contributor to the variability of complex traits. Consequently, attentions have moved beyond single locus analysis of association to more complex genetic models. Although several single-marker approaches toward interaction analysis have been developed, such methods suffer from very high testing dimensionality and do not take advantage of existing information, notably the definition of genes as functional units. Here, we propose a comprehensive family of gene-level score tests for identifying genetic elements of disease risk, in particular pairwise gene-gene interactions. Using kernel machine methods, we devise score-based variance component tests under a generalized linear mixed model framework. We conducted simulations based upon coalescent genetic models to evaluate the performance of our approach under a variety of disease models. These simulations indicate that our methods are generally higher powered than alternative gene-level approaches and at worst competitive with exhaustive SNP-level (where SNP is single-nucleotide polymorphism) analyses. Furthermore, we observe that simulated epistatic effects resulted in significant marginal testing results for the involved genes regardless of whether or not true main effects were present. We detail the benefits of our methods and discuss potential genome-wide analysis strategies for gene-gene interaction analysis in a case-control study design. © 2013 WILEY PERIODICALS, INC.

  18. Gene-gene interactions and gene polymorphisms of VEGFA and EG-VEGF gene systems in recurrent pregnancy loss.

    Science.gov (United States)

    Su, Mei-Tsz; Lin, Sheng-Hsiang; Chen, Yi-Chi; Kuo, Pao-Lin

    2014-06-01

    Both vascular endothelial growth factor A (VEGFA) and endocrine gland-derived vascular endothelial growth factor (EG-VEGF) systems play major roles in angiogenesis. A body of evidence suggests VEGFs regulate critical processes during pregnancy and have been associated with recurrent pregnancy loss (RPL). However, little information is available regarding the interaction of these two major major angiogenesis-related systems in early human pregnancy. This study was conducted to investigate the association of gene polymorphisms and gene-gene interaction among genes in VEGFA and EG-VEGF systems and idiopathic RPL. A total of 98 women with history of idiopathic RPL and 142 controls were included, and 5 functional SNPs selected from VEGFA, KDR, EG-VEGF (PROK1), PROKR1 and PROKR2 were genotyped. We used multifactor dimensionality reduction (MDR) analysis to choose a best model and evaluate gene-gene interactions. Ingenuity pathways analysis (IPA) was introduced to explore possible complex interactions. Two receptor gene polymorphisms [KDR (Q472H) and PROKR2 (V331M)] were significantly associated with idiopathic RPL (P<0.01). The MDR test revealed that the KDR (Q472H) polymorphism was the best loci to be associated with RPL (P=0.02). IPA revealed EG-VEGF and VEGFA systems shared several canonical signaling pathways that may contribute to gene-gene interactions, including the Akt, IL-8, EGFR, MAPK, SRC, VHL, HIF-1A and STAT3 signaling pathways. Two receptor gene polymorphisms [KDR (Q472H) and PROKR2 (V331M)] were significantly associated with idiopathic RPL. EG-VEGF and VEGFA systems shared several canonical signaling pathways that may contribute to gene-gene interactions, including the Akt, IL-8, EGFR, MAPK, SRC, VHL, HIF-1A and STAT3.

  19. Genome-wide identification of key modulators of gene-gene interaction networks in breast cancer.

    Science.gov (United States)

    Chiu, Yu-Chiao; Wang, Li-Ju; Hsiao, Tzu-Hung; Chuang, Eric Y; Chen, Yidong

    2017-10-03

    With the advances in high-throughput gene profiling technologies, a large volume of gene interaction maps has been constructed. A higher-level layer of gene-gene interaction, namely modulate gene interaction, is composed of gene pairs of which interaction strengths are modulated by (i.e., dependent on) the expression level of a key modulator gene. Systematic investigations into the modulation by estrogen receptor (ER), the best-known modulator gene, have revealed the functional and prognostic significance in breast cancer. However, a genome-wide identification of key modulator genes that may further unveil the landscape of modulated gene interaction is still lacking. We proposed a systematic workflow to screen for key modulators based on genome-wide gene expression profiles. We designed four modularity parameters to measure the ability of a putative modulator to perturb gene interaction networks. Applying the method to a dataset of 286 breast tumors, we comprehensively characterized the modularity parameters and identified a total of 973 key modulator genes. The modularity of these modulators was verified in three independent breast cancer datasets. ESR1, the encoding gene of ER, appeared in the list, and abundant novel modulators were illuminated. For instance, a prognostic predictor of breast cancer, SFRP1, was found the second modulator. Functional annotation analysis of the 973 modulators revealed involvements in ER-related cellular processes as well as immune- and tumor-associated functions. Here we present, as far as we know, the first comprehensive analysis of key modulator genes on a genome-wide scale. The validity of filtering parameters as well as the conservativity of modulators among cohorts were corroborated. Our data bring new insights into the modulated layer of gene-gene interaction and provide candidates for further biological investigations.

  20. GxGrare: gene-gene interaction analysis method for rare variants from high-throughput sequencing data.

    Science.gov (United States)

    Kwon, Minseok; Leem, Sangseob; Yoon, Joon; Park, Taesung

    2018-03-19

    With the rapid advancement of array-based genotyping techniques, genome-wide association studies (GWAS) have successfully identified common genetic variants associated with common complex diseases. However, it has been shown that only a small proportion of the genetic etiology of complex diseases could be explained by the genetic factors identified from GWAS. This missing heritability could possibly be explained by gene-gene interaction (epistasis) and rare variants. There has been an exponential growth of gene-gene interaction analysis for common variants in terms of methodological developments and practical applications. Also, the recent advancement of high-throughput sequencing technologies makes it possible to conduct rare variant analysis. However, little progress has been made in gene-gene interaction analysis for rare variants. Here, we propose GxGrare which is a new gene-gene interaction method for the rare variants in the framework of the multifactor dimensionality reduction (MDR) analysis. The proposed method consists of three steps; 1) collapsing the rare variants, 2) MDR analysis for the collapsed rare variants, and 3) detect top candidate interaction pairs. GxGrare can be used for the detection of not only gene-gene interactions, but also interactions within a single gene. The proposed method is illustrated with 1080 whole exome sequencing data of the Korean population in order to identify causal gene-gene interaction for rare variants for type 2 diabetes. The proposed GxGrare performs well for gene-gene interaction detection with collapsing of rare variants. GxGrare is available at http://bibs.snu.ac.kr/software/gxgrare which contains simulation data and documentation. Supported operating systems include Linux and OS X.

  1. Gene-Gene and Gene-Environment Interactions in the Etiology of Breast Cancer

    National Research Council Canada - National Science Library

    Adegoke, Olufemi

    2003-01-01

    The objective of this CDA is to evaluate the gene-gene and gene-environment interactions in the etiology of breast cancer in two ongoing case-control studies, the Shanghai Breast Cancer Study (SBCS...

  2. Combining many interaction networks to predict gene function and analyze gene lists.

    Science.gov (United States)

    Mostafavi, Sara; Morris, Quaid

    2012-05-01

    In this article, we review how interaction networks can be used alone or in combination in an automated fashion to provide insight into gene and protein function. We describe the concept of a "gene-recommender system" that can be applied to any large collection of interaction networks to make predictions about gene or protein function based on a query list of proteins that share a function of interest. We discuss these systems in general and focus on one specific system, GeneMANIA, that has unique features and uses different algorithms from the majority of other systems. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  3. Duplicability of self-interacting human genes.

    LENUS (Irish Health Repository)

    Pérez-Bercoff, Asa

    2010-01-01

    BACKGROUND: There is increasing interest in the evolution of protein-protein interactions because this should ultimately be informative of the patterns of evolution of new protein functions within the cell. One model proposes that the evolution of new protein-protein interactions and protein complexes proceeds through the duplication of self-interacting genes. This model is supported by data from yeast. We examined the relationship between gene duplication and self-interaction in the human genome. RESULTS: We investigated the patterns of self-interaction and duplication among 34808 interactions encoded by 8881 human genes, and show that self-interacting proteins are encoded by genes with higher duplicability than genes whose proteins lack this type of interaction. We show that this result is robust against the system used to define duplicate genes. Finally we compared the presence of self-interactions amongst proteins whose genes have duplicated either through whole-genome duplication (WGD) or small-scale duplication (SSD), and show that the former tend to have more interactions in general. After controlling for age differences between the two sets of duplicates this result can be explained by the time since the gene duplication. CONCLUSIONS: Genes encoding self-interacting proteins tend to have higher duplicability than proteins lacking self-interactions. Moreover these duplicate genes have more often arisen through whole-genome rather than small-scale duplication. Finally, self-interacting WGD genes tend to have more interaction partners in general in the PIN, which can be explained by their overall greater age. This work adds to our growing knowledge of the importance of contextual factors in gene duplicability.

  4. Animal models of gene-environment interactions in schizophrenia.

    Science.gov (United States)

    Ayhan, Yavuz; Sawa, Akira; Ross, Christopher A; Pletnikov, Mikhail V

    2009-12-07

    The pathogenesis of schizophrenia and related mental illnesses likely involves multiple interactions between susceptibility genes of small effects and environmental factors. Gene-environment interactions occur across different stages of neurodevelopment to produce heterogeneous clinical and pathological manifestations of the disease. The main obstacle for mechanistic studies of gene-environment interplay has been the paucity of appropriate experimental systems for elucidating the molecular pathways that mediate gene-environment interactions relevant to schizophrenia. Recent advances in psychiatric genetics and a plethora of experimental data from animal studies allow us to suggest a new approach to gene-environment interactions in schizophrenia. We propose that animal models based on identified genetic mutations and measurable environment factors will help advance studies of the molecular mechanisms of gene-environment interplay.

  5. Methodological issues in detecting gene-gene interactions in breast cancer susceptibility: a population-based study in Ontario

    Directory of Open Access Journals (Sweden)

    Onay Venus

    2007-08-01

    Full Text Available Abstract Background There is growing evidence that gene-gene interactions are ubiquitous in determining the susceptibility to common human diseases. The investigation of such gene-gene interactions presents new statistical challenges for studies with relatively small sample sizes as the number of potential interactions in the genome can be large. Breast cancer provides a useful paradigm to study genetically complex diseases because commonly occurring single nucleotide polymorphisms (SNPs may additively or synergistically disturb the system-wide communication of the cellular processes leading to cancer development. Methods In this study, we systematically studied SNP-SNP interactions among 19 SNPs from 18 key genes involved in major cancer pathways in a sample of 398 breast cancer cases and 372 controls from Ontario. We discuss the methodological issues associated with the detection of SNP-SNP interactions in this dataset by applying and comparing three commonly used methods: the logistic regression model, classification and regression trees (CART, and the multifactor dimensionality reduction (MDR method. Results Our analyses show evidence for several simple (two-way and complex (multi-way SNP-SNP interactions associated with breast cancer. For example, all three methods identified XPD-[Lys751Gln]*IL10-[G(-1082A] as the most significant two-way interaction. CART and MDR identified the same critical SNPs participating in complex interactions. Our results suggest that the use of multiple statistical approaches (or an integrated approach rather than a single methodology could be the best strategy to elucidate complex gene interactions that have generally very different patterns. Conclusion The strategy used here has the potential to identify complex biological relationships among breast cancer genes and processes. This will lead to the discovery of novel biological information, which will improve breast cancer risk management.

  6. Gene-gene, gene-environment, gene-nutrient interactions and single nucleotide polymorphisms of inflammatory cytokines.

    Science.gov (United States)

    Nadeem, Amina; Mumtaz, Sadaf; Naveed, Abdul Khaliq; Aslam, Muhammad; Siddiqui, Arif; Lodhi, Ghulam Mustafa; Ahmad, Tausif

    2015-05-15

    Inflammation plays a significant role in the etiology of type 2 diabetes mellitus (T2DM). The rise in the pro-inflammatory cytokines is the essential step in glucotoxicity and lipotoxicity induced mitochondrial injury, oxidative stress and beta cell apoptosis in T2DM. Among the recognized markers are interleukin (IL)-6, IL-1, IL-10, IL-18, tissue necrosis factor-alpha (TNF-α), C-reactive protein, resistin, adiponectin, tissue plasminogen activator, fibrinogen and heptoglobins. Diabetes mellitus has firm genetic and very strong environmental influence; exhibiting a polygenic mode of inheritance. Many single nucleotide polymorphisms (SNPs) in various genes including those of pro and anti-inflammatory cytokines have been reported as a risk for T2DM. Not all the SNPs have been confirmed by unifying results in different studies and wide variations have been reported in various ethnic groups. The inter-ethnic variations can be explained by the fact that gene expression may be regulated by gene-gene, gene-environment and gene-nutrient interactions. This review highlights the impact of these interactions on determining the role of single nucleotide polymorphism of IL-6, TNF-α, resistin and adiponectin in pathogenesis of T2DM.

  7. Systematically characterizing and prioritizing chemosensitivity related gene based on Gene Ontology and protein interaction network

    Directory of Open Access Journals (Sweden)

    Chen Xin

    2012-10-01

    Full Text Available Abstract Background The identification of genes that predict in vitro cellular chemosensitivity of cancer cells is of great importance. Chemosensitivity related genes (CRGs have been widely utilized to guide clinical and cancer chemotherapy decisions. In addition, CRGs potentially share functional characteristics and network features in protein interaction networks (PPIN. Methods In this study, we proposed a method to identify CRGs based on Gene Ontology (GO and PPIN. Firstly, we documented 150 pairs of drug-CCRG (curated chemosensitivity related gene from 492 published papers. Secondly, we characterized CCRGs from the perspective of GO and PPIN. Thirdly, we prioritized CRGs based on CCRGs’ GO and network characteristics. Lastly, we evaluated the performance of the proposed method. Results We found that CCRG enriched GO terms were most often related to chemosensitivity and exhibited higher similarity scores compared to randomly selected genes. Moreover, CCRGs played key roles in maintaining the connectivity and controlling the information flow of PPINs. We then prioritized CRGs using CCRG enriched GO terms and CCRG network characteristics in order to obtain a database of predicted drug-CRGs that included 53 CRGs, 32 of which have been reported to affect susceptibility to drugs. Our proposed method identifies a greater number of drug-CCRGs, and drug-CCRGs are much more significantly enriched in predicted drug-CRGs, compared to a method based on the correlation of gene expression and drug activity. The mean area under ROC curve (AUC for our method is 65.2%, whereas that for the traditional method is 55.2%. Conclusions Our method not only identifies CRGs with expression patterns strongly correlated with drug activity, but also identifies CRGs in which expression is weakly correlated with drug activity. This study provides the framework for the identification of signatures that predict in vitro cellular chemosensitivity and offers a valuable

  8. A deeper look at two concepts of measuring gene-gene interactions: logistic regression and interaction information revisited.

    Science.gov (United States)

    Mielniczuk, Jan; Teisseyre, Paweł

    2018-03-01

    Detection of gene-gene interactions is one of the most important challenges in genome-wide case-control studies. Besides traditional logistic regression analysis, recently the entropy-based methods attracted a significant attention. Among entropy-based methods, interaction information is one of the most promising measures having many desirable properties. Although both logistic regression and interaction information have been used in several genome-wide association studies, the relationship between them has not been thoroughly investigated theoretically. The present paper attempts to fill this gap. We show that although certain connections between the two methods exist, in general they refer two different concepts of dependence and looking for interactions in those two senses leads to different approaches to interaction detection. We introduce ordering between interaction measures and specify conditions for independent and dependent genes under which interaction information is more discriminative measure than logistic regression. Moreover, we show that for so-called perfect distributions those measures are equivalent. The numerical experiments illustrate the theoretical findings indicating that interaction information and its modified version are more universal tools for detecting various types of interaction than logistic regression and linkage disequilibrium measures. © 2017 WILEY PERIODICALS, INC.

  9. Ultrahigh-dimensional variable selection method for whole-genome gene-gene interaction analysis

    Directory of Open Access Journals (Sweden)

    Ueki Masao

    2012-05-01

    Full Text Available Abstract Background Genome-wide gene-gene interaction analysis using single nucleotide polymorphisms (SNPs is an attractive way for identification of genetic components that confers susceptibility of human complex diseases. Individual hypothesis testing for SNP-SNP pairs as in common genome-wide association study (GWAS however involves difficulty in setting overall p-value due to complicated correlation structure, namely, the multiple testing problem that causes unacceptable false negative results. A large number of SNP-SNP pairs than sample size, so-called the large p small n problem, precludes simultaneous analysis using multiple regression. The method that overcomes above issues is thus needed. Results We adopt an up-to-date method for ultrahigh-dimensional variable selection termed the sure independence screening (SIS for appropriate handling of numerous number of SNP-SNP interactions by including them as predictor variables in logistic regression. We propose ranking strategy using promising dummy coding methods and following variable selection procedure in the SIS method suitably modified for gene-gene interaction analysis. We also implemented the procedures in a software program, EPISIS, using the cost-effective GPGPU (General-purpose computing on graphics processing units technology. EPISIS can complete exhaustive search for SNP-SNP interactions in standard GWAS dataset within several hours. The proposed method works successfully in simulation experiments and in application to real WTCCC (Wellcome Trust Case–control Consortium data. Conclusions Based on the machine-learning principle, the proposed method gives powerful and flexible genome-wide search for various patterns of gene-gene interaction.

  10. Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes

    KAUST Repository

    AlShahrani, Mona; Hoehndorf, Robert

    2018-01-01

    In the past years, several methods have been developed to incorporate information about phenotypes into computational disease gene prioritization methods. These methods commonly compute the similarity between a disease's (or patient's) phenotypes and a database of gene-to-phenotype associations to find the phenotypically most similar match. A key limitation of these methods is their reliance on knowledge about phenotypes associated with particular genes which is highly incomplete in humans as well as in many model organisms such as the mouse. Results: We developed SmuDGE, a method that uses feature learning to generate vector-based representations of phenotypes associated with an entity. SmuDGE can be used as a trainable semantic similarity measure to compare two sets of phenotypes (such as between a disease and gene, or a disease and patient). More importantly, SmuDGE can generate phenotype representations for entities that are only indirectly associated with phenotypes through an interaction network; for this purpose, SmuDGE exploits background knowledge in interaction networks comprising of multiple types of interactions. We demonstrate that SmuDGE can match or outperform semantic similarity in phenotype-based disease gene prioritization, and furthermore significantly extends the coverage of phenotype-based methods to all genes in a connected interaction network.

  11. Semantic Disease Gene Embeddings (SmuDGE): phenotype-based disease gene prioritization without phenotypes

    KAUST Repository

    Alshahrani, Mona

    2018-04-30

    In the past years, several methods have been developed to incorporate information about phenotypes into computational disease gene prioritization methods. These methods commonly compute the similarity between a disease\\'s (or patient\\'s) phenotypes and a database of gene-to-phenotype associations to find the phenotypically most similar match. A key limitation of these methods is their reliance on knowledge about phenotypes associated with particular genes which is highly incomplete in humans as well as in many model organisms such as the mouse. Results: We developed SmuDGE, a method that uses feature learning to generate vector-based representations of phenotypes associated with an entity. SmuDGE can be used as a trainable semantic similarity measure to compare two sets of phenotypes (such as between a disease and gene, or a disease and patient). More importantly, SmuDGE can generate phenotype representations for entities that are only indirectly associated with phenotypes through an interaction network; for this purpose, SmuDGE exploits background knowledge in interaction networks comprising of multiple types of interactions. We demonstrate that SmuDGE can match or outperform semantic similarity in phenotype-based disease gene prioritization, and furthermore significantly extends the coverage of phenotype-based methods to all genes in a connected interaction network.

  12. Clustering gene expression data based on predicted differential effects of GV interaction.

    Science.gov (United States)

    Pan, Hai-Yan; Zhu, Jun; Han, Dan-Fu

    2005-02-01

    Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.

  13. Identification of human disease genes from interactome network using graphlet interaction.

    Directory of Open Access Journals (Sweden)

    Xiao-Dong Wang

    Full Text Available Identifying genes related to human diseases, such as cancer and cardiovascular disease, etc., is an important task in biomedical research because of its applications in disease diagnosis and treatment. Interactome networks, especially protein-protein interaction networks, had been used to disease genes identification based on the hypothesis that strong candidate genes tend to closely relate to each other in some kinds of measure on the network. We proposed a new measure to analyze the relationship between network nodes which was called graphlet interaction. The graphlet interaction contained 28 different isomers. The results showed that the numbers of the graphlet interaction isomers between disease genes in interactome networks were significantly larger than random picked genes, while graphlet signatures were not. Then, we designed a new type of score, based on the network properties, to identify disease genes using graphlet interaction. The genes with higher scores were more likely to be disease genes, and all candidate genes were ranked according to their scores. Then the approach was evaluated by leave-one-out cross-validation. The precision of the current approach achieved 90% at about 10% recall, which was apparently higher than the previous three predominant algorithms, random walk, Endeavour and neighborhood based method. Finally, the approach was applied to predict new disease genes related to 4 common diseases, most of which were identified by other independent experimental researches. In conclusion, we demonstrate that the graphlet interaction is an effective tool to analyze the network properties of disease genes, and the scores calculated by graphlet interaction is more precise in identifying disease genes.

  14. Identification of Human Disease Genes from Interactome Network Using Graphlet Interaction

    Science.gov (United States)

    Yang, Lun; Wei, Dong-Qing; Qi, Ying-Xin; Jiang, Zong-Lai

    2014-01-01

    Identifying genes related to human diseases, such as cancer and cardiovascular disease, etc., is an important task in biomedical research because of its applications in disease diagnosis and treatment. Interactome networks, especially protein-protein interaction networks, had been used to disease genes identification based on the hypothesis that strong candidate genes tend to closely relate to each other in some kinds of measure on the network. We proposed a new measure to analyze the relationship between network nodes which was called graphlet interaction. The graphlet interaction contained 28 different isomers. The results showed that the numbers of the graphlet interaction isomers between disease genes in interactome networks were significantly larger than random picked genes, while graphlet signatures were not. Then, we designed a new type of score, based on the network properties, to identify disease genes using graphlet interaction. The genes with higher scores were more likely to be disease genes, and all candidate genes were ranked according to their scores. Then the approach was evaluated by leave-one-out cross-validation. The precision of the current approach achieved 90% at about 10% recall, which was apparently higher than the previous three predominant algorithms, random walk, Endeavour and neighborhood based method. Finally, the approach was applied to predict new disease genes related to 4 common diseases, most of which were identified by other independent experimental researches. In conclusion, we demonstrate that the graphlet interaction is an effective tool to analyze the network properties of disease genes, and the scores calculated by graphlet interaction is more precise in identifying disease genes. PMID:24465923

  15. A review for detecting gene-gene interactions using machine learning methods in genetic epidemiology.

    Science.gov (United States)

    Koo, Ching Lee; Liew, Mei Jing; Mohamad, Mohd Saberi; Salleh, Abdul Hakim Mohamed

    2013-01-01

    Recently, the greatest statistical computational challenge in genetic epidemiology is to identify and characterize the genes that interact with other genes and environment factors that bring the effect on complex multifactorial disease. These gene-gene interactions are also denoted as epitasis in which this phenomenon cannot be solved by traditional statistical method due to the high dimensionality of the data and the occurrence of multiple polymorphism. Hence, there are several machine learning methods to solve such problems by identifying such susceptibility gene which are neural networks (NNs), support vector machine (SVM), and random forests (RFs) in such common and multifactorial disease. This paper gives an overview on machine learning methods, describing the methodology of each machine learning methods and its application in detecting gene-gene and gene-environment interactions. Lastly, this paper discussed each machine learning method and presents the strengths and weaknesses of each machine learning method in detecting gene-gene interactions in complex human disease.

  16. A Review for Detecting Gene-Gene Interactions Using Machine Learning Methods in Genetic Epidemiology

    Directory of Open Access Journals (Sweden)

    Ching Lee Koo

    2013-01-01

    Full Text Available Recently, the greatest statistical computational challenge in genetic epidemiology is to identify and characterize the genes that interact with other genes and environment factors that bring the effect on complex multifactorial disease. These gene-gene interactions are also denoted as epitasis in which this phenomenon cannot be solved by traditional statistical method due to the high dimensionality of the data and the occurrence of multiple polymorphism. Hence, there are several machine learning methods to solve such problems by identifying such susceptibility gene which are neural networks (NNs, support vector machine (SVM, and random forests (RFs in such common and multifactorial disease. This paper gives an overview on machine learning methods, describing the methodology of each machine learning methods and its application in detecting gene-gene and gene-environment interactions. Lastly, this paper discussed each machine learning method and presents the strengths and weaknesses of each machine learning method in detecting gene-gene interactions in complex human disease.

  17. The role of gene-gene interaction in the prediction of criminal behavior.

    Science.gov (United States)

    Boutwell, Brian B; Menard, Scott; Barnes, J C; Beaver, Kevin M; Armstrong, Todd A; Boisvert, Danielle

    2014-04-01

    A host of research has examined the possibility that environmental risk factors might condition the influence of genes on various outcomes. Less research, however, has been aimed at exploring the possibility that genetic factors might interact to impact the emergence of human traits. Even fewer studies exist examining the interaction of genes in the prediction of behavioral outcomes. The current study expands this body of research by testing the interaction between genes involved in neural transmission. Our findings suggest that certain dopamine genes interact to increase the odds of criminogenic outcomes in a national sample of Americans. Copyright © 2014 Elsevier Inc. All rights reserved.

  18. A powerful score-based test statistic for detecting gene-gene co-association.

    Science.gov (United States)

    Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

    2016-01-29

    The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.

  19. Behavioral science and the study of gene-nutrition and gene-physical activity interactions in obesity research.

    Science.gov (United States)

    Faith, Myles S

    2008-12-01

    This report summarizes emerging opportunities for behavioral science to help advance the field of gene-environment and gene-behavior interactions, based on presentations at The National Cancer Institute (NCI) Workshop, "Gene-Nutrition and Gene-Physical Activity Interactions in the Etiology of Obesity." Three opportunities are highlighted: (i) designing potent behavioral "challenges" in experiments, (ii) determining viable behavioral phenotypes for genetics studies, and (iii) identifying specific measures of the environment or environmental exposures. Additional points are underscored, including the need to incorporate novel findings from neuroimaging studies regarding motivation and drive for eating and physical activity. Advances in behavioral science theory and methods can play an important role in advancing understanding of gene-brain-behavior relationships in obesity onset.

  20. Screening for interaction effects in gene expression data.

    Directory of Open Access Journals (Sweden)

    Peter J Castaldi

    Full Text Available Expression quantitative trait (eQTL studies are a powerful tool for identifying genetic variants that affect levels of messenger RNA. Since gene expression is controlled by a complex network of gene-regulating factors, one way to identify these factors is to search for interaction effects between genetic variants and mRNA levels of transcription factors (TFs and their respective target genes. However, identification of interaction effects in gene expression data pose a variety of methodological challenges, and it has become clear that such analyses should be conducted and interpreted with caution. Investigating the validity and interpretability of several interaction tests when screening for eQTL SNPs whose effect on the target gene expression is modified by the expression level of a transcription factor, we characterized two important methodological issues. First, we stress the scale-dependency of interaction effects and highlight that commonly applied transformation of gene expression data can induce or remove interactions, making interpretation of results more challenging. We then demonstrate that, in the setting of moderate to strong interaction effects on the order of what may be reasonably expected for eQTL studies, standard interaction screening can be biased due to heteroscedasticity induced by true interactions. Using simulation and real data analysis, we outline a set of reasonable minimum conditions and sample size requirements for reliable detection of variant-by-environment and variant-by-TF interactions using the heteroscedasticity consistent covariance-based approach.

  1. Gene Environment Interactions and Predictors of Colorectal Cancer in Family-Based, Multi-Ethnic Groups.

    Science.gov (United States)

    Shiao, S Pamela K; Grayson, James; Yu, Chong Ho; Wasek, Brandi; Bottiglieri, Teodoro

    2018-02-16

    For the personalization of polygenic/omics-based health care, the purpose of this study was to examine the gene-environment interactions and predictors of colorectal cancer (CRC) by including five key genes in the one-carbon metabolism pathways. In this proof-of-concept study, we included a total of 54 families and 108 participants, 54 CRC cases and 54 matched family friends representing four major racial ethnic groups in southern California (White, Asian, Hispanics, and Black). We used three phases of data analytics, including exploratory, family-based analyses adjusting for the dependence within the family for sharing genetic heritage, the ensemble method, and generalized regression models for predictive modeling with a machine learning validation procedure to validate the results for enhanced prediction and reproducibility. The results revealed that despite the family members sharing genetic heritage, the CRC group had greater combined gene polymorphism rates than the family controls ( p relation to gene-environment interactions in the prevention of CRC.

  2. Environmental confounding in gene-environment interaction studies.

    Science.gov (United States)

    Vanderweele, Tyler J; Ko, Yi-An; Mukherjee, Bhramar

    2013-07-01

    We show that, in the presence of uncontrolled environmental confounding, joint tests for the presence of a main genetic effect and gene-environment interaction will be biased if the genetic and environmental factors are correlated, even if there is no effect of either the genetic factor or the environmental factor on the disease. When environmental confounding is ignored, such tests will in fact reject the joint null of no genetic effect with a probability that tends to 1 as the sample size increases. This problem with the joint test vanishes under gene-environment independence, but it still persists if estimating the gene-environment interaction parameter itself is of interest. Uncontrolled environmental confounding will bias estimates of gene-environment interaction parameters even under gene-environment independence, but it will not do so if the unmeasured confounding variable itself does not interact with the genetic factor. Under gene-environment independence, if the interaction parameter without controlling for the environmental confounder is nonzero, then there is gene-environment interaction either between the genetic factor and the environmental factor of interest or between the genetic factor and the unmeasured environmental confounder. We evaluate several recently proposed joint tests in a simulation study and discuss the implications of these results for the conduct of gene-environment interaction studies.

  3. HSD3B and gene-gene interactions in a pathway-based analysis of genetic susceptibility to bladder cancer.

    Directory of Open Access Journals (Sweden)

    Angeline S Andrew

    Full Text Available Bladder cancer is the 4(th most common cancer among men in the U.S. We analyzed variant genotypes hypothesized to modify major biological processes involved in bladder carcinogenesis, including hormone regulation, apoptosis, DNA repair, immune surveillance, metabolism, proliferation, and telomere maintenance. Logistic regression was used to assess the relationship between genetic variation affecting these processes and susceptibility in 563 genotyped urothelial cell carcinoma cases and 863 controls enrolled in a case-control study of incident bladder cancer conducted in New Hampshire, U.S. We evaluated gene-gene interactions using Multifactor Dimensionality Reduction (MDR and Statistical Epistasis Network analysis. The 3'UTR flanking variant form of the hormone regulation gene HSD3B2 was associated with increased bladder cancer risk in the New Hampshire population (adjusted OR 1.85 95%CI 1.31-2.62. This finding was successfully replicated in the Texas Bladder Cancer Study with 957 controls, 497 cases (adjusted OR 3.66 95%CI 1.06-12.63. The effect of this prevalent SNP was stronger among males (OR 2.13 95%CI 1.40-3.25 than females (OR 1.56 95%CI 0.83-2.95, (SNP-gender interaction P = 0.048. We also identified a SNP-SNP interaction between T-cell activation related genes GATA3 and CD81 (interaction P = 0.0003. The fact that bladder cancer incidence is 3-4 times higher in males suggests the involvement of hormone levels. This biologic process-based analysis suggests candidate susceptibility markers and supports the theory that disrupted hormone regulation plays a role in bladder carcinogenesis.

  4. The Cumulative Effect of Gene-Gene and Gene-Environment Interactions on the Risk of Prostate Cancer in Chinese Men

    Directory of Open Access Journals (Sweden)

    Ming Liu

    2016-01-01

    Full Text Available Prostate cancer (PCa is a multifactorial disease involving complex genetic and environmental factors interactions. Gene-gene and gene-environment interactions associated with PCa in Chinese men are less studied. We explored the association between 36 SNPs and PCa in 574 subjects from northern China. Body mass index (BMI, smoking, and alcohol consumption were determined through self-administered questionnaires in 134 PCa patients. Then gene-gene and gene-environment interactions among the PCa-associated SNPs were analyzed using the generalized multifactor dimensionality reduction (GMDR and logistic regression methods. Allelic and genotypic association analyses showed that six variants were associated with PCa and the cumulative effect suggested men who carried any combination of 1, 2, or ≥3 risk genotypes had a gradually increased PCa risk (odds ratios (ORs = 1.79–4.41. GMDR analysis identified the best gene-gene interaction model with scores of 10 for both the cross-validation consistency and sign tests. For gene-environment interactions, rs6983561 CC and rs16901966 GG in individuals with a BMI ≥ 28 had ORs of 7.66 (p = 0.032 and 5.33 (p = 0.046, respectively. rs7679673 CC + CA and rs12653946 TT in individuals that smoked had ORs of 2.77 (p = 0.007 and 3.11 (p = 0.024, respectively. rs7679673 CC in individuals that consumed alcohol had an OR of 4.37 (p = 0.041. These results suggest that polymorphisms, either individually or by interacting with other genes or environmental factors, contribute to an increased risk of PCa.

  5. ReliefSeq: a gene-wise adaptive-K nearest-neighbor feature selection tool for finding gene-gene interactions and main effects in mRNA-Seq gene expression data.

    Directory of Open Access Journals (Sweden)

    Brett A McKinney

    Full Text Available Relief-F is a nonparametric, nearest-neighbor machine learning method that has been successfully used to identify relevant variables that may interact in complex multivariate models to explain phenotypic variation. While several tools have been developed for assessing differential expression in sequence-based transcriptomics, the detection of statistical interactions between transcripts has received less attention in the area of RNA-seq analysis. We describe a new extension and assessment of Relief-F for feature selection in RNA-seq data. The ReliefSeq implementation adapts the number of nearest neighbors (k for each gene to optimize the Relief-F test statistics (importance scores for finding both main effects and interactions. We compare this gene-wise adaptive-k (gwak Relief-F method with standard RNA-seq feature selection tools, such as DESeq and edgeR, and with the popular machine learning method Random Forests. We demonstrate performance on a panel of simulated data that have a range of distributional properties reflected in real mRNA-seq data including multiple transcripts with varying sizes of main effects and interaction effects. For simulated main effects, gwak-Relief-F feature selection performs comparably to standard tools DESeq and edgeR for ranking relevant transcripts. For gene-gene interactions, gwak-Relief-F outperforms all comparison methods at ranking relevant genes in all but the highest fold change/highest signal situations where it performs similarly. The gwak-Relief-F algorithm outperforms Random Forests for detecting relevant genes in all simulation experiments. In addition, Relief-F is comparable to the other methods based on computational time. We also apply ReliefSeq to an RNA-Seq study of smallpox vaccine to identify gene expression changes between vaccinia virus-stimulated and unstimulated samples. ReliefSeq is an attractive tool for inclusion in the suite of tools used for analysis of mRNA-Seq data; it has power to

  6. Blood lead levels, iron metabolism gene polymorphisms and homocysteine: a gene-environment interaction study.

    Science.gov (United States)

    Kim, Kyoung-Nam; Lee, Mee-Ri; Lim, Youn-Hee; Hong, Yun-Chul

    2017-12-01

    Homocysteine has been causally associated with various adverse health outcomes. Evidence supporting the relationship between lead and homocysteine levels has been accumulating, but most prior studies have not focused on the interaction with genetic polymorphisms. From a community-based prospective cohort, we analysed 386 participants (aged 41-71 years) with information regarding blood lead and plasma homocysteine levels. Blood lead levels were measured between 2001 and 2003, and plasma homocysteine levels were measured in 2007. Interactions of lead levels with 42 genotyped single-nucleotide polymorphisms (SNPs) in five genes ( TF , HFE , CBS , BHMT and MTR ) were assessed via a 2-degree of freedom (df) joint test and a 1-df interaction test. In secondary analyses using imputation, we further assessed 58 imputed SNPs in the TF and MTHFR genes. Blood lead concentrations were positively associated with plasma homocysteine levels (p=0.0276). Six SNPs in the TF and MTR genes were screened using the 2-df joint test, and among them, three SNPs in the TF gene showed interactions with lead with respect to homocysteine levels through the 1-df interaction test (plead levels. Blood lead levels were positively associated with plasma homocysteine levels measured 4-6 years later, and three SNPs in the TF gene modified the association. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  7. Systematic Search for Gene-Gene Interaction Effect on Prostate Cancer Risk

    Science.gov (United States)

    2013-07-01

    Systematic Search for Gene-Gene Interaction 5a. CONTRACT NUMBER Effect on Prostate Cancer Risk 5b. GRANT NUMBER W81XWH-09-1-0488 5c. PROGRAM...Supported by this grant ) 1. Tao S, Wang Z, Feng J, Hsu FC, Jin G, Kin ST, Zhang Z, Gronberg H, Zheng, SL, Isaacs WB, XU J, Sun J. A Genome-Wide Search for...order interactions among estrogen- metabolism genes in sporadic breast cancer. Am J Hum Genet, 69, 138-47. 48. Marchini, J., Donnelly, P. and Cardon

  8. A model of gene-gene and gene-environment interactions and its implications for targeting environmental interventions by genotype

    Directory of Open Access Journals (Sweden)

    Wallace Helen M

    2006-10-01

    Full Text Available Abstract Background The potential public health benefits of targeting environmental interventions by genotype depend on the environmental and genetic contributions to the variance of common diseases, and the magnitude of any gene-environment interaction. In the absence of prior knowledge of all risk factors, twin, family and environmental data may help to define the potential limits of these benefits in a given population. However, a general methodology to analyze twin data is required because of the potential importance of gene-gene interactions (epistasis, gene-environment interactions, and conditions that break the 'equal environments' assumption for monozygotic and dizygotic twins. Method A new model for gene-gene and gene-environment interactions is developed that abandons the assumptions of the classical twin study, including Fisher's (1918 assumption that genes act as risk factors for common traits in a manner necessarily dominated by an additive polygenic term. Provided there are no confounders, the model can be used to implement a top-down approach to quantifying the potential utility of genetic prediction and prevention, using twin, family and environmental data. The results describe a solution space for each disease or trait, which may or may not include the classical twin study result. Each point in the solution space corresponds to a different model of genotypic risk and gene-environment interaction. Conclusion The results show that the potential for reducing the incidence of common diseases using environmental interventions targeted by genotype may be limited, except in special cases. The model also confirms that the importance of an individual's genotype in determining their risk of complex diseases tends to be exaggerated by the classical twin studies method, owing to the 'equal environments' assumption and the assumption of no gene-environment interaction. In addition, if phenotypes are genetically robust, because of epistasis

  9. [Gene-gene interaction on central obesity in school-aged children in China].

    Science.gov (United States)

    Fu, L W; Zhang, M X; Wu, L J; Gao, L W; Mi, J

    2017-07-10

    Objective: To investigate possible effect of 6 obesity-associated SNPs in contribution to central obesity and examine whether there is an interaction in the 6 SNPs in the cause of central obesity in school-aged children in China. Methods: A total of 3 502 school-aged children who were included in Beijing Child and Adolescent Metabolic Syndrome (BCAMS) Study were selected, and based on the age and sex specific waist circumference (WC) standards in the BCAMS study, 1 196 central obese cases and 2 306 controls were identified. Genomic DNA was extracted from peripheral blood white cells using the salt fractionation method. A total of 6 single nucleotide polymorphisms ( FTO rs9939609, MC4R rs17782313, BDNF rs6265, PCSK1 rs6235, SH2B1 rs4788102, and CSK rs1378942) were genotyped by TaqMan allelic discrimination assays with the GeneAmp 7900 sequence detection system (Applied Biosystems, Foster City, CA, USA). Logistic regression model was used to investigate the association between 6 SNPs and central obesity. Gene-gene interactions among 6 polymorphic loci were analyzed by using the Generalized Multifactor Dimensionality Reduction (GMDR) method, and then logistic regression model was constructed to confirm the best combination of loci identified in the GMDR. Results: After adjusting gender, age, Tanner stage, physical activity and family history of obesity, the FTO rs9939609-A, MC4R rs17782313-C and BDNF rs6265-G alleles were associated with central obesity under additive genetic model ( OR =1.24, 95 %CI : 1.06-1.45, P =0.008; OR =1.26, 95 %CI : 1.11-1.43, P =2.98×10(-4); OR =1.18, 95 % CI : 1.06-1.32, P =0.003). GMDR analysis showed a significant gene-gene interaction between MC4R rs17782313 and BDNF rs6265 ( P =0.001). The best two-locus combination showed the cross-validation consistency of 10/10 and testing accuracy of 0.539. This interaction showed the maximum consistency and minimum prediction error among all gene-gene interaction models evaluated. Moreover, the

  10. A PLSPM-Based Test Statistic for Detecting Gene-Gene Co-Association in Genome-Wide Association Study with Case-Control Design

    Science.gov (United States)

    Zhang, Xiaoshuai; Yang, Xiaowei; Yuan, Zhongshang; Liu, Yanxun; Li, Fangyu; Peng, Bin; Zhu, Dianwen; Zhao, Jinghua; Xue, Fuzhong

    2013-01-01

    For genome-wide association data analysis, two genes in any pathway, two SNPs in the two linked gene regions respectively or in the two linked exons respectively within one gene are often correlated with each other. We therefore proposed the concept of gene-gene co-association, which refers to the effects not only due to the traditional interaction under nearly independent condition but the correlation between two genes. Furthermore, we constructed a novel statistic for detecting gene-gene co-association based on Partial Least Squares Path Modeling (PLSPM). Through simulation, the relationship between traditional interaction and co-association was highlighted under three different types of co-association. Both simulation and real data analysis demonstrated that the proposed PLSPM-based statistic has better performance than single SNP-based logistic model, PCA-based logistic model, and other gene-based methods. PMID:23620809

  11. A combination test for detection of gene-environment interaction in cohort studies.

    Science.gov (United States)

    Coombes, Brandon; Basu, Saonli; McGue, Matt

    2017-07-01

    Identifying gene-environment (G-E) interactions can contribute to a better understanding of disease etiology, which may help researchers develop disease prevention strategies and interventions. One big criticism of studying G-E interaction is the lack of power due to sample size. Studies often restrict the interaction search to the top few hundred hits from a genome-wide association study or focus on potential candidate genes. In this paper, we test interactions between a candidate gene and an environmental factor to improve power by analyzing multiple variants within a gene. We extend recently developed score statistic based genetic association testing approaches to the G-E interaction testing problem. We also propose tests for interaction using gene-based summary measures that pool variants together. Although it has recently been shown that these summary measures can be biased and may lead to inflated type I error, we show that under several realistic scenarios, we can still provide valid tests of interaction. These tests use significantly less degrees of freedom and thus can have much higher power to detect interaction. Additionally, we demonstrate that the iSeq-aSum-min test, which combines a gene-based summary measure test, iSeq-aSum-G, and an interaction-based summary measure test, iSeq-aSum-I, provides a powerful alternative to test G-E interaction. We demonstrate the performance of these approaches using simulation studies and illustrate their performance to study interaction between the SNPs in several candidate genes and family climate environment on alcohol consumption using the Minnesota Center for Twin and Family Research dataset. © 2017 WILEY PERIODICALS, INC.

  12. Inverse gene-for-gene interactions contribute additively to tan spot susceptibility in wheat.

    Science.gov (United States)

    Liu, Zhaohui; Zurn, Jason D; Kariyawasam, Gayan; Faris, Justin D; Shi, Gongjun; Hansen, Jana; Rasmussen, Jack B; Acevedo, Maricelis

    2017-06-01

    Tan spot susceptibility is conferred by multiple interactions of necrotrophic effector and host sensitivity genes. Tan spot of wheat, caused by Pyrenophora tritici-repentis, is an important disease in almost all wheat-growing areas of the world. The disease system is known to involve at least three fungal-produced necrotrophic effectors (NEs) that interact with the corresponding host sensitivity (S) genes in an inverse gene-for-gene manner to induce disease. However, it is unknown if the effects of these NE-S gene interactions contribute additively to the development of tan spot. In this work, we conducted disease evaluations using different races and quantitative trait loci (QTL) analysis in a wheat recombinant inbred line (RIL) population derived from a cross between two susceptible genotypes, LMPG-6 and PI 626573. The two parental lines each harbored a single known NE sensitivity gene with LMPG-6 having the Ptr ToxC sensitivity gene Tsc1 and PI 626573 having the Ptr ToxA sensitivity gene Tsn1. Transgressive segregation was observed in the population for all races. QTL mapping revealed that both loci (Tsn1 and Tsc1) were significantly associated with susceptibility to race 1 isolates, which produce both Ptr ToxA and Ptr ToxC, and the two genes contributed additively to tan spot susceptibility. For isolates of races 2 and 3, which produce only Ptr ToxA and Ptr ToxC, only Tsn1 and Tsc1 were associated with tan spot susceptibility, respectively. This work clearly demonstrates that tan spot susceptibility in this population is due primarily to two NE-S interactions. Breeders should remove both sensitivity genes from wheat lines to obtain high levels of tan spot resistance.

  13. An Interactive Database of Cocaine-Responsive Gene Expression

    Directory of Open Access Journals (Sweden)

    Willard M. Freeman

    2002-01-01

    Full Text Available The postgenomic era of large-scale gene expression studies is inundating drug abuse researchers and many other scientists with findings related to gene expression. This information is distributed across many different journals, and requires laborious literature searches. Here, we present an interactive database that combines existing information related to cocaine-mediated changes in gene expression in an easy-to-use format. The database is limited to statistically significant changes in mRNA or protein expression after cocaine administration. The Flash-based program is integrated into a Web page, and organizes changes in gene expression based on neuroanatomical region, general function, and gene name. Accompanying each gene is a description of the gene, links to the original publications, and a link to the appropriate OMIM (Online Mendelian Inheritance in Man entry. The nature of this review allows for timely modifications and rapid inclusion of new publications, and should help researchers build second-generation hypotheses on the role of gene expression changes in the physiology and behavior of cocaine abuse. Furthermore, this method of organizing large volumes of scientific information can easily be adapted to assist researchers in fields outside of drug abuse.

  14. Association testing to detect gene-gene interactions on sex chromosomes in trio data

    Directory of Open Access Journals (Sweden)

    Yeonok eLee

    2013-11-01

    Full Text Available Autism Spectrum Disorder (ASD occurs more often among males than females in a 4:1 ratio. Among theories used to explain the causes of ASD, the X chromosome and the Y chromosome theories attribute ASD to X-linked mutation and the male-limited gene expressions on the Y chromosome, respectively. Despite the rationale of the theory, studies have failed to attribute the sex-biased ratio to the significant linkage or association on the regions of interest on X chromosome. We further study the gender biased ratio by examining the possible interaction effects between two genes in the sex chromosomes. We propose a logistic regression model with mixed effects to detect gene-gene interactions on sex chromosomes. We investigated the power and type I error rates of the approach for a range of minor allele frequencies and varying linkage disequilibrium between markers and QTLs. We also evaluated the robustness of the model to population stratification. We applied the model to a trio-family data set with an ASD affected male child to study gene-gene interactions on sex chromosomes.

  15. Gene Environment Interactions and Predictors of Colorectal Cancer in Family-Based, Multi-Ethnic Groups

    Directory of Open Access Journals (Sweden)

    S. Pamela K. Shiao

    2018-02-01

    Full Text Available For the personalization of polygenic/omics-based health care, the purpose of this study was to examine the gene–environment interactions and predictors of colorectal cancer (CRC by including five key genes in the one-carbon metabolism pathways. In this proof-of-concept study, we included a total of 54 families and 108 participants, 54 CRC cases and 54 matched family friends representing four major racial ethnic groups in southern California (White, Asian, Hispanics, and Black. We used three phases of data analytics, including exploratory, family-based analyses adjusting for the dependence within the family for sharing genetic heritage, the ensemble method, and generalized regression models for predictive modeling with a machine learning validation procedure to validate the results for enhanced prediction and reproducibility. The results revealed that despite the family members sharing genetic heritage, the CRC group had greater combined gene polymorphism rates than the family controls (p < 0.05, on MTHFR C677T, MTR A2756G, MTRR A66G, and DHFR 19 bp except MTHFR A1298C. Four racial groups presented different polymorphism rates for four genes (all p < 0.05 except MTHFR A1298C. Following the ensemble method, the most influential factors were identified, and the best predictive models were generated by using the generalized regression models, with Akaike’s information criterion and leave-one-out cross validation methods. Body mass index (BMI and gender were consistent predictors of CRC for both models when individual genes versus total polymorphism counts were used, and alcohol use was interactive with BMI status. Body mass index status was also interactive with both gender and MTHFR C677T gene polymorphism, and the exposure to environmental pollutants was an additional predictor. These results point to the important roles of environmental and modifiable factors in relation to gene–environment interactions in the prevention of CRC.

  16. Leveraging gene-environment interactions and endotypes for asthma gene discovery

    DEFF Research Database (Denmark)

    Bønnelykke, Klaus; Ober, Carole

    2016-01-01

    , such as childhood asthma with severe exacerbations, and on relevant exposures that are involved in gene-environment interactions (GEIs), such as rhinovirus infections, will improve detection of asthma genes and our understanding of the underlying mechanisms. We will discuss the challenges of considering GEIs......Asthma is a heterogeneous clinical syndrome that includes subtypes of disease with different underlying causes and disease mechanisms. Asthma is caused by a complex interaction between genes and environmental exposures; early-life exposures in particular play an important role. Asthma is also...... heritable, and a number of susceptibility variants have been discovered in genome-wide association studies, although the known risk alleles explain only a small proportion of the heritability. In this review, we present evidence supporting the hypothesis that focusing on more specific asthma phenotypes...

  17. Predictability of Genetic Interactions from Functional Gene Modules

    Directory of Open Access Journals (Sweden)

    Jonathan H. Young

    2017-02-01

    Full Text Available Characterizing genetic interactions is crucial to understanding cellular and organismal response to gene-level perturbations. Such knowledge can inform the selection of candidate disease therapy targets, yet experimentally determining whether genes interact is technically nontrivial and time-consuming. High-fidelity prediction of different classes of genetic interactions in multiple organisms would substantially alleviate this experimental burden. Under the hypothesis that functionally related genes tend to share common genetic interaction partners, we evaluate a computational approach to predict genetic interactions in Homo sapiens, Drosophila melanogaster, and Saccharomyces cerevisiae. By leveraging knowledge of functional relationships between genes, we cross-validate predictions on known genetic interactions and observe high predictive power of multiple classes of genetic interactions in all three organisms. Additionally, our method suggests high-confidence candidate interaction pairs that can be directly experimentally tested. A web application is provided for users to query genes for predicted novel genetic interaction partners. Finally, by subsampling the known yeast genetic interaction network, we found that novel genetic interactions are predictable even when knowledge of currently known interactions is minimal.

  18. A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer

    Directory of Open Access Journals (Sweden)

    Mary Qu Yang

    Full Text Available Clear cell renal cell carcinoma (ccRCC is the most common and most aggressive form of renal cell cancer (RCC. The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1, as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways. Keywords: ccRCC, Causative mutation, Pathways, Protein-protein interaction, Gene module, eQTL

  19. Gene by Environment Interaction and Resilience: Effects of Child Maltreatment and Serotonin, Corticotropin Releasing Hormone, Dopamine, and Oxytocin Genes

    Science.gov (United States)

    Cicchetti, Dante; Rogosch, Fred A.

    2013-01-01

    In this investigation, gene-environment interaction effects in predicting resilience in adaptive functioning among maltreated and nonmaltreated low-income children (N = 595) were examined. A multi-component index of resilient functioning was derived and levels of resilient functioning were identified. Variants in four genes, 5-HTTLPR, CRHR1, DRD4 -521C/T, and OXTR, were investigated. In a series of ANCOVAs, child maltreatment demonstrated a strong negative main effect on children’s resilient functioning, whereas no main effects for any of the genotypes of the respective genes were found. However, gene-environment interactions involving genotypes of each of the respective genes and maltreatment status were obtained. For each respective gene, among children with a specific genotype, the relative advantage in resilient functioning of nonmaltreated compared to maltreated children was stronger than was the case for nonmaltreated and maltreated children with other genotypes of the respective gene. Across the four genes, a composite of the genotypes that more strongly differentiated resilient functioning between nonmaltreated and maltreated children provided further evidence of genetic variations influencing resilient functioning in nonmaltreated children, whereas genetic variation had a negligible effect on promoting resilience among maltreated children. Additional effects were observed for children based on the number of subtypes of maltreatment children experienced, as well as for abuse and neglect subgroups. Finally, maltreated and nonmaltreated children with high levels of resilience differed in their average number of differentiating genotypes. These results suggest that differential resilient outcomes are based on the interaction between genes and developmental experiences. PMID:22559122

  20. Large-scale extraction of gene interactions from full-text literature using DeepDive.

    Science.gov (United States)

    Mallory, Emily K; Zhang, Ce; Ré, Christopher; Altman, Russ B

    2016-01-01

    A complete repository of gene-gene interactions is key for understanding cellular processes, human disease and drug response. These gene-gene interactions include both protein-protein interactions and transcription factor interactions. The majority of known interactions are found in the biomedical literature. Interaction databases, such as BioGRID and ChEA, annotate these gene-gene interactions; however, curation becomes difficult as the literature grows exponentially. DeepDive is a trained system for extracting information from a variety of sources, including text. In this work, we used DeepDive to extract both protein-protein and transcription factor interactions from over 100,000 full-text PLOS articles. We built an extractor for gene-gene interactions that identified candidate gene-gene relations within an input sentence. For each candidate relation, DeepDive computed a probability that the relation was a correct interaction. We evaluated this system against the Database of Interacting Proteins and against randomly curated extractions. Our system achieved 76% precision and 49% recall in extracting direct and indirect interactions involving gene symbols co-occurring in a sentence. For randomly curated extractions, the system achieved between 62% and 83% precision based on direct or indirect interactions, as well as sentence-level and document-level precision. Overall, our system extracted 3356 unique gene pairs using 724 features from over 100,000 full-text articles. Application source code is publicly available at https://github.com/edoughty/deepdive_genegene_app russ.altman@stanford.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  1. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  2. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature

    Directory of Open Access Journals (Sweden)

    Ning Ye

    2015-01-01

    Full Text Available The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  3. The SKP1-like gene family of Arabidopsis exhibits a high degree of differential gene expression and gene product interaction during development.

    Directory of Open Access Journals (Sweden)

    Mohammad H Dezfulian

    Full Text Available The Arabidopsis thaliana genome encodes several families of polypeptides that are known or predicted to participate in the formation of the SCF-class of E3-ubiquitin ligase complexes. One such gene family encodes the Skp1-like class of polypeptide subunits, where 21 genes have been identified and are known to be expressed in Arabidopsis. Phylogenetic analysis based on deduced polypeptide sequence organizes the family of ASK proteins into 7 clades. The complexity of the ASK gene family, together with the close structural similarity among its members raises the prospect of significant functional redundancy among select paralogs. We have assessed the potential for functional redundancy within the ASK gene family by analyzing an expanded set of criteria that define redundancy with higher resolution. The criteria used include quantitative expression of locus-specific transcripts using qRT-PCR, assessment of the sub-cellular localization of individual ASK:YFP auto-fluorescent fusion proteins expressed in vivo as well as the in planta assessment of individual ASK-F-Box protein interactions using bimolecular fluorescent complementation techniques in combination with confocal imagery in live cells. The results indicate significant functional divergence of steady state transcript abundance and protein-protein interaction specificity involving ASK proteins in a pattern that is poorly predicted by sequence-based phylogeny. The information emerging from this and related studies will prove important for defining the functional intersection of expression, localization and gene product interaction that better predicts the formation of discrete SCF complexes, as a prelude to investigating their molecular mode of action.

  4. Novel interactions between vertebrate Hox genes

    NARCIS (Netherlands)

    Hooiveld, MHW; Morgan, R; Rieden, PID; Houtzager, E; Pannese, M; Damen, K; Boncinelli, E; Durston, AJ

    1999-01-01

    Understanding why metazoan Hox/HOM-C genes are expressed in spatiotemporal sequences showing colinearity with their genomic sequence is a central challenge in developmental biology. Here, we studied the consequences of ectopically expressing Hox genes to investigate whether Hox-Hox interactions

  5. Prioritization of gene regulatory interactions from large-scale modules in yeast

    Directory of Open Access Journals (Sweden)

    Bringas Ricardo

    2008-01-01

    Full Text Available Abstract Background The identification of groups of co-regulated genes and their transcription factors, called transcriptional modules, has been a focus of many studies about biological systems. While methods have been developed to derive numerous modules from genome-wide data, individual links between regulatory proteins and target genes still need experimental verification. In this work, we aim to prioritize regulator-target links within transcriptional modules based on three types of large-scale data sources. Results Starting with putative transcriptional modules from ChIP-chip data, we first derive modules in which target genes show both expression and function coherence. The most reliable regulatory links between transcription factors and target genes are established by identifying intersection of target genes in coherent modules for each enriched functional category. Using a combination of genome-wide yeast data in normal growth conditions and two different reference datasets, we show that our method predicts regulatory interactions with significantly higher predictive power than ChIP-chip binding data alone. A comparison with results from other studies highlights that our approach provides a reliable and complementary set of regulatory interactions. Based on our results, we can also identify functionally interacting target genes, for instance, a group of co-regulated proteins related to cell wall synthesis. Furthermore, we report novel conserved binding sites of a glycoprotein-encoding gene, CIS3, regulated by Swi6-Swi4 and Ndd1-Fkh2-Mcm1 complexes. Conclusion We provide a simple method to prioritize individual TF-gene interactions from large-scale transcriptional modules. In comparison with other published works, we predict a complementary set of regulatory interactions which yields a similar or higher prediction accuracy at the expense of sensitivity. Therefore, our method can serve as an alternative approach to prioritization for

  6. Disease candidate gene identification and prioritization using protein interaction networks

    Directory of Open Access Journals (Sweden)

    Aronow Bruce J

    2009-02-01

    Full Text Available Abstract Background Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN analyses. Results For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds", and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. Conclusion Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization.

  7. Gene-Environment Interactions in Severe Mental Illness

    Directory of Open Access Journals (Sweden)

    Rudolf eUher

    2014-05-01

    Full Text Available Severe mental illness is a broad category that includes schizophrenia, bipolar disorder and severe depression. Both genetic disposition and environmental exposures play important roles in the development of severe mental illness. Multiple lines of evidence suggest that the roles of genetic and environmental depend on each other. Gene-environment interactions may underlie the paradox of strong environmental factors for highly heritable disorders, the low estimates of shared environmental influences in twin studies of severe mental illness and the heritability gap between twin and molecular heritability estimates. Sons and daughters of parents with severe mental illness are more vulnerable to the effects of prenatal and postnatal environmental exposures, suggesting that the expression of genetic liability depends on environment. In the last decade, gene-environment interactions involving specific molecular variants in candidate genes have been identified. Replicated findings include an interaction between a polymorphism in the AKT1 gene and cannabis use in the development of psychosis and an interaction between the length polymorphism of the serotonin transporter gene and childhood maltreatment in the development of persistent depressive disorder. Bipolar disorder has been underinvestigated, with only a single study showing an interaction between a functional polymorphism in BDNF and stressful life events triggering bipolar depressive episodes. The first systematic search for gene-environment interactions has found that a polymorphism in CTNNA3 may sensitise the developing brain to the pathogenic effect of cytomegalovirus in utero, leading to schizophrenia in adulthood. Strategies for genome-wide investigations will likely include coordination between epidemiological and genetic research efforts, systematic assessment of multiple environmental factors in large samples, and prioritization of genetic variants.

  8. You've gotta be lucky: Coverage and the elusive gene-gene interaction.

    Science.gov (United States)

    Reimherr, Matthew; Nicolae, Dan L

    2011-01-01

    Genome-wide association studies (GWAS) have led to a large number of single-SNP association findings, but there has been, so far, no investigation resulting in the discovery of a replicable gene-gene interaction. In this paper, we examine some of the possible explanations for the lack of findings, and argue that coverage of causal variation not only has a large effect on the loss in power, but that the effect is larger than in the single-SNP analyses. We show that the product of linkage disequilibrium measures, r², between causal and tested SNPs offers a good approximation to the loss in efficiency as defined by the ratio of sample sizes that lead to similar power. We also demonstrate that, in addition to the huge search space, the loss in power due to coverage when using commercially available platforms makes the search for gene-gene interactions daunting. © 2010 The Authors Annals of Human Genetics © 2010 Blackwell Publishing Ltd/University College London.

  9. Network Graph Analysis of Gene-Gene Interactions in Genome-Wide Association Study Data

    Directory of Open Access Journals (Sweden)

    Sungyoung Lee

    2012-12-01

    Full Text Available Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs. For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR is one of the powerful and efficient methods for detecting high-order gene-gene (GxG interactions. However, the biological interpretation of GxG interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified GxG interactions. The proposed network graph analysis consists of three steps. The first step is for performing GxG interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified GxG interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform GxG interaction analysis of body mass index (BMI. Our network graph analysis successfully showed that many identified GxG interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of GxG interactions.

  10. Network graph analysis of gene-gene interactions in genome-wide association study data.

    Science.gov (United States)

    Lee, Sungyoung; Kwon, Min-Seok; Park, Taesung

    2012-12-01

    Most common complex traits, such as obesity, hypertension, diabetes, and cancers, are known to be associated with multiple genes, environmental factors, and their epistasis. Recently, the development of advanced genotyping technologies has allowed us to perform genome-wide association studies (GWASs). For detecting the effects of multiple genes on complex traits, many approaches have been proposed for GWASs. Multifactor dimensionality reduction (MDR) is one of the powerful and efficient methods for detecting high-order gene-gene (GxG) interactions. However, the biological interpretation of GxG interactions identified by MDR analysis is not easy. In order to aid the interpretation of MDR results, we propose a network graph analysis to elucidate the meaning of identified GxG interactions. The proposed network graph analysis consists of three steps. The first step is for performing GxG interaction analysis using MDR analysis. The second step is to draw the network graph using the MDR result. The third step is to provide biological evidence of the identified GxG interaction using external biological databases. The proposed method was applied to Korean Association Resource (KARE) data, containing 8838 individuals with 327,632 single-nucleotide polymorphisms, in order to perform GxG interaction analysis of body mass index (BMI). Our network graph analysis successfully showed that many identified GxG interactions have known biological evidence related to BMI. We expect that our network graph analysis will be helpful to interpret the biological meaning of GxG interactions.

  11. DGIdb 3.0: a redesign and expansion of the drug-gene interaction database.

    Science.gov (United States)

    Cotto, Kelsy C; Wagner, Alex H; Feng, Yang-Yang; Kiwala, Susanna; Coffman, Adam C; Spies, Gregory; Wollam, Alex; Spies, Nicholas C; Griffith, Obi L; Griffith, Malachi

    2018-01-04

    The drug-gene interaction database (DGIdb, www.dgidb.org) consolidates, organizes and presents drug-gene interactions and gene druggability information from papers, databases and web resources. DGIdb normalizes content from 30 disparate sources and allows for user-friendly advanced browsing, searching and filtering for ease of access through an intuitive web user interface, application programming interface (API) and public cloud-based server image. DGIdb v3.0 represents a major update of the database. Nine of the previously included 24 sources were updated. Six new resources were added, bringing the total number of sources to 30. These updates and additions of sources have cumulatively resulted in 56 309 interaction claims. This has also substantially expanded the comprehensive catalogue of druggable genes and anti-neoplastic drug-gene interactions included in the DGIdb. Along with these content updates, v3.0 has received a major overhaul of its codebase, including an updated user interface, preset interaction search filters, consolidation of interaction information into interaction groups, greatly improved search response times and upgrading the underlying web application framework. In addition, the expanded API features new endpoints which allow users to extract more detailed information about queried drugs, genes and drug-gene interactions, including listings of PubMed IDs, interaction type and other interaction metadata.

  12. A high-resolution gene expression atlas of epistasis between gene-specific transcription factors exposes potential mechanisms for genetic interactions.

    Science.gov (United States)

    Sameith, Katrin; Amini, Saman; Groot Koerkamp, Marian J A; van Leenen, Dik; Brok, Mariel; Brabers, Nathalie; Lijnzaad, Philip; van Hooff, Sander R; Benschop, Joris J; Lenstra, Tineke L; Apweiler, Eva; van Wageningen, Sake; Snel, Berend; Holstege, Frank C P; Kemmeren, Patrick

    2015-12-23

    Genetic interactions, or non-additive effects between genes, play a crucial role in many cellular processes and disease. Which mechanisms underlie these genetic interactions has hardly been characterized. Understanding the molecular basis of genetic interactions is crucial in deciphering pathway organization and understanding the relationship between genotype, phenotype and disease. To investigate the nature of genetic interactions between gene-specific transcription factors (GSTFs) in Saccharomyces cerevisiae, we systematically analyzed 72 GSTF pairs by gene expression profiling double and single deletion mutants. These pairs were selected through previously published growth-based genetic interactions as well as through similarity in DNA binding properties. The result is a high-resolution atlas of gene expression-based genetic interactions that provides systems-level insight into GSTF epistasis. The atlas confirms known genetic interactions and exposes new ones. Importantly, the data can be used to investigate mechanisms that underlie individual genetic interactions. Two molecular mechanisms are proposed, "buffering by induced dependency" and "alleviation by derepression". These mechanisms indicate how negative genetic interactions can occur between seemingly unrelated parallel pathways and how positive genetic interactions can indirectly expose parallel rather than same-pathway relationships. The focus on GSTFs is important for understanding the transcription regulatory network of yeast as it uncovers details behind many redundancy relationships, some of which are completely new. In addition, the study provides general insight into the complex nature of epistasis and proposes mechanistic models for genetic interactions, the majority of which do not fall into easily recognizable within- or between-pathway relationships.

  13. Artificial neural network inference (ANNI: a study on gene-gene interaction for biomarkers in childhood sarcomas.

    Directory of Open Access Journals (Sweden)

    Dong Ling Tong

    Full Text Available OBJECTIVE: To model the potential interaction between previously identified biomarkers in children sarcomas using artificial neural network inference (ANNI. METHOD: To concisely demonstrate the biological interactions between correlated genes in an interaction network map, only 2 types of sarcomas in the children small round blue cell tumors (SRBCTs dataset are discussed in this paper. A backpropagation neural network was used to model the potential interaction between genes. The prediction weights and signal directions were used to model the strengths of the interaction signals and the direction of the interaction link between genes. The ANN model was validated using Monte Carlo cross-validation to minimize the risk of over-fitting and to optimize generalization ability of the model. RESULTS: Strong connection links on certain genes (TNNT1 and FNDC5 in rhabdomyosarcoma (RMS; FCGRT and OLFM1 in Ewing's sarcoma (EWS suggested their potency as central hubs in the interconnection of genes with different functionalities. The results showed that the RMS patients in this dataset are likely to be congenital and at low risk of cardiomyopathy development. The EWS patients are likely to be complicated by EWS-FLI fusion and deficiency in various signaling pathways, including Wnt, Fas/Rho and intracellular oxygen. CONCLUSIONS: The ANN network inference approach and the examination of identified genes in the published literature within the context of the disease highlights the substantial influence of certain genes in sarcomas.

  14. Gene-Gene Interactions in the Folate Metabolic Pathway and the Risk of Conotruncal Heart Defects

    Directory of Open Access Journals (Sweden)

    Philip J. Lupo

    2010-01-01

    Full Text Available Conotruncal and related heart defects (CTRD are common, complex malformations. Although there are few established risk factors, there is evidence that genetic variation in the folate metabolic pathway influences CTRD risk. This study was undertaken to assess the association between inherited (i.e., case and maternal gene-gene interactions in this pathway and the risk of CTRD. Case-parent triads (n=727, ascertained from the Children's Hospital of Philadelphia, were genotyped for ten functional variants of nine folate metabolic genes. Analyses of inherited genotypes were consistent with the previously reported association between MTHFR A1298C and CTRD (adjusted P=.02, but provided no evidence that CTRD was associated with inherited gene-gene interactions. Analyses of the maternal genotypes provided evidence of a MTHFR C677T/CBS 844ins68 interaction and CTRD risk (unadjusted P=.02. This association is consistent with the effects of this genotype combination on folate-homocysteine biochemistry but remains to be confirmed in independent study populations.

  15. Influence of SNPs in nutrient-sensitive candidate genes and gene-diet interactions on blood lipids

    DEFF Research Database (Denmark)

    Brahe, Lena Kirchner; Angquist, Lars; Larsen, Lesli Hingstrup

    2013-01-01

    Blood lipid response to a given dietary intervention could be determined by the effect of diet, gene variants or gene-diet interactions. The objective of the present study was to investigate whether variants in presumed nutrient-sensitive genes involved in lipid metabolism modified lipid profile ...

  16. Discovering disease-associated genes in weighted protein-protein interaction networks

    Science.gov (United States)

    Cui, Ying; Cai, Meng; Stanley, H. Eugene

    2018-04-01

    Although there have been many network-based attempts to discover disease-associated genes, most of them have not taken edge weight - which quantifies their relative strength - into consideration. We use connection weights in a protein-protein interaction (PPI) network to locate disease-related genes. We analyze the topological properties of both weighted and unweighted PPI networks and design an improved random forest classifier to distinguish disease genes from non-disease genes. We use a cross-validation test to confirm that weighted networks are better able to discover disease-associated genes than unweighted networks, which indicates that including link weight in the analysis of network properties provides a better model of complex genotype-phenotype associations.

  17. Evidence for plasticity genotypes in a gene-gene-environment interaction : the TRAILS study

    NARCIS (Netherlands)

    Nederhof, E; Bouma, Esther; Riese, Harriette; Laceulle, Odilia; Ormel, J.; Oldehinkel, A.J.

    2010-01-01

    The purpose was to study how functional polymorphisms in the brain derived neurotrophic factor gene (BDNF val66met) and the serotonin transporter gene linked promotor region (5-HTTLPR) interact with childhood adversities in predicting Effortful Control. Effortful Control refers to the ability to

  18. Systemic virus-induced gene silencing allows functional characterization of maize genes during biotrophic interaction with Ustilago maydis.

    Science.gov (United States)

    van der Linde, Karina; Kastner, Christine; Kumlehn, Jochen; Kahmann, Regine; Doehlemann, Gunther

    2011-01-01

    Infection of maize (Zea mays) plants with the corn smut fungus Ustilago maydis leads to the formation of large tumors on the stem, leaves and inflorescences. In this biotrophic interaction, plant defense responses are actively suppressed by the pathogen, and previous transcriptome analyses of infected maize plants showed massive and stage-specific changes in host gene expression during disease progression. To identify maize genes that are functionally involved in the interaction with U. maydis, we adapted a virus-induced gene silencing (VIGS) system based on the brome mosaic virus (BMV) for maize. Conditions were established that allowed successful U. maydis infection of BMV-preinfected maize plants. This set-up enabled quantification of VIGS and its impact on U. maydis infection using a quantitative real-time PCR (qRT-PCR)-based readout. In proof-of-principle experiments, an U. maydis-induced terpene synthase was shown to negatively regulate disease development while a protein involved in cell death inhibition was required for full virulence of U. maydis. The results suggest that this system is a versatile tool for the rapid identification of maize genes that determine compatibility with U. maydis. © (2010) Max Planck Society. Journal compilation © New Phytologist Trust (2010).

  19. FunGeneNet: a web tool to estimate enrichment of functional interactions in experimental gene sets.

    Science.gov (United States)

    Tiys, Evgeny S; Ivanisenko, Timofey V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2018-02-09

    Estimation of functional connectivity in gene sets derived from genome-wide or other biological experiments is one of the essential tasks of bioinformatics. A promising approach for solving this problem is to compare gene networks built using experimental gene sets with random networks. One of the resources that make such an analysis possible is CrossTalkZ, which uses the FunCoup database. However, existing methods, including CrossTalkZ, do not take into account individual types of interactions, such as protein/protein interactions, expression regulation, transport regulation, catalytic reactions, etc., but rather work with generalized types characterizing the existence of any connection between network members. We developed the online tool FunGeneNet, which utilizes the ANDSystem and STRING to reconstruct gene networks using experimental gene sets and to estimate their difference from random networks. To compare the reconstructed networks with random ones, the node permutation algorithm implemented in CrossTalkZ was taken as a basis. To study the FunGeneNet applicability, the functional connectivity analysis of networks constructed for gene sets involved in the Gene Ontology biological processes was conducted. We showed that the method sensitivity exceeds 0.8 at a specificity of 0.95. We found that the significance level of the difference between gene networks of biological processes and random networks is determined by the type of connections considered between objects. At the same time, the highest reliability is achieved for the generalized form of connections that takes into account all the individual types of connections. By taking examples of the thyroid cancer networks and the apoptosis network, it is demonstrated that key participants in these processes are involved in the interactions of those types by which these networks differ from random ones. FunGeneNet is a web tool aimed at proving the functionality of networks in a wide range of sizes of

  20. Genes2FANs: connecting genes through functional association networks

    Science.gov (United States)

    2012-01-01

    Background Protein-protein, cell signaling, metabolic, and transcriptional interaction networks are useful for identifying connections between lists of experimentally identified genes/proteins. However, besides physical or co-expression interactions there are many ways in which pairs of genes, or their protein products, can be associated. By systematically incorporating knowledge on shared properties of genes from diverse sources to build functional association networks (FANs), researchers may be able to identify additional functional interactions between groups of genes that are not readily apparent. Results Genes2FANs is a web based tool and a database that utilizes 14 carefully constructed FANs and a large-scale protein-protein interaction (PPI) network to build subnetworks that connect lists of human and mouse genes. The FANs are created from mammalian gene set libraries where mouse genes are converted to their human orthologs. The tool takes as input a list of human or mouse Entrez gene symbols to produce a subnetwork and a ranked list of intermediate genes that are used to connect the query input list. In addition, users can enter any PubMed search term and then the system automatically converts the returned results to gene lists using GeneRIF. This gene list is then used as input to generate a subnetwork from the user’s PubMed query. As a case study, we applied Genes2FANs to connect disease genes from 90 well-studied disorders. We find an inverse correlation between the counts of links connecting disease genes through PPI and links connecting diseases genes through FANs, separating diseases into two categories. Conclusions Genes2FANs is a useful tool for interpreting the relationships between gene/protein lists in the context of their various functions and networks. Combining functional association interactions with physical PPIs can be useful for revealing new biology and help form hypotheses for further experimentation. Our finding that disease genes in

  1. Spatially Uniform ReliefF (SURF for computationally-efficient filtering of gene-gene interactions

    Directory of Open Access Journals (Sweden)

    Greene Casey S

    2009-09-01

    Full Text Available Abstract Background Genome-wide association studies are becoming the de facto standard in the genetic analysis of common human diseases. Given the complexity and robustness of biological networks such diseases are unlikely to be the result of single points of failure but instead likely arise from the joint failure of two or more interacting components. The hope in genome-wide screens is that these points of failure can be linked to single nucleotide polymorphisms (SNPs which confer disease susceptibility. Detecting interacting variants that lead to disease in the absence of single-gene effects is difficult however, and methods to exhaustively analyze sets of these variants for interactions are combinatorial in nature thus making them computationally infeasible. Efficient algorithms which can detect interacting SNPs are needed. ReliefF is one such promising algorithm, although it has low success rate for noisy datasets when the interaction effect is small. ReliefF has been paired with an iterative approach, Tuned ReliefF (TuRF, which improves the estimation of weights in noisy data but does not fundamentally change the underlying ReliefF algorithm. To improve the sensitivity of studies using these methods to detect small effects we introduce Spatially Uniform ReliefF (SURF. Results SURF's ability to detect interactions in this domain is significantly greater than that of ReliefF. Similarly SURF, in combination with the TuRF strategy significantly outperforms TuRF alone for SNP selection under an epistasis model. It is important to note that this success rate increase does not require an increase in algorithmic complexity and allows for increased success rate, even with the removal of a nuisance parameter from the algorithm. Conclusion Researchers performing genetic association studies and aiming to discover gene-gene interactions associated with increased disease susceptibility should use SURF in place of ReliefF. For instance, SURF should be

  2. Gene-physical activity interactions and their impact on diabetes

    DEFF Research Database (Denmark)

    Oskari Kilpeläinen, Tuomas; Franks, Paul W

    2014-01-01

    to an equal bout of physical activity. Individuals with specific genetic profiles are also expected to be more responsive to the beneficial effects of physical activity in the prevention of type 2 diabetes. Identification of such gene-physical activity interactions could give new insights into the biological...... the reader to the recent advances in the genetics of type 2 diabetes, summarize the current evidence on gene-physical activity interactions in relation to type 2 diabetes, and outline how information on gene-physical activity interactions might help improve the prevention and treatment of type 2 diabetes....... Finally, we will discuss the existing and emerging strategies that might enhance our ability to identify and exploit gene-physical activity interactions in the etiology of type 2 diabetes. © 2014 S. Karger AG, Basel....

  3. Gene-environment interaction and behavioral disorders: a developmental perspective based on endophenotypes.

    Science.gov (United States)

    Battaglia, Marco; Marino, Cecilia; Maziade, Michel; Molteni, Massimo; D'Amato, Francesca

    2008-01-01

    It has been observed that 'No aspect of human behavioral genetics has caused more confusion and generated more obscurantism than the analysis and interpretation of various types of non-additivity and non-independence of gene and environmental action and interaction' (Eaves LJ et al 1977 Br J Math Stat Psychol 30:1-42). On the other hand, a bulk of newly published studies appear to speak in favour of common and frequent interplay--and possibly interaction--between identified genetic polymorphisms and specified environmental variables in shaping behavior and behavioral disorders. Considerable interest has arisen from the introduction of putative functional 'endophenotypes' which would represent a more proximate biological link to genes, as well as an obligatory intermediate of behavior. While explicit criteria to identify valid endophenotypes have been offered, a number of new 'alternative phenotypes' are now being proposed as possible 'endophenotypes' for behavioral and psychiatric genetics research, sometimes with less than optimal stringency. Nonetheless, we suggest that some endophenotypes can be helpful in investigating several instances of gene-environment interactions and be employed as additional tools to reduce the risk for spurious results in this controversial area.

  4. Insulators form gene loops by interacting with promoters in Drosophila.

    Science.gov (United States)

    Erokhin, Maksim; Davydova, Anna; Kyrchanova, Olga; Parshikov, Alexander; Georgiev, Pavel; Chetverina, Darya

    2011-09-01

    Chromatin insulators are regulatory elements involved in the modulation of enhancer-promoter communication. The 1A2 and Wari insulators are located immediately downstream of the Drosophila yellow and white genes, respectively. Using an assay based on the yeast GAL4 activator, we have found that both insulators are able to interact with their target promoters in transgenic lines, forming gene loops. The existence of an insulator-promoter loop is confirmed by the fact that insulator proteins could be detected on the promoter only in the presence of an insulator in the transgene. The upstream promoter regions, which are required for long-distance stimulation by enhancers, are not essential for promoter-insulator interactions. Both insulators support basal activity of the yellow and white promoters in eyes. Thus, the ability of insulators to interact with promoters might play an important role in the regulation of basal gene transcription.

  5. Protein-Protein Interactions Prediction Based on Iterative Clique Extension with Gene Ontology Filtering

    Directory of Open Access Journals (Sweden)

    Lei Yang

    2014-01-01

    Full Text Available Cliques (maximal complete subnets in protein-protein interaction (PPI network are an important resource used to analyze protein complexes and functional modules. Clique-based methods of predicting PPI complement the data defection from biological experiments. However, clique-based predicting methods only depend on the topology of network. The false-positive and false-negative interactions in a network usually interfere with prediction. Therefore, we propose a method combining clique-based method of prediction and gene ontology (GO annotations to overcome the shortcoming and improve the accuracy of predictions. According to different GO correcting rules, we generate two predicted interaction sets which guarantee the quality and quantity of predicted protein interactions. The proposed method is applied to the PPI network from the Database of Interacting Proteins (DIP and most of the predicted interactions are verified by another biological database, BioGRID. The predicted protein interactions are appended to the original protein network, which leads to clique extension and shows the significance of biological meaning.

  6. Integration of human adipocyte chromosomal interactions with adipose gene expression prioritizes obesity-related genes from GWAS.

    Science.gov (United States)

    Pan, David Z; Garske, Kristina M; Alvarez, Marcus; Bhagat, Yash V; Boocock, James; Nikkola, Elina; Miao, Zong; Raulerson, Chelsea K; Cantor, Rita M; Civelek, Mete; Glastonbury, Craig A; Small, Kerrin S; Boehnke, Michael; Lusis, Aldons J; Sinsheimer, Janet S; Mohlke, Karen L; Laakso, Markku; Pajukanta, Päivi; Ko, Arthur

    2018-04-17

    Increased adiposity is a hallmark of obesity and overweight, which affect 2.2 billion people world-wide. Understanding the genetic and molecular mechanisms that underlie obesity-related phenotypes can help to improve treatment options and drug development. Here we perform promoter Capture Hi-C in human adipocytes to investigate interactions between gene promoters and distal elements as a transcription-regulating mechanism contributing to these phenotypes. We find that promoter-interacting elements in human adipocytes are enriched for adipose-related transcription factor motifs, such as PPARG and CEBPB, and contribute to heritability of cis-regulated gene expression. We further intersect these data with published genome-wide association studies for BMI and BMI-related metabolic traits to identify the genes that are under genetic cis regulation in human adipocytes via chromosomal interactions. This integrative genomics approach identifies four cis-eQTL-eGene relationships associated with BMI or obesity-related traits, including rs4776984 and MAP2K5, which we further confirm by EMSA, and highlights 38 additional candidate genes.

  7. Identification of potential crucial genes associated with steroid-induced necrosis of femoral head based on gene expression profile.

    Science.gov (United States)

    Lin, Zhe; Lin, Yongsheng

    2017-09-05

    The aim of this study was to explore potential crucial genes associated with the steroid-induced necrosis of femoral head (SINFH) and to provide valid biological information for further investigation of SINFH. Gene expression profile of GSE26316, generated from 3 SINFH rat samples and 3 normal rat samples were downloaded from Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) were identified using LIMMA package. After functional enrichment analyses of DEGs, protein-protein interaction (PPI) network and sub-PPI network analyses were conducted based on the STRING database and cytoscape. In total, 59 up-regulated DEGs and 156 downregulated DEGs were identified. The up-regulated DEGs were mainly involved in functions about immunity (e.g. Fcer1A and Il7R), and the downregulated DEGs were mainly enriched in muscle system process (e.g. Tnni2, Mylpf and Myl1). The PPI network of DEGs consisted of 123 nodes and 300 interactions. Tnni2, Mylpf, and Myl1 were the top 3 outstanding genes based on both subgraph centrality and degree centrality evaluation. These three genes interacted with each other in the network. Furthermore, the significant network module was composed of 22 downregulated genes (e.g. Tnni2, Mylpf and Myl1). These genes were mainly enriched in functions like muscle system process. The DEGs related to the regulation of immune system process (e.g. Fcer1A and Il7R), and DEGs correlated with muscle system process (e.g. Tnni2, Mylpf and Myl1) may be closely associated with the progress of SINFH, which is still needed to be confirmed by experiments. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

    Science.gov (United States)

    Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

    2018-02-23

    Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.

  9. Robust Tests for Additive Gene-Environment Interaction in Case-Control Studies Using Gene-Environment Independence

    DEFF Research Database (Denmark)

    Liu, Gang; Lee, Seunggeun; Lee, Alice W

    2018-01-01

    test with case-control data. Our simulation studies suggest that the EB approach uses the gene-environment independence assumption in a data-adaptive way and provides power gain compared to the standard logistic regression analysis and better control of Type I error when compared to the analysis......There have been recent proposals advocating the use of additive gene-environment interaction instead of the widely used multiplicative scale, as a more relevant public health measure. Using gene-environment independence enhances the power for testing multiplicative interaction in case......-control studies. However, under departure from this assumption, substantial bias in the estimates and inflated Type I error in the corresponding tests can occur. This paper extends the empirical Bayes (EB) approach previously developed for multiplicative interaction that trades off between bias and efficiency...

  10. fabp4 is central to eight obesity associated genes: a functional gene network-based polymorphic study.

    Science.gov (United States)

    Bag, Susmita; Ramaiah, Sudha; Anbarasu, Anand

    2015-01-07

    Network study on genes and proteins offers functional basics of the complexity of gene and protein, and its interacting partners. The gene fatty acid-binding protein 4 (fabp4) is found to be highly expressed in adipose tissue, and is one of the most abundant proteins in mature adipocytes. Our investigations on functional modules of fabp4 provide useful information on the functional genes interacting with fabp4, their biochemical properties and their regulatory functions. The present study shows that there are eight set of candidate genes: acp1, ext2, insr, lipe, ostf1, sncg, usp15, and vim that are strongly and functionally linked up with fabp4. Gene ontological analysis of network modules of fabp4 provides an explicit idea on the functional aspect of fabp4 and its interacting nodes. The hierarchal mapping on gene ontology indicates gene specific processes and functions as well as their compartmentalization in tissues. The fabp4 along with its interacting genes are involved in lipid metabolic activity and are integrated in multi-cellular processes of tissues and organs. They also have important protein/enzyme binding activity. Our study elucidated disease-associated nsSNP prediction for fabp4 and it is interesting to note that there are four rsID׳s (rs1051231, rs3204631, rs140925685 and rs141169989) with disease allelic variation (T104P, T126P, G27D and G90V respectively). On the whole, our gene network analysis presents a clear insight about the interactions and functions associated with fabp4 gene network. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Exploring Plant Co-Expression and Gene-Gene Interactions with CORNET 3.0.

    Science.gov (United States)

    Van Bel, Michiel; Coppens, Frederik

    2017-01-01

    Selecting and filtering a reference expression and interaction dataset when studying specific pathways and regulatory interactions can be a very time-consuming and error-prone task. In order to reduce the duplicated efforts required to amass such datasets, we have created the CORNET (CORrelation NETworks) platform which allows for easy access to a wide variety of data types: coexpression data, protein-protein interactions, regulatory interactions, and functional annotations. The CORNET platform outputs its results in either text format or through the Cytoscape framework, which is automatically launched by the CORNET website.CORNET 3.0 is the third iteration of the web platform designed for the user exploration of the coexpression space of plant genomes, with a focus on the model species Arabidopsis thaliana. Here we describe the platform: the tools, data, and best practices when using the platform. We indicate how the platform can be used to infer networks from a set of input genes, such as upregulated genes from an expression experiment. By exploring the network, new target and regulator genes can be discovered, allowing for follow-up experiments and more in-depth study. We also indicate how to avoid common pitfalls when evaluating the networks and how to avoid over interpretation of the results.All CORNET versions are available at http://bioinformatics.psb.ugent.be/cornet/ .

  12. Gene × Smoking Interactions on Human Brain Gene Expression: Finding Common Mechanisms in Adolescents and Adults

    Science.gov (United States)

    Wolock, Samuel L.; Yates, Andrew; Petrill, Stephen A.; Bohland, Jason W.; Blair, Clancy; Li, Ning; Machiraju, Raghu; Huang, Kun; Bartlett, Christopher W.

    2013-01-01

    Background: Numerous studies have examined gene × environment interactions (G × E) in cognitive and behavioral domains. However, these studies have been limited in that they have not been able to directly assess differential patterns of gene expression in the human brain. Here, we assessed G × E interactions using two publically available datasets…

  13. 5C analysis of the Epidermal Differentiation Complex locus reveals distinct chromatin interaction networks between gene-rich and gene-poor TADs in skin epithelial cells.

    Directory of Open Access Journals (Sweden)

    Krzysztof Poterlowicz

    2017-09-01

    Full Text Available Mammalian genomes contain several dozens of large (>0.5 Mbp lineage-specific gene loci harbouring functionally related genes. However, spatial chromatin folding, organization of the enhancer-promoter networks and their relevance to Topologically Associating Domains (TADs in these loci remain poorly understood. TADs are principle units of the genome folding and represents the DNA regions within which DNA interacts more frequently and less frequently across the TAD boundary. Here, we used Chromatin Conformation Capture Carbon Copy (5C technology to characterize spatial chromatin interaction network in the 3.1 Mb Epidermal Differentiation Complex (EDC locus harbouring 61 functionally related genes that show lineage-specific activation during terminal keratinocyte differentiation in the epidermis. 5C data validated by 3D-FISH demonstrate that the EDC locus is organized into several TADs showing distinct lineage-specific chromatin interaction networks based on their transcription activity and the gene-rich or gene-poor status. Correlation of the 5C results with genome-wide studies for enhancer-specific histone modifications (H3K4me1 and H3K27ac revealed that the majority of spatial chromatin interactions that involves the gene-rich TADs at the EDC locus in keratinocytes include both intra- and inter-TAD interaction networks, connecting gene promoters and enhancers. Compared to thymocytes in which the EDC locus is mostly transcriptionally inactive, these interactions were found to be keratinocyte-specific. In keratinocytes, the promoter-enhancer anchoring regions in the gene-rich transcriptionally active TADs are enriched for the binding of chromatin architectural proteins CTCF, Rad21 and chromatin remodeler Brg1. In contrast to gene-rich TADs, gene-poor TADs show preferential spatial contacts with each other, do not contain active enhancers and show decreased binding of CTCF, Rad21 and Brg1 in keratinocytes. Thus, spatial interactions between gene

  14. Discovering implicit entity relation with the gene-citation-gene network.

    Directory of Open Access Journals (Sweden)

    Min Song

    Full Text Available In this paper, we apply the entitymetrics model to our constructed Gene-Citation-Gene (GCG network. Based on the premise there is a hidden, but plausible, relationship between an entity in one article and an entity in its citing article, we constructed a GCG network of gene pairs implicitly connected through citation. We compare the performance of this GCG network to a gene-gene (GG network constructed over the same corpus but which uses gene pairs explicitly connected through traditional co-occurrence. Using 331,411 MEDLINE abstracts collected from 18,323 seed articles and their references, we identify 25 gene pairs. A comparison of these pairs with interactions found in BioGRID reveal that 96% of the gene pairs in the GCG network have known interactions. We measure network performance using degree, weighted degree, closeness, betweenness centrality and PageRank. Combining all measures, we find the GCG network has more gene pairs, but a lower matching rate than the GG network. However, combining top ranked genes in both networks produces a matching rate of 35.53%. By visualizing both the GG and GCG networks, we find that cancer is the most dominant disease associated with the genes in both networks. Overall, the study indicates that the GCG network can be useful for detecting gene interaction in an implicit manner.

  15. Genome-Wide Constitutively Expressed Gene Analysis and New Reference Gene Selection Based on Transcriptome Data: A Case Study from Poplar/Canker Disease Interaction

    Directory of Open Access Journals (Sweden)

    Jiaping Zhao

    2017-10-01

    Full Text Available A number of transcriptome datasets for differential expression (DE genes have been widely used for understanding organismal biology, but these datasets also contain untapped information that can be used to develop more precise analytical tools. With the use of transcriptome data generated from poplar/canker disease interaction system, we describe a methodology to identify candidate reference genes from high-throughput sequencing data. This methodology will improve the accuracy of RT-qPCR and will lead to better standards for the normalization of expression data. Expression stability analysis from xylem and phloem of Populus bejingensis inoculated with the fungal canker pathogen Botryosphaeria dothidea revealed that 729 poplar transcripts (1.11% were stably expressed, at a threshold level of coefficient of variance (CV of FPKM < 20% and maximum fold change (MFC of FPKM < 2.0. Expression stability and bioinformatics analysis suggested that commonly used house-keeping (HK genes were not the most appropriate internal controls: 70 of the 72 commonly used HK genes were not stably expressed, 45 of the 72 produced multiple isoform transcripts, and some of their reported primers produced unspecific amplicons in PCR amplification. RT-qPCR analysis to compare and evaluate the expression stability of 10 commonly used poplar HK genes and 20 of the 729 newly-identified stably expressed transcripts showed that some of the newly-identified genes (such as SSU_S8e, LSU_L5e, and 20S_PSU had higher stability ranking than most of commonly used HK genes. Based on these results, we recommend a pipeline for deriving reference genes from transcriptome data. An appropriate candidate gene should have a unique transcript, constitutive expression, CV value of expression < 20% (or possibly 30% and MFC value of expression <2, and an expression level of 50–1,000 units. Lastly, when four of the newly identified HK genes were used in the normalization of expression data for 20

  16. A gene-gene interaction between polymorphisms in the OCT2 and MATE1 genes influences the renal clearance of metformin

    DEFF Research Database (Denmark)

    Hougaard Christensen, Mette Marie; Pedersen, Rasmus Steen; Stage, Tore Bjerregaard

    2013-01-01

    The aim of this study was to determine the association between the renal clearance (CL(renal)) of metformin in healthy Caucasian volunteers and the single-nucleotide polymorphism (SNP) c.808G>T (rs316019) in OCT2 as well as the relevance of the gene-gene interactions between this SNP and (a) the ...

  17. Gene-gene interactions of IRF5, STAT4, IKZF1 and ETS1 in systemic lupus erythematosus.

    Science.gov (United States)

    Dang, J; Shan, S; Li, J; Zhao, H; Xin, Q; Liu, Y; Bian, X; Liu, Q

    2014-06-01

    Interferon (IFN) activation signaling and T helper 17 (Th17)-cell/B-cell regulation play a critical role in the pathogenesis of systemic lupus erythematosus (SLE). Several studies have provided convincing evidence that polymorphisms in IRF5, STAT4, IKZF1 and ETS1 from these pathways may be involved in SLE by affecting gene expression or epistasis. We analyzed the genetic interaction in known SLE susceptibility loci from the four genes in northern Han Chinese. A total of 946 northern Han Chinese participated in this study (370 unrelated SLE patients and 576 healthy controls). Subjects underwent genotyping for the single-nucleotide polymorphisms (SNPs) rs2004640 in IRF5, rs7574865 in STAT4, rs4917014 in IKZF1 and rs1128334 in ETS1 by use of a TaqMan SNP genotyping assay and direct sequencing. Gene-gene interaction analysis involved direct counting, multifactor dimensionality reduction (MDR) and linear regression analysis. SLE patients and controls differed in allele frequencies of rs7574865, rs1128334 (P < 0.001) and rs4917014 (P < 0.01). Direct counting revealed that the frequency of risk homozygote combinations was higher for SLE patients than controls (P < 0.01). Furthermore, 2-, 3- and 4-way gene-gene epistasis in SLE was confirmed by parametric methods and MDR analysis. Gene expression analysis partially supported the findings. Our study confirmed the association of the IFN pathway or Th17/B-cells and the pathogenesis of SLE, and gene-gene interaction in this pathway may increase the risk of SLE. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  18. Gene-Lifestyle Interactions in Obesity.

    Science.gov (United States)

    van Vliet-Ostaptchouk, Jana V; Snieder, Harold; Lagou, Vasiliki

    2012-01-01

    Obesity is a complex multifaceted disease resulting from interactions between genetics and lifestyle. The proportion of phenotypic variance ascribed to genetic variance is 0.4 to 0.7 for obesity and recent years have seen considerable success in identifying disease-susceptibility variants. Although with the advent of genome-wide association studies the list of genetic variants predisposing to obesity has significantly increased the identified variants only explain a fraction of disease heritability. Studies of gene-environment interactions can provide more insight into the biological mechanisms involved in obesity despite the challenges associated with such designs. Epigenetic changes that affect gene function without DNA sequence modifications may be a key factor explaining interindividual differences in obesity, with both genetic and environmental factors influencing the epigenome. Disentangling the relative contributions of genetic, environmental and epigenetic marks to the establishment of obesity is a major challenge given the complex interplay between these determinants.

  19. Genome-wide analysis of E. coli cell-gene interactions.

    Science.gov (United States)

    Cardinale, S; Cambray, G

    2017-11-23

    The pursuit of standardization and reliability in synthetic biology has achieved, in recent years, a number of advances in the design of more predictable genetic parts for biological circuits. However, even with the development of high-throughput screening methods and whole-cell models, it is still not possible to predict reliably how a synthetic genetic construct interacts with all cellular endogenous systems. This study presents a genome-wide analysis of how the expression of synthetic genes is affected by systematic perturbations of cellular functions. We found that most perturbations modulate expression indirectly through an effect on cell size, putting forward the existence of a generic Size-Expression interaction in the model prokaryote Escherichia coli. The Size-Expression interaction was quantified by inserting a dual fluorescent reporter gene construct into each of the 3822 single-gene deletion strains comprised in the KEIO collection. Cellular size was measured for single cells via flow cytometry. Regression analyses were used to discriminate between expression-specific and gene-specific effects. Functions of the deleted genes broadly mapped onto three systems with distinct primary influence on the Size-Expression map. Perturbations in the Division and Biosynthesis (DB) system led to a large-cell and high-expression phenotype. In contrast, disruptions of the Membrane and Motility (MM) system caused small-cell and low-expression phenotypes. The Energy, Protein synthesis and Ribosome (EPR) system was predominantly associated with smaller cells and positive feedback on ribosome function. Feedback between cell growth and gene expression is widespread across cell systems. Even though most gene disruptions proximally affect one component of the Size-Expression interaction, the effect therefore ultimately propagates to both. More specifically, we describe the dual impact of growth on cell size and gene expression through cell division and ribosomal content

  20. Risk score modeling of multiple gene to gene interactions using aggregated-multifactor dimensionality reduction

    Directory of Open Access Journals (Sweden)

    Dai Hongying

    2013-01-01

    Full Text Available Abstract Background Multifactor Dimensionality Reduction (MDR has been widely applied to detect gene-gene (GxG interactions associated with complex diseases. Existing MDR methods summarize disease risk by a dichotomous predisposing model (high-risk/low-risk from one optimal GxG interaction, which does not take the accumulated effects from multiple GxG interactions into account. Results We propose an Aggregated-Multifactor Dimensionality Reduction (A-MDR method that exhaustively searches for and detects significant GxG interactions to generate an epistasis enriched gene network. An aggregated epistasis enriched risk score, which takes into account multiple GxG interactions simultaneously, replaces the dichotomous predisposing risk variable and provides higher resolution in the quantification of disease susceptibility. We evaluate this new A-MDR approach in a broad range of simulations. Also, we present the results of an application of the A-MDR method to a data set derived from Juvenile Idiopathic Arthritis patients treated with methotrexate (MTX that revealed several GxG interactions in the folate pathway that were associated with treatment response. The epistasis enriched risk score that pooled information from 82 significant GxG interactions distinguished MTX responders from non-responders with 82% accuracy. Conclusions The proposed A-MDR is innovative in the MDR framework to investigate aggregated effects among GxG interactions. New measures (pOR, pRR and pChi are proposed to detect multiple GxG interactions.

  1. A novel approach to simulate gene-environment interactions in complex diseases

    Directory of Open Access Journals (Sweden)

    Nicodemi Mario

    2010-01-01

    Full Text Available Abstract Background Complex diseases are multifactorial traits caused by both genetic and environmental factors. They represent the major part of human diseases and include those with largest prevalence and mortality (cancer, heart disease, obesity, etc.. Despite a large amount of information that has been collected about both genetic and environmental risk factors, there are few examples of studies on their interactions in epidemiological literature. One reason can be the incomplete knowledge of the power of statistical methods designed to search for risk factors and their interactions in these data sets. An improvement in this direction would lead to a better understanding and description of gene-environment interactions. To this aim, a possible strategy is to challenge the different statistical methods against data sets where the underlying phenomenon is completely known and fully controllable, for example simulated ones. Results We present a mathematical approach that models gene-environment interactions. By this method it is possible to generate simulated populations having gene-environment interactions of any form, involving any number of genetic and environmental factors and also allowing non-linear interactions as epistasis. In particular, we implemented a simple version of this model in a Gene-Environment iNteraction Simulator (GENS, a tool designed to simulate case-control data sets where a one gene-one environment interaction influences the disease risk. The main aim has been to allow the input of population characteristics by using standard epidemiological measures and to implement constraints to make the simulator behaviour biologically meaningful. Conclusions By the multi-logistic model implemented in GENS it is possible to simulate case-control samples of complex disease where gene-environment interactions influence the disease risk. The user has full control of the main characteristics of the simulated population and a Monte

  2. Analysis of the robustness of network-based disease-gene prioritization methods reveals redundancy in the human interactome and functional diversity of disease-genes.

    Directory of Open Access Journals (Sweden)

    Emre Guney

    Full Text Available Complex biological systems usually pose a trade-off between robustness and fragility where a small number of perturbations can substantially disrupt the system. Although biological systems are robust against changes in many external and internal conditions, even a single mutation can perturb the system substantially, giving rise to a pathophenotype. Recent advances in identifying and analyzing the sequential variations beneath human disorders help to comprehend a systemic view of the mechanisms underlying various disease phenotypes. Network-based disease-gene prioritization methods rank the relevance of genes in a disease under the hypothesis that genes whose proteins interact with each other tend to exhibit similar phenotypes. In this study, we have tested the robustness of several network-based disease-gene prioritization methods with respect to the perturbations of the system using various disease phenotypes from the Online Mendelian Inheritance in Man database. These perturbations have been introduced either in the protein-protein interaction network or in the set of known disease-gene associations. As the network-based disease-gene prioritization methods are based on the connectivity between known disease-gene associations, we have further used these methods to categorize the pathophenotypes with respect to the recoverability of hidden disease-genes. Our results have suggested that, in general, disease-genes are connected through multiple paths in the human interactome. Moreover, even when these paths are disturbed, network-based prioritization can reveal hidden disease-gene associations in some pathophenotypes such as breast cancer, cardiomyopathy, diabetes, leukemia, parkinson disease and obesity to a greater extend compared to the rest of the pathophenotypes tested in this study. Gene Ontology (GO analysis highlighted the role of functional diversity for such diseases.

  3. Gene-gene interaction between MSX1 and TP63 in Asian case-parent trios with nonsyndromic cleft lip with or without cleft palate.

    Science.gov (United States)

    Liu, Dongjing; Schwender, Holger; Wang, Mengying; Wang, Hong; Wang, Ping; Zhu, Hongping; Zhou, Zhibo; Li, Jing; Wu, Tao; Beaty, Terri H

    2018-03-01

    Small ubiquitin-like modification, also known as sumoylation, is a crucial post-translational regulatory mechanisms involved in development of the lip and palate. Recent studies reported two sumoylation target genes, MSX1 and TP63, to have achieved genome-wide level significance in tests of association with nonsyndromic clefts. Here, we performed a candidate gene analysis considering gene-gene and gene-environment interaction for SUMO1, MSX1, and TP63 to further explore the etiology of nonsyndromic cleft lip with or without cleft palate (NSCL/P). A total of 130 single-nucleotide polymorphisms (SNPs) in or near SUMO1, MSX1, and TP63 was analyzed among 1,038 Asian NSCL/P trios ascertained through an international consortium. Conditional logistic regression models were used to explore gene-gene (G × G) and gene-environment (G × E) interaction involving maternal environmental tobacco smoke and multivitamin supplementation. Bonferroni correction was used for G × E analysis and permutation tests were used for G × G analysis. While transmission disequilibrium tests and gene-environment interaction analysis showed no significant results, we did find signals of gene-gene interaction between SNPs near MSX1 and TP63. Three pairwise interactions yielded significant p values in permutation tests (rs884690 and rs9290890 with p = 9.34 × 10 -5 and empirical p = 1.00 × 10 -4 , rs1022136 and rs4687098 with p = 2.41 × 10 -4 and empirical p = 2.95 × 10 -4 , rs6819546 and rs9681004 with p = 5.15 × 10 -4 and empirical p = 3.02 × 10 -4 ). Gene-gene interaction between MSX1 and TP63 may influence the risk of NSCL/P in Asian populations. Our study provided additional understanding of the genetic etiology of NSCL/P and underlined the importance of considering gene-gene interaction in the etiology of this common craniofacial malformation. © 2018 Wiley Periodicals, Inc.

  4. Multiple Gene-Environment Interactions on the Angiogenesis Gene-Pathway Impact Rectal Cancer Risk and Survival

    Directory of Open Access Journals (Sweden)

    Noha Sharafeldin

    2017-09-01

    Full Text Available Characterization of gene-environment interactions (GEIs in cancer is limited. We aimed at identifying GEIs in rectal cancer focusing on a relevant biologic process involving the angiogenesis pathway and relevant environmental exposures: cigarette smoking, alcohol consumption, and animal protein intake. We analyzed data from 747 rectal cancer cases and 956 controls from the Diet, Activity and Lifestyle as a Risk Factor for Rectal Cancer study. We applied a 3-step analysis approach: first, we searched for interactions among single nucleotide polymorphisms on the pathway genes; second, we searched for interactions among the genes, both steps using Logic regression; third, we examined the GEIs significant at the 5% level using logistic regression for cancer risk and Cox proportional hazards models for survival. Permutation-based test was used for multiple testing adjustment. We identified 8 significant GEIs associated with risk among 6 genes adjusting for multiple testing: TNF (OR = 1.85, 95% CI: 1.10, 3.11, TLR4 (OR = 2.34, 95% CI: 1.38, 3.98, and EGR2 (OR = 2.23, 95% CI: 1.04, 4.78 with smoking; IGF1R (OR = 1.69, 95% CI: 1.04, 2.72, TLR4 (OR = 2.10, 95% CI: 1.22, 3.60 and EGR2 (OR = 2.12, 95% CI: 1.01, 4.46 with alcohol; and PDGFB (OR = 1.75, 95% CI: 1.04, 2.92 and MMP1 (OR = 2.44, 95% CI: 1.24, 4.81 with protein. Five GEIs were associated with survival at the 5% significance level but not after multiple testing adjustment: CXCR1 (HR = 2.06, 95% CI: 1.13, 3.75 with smoking; and KDR (HR = 4.36, 95% CI: 1.62, 11.73, TLR2 (HR = 9.06, 95% CI: 1.14, 72.11, EGR2 (HR = 2.45, 95% CI: 1.42, 4.22, and EGFR (HR = 6.33, 95% CI: 1.95, 20.54 with protein. GEIs between angiogenesis genes and smoking, alcohol, and animal protein impact rectal cancer risk. Our results support the importance of considering the biologic hypothesis to characterize GEIs associated with cancer outcomes.

  5. Genotet: An Interactive Web-based Visual Exploration Framework to Support Validation of Gene Regulatory Networks.

    Science.gov (United States)

    Yu, Bowen; Doraiswamy, Harish; Chen, Xi; Miraldi, Emily; Arrieta-Ortiz, Mario Luis; Hafemeister, Christoph; Madar, Aviv; Bonneau, Richard; Silva, Cláudio T

    2014-12-01

    Elucidation of transcriptional regulatory networks (TRNs) is a fundamental goal in biology, and one of the most important components of TRNs are transcription factors (TFs), proteins that specifically bind to gene promoter and enhancer regions to alter target gene expression patterns. Advances in genomic technologies as well as advances in computational biology have led to multiple large regulatory network models (directed networks) each with a large corpus of supporting data and gene-annotation. There are multiple possible biological motivations for exploring large regulatory network models, including: validating TF-target gene relationships, figuring out co-regulation patterns, and exploring the coordination of cell processes in response to changes in cell state or environment. Here we focus on queries aimed at validating regulatory network models, and on coordinating visualization of primary data and directed weighted gene regulatory networks. The large size of both the network models and the primary data can make such coordinated queries cumbersome with existing tools and, in particular, inhibits the sharing of results between collaborators. In this work, we develop and demonstrate a web-based framework for coordinating visualization and exploration of expression data (RNA-seq, microarray), network models and gene-binding data (ChIP-seq). Using specialized data structures and multiple coordinated views, we design an efficient querying model to support interactive analysis of the data. Finally, we show the effectiveness of our framework through case studies for the mouse immune system (a dataset focused on a subset of key cellular functions) and a model bacteria (a small genome with high data-completeness).

  6. Gene-Based Genome-Wide Association Analysis in European and Asian Populations Identified Novel Genes for Rheumatoid Arthritis.

    Directory of Open Access Journals (Sweden)

    Hong Zhu

    Full Text Available Rheumatoid arthritis (RA is a complex autoimmune disease. Using a gene-based association research strategy, the present study aims to detect unknown susceptibility to RA and to address the ethnic differences in genetic susceptibility to RA between European and Asian populations.Gene-based association analyses were performed with KGG 2.5 by using publicly available large RA datasets (14,361 RA cases and 43,923 controls of European subjects, 4,873 RA cases and 17,642 controls of Asian Subjects. For the newly identified RA-associated genes, gene set enrichment analyses and protein-protein interactions analyses were carried out with DAVID and STRING version 10.0, respectively. Differential expression verification was conducted using 4 GEO datasets. The expression levels of three selected 'highly verified' genes were measured by ELISA among our in-house RA cases and controls.A total of 221 RA-associated genes were newly identified by gene-based association study, including 71'overlapped', 76 'European-specific' and 74 'Asian-specific' genes. Among them, 105 genes had significant differential expressions between RA patients and health controls at least in one dataset, especially for 20 genes including 11 'overlapped' (ABCF1, FLOT1, HLA-F, IER3, TUBB, ZKSCAN4, BTN3A3, HSP90AB1, CUTA, BRD2, HLA-DMA, 5 'European-specific' (PHTF1, RPS18, BAK1, TNFRSF14, SUOX and 4 'Asian-specific' (RNASET2, HFE, BTN2A2, MAPK13 genes whose differential expressions were significant at least in three datasets. The protein expressions of two selected genes FLOT1 (P value = 1.70E-02 and HLA-DMA (P value = 4.70E-02 in plasma were significantly different in our in-house samples.Our study identified 221 novel RA-associated genes and especially highlighted the importance of 20 candidate genes on RA. The results addressed ethnic genetic background differences for RA susceptibility between European and Asian populations and detected a long list of overlapped or ethnic specific RA

  7. Functional modules by relating protein interaction networks and gene expression.

    Science.gov (United States)

    Tornow, Sabine; Mewes, H W

    2003-11-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.

  8. Assessment of gene-by-sex interaction effect on bone mineral density

    DEFF Research Database (Denmark)

    Liu, Ching-Ti; Estrada, Karol; Yerges-Armstrong, Laura M

    2012-01-01

    Sexual dimorphism in various bone phenotypes, including bone mineral density (BMD), is widely observed; however, the extent to which genes explain these sex differences is unclear. To identify variants with different effects by sex, we examined gene-by-sex autosomal interactions genome-wide, and ......Sexual dimorphism in various bone phenotypes, including bone mineral density (BMD), is widely observed; however, the extent to which genes explain these sex differences is unclear. To identify variants with different effects by sex, we examined gene-by-sex autosomal interactions genome...

  9. Chemical-Gene Interactions from ToxCast Bioactivity Data Expands Universe of Literature Network-Based Associations (SOT)

    Science.gov (United States)

    Characterizing the effects of chemicals in biological systems is often summarized by chemical-gene interactions, which have sparse coverage in the literature. The ToxCast chemical screening program has produced bioactivity data for nearly 2000 chemicals and over 450 gene targets....

  10. Differential gene expression and Hog1 interaction with osmoresponsive genes in the extremely halotolerant black yeast Hortaea werneckii

    Directory of Open Access Journals (Sweden)

    Plemenitaš Ana

    2007-08-01

    Full Text Available Abstract Background Fluctuations in external salinity force eukaryotic cells to respond by changes in the gene expression of proteins acting in protective biochemical processes, thus counteracting the changing osmotic pressure. The high-osmolarity glycerol (HOG signaling pathway is essential for the efficient up-regulation of the osmoresponsive genes. In this study, the differential gene expression of the extremely halotolerant black yeast Hortaea werneckii was explored. Furthermore, the interaction of mitogen-activated protein kinase HwHog1 and RNA polymerase II with the chromatin in cells adapted to an extremely hypersaline environment was analyzed. Results A cDNA subtraction library was constructed for H. werneckii, adapted to moderate salinity or an extremely hypersaline environment of 4.5 M NaCl. An uncommon osmoresponsive set of 95 differentially expressed genes was identified. The majority of these had not previously been connected with the adaptation of salt-sensitive S. cerevisiae to hypersaline conditions. The transcriptional response in hypersaline-adapted and hypersaline-stressed cells showed that only a subset of the identified genes responded to acute salt-stress, whereas all were differentially expressed in adapted cells. Interaction with HwHog1 was shown for 36 of the 95 differentially expressed genes. The majority of the identified osmoresponsive and HwHog1-dependent genes in H. werneckii have not been previously reported as Hog1-dependent genes in the salt-sensitive S. cerevisiae. The study further demonstrated the co-occupancy of HwHog1 and RNA polymerase II on the chromatin of 17 up-regulated and 2 down-regulated genes in 4.5 M NaCl-adapted H. werneckii cells. Conclusion Extremely halotolerant H. werneckii represents a suitable and highly relevant organism to study cellular responses to environmental salinity. In comparison with the salt-sensitive S. cerevisiae, this yeast shows a different set of genes being expressed at

  11. Genome-wide search for gene-gene interactions in colorectal cancer.

    Directory of Open Access Journals (Sweden)

    Shuo Jiao

    Full Text Available Genome-wide association studies (GWAS have successfully identified a number of single-nucleotide polymorphisms (SNPs associated with colorectal cancer (CRC risk. However, these susceptibility loci known today explain only a small fraction of the genetic risk. Gene-gene interaction (GxG is considered to be one source of the missing heritability. To address this, we performed a genome-wide search for pair-wise GxG associated with CRC risk using 8,380 cases and 10,558 controls in the discovery phase and 2,527 cases and 2,658 controls in the replication phase. We developed a simple, but powerful method for testing interaction, which we term the Average Risk Due to Interaction (ARDI. With this method, we conducted a genome-wide search to identify SNPs showing evidence for GxG with previously identified CRC susceptibility loci from 14 independent regions. We also conducted a genome-wide search for GxG using the marginal association screening and examining interaction among SNPs that pass the screening threshold (p<10(-4. For the known locus rs10795668 (10p14, we found an interacting SNP rs367615 (5q21 with replication p = 0.01 and combined p = 4.19×10(-8. Among the top marginal SNPs after LD pruning (n = 163, we identified an interaction between rs1571218 (20p12.3 and rs10879357 (12q21.1 (nominal combined p = 2.51×10(-6; Bonferroni adjusted p = 0.03. Our study represents the first comprehensive search for GxG in CRC, and our results may provide new insight into the genetic etiology of CRC.

  12. A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction.

    Science.gov (United States)

    Marceau, Rachel; Lu, Wenbin; Holloway, Shannon; Sale, Michèle M; Worrall, Bradford B; Williams, Stephen R; Hsu, Fang-Chi; Tzeng, Jung-Ying

    2015-09-01

    Kernel machine (KM) models are a powerful tool for exploring associations between sets of genetic variants and complex traits. Although most KM methods use a single kernel function to assess the marginal effect of a variable set, KM analyses involving multiple kernels have become increasingly popular. Multikernel analysis allows researchers to study more complex problems, such as assessing gene-gene or gene-environment interactions, incorporating variance-component based methods for population substructure into rare-variant association testing, and assessing the conditional effects of a variable set adjusting for other variable sets. The KM framework is robust, powerful, and provides efficient dimension reduction for multifactor analyses, but requires the estimation of high dimensional nuisance parameters. Traditional estimation techniques, including regularization and the "expectation-maximization (EM)" algorithm, have a large computational cost and are not scalable to large sample sizes needed for rare variant analysis. Therefore, under the context of gene-environment interaction, we propose a computationally efficient and statistically rigorous "fastKM" algorithm for multikernel analysis that is based on a low-rank approximation to the nuisance effect kernel matrices. Our algorithm is applicable to various trait types (e.g., continuous, binary, and survival traits) and can be implemented using any existing single-kernel analysis software. Through extensive simulation studies, we show that our algorithm has similar performance to an EM-based KM approach for quantitative traits while running much faster. We also apply our method to the Vitamin Intervention for Stroke Prevention (VISP) clinical trial, examining gene-by-vitamin effects on recurrent stroke risk and gene-by-age effects on change in homocysteine level. © 2015 WILEY PERIODICALS, INC.

  13. BPhyOG: An interactive server for genome-wide inference of bacterial phylogenies based on overlapping genes

    Directory of Open Access Journals (Sweden)

    Lin Kui

    2007-07-01

    Full Text Available Abstract Background Overlapping genes (OGs in bacterial genomes are pairs of adjacent genes of which the coding sequences overlap partly or entirely. With the rapid accumulation of sequence data, many OGs in bacterial genomes have now been identified. Indeed, these might prove a consistent feature across all microbial genomes. Our previous work suggests that OGs can be considered as robust markers at the whole genome level for the construction of phylogenies. An online, interactive web server for inferring phylogenies is needed for biologists to analyze phylogenetic relationships among a set of bacterial genomes of interest. Description BPhyOG is an online interactive server for reconstructing the phylogenies of completely sequenced bacterial genomes on the basis of their shared overlapping genes. It provides two tree-reconstruction methods: Neighbor Joining (NJ and Unweighted Pair-Group Method using Arithmetic averages (UPGMA. Users can apply the desired method to generate phylogenetic trees, which are based on an evolutionary distance matrix for the selected genomes. The distance between two genomes is defined by the normalized number of their shared OG pairs. BPhyOG also allows users to browse the OGs that were used to infer the phylogenetic relationships. It provides detailed annotation for each OG pair and the features of the component genes through hyperlinks. Users can also retrieve each of the homologous OG pairs that have been determined among 177 genomes. It is a useful tool for analyzing the tree of life and overlapping genes from a genomic standpoint. Conclusion BPhyOG is a useful interactive web server for genome-wide inference of any potential evolutionary relationship among the genomes selected by users. It currently includes 177 completely sequenced bacterial genomes containing 79,855 OG pairs, the annotation and homologous OG pairs of which are integrated comprehensively. The reliability of phylogenies complemented by

  14. Gene interactions and genetics for yield and its attributes in grass ...

    Indian Academy of Sciences (India)

    A. K. PARIHAR

    explaining the manifestation of complex traits such as yield. ... interactions (i, j, l) contributed towards the inheritance of traits in the given crosses. ... Keywords. grass pea; scaling test; gene interactions; gene effects; heritability; Lathyrus sativus.

  15. Diet-gene interactions between dietary fat intake and common polymorphisms in determining lipid metabolism

    Energy Technology Data Exchange (ETDEWEB)

    Corella, D.

    2009-07-01

    Current dietary guidelines for fat intake have not taken into consideration the possible genetic differences underlying the individual variability in responsiveness to dietary components. Genetic variability has been identified in humans for all the known lipid metabolism-related genes resulting in a plethora of candidate genes and genetic variants to examine in diet-gene interaction studies focused on fat consumption. Some examples of fat-gene interaction are reviewed. These include: the interaction between total intake and the 14C/T in the hepatic lipase gene promoter in determining high-density lipoprotein cholesterol (HDL-C) metabolism; the interaction between polyunsaturated fatty acids (PUFA) and the 5G/A polymorphism in the APOA1 gene plasma HDL-C concentrations; the interaction between PUFA and the L162V polymorphism in the PPARA gene in determining triglycerides and APOC3 concentrations; and the interaction between PUFA intake and the -1131T>C in the APOA5 gene in determining triglyceride metabolism. Although hundreds of diet-gene interaction studies in lipid metabolism have been published, the level of evidence to make specific nutritional recommendations to the population is still low and more research in nutrigenetics has to be undertaken. (Author) 31 refs.

  16. Genetic interaction motif finding by expectation maximization – a novel statistical model for inferring gene modules from synthetic lethality

    Directory of Open Access Journals (Sweden)

    Ye Ping

    2005-12-01

    Full Text Available Abstract Background Synthetic lethality experiments identify pairs of genes with complementary function. More direct functional associations (for example greater probability of membership in a single protein complex may be inferred between genes that share synthetic lethal interaction partners than genes that are directly synthetic lethal. Probabilistic algorithms that identify gene modules based on motif discovery are highly appropriate for the analysis of synthetic lethal genetic interaction data and have great potential in integrative analysis of heterogeneous datasets. Results We have developed Genetic Interaction Motif Finding (GIMF, an algorithm for unsupervised motif discovery from synthetic lethal interaction data. Interaction motifs are characterized by position weight matrices and optimized through expectation maximization. Given a seed gene, GIMF performs a nonlinear transform on the input genetic interaction data and automatically assigns genes to the motif or non-motif category. We demonstrate the capacity to extract known and novel pathways for Saccharomyces cerevisiae (budding yeast. Annotations suggested for several uncharacterized genes are supported by recent experimental evidence. GIMF is efficient in computation, requires no training and automatically down-weights promiscuous genes with high degrees. Conclusion GIMF effectively identifies pathways from synthetic lethality data with several unique features. It is mostly suitable for building gene modules around seed genes. Optimal choice of one single model parameter allows construction of gene networks with different levels of confidence. The impact of hub genes the generic probabilistic framework of GIMF may be used to group other types of biological entities such as proteins based on stochastic motifs. Analysis of the strongest motifs discovered by the algorithm indicates that synthetic lethal interactions are depleted between genes within a motif, suggesting that synthetic

  17. Gene-particulate matter-health interactions

    International Nuclear Information System (INIS)

    Kleeberger, Steven R.; Ohtsuka, Yoshinori

    2005-01-01

    Inter-individual variation in human responses to air pollutants suggests that some subpopulations are at increased risk to the detrimental effects of pollutant exposure. Extrinsic factors such as previous exposure and nutritional status may influence individual susceptibility. Intrinsic (host) factors that determine susceptibility include age, gender, and pre-existing disease (e.g., asthma), and it is becoming clear that genetic background also contributes to individual susceptibility. Environmental exposures to particulates and genetic factors associated with disease risk likely interact in a complex fashion that varies from one population and one individual to another. The relationships between genetic background and disease risk and severity are often evaluated through traditional family-based linkage studies and positional cloning techniques. However, case-control studies based on association of disease or disease subphenotypes with candidate genes have advantages over family pedigree studies for complex disease phenotypes. This is based in part on continued development of quantitative analysis and the discovery and availability of simple sequence repeats and single nucleotide polymorphisms. Linkage analyses with genetically standardized animal models also provide a useful tool to identify genetic determinants of responses to environmental pollutants. These approaches have identified significant susceptibility quantitative trait loci on mouse chromosomes 1, 6, 11, and 17. Physical mapping and comparative mapping between human and mouse genomes will yield candidate susceptibility genes that may be tested by association studies in human subjects. Human studies and mouse modeling will provide important insight to understanding genetic factors that contribute to differential susceptibility to air pollutants

  18. Interactions of renin-angiotensin system gene polymorphisms and antihypertensive effect of benazepril in Chinese population.

    Science.gov (United States)

    Chen, Qing; Yu, Can-Qing; Tang, Xun; Chen, Da-Fang; Tian, Jun; Cao, Yang; Fan, Wen-Yi; Cao, Wei-Hua; Zhan, Si-Yan; Lv, Jun; Guo, Xiao-Xia; Hu, Yong-Hua; Lee, Li-Ming

    2011-05-01

    Angiotensin-converting enzyme inhibitors are widely used antihypertensive drugs with individual response variation. We studied whether interactions of AGT, AGTR1 and ACE2 gene polymorphisms affect this response. Our study is based on a 3-year field trial with 1831 hypertensive patients prescribed benazepril. Generalized multifactor dimensionality reduction was used to explore interaction models and logistic regressions were used to confirm them. A two-locus model involving the AGT and ACE2 genes was found in males, the sensitive genotypes showed an odds ratio (OR) of 1.9 (95% CI: 1.3-2.8) when compared with nonsensitive genotypes. Two AGT-AGTR1 models were found in females, with an OR of 3.5 (95% CI: 2.0-5.9) and 3.1 (95% CI: 1.8-5.3). Gender-specific gene-gene interactions of the AGT, AGTR1 and ACE2 genes were associated with individual variation of response to benazepril. Further studies are needed to confirm this finding.

  19. Gene-environment interactions involving functional variants

    DEFF Research Database (Denmark)

    Barrdahl, Myrto; Rudolph, Anja; Hopper, John L

    2017-01-01

    .36, 95% CI: 1.16-1.59, pint  = 1.9 × 10(-5) ) in relation to ER- disease risk. The remaining two gene-environment interactions were also identified in relation to ER- breast cancer risk and were found between 3p21-rs6796502 and age at menarche (ORint  = 1.26, 95% CI: 1.12-1.43, pint =1.8 × 10...... epidemiological breast cancer risk factors in relation to breast cancer. Analyses were conducted on up to 58,573 subjects (26,968 cases and 31,605 controls) from the Breast Cancer Association Consortium, in one of the largest studies of its kind. Analyses were carried out separately for estrogen receptor (ER......) positive (ER+) and ER negative (ER-) disease. The Bayesian False Discovery Probability (BFDP) was computed to assess the noteworthiness of the results. Four potential gene-environment interactions were identified as noteworthy (BFDP 

  20. DTFP-Growth: Dynamic Threshold-Based FP-Growth Rule Mining Algorithm Through Integrating Gene Expression, Methylation, and Protein-Protein Interaction Profiles.

    Science.gov (United States)

    Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan; Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan; Mallik, Saurav; Bhadra, Tapas; Mukherji, Ayan

    2018-04-01

    Association rule mining is an important technique for identifying interesting relationships between gene pairs in a biological data set. Earlier methods basically work for a single biological data set, and, in maximum cases, a single minimum support cutoff can be applied globally, i.e., across all genesets/itemsets. To overcome this limitation, in this paper, we propose dynamic threshold-based FP-growth rule mining algorithm that integrates gene expression, methylation and protein-protein interaction profiles based on weighted shortest distance to find the novel associations among different pairs of genes in multi-view data sets. For this purpose, we introduce three new thresholds, namely, Distance-based Variable/Dynamic Supports (DVS), Distance-based Variable Confidences (DVC), and Distance-based Variable Lifts (DVL) for each rule by integrating co-expression, co-methylation, and protein-protein interactions existed in the multi-omics data set. We develop the proposed algorithm utilizing these three novel multiple threshold measures. In the proposed algorithm, the values of , , and are computed for each rule separately, and subsequently it is verified whether the support, confidence, and lift of each evolved rule are greater than or equal to the corresponding individual , , and values, respectively, or not. If all these three conditions for a rule are found to be true, the rule is treated as a resultant rule. One of the major advantages of the proposed method compared with other related state-of-the-art methods is that it considers both the quantitative and interactive significance among all pairwise genes belonging to each rule. Moreover, the proposed method generates fewer rules, takes less running time, and provides greater biological significance for the resultant top-ranking rules compared to previous methods.

  1. Association of peroxisome proliferator-activated receptor single-nucleotide polymorphisms and gene-gene interactions with the lipoprotein(a)

    Institute of Scientific and Technical Information of China (English)

    解惠坚

    2014-01-01

    Objective To examine the associations of 10 singlenucleotide polymorphisms(SNPs)in peroxisome proliferator-activated receptor(PPARs)gene with lipoprotein(a)level,and to investigate if there is gene-gene interaction among the SNPs on lipoprotein(a)level.Methods Totally 644 subjects(234 men and 410 women)were enrolled from Prevention of Multiple Metabolic Disorders and Metabolic Syndrome Study Cohort,which was an urban community survey study conducted in Jiangsu province.Ten SNPs in PPARα(rs135539,rs4253778,

  2. Novel gene sets improve set-level classification of prokaryotic gene expression data.

    Science.gov (United States)

    Holec, Matěj; Kuželka, Ondřej; Železný, Filip

    2015-10-28

    Set-level classification of gene expression data has received significant attention recently. In this setting, high-dimensional vectors of features corresponding to genes are converted into lower-dimensional vectors of features corresponding to biologically interpretable gene sets. The dimensionality reduction brings the promise of a decreased risk of overfitting, potentially resulting in improved accuracy of the learned classifiers. However, recent empirical research has not confirmed this expectation. Here we hypothesize that the reported unfavorable classification results in the set-level framework were due to the adoption of unsuitable gene sets defined typically on the basis of the Gene ontology and the KEGG database of metabolic networks. We explore an alternative approach to defining gene sets, based on regulatory interactions, which we expect to collect genes with more correlated expression. We hypothesize that such more correlated gene sets will enable to learn more accurate classifiers. We define two families of gene sets using information on regulatory interactions, and evaluate them on phenotype-classification tasks using public prokaryotic gene expression data sets. From each of the two gene-set families, we first select the best-performing subtype. The two selected subtypes are then evaluated on independent (testing) data sets against state-of-the-art gene sets and against the conventional gene-level approach. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. The novel gene sets are indeed more correlated than the conventional ones, and lead to significantly more accurate classifiers. Novel gene sets defined on the basis of regulatory interactions improve set-level classification of gene expression data. The experimental scripts and other material needed to reproduce the experiments are available at http://ida.felk.cvut.cz/novelgenesets.tar.gz.

  3. A hybrid network-based method for the detection of disease-related genes

    Science.gov (United States)

    Cui, Ying; Cai, Meng; Dai, Yang; Stanley, H. Eugene

    2018-02-01

    Detecting disease-related genes is crucial in disease diagnosis and drug design. The accepted view is that neighbors of a disease-causing gene in a molecular network tend to cause the same or similar diseases, and network-based methods have been recently developed to identify novel hereditary disease-genes in available biomedical networks. Despite the steady increase in the discovery of disease-associated genes, there is still a large fraction of disease genes that remains under the tip of the iceberg. In this paper we exploit the topological properties of the protein-protein interaction (PPI) network to detect disease-related genes. We compute, analyze, and compare the topological properties of disease genes with non-disease genes in PPI networks. We also design an improved random forest classifier based on these network topological features, and a cross-validation test confirms that our method performs better than previous similar studies.

  4. Synergistic interactions of biotic and abiotic environmental stressors on gene expression.

    Science.gov (United States)

    Altshuler, Ianina; McLeod, Anne M; Colbourne, John K; Yan, Norman D; Cristescu, Melania E

    2015-03-01

    Understanding the response of organisms to multiple stressors is critical for predicting if populations can adapt to rapid environmental change. Natural and anthropogenic stressors often interact, complicating general predictions. In this study, we examined the interactive and cumulative effects of two common environmental stressors, lowered calcium concentration, an anthropogenic stressor, and predator presence, a natural stressor, on the water flea Daphnia pulex. We analyzed expression changes of five genes involved in calcium homeostasis - cuticle proteins (Cutie, Icp2), calbindin (Calb), and calcium pump and channel (Serca and Ip3R) - using real-time quantitative PCR (RT-qPCR) in a full factorial experiment. We observed strong synergistic interactions between low calcium concentration and predator presence. While the Ip3R gene was not affected by the stressors, the other four genes were affected in their transcriptional levels by the combination of the stressors. Transcriptional patterns of genes that code for cuticle proteins (Cutie and Icp2) and a sarcoplasmic calcium pump (Serca) only responded to the combination of stressors, changing their relative expression levels in a synergistic response, while a calcium-binding protein (Calb) responded to low calcium stress and the combination of both stressors. The expression pattern of these genes (Cutie, Icp2, and Serca) were nonlinear, yet they were dose dependent across the calcium gradient. Multiple stressors can have complex, often unexpected effects on ecosystems. This study demonstrates that the dominant interaction for the set of tested genes appears to be synergism. We argue that gene expression patterns can be used to understand and predict the type of interaction expected when organisms are exposed simultaneously to natural and anthropogenic stressors.

  5. Trends in gastrectomy and ADH1B and ALDH2 genotypes in Japanese alcoholic men and their gene-gastrectomy, gene-gene and gene-age interactions for risk of alcoholism.

    Science.gov (United States)

    Yokoyama, Akira; Yokoyama, Tetsuji; Matsui, Toshifumi; Mizukami, Takeshi; Kimura, Mitsuru; Matsushita, Sachio; Higuchi, Susumu; Maruyama, Katsuya

    2013-01-01

    The life-time drinking profiles of Japanese alcoholics have shown that gastrectomy increases susceptibility to alcoholism. We investigated the trends in gastrectomy and alcohol dehydrogenase-1B (ADH1B) and aldehyde dehydrogenase-2 (ALDH2) genotypes and their interactions in alcoholics. This survey was conducted on 4879 Japanese alcoholic men 40 years of age or older who underwent routine gastrointestinal endoscopic screening during the period 1996-2010. ADH1B/ALDH2 genotyping was performed in 3702 patients. A history of gastrectomy was found in 508 (10.4%) patients. The reason for the gastrectomy was peptic ulcer in 317 patients and gastric cancer in 187 patients. The frequency of gastrectomy had gradually decreased from 13.3% in 1996-2000 to 10.5% in 2001-2005 and to 7.8% in 2006-2010 (P alcoholism-susceptibility genotypes, ADH1B*1/*1 and ALDH2*1/*1, modestly but significantly tended not to occur in the same individual (P = 0.026). The frequency of ADH1B*1/*1 decreased with ascending age groups. The high frequency of history of gastrectomy suggested that gastrectomy is still a risk factor for alcoholism, although the percentage decreased during the period. The alcoholism-susceptibility genotype ADH1B*1/*1 was less frequent in the gastrectomy group, suggesting a competitive gene-gastrectomy interaction for alcoholism. A gene-gene interaction and gene-age interactions regarding the ADH1B genotype were observed.

  6. Understanding Epistatic Interactions between Genes Targeted by Non-coding Regulatory Elements in Complex Diseases

    Directory of Open Access Journals (Sweden)

    Min Kyung Sung

    2014-12-01

    Full Text Available Genome-wide association studies have proven the highly polygenic architecture of complex diseases or traits; therefore, single-locus-based methods are usually unable to detect all involved loci, especially when individual loci exert small effects. Moreover, the majority of associated single-nucleotide polymorphisms resides in non-coding regions, making it difficult to understand their phenotypic contribution. In this work, we studied epistatic interactions associated with three common diseases using Korea Association Resource (KARE data: type 2 diabetes mellitus (DM, hypertension (HT, and coronary artery disease (CAD. We showed that epistatic single-nucleotide polymorphisms (SNPs were enriched in enhancers, as well as in DNase I footprints (the Encyclopedia of DNA Elements [ENCODE] Project Consortium 2012, which suggested that the disruption of the regulatory regions where transcription factors bind may be involved in the disease mechanism. Accordingly, to identify the genes affected by the SNPs, we employed whole-genome multiple-cell-type enhancer data which discovered using DNase I profiles and Cap Analysis Gene Expression (CAGE. Assigned genes were significantly enriched in known disease associated gene sets, which were explored based on the literature, suggesting that this approach is useful for detecting relevant affected genes. In our knowledge-based epistatic network, the three diseases share many associated genes and are also closely related with each other through many epistatic interactions. These findings elucidate the genetic basis of the close relationship between DM, HT, and CAD.

  7. A novel test for gene-ancestry interactions in genome-wide association data.

    Directory of Open Access Journals (Sweden)

    Joanna L Davies

    Full Text Available Genome-wide association study (GWAS data on a disease are increasingly available from multiple related populations. In this scenario, meta-analyses can improve power to detect homogeneous genetic associations, but if there exist ancestry-specific effects, via interactions on genetic background or with a causal effect that co-varies with genetic background, then these will typically be obscured. To address this issue, we have developed a robust statistical method for detecting susceptibility gene-ancestry interactions in multi-cohort GWAS based on closely-related populations. We use the leading principal components of the empirical genotype matrix to cluster individuals into "ancestry groups" and then look for evidence of heterogeneous genetic associations with disease or other trait across these clusters. Robustness is improved when there are multiple cohorts, as the signal from true gene-ancestry interactions can then be distinguished from gene-collection artefacts by comparing the observed interaction effect sizes in collection groups relative to ancestry groups. When applied to colorectal cancer, we identified a missense polymorphism in iron-absorption gene CYBRD1 that associated with disease in individuals of English, but not Scottish, ancestry. The association replicated in two additional, independently-collected data sets. Our method can be used to detect associations between genetic variants and disease that have been obscured by population genetic heterogeneity. It can be readily extended to the identification of genetic interactions on other covariates such as measured environmental exposures. We envisage our methodology being of particular interest to researchers with existing GWAS data, as ancestry groups can be easily defined and thus tested for interactions.

  8. Methylobacterium-plant interaction genes regulated by plant exudate and quorum sensing molecules

    Directory of Open Access Journals (Sweden)

    Manuella Nóbrega Dourado

    2013-12-01

    Full Text Available Bacteria from the genus Methylobacterium interact symbiotically (endophytically and epiphytically with different plant species. These interactions can promote plant growth or induce systemic resistance, increasing plant fitness. The plant colonization is guided by molecular communication between bacteria-bacteria and bacteria-plants, where the bacteria recognize specific exuded compounds by other bacteria (e.g. homoserine molecules and/or by the plant roots (e.g. flavonoids, ethanol and methanol, respectively. In this context, the aim of this study was to evaluate the effect of quorum sensing molecules (N-acyl-homoserine lactones and plant exudates (including ethanol in the expression of a series of bacterial genes involved in Methylobacterium-plant interaction. The selected genes are related to bacterial metabolism (mxaF, adaptation to stressful environment (crtI, phoU and sss, to interactions with plant metabolism compounds (acdS and pathogenicity (patatin and phoU. Under in vitro conditions, our results showed the differential expression of some important genes related to metabolism, stress and pathogenesis, thereby AHL molecules up-regulate all tested genes, except phoU, while plant exudates induce only mxaF gene expression. In the presence of plant exudates there is a lower bacterial density (due the endophytic and epiphytic colonization, which produce less AHL, leading to down regulation of genes when compared to the control. Therefore, bacterial density, more than plant exudate, influences the expression of genes related to plant-bacteria interaction.

  9. Comparative study on gene set and pathway topology-based enrichment methods.

    Science.gov (United States)

    Bayerlová, Michaela; Jung, Klaus; Kramer, Frank; Klemm, Florian; Bleckmann, Annalen; Beißbarth, Tim

    2015-10-22

    Enrichment analysis is a popular approach to identify pathways or sets of genes which are significantly enriched in the context of differentially expressed genes. The traditional gene set enrichment approach considers a pathway as a simple gene list disregarding any knowledge of gene or protein interactions. In contrast, the new group of so called pathway topology-based methods integrates the topological structure of a pathway into the analysis. We comparatively investigated gene set and pathway topology-based enrichment approaches, considering three gene set and four topological methods. These methods were compared in two extensive simulation studies and on a benchmark of 36 real datasets, providing the same pathway input data for all methods. In the benchmark data analysis both types of methods showed a comparable ability to detect enriched pathways. The first simulation study was conducted with KEGG pathways, which showed considerable gene overlaps between each other. In this study with original KEGG pathways, none of the topology-based methods outperformed the gene set approach. Therefore, a second simulation study was performed on non-overlapping pathways created by unique gene IDs. Here, methods accounting for pathway topology reached higher accuracy than the gene set methods, however their sensitivity was lower. We conducted one of the first comprehensive comparative works on evaluating gene set against pathway topology-based enrichment methods. The topological methods showed better performance in the simulation scenarios with non-overlapping pathways, however, they were not conclusively better in the other scenarios. This suggests that simple gene set approach might be sufficient to detect an enriched pathway under realistic circumstances. Nevertheless, more extensive studies and further benchmark data are needed to systematically evaluate these methods and to assess what gain and cost pathway topology information introduces into enrichment analysis. Both

  10. The Gene-Lifestyle Interaction on Leptin Sensitivity and Lipid Metabolism in Adults: A Population Based Study.

    Science.gov (United States)

    Luglio, Harry Freitag; Sulistyoningrum, Dian Caturini; Huriyati, Emy; Lee, Yi Yi; Wan Muda, Wan Abdul Manan

    2017-07-07

    Obesity has been associated with leptin resistance and this might be caused by genetic factors. The aim of this study was to investigate the gene-lifestyle interaction between -866G/A UCP2 (uncoupling protein 2) gene polymorphism, dietary intake and leptin in a population based study. This is a cross sectional study conducted in adults living at urban area of Yogyakarta, Indonesia. Data of adiposity, lifestyle, triglyceride, high density lipoprotein (HDL) cholesterol, leptin and UCP2 gene polymorphism were obtained in 380 men and female adults. UCP2 gene polymorphism was not significantly associated with adiposity, leptin, triglyceride, HDL cholesterol, dietary intake and physical activity (all p > 0.05). Leptin was lower in overweight subjects with AA + GA genotypes than those with GG genotype counterparts ( p = 0.029). In subjects with AA + GA genotypes there was a negative correlation between leptin concentration ( r = -0.324; p correlation was not seen in GG genotype ( r = -0.111; p = 0.188). In summary, we showed how genetic variation in -866G/A UCP2 affected individual response to leptin production. AA + GA genotype had a better leptin sensitivity shown by its response in dietary intake and body mass index (BMI) and this explained the protective effect of A allele to obesity.

  11. DNMT1-interacting RNAs block gene specific DNA methylation

    Science.gov (United States)

    Di Ruscio, Annalisa; Ebralidze, Alexander K.; Benoukraf, Touati; Amabile, Giovanni; Goff, Loyal A.; Terragni, Joylon; Figueroa, Maria Eugenia; De Figureido Pontes, Lorena Lobo; Alberich-Jorda, Meritxell; Zhang, Pu; Wu, Mengchu; D’Alò, Francesco; Melnick, Ari; Leone, Giuseppe; Ebralidze, Konstantin K.; Pradhan, Sriharsa; Rinn, John L.; Tenen, Daniel G.

    2013-01-01

    Summary DNA methylation was described almost a century ago. However, the rules governing its establishment and maintenance remain elusive. Here, we present data demonstrating that active transcription regulates levels of genomic methylation. We identified a novel RNA arising from the CEBPA gene locus critical in regulating the local DNA methylation profile. This RNA binds to DNMT1 and prevents CEBPA gene locus methylation. Deep sequencing of transcripts associated with DNMT1 combined with genome-scale methylation and expression profiling extended the generality of this finding to numerous gene loci. Collectively, these results delineate the nature of DNMT1-RNA interactions and suggest strategies for gene selective demethylation of therapeutic targets in disease. PMID:24107992

  12. Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease

    Directory of Open Access Journals (Sweden)

    Maria V. Fernández

    2018-04-01

    Full Text Available Gene-based tests to study the combined effect of rare variants on a particular phenotype have been widely developed for case-control studies, but their evolution and adaptation for family-based studies, especially studies of complex incomplete families, has been slower. In this study, we have performed a practical examination of all the latest gene-based methods available for family-based study designs using both simulated and real datasets. We examined the performance of several collapsing, variance-component, and transmission disequilibrium tests across eight different software packages and 22 models utilizing a cohort of 285 families (N = 1,235 with late-onset Alzheimer disease (LOAD. After a thorough examination of each of these tests, we propose a methodological approach to identify, with high confidence, genes associated with the tested phenotype and we provide recommendations to select the best software and model for family-based gene-based analyses. Additionally, in our dataset, we identified PTK2B, a GWAS candidate gene for sporadic AD, along with six novel genes (CHRD, CLCN2, HDLBP, CPAMD8, NLRP9, and MAS1L as candidate genes for familial LOAD.

  13. Prediction of the Ebola Virus Infection Related Human Genes Using Protein-Protein Interaction Network.

    Science.gov (United States)

    Cao, HuanHuan; Zhang, YuHang; Zhao, Jia; Zhu, Liucun; Wang, Yi; Li, JiaRui; Feng, Yuan-Ming; Zhang, Ning

    2017-01-01

    Ebola hemorrhagic fever (EHF) is caused by Ebola virus (EBOV). It is reported that human could be infected by EBOV with a high fatality rate. However, association factors between EBOV and host still tend to be ambiguous. According to the "guilt by association" (GBA) principle, proteins interacting with each other are very likely to function similarly or the same. Based on this assumption, we tried to obtain EBOV infection-related human genes in a protein-protein interaction network using Dijkstra algorithm. We hope it could contribute to the discovery of novel effective treatments. Finally, 15 genes were selected as potential EBOV infection-related human genes. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  14. ToxCast Data Expands Universe of Chemical-Gene Interactions (SOT)

    Science.gov (United States)

    Characterizing the effects of chemicals in biological systems is often summarized by chemical-gene interactions, which have sparse coverage in literature. The ToxCast chemical screening program has produced bioactivity data for nearly 2000 chemicals and over 450 gene targets. Thi...

  15. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool.

    Science.gov (United States)

    Chen, Edward Y; Tan, Christopher M; Kou, Yan; Duan, Qiaonan; Wang, Zichen; Meirelles, Gabriela Vaz; Clark, Neil R; Ma'ayan, Avi

    2013-04-15

    System-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement. Here, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios. Enrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr.

  16. KBERG: KnowledgeBase for Estrogen Responsive Genes

    DEFF Research Database (Denmark)

    Tang, Suisheng; Zhang, Zhuo; Tan, Sin Lam

    2007-01-01

    Estrogen has a profound impact on human physiology affecting transcription of numerous genes. To decipher functional characteristics of estrogen responsive genes, we developed KnowledgeBase for Estrogen Responsive Genes (KBERG). Genes in KBERG were derived from Estrogen Responsive Gene Database...... (ERGDB) and were analyzed from multiple aspects. We explored the possible transcription regulation mechanism by capturing highly conserved promoter motifs across orthologous genes, using promoter regions that cover the range of [-1200, +500] relative to the transcription start sites. The motif detection...... is based on ab initio discovery of common cis-elements from the orthologous gene cluster from human, mouse and rat, thus reflecting a degree of promoter sequence preservation during evolution. The identified motifs are linked to transcription factor binding sites based on the TRANSFAC database. In addition...

  17. Characterization of Genes for Beef Marbling Based on Applying Gene Coexpression Network

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    2014-01-01

    Full Text Available Marbling is an important trait in characterization beef quality and a major factor for determining the price of beef in the Korean beef market. In particular, marbling is a complex trait and needs a system-level approach for identifying candidate genes related to the trait. To find the candidate gene associated with marbling, we used a weighted gene coexpression network analysis from the expression value of bovine genes. Hub genes were identified; they were topologically centered with large degree and BC values in the global network. We performed gene expression analysis to detect candidate genes in M. longissimus with divergent marbling phenotype (marbling scores 2 to 7 using qRT-PCR. The results demonstrate that transmembrane protein 60 (TMEM60 and dihydropyrimidine dehydrogenase (DPYD are associated with increasing marbling fat. We suggest that the network-based approach in livestock may be an important method for analyzing the complex effects of candidate genes associated with complex traits like marbling or tenderness.

  18. Identification and comprehensive evaluation of reference genes for RT-qPCR analysis of host gene-expression in Brassica juncea-aphid interaction using microarray data.

    Science.gov (United States)

    Ram, Chet; Koramutla, Murali Krishna; Bhattacharya, Ramcharan

    2017-07-01

    Brassica juncea is a chief oil yielding crop in many parts of the world including India. With advancement of molecular techniques, RT-qPCR based study of gene-expression has become an integral part of experimentations in crop breeding. In RT-qPCR, use of appropriate reference gene(s) is pivotal. The virtue of the reference genes, being constant in expression throughout the experimental treatments, needs to be validated case by case. Appropriate reference gene(s) for normalization of gene-expression data in B. juncea during the biotic stress of aphid infestation is not known. In the present investigation, 11 reference genes identified from microarray database of Arabidopsis-aphid interaction at a cut off FDR ≤0.1, along with two known reference genes of B. juncea, were analyzed for their expression stability upon aphid infestation. These included 6 frequently used and 5 newly identified reference genes. Ranking orders of the reference genes in terms of expression stability were calculated using advanced statistical approaches such as geNorm, NormFinder, delta Ct and BestKeeper. The analysis suggested CAC, TUA and DUF179 as the most suitable reference genes. Further, normalization of the gene-expression data of STP4 and PR1 by the most and the least stable reference gene, respectively has demonstrated importance and applicability of the recommended reference genes in aphid infested samples of B. juncea. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  19. Finding gene regulatory network candidates using the gene expression knowledge base.

    Science.gov (United States)

    Venkatesan, Aravind; Tripathi, Sushil; Sanz de Galdeano, Alejandro; Blondé, Ward; Lægreid, Astrid; Mironov, Vladimir; Kuiper, Martin

    2014-12-10

    Network-based approaches for the analysis of large-scale genomics data have become well established. Biological networks provide a knowledge scaffold against which the patterns and dynamics of 'omics' data can be interpreted. The background information required for the construction of such networks is often dispersed across a multitude of knowledge bases in a variety of formats. The seamless integration of this information is one of the main challenges in bioinformatics. The Semantic Web offers powerful technologies for the assembly of integrated knowledge bases that are computationally comprehensible, thereby providing a potentially powerful resource for constructing biological networks and network-based analysis. We have developed the Gene eXpression Knowledge Base (GeXKB), a semantic web technology based resource that contains integrated knowledge about gene expression regulation. To affirm the utility of GeXKB we demonstrate how this resource can be exploited for the identification of candidate regulatory network proteins. We present four use cases that were designed from a biological perspective in order to find candidate members relevant for the gastrin hormone signaling network model. We show how a combination of specific query definitions and additional selection criteria derived from gene expression data and prior knowledge concerning candidate proteins can be used to retrieve a set of proteins that constitute valid candidates for regulatory network extensions. Semantic web technologies provide the means for processing and integrating various heterogeneous information sources. The GeXKB offers biologists such an integrated knowledge resource, allowing them to address complex biological questions pertaining to gene expression. This work illustrates how GeXKB can be used in combination with gene expression results and literature information to identify new potential candidates that may be considered for extending a gene regulatory network.

  20. Learning gene regulatory networks from gene expression data using weighted consensus

    KAUST Repository

    Fujii, Chisato; Kuwahara, Hiroyuki; Yu, Ge; Guo, Lili; Gao, Xin

    2016-01-01

    An accurate determination of the network structure of gene regulatory systems from high-throughput gene expression data is an essential yet challenging step in studying how the expression of endogenous genes is controlled through a complex interaction of gene products and DNA. While numerous methods have been proposed to infer the structure of gene regulatory networks, none of them seem to work consistently over different data sets with high accuracy. A recent study to compare gene network inference methods showed that an average-ranking-based consensus method consistently performs well under various settings. Here, we propose a linear programming-based consensus method for the inference of gene regulatory networks. Unlike the average-ranking-based one, which treats the contribution of each individual method equally, our new consensus method assigns a weight to each method based on its credibility. As a case study, we applied the proposed consensus method on synthetic and real microarray data sets, and compared its performance to that of the average-ranking-based consensus and individual inference methods. Our results show that our weighted consensus method achieves superior performance over the unweighted one, suggesting that assigning weights to different individual methods rather than giving them equal weights improves the accuracy. © 2016 Elsevier B.V.

  1. Learning gene regulatory networks from gene expression data using weighted consensus

    KAUST Repository

    Fujii, Chisato

    2016-08-25

    An accurate determination of the network structure of gene regulatory systems from high-throughput gene expression data is an essential yet challenging step in studying how the expression of endogenous genes is controlled through a complex interaction of gene products and DNA. While numerous methods have been proposed to infer the structure of gene regulatory networks, none of them seem to work consistently over different data sets with high accuracy. A recent study to compare gene network inference methods showed that an average-ranking-based consensus method consistently performs well under various settings. Here, we propose a linear programming-based consensus method for the inference of gene regulatory networks. Unlike the average-ranking-based one, which treats the contribution of each individual method equally, our new consensus method assigns a weight to each method based on its credibility. As a case study, we applied the proposed consensus method on synthetic and real microarray data sets, and compared its performance to that of the average-ranking-based consensus and individual inference methods. Our results show that our weighted consensus method achieves superior performance over the unweighted one, suggesting that assigning weights to different individual methods rather than giving them equal weights improves the accuracy. © 2016 Elsevier B.V.

  2. Using the Pathogen-Host Interactions database (PHI-base to investigate plant pathogen genomes and genes implicated in virulence

    Directory of Open Access Journals (Sweden)

    Martin eUrban

    2015-08-01

    Full Text Available New pathogen-host interaction mechanisms can be revealed by integrating mutant phenotype data with genetic information. PHI-base is a multi-species manually curated database combining peer-reviewed published phenotype data from plant and animal pathogens and gene/protein information in a single database.

  3. Gene-Environment Interactions in the Development of Complex Disease Phenotypes

    Directory of Open Access Journals (Sweden)

    Kenneth Olden

    2008-03-01

    Full Text Available The lack of knowledge about the earliest events in disease development is due to the multi-factorial nature of disease risk. This information gap is the consequence of the lack of appreciation for the fact that most diseases arise from the complex interactions between genes and the environment as a function of the age or stage of development of the individual. Whether an environmental exposure causes illness or not is dependent on the efficiency of the so-called “environmental response machinery” (i.e., the complex of metabolic pathways that can modulate response to environmental perturbations that one has inherited. Thus, elucidating the causes of most chronic diseases will require an understanding of both the genetic and environmental contribution to their etiology. Unfortunately, the exploration of the relationship between genes and the environment has been hampered in the past by the limited knowledge of the human genome, and by the inclination of scientists to study disease development using experimental models that consider exposure to a single environmental agent. Rarely in the past were interactions between multiple genes or between genes and environmental agents considered in studies of human disease etiology. The most critical issue is how to relate exposure-disease association studies to pathways and mechanisms. To understand how genes and environmental factors interact to perturb biological pathways to cause injury or disease, scientists will need tools with the capacity to monitor the global expression of thousands of genes, proteins and metabolites simultaneously. The generation of such data in multiple species can be used to identify conserved and functionally significant genes and pathways involved in geneenvironment interactions. Ultimately, it is this knowledge that will be used to guide agencies such as the U.S. Department of Health and Human Services in decisions regarding biomedical research funding

  4. Assessment of Multifactor Gene-Environment Interactions and Ovarian Cancer Risk

    DEFF Research Database (Denmark)

    Usset, Joseph L; Raghavan, Rama; Tyrer, Jonathan P

    2016-01-01

    and non-obese women. METHODS: We considered interactions between 11,441 SNPs within 80 candidate genes related to hormone biosynthesis and metabolism and insulin-like growth factors with six hormone-related factors (oral contraceptive use, parity, endometriosis, tubal ligation, hormone replacement therapy...... Future work is needed to develop powerful statistical methods able to detect these complex interactions. IMPACT: Assessment of multifactor interaction is feasible, and, here, suggests that the relationship between genetic variants within candidate genes and hormone-related risk factors may vary EOC...

  5. LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes.

    Science.gov (United States)

    Wang, Dapeng; Zhang, Yubin; Fan, Zhonghua; Liu, Guiming; Yu, Jun

    2012-01-01

    Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database-LCGbase (a comprehensive database for lineage-based co-regulated genes)-hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warm-blooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene

  6. Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.

    Science.gov (United States)

    Speiser, Daniel I; Pankey, M Sabrina; Zaharoff, Alexander K; Battelle, Barbara A; Bracken-Grissom, Heather D; Breinholt, Jesse W; Bybee, Seth M; Cronin, Thomas W; Garm, Anders; Lindgren, Annie R; Patel, Nipam H; Porter, Megan L; Protas, Meredith E; Rivera, Ajna S; Serb, Jeanne M; Zigler, Kirk S; Crandall, Keith A; Oakley, Todd H

    2014-11-19

    Tools for high throughput sequencing and de novo assembly make the analysis of transcriptomes (i.e. the suite of genes expressed in a tissue) feasible for almost any organism. Yet a challenge for biologists is that it can be difficult to assign identities to gene sequences, especially from non-model organisms. Phylogenetic analyses are one useful method for assigning identities to these sequences, but such methods tend to be time-consuming because of the need to re-calculate trees for every gene of interest and each time a new data set is analyzed. In response, we employed existing tools for phylogenetic analysis to produce a computationally efficient, tree-based approach for annotating transcriptomes or new genomes that we term Phylogenetically-Informed Annotation (PIA), which places uncharacterized genes into pre-calculated phylogenies of gene families. We generated maximum likelihood trees for 109 genes from a Light Interaction Toolkit (LIT), a collection of genes that underlie the function or development of light-interacting structures in metazoans. To do so, we searched protein sequences predicted from 29 fully-sequenced genomes and built trees using tools for phylogenetic analysis in the Osiris package of Galaxy (an open-source workflow management system). Next, to rapidly annotate transcriptomes from organisms that lack sequenced genomes, we repurposed a maximum likelihood-based Evolutionary Placement Algorithm (implemented in RAxML) to place sequences of potential LIT genes on to our pre-calculated gene trees. Finally, we implemented PIA in Galaxy and used it to search for LIT genes in 28 newly-sequenced transcriptomes from the light-interacting tissues of a range of cephalopod mollusks, arthropods, and cubozoan cnidarians. Our new trees for LIT genes are available on the Bitbucket public repository ( http://bitbucket.org/osiris_phylogenetics/pia/ ) and we demonstrate PIA on a publicly-accessible web server ( http://galaxy-dev.cnsi.ucsb.edu/pia/ ). Our new

  7. Candidate genes and pathogenesis investigation for sepsis-related acute respiratory distress syndrome based on gene expression profile.

    Science.gov (United States)

    Wang, Min; Yan, Jingjun; He, Xingxing; Zhong, Qiang; Zhan, Chengye; Li, Shusheng

    2016-04-18

    Acute respiratory distress syndrome (ARDS) is a potentially devastating form of acute inflammatory lung injury as well as a major cause of acute respiratory failure. Although researchers have made significant progresses in elucidating the pathophysiology of this complex syndrome over the years, the absence of a universal detail disease mechanism up until now has led to a series of practical problems for a definitive treatment. This study aimed to predict some genes or pathways associated with sepsis-related ARDS based on a public microarray dataset and to further explore the molecular mechanism of ARDS. A total of 122 up-regulated DEGs and 91 down-regulated differentially expressed genes (DEGs) were obtained. The up- and down-regulated DEGs were mainly involved in functions like mitotic cell cycle and pathway like cell cycle. Protein-protein interaction network of ARDS analysis revealed 20 hub genes including cyclin B1 (CCNB1), cyclin B2 (CCNB2) and topoisomerase II alpha (TOP2A). A total of seven transcription factors including forkhead box protein M1 (FOXM1) and 30 target genes were revealed in the transcription factor-target gene regulation network. Furthermore, co-cited genes including CCNB2-CCNB1 were revealed in literature mining for the relations ARDS related genes. Pathways like mitotic cell cycle were closed related with the development of ARDS. Genes including CCNB1, CCNB2 and TOP2A, as well as transcription factors like FOXM1 might be used as the novel gene therapy targets for sepsis related ARDS.

  8. Link-based quantitative methods to identify differentially coexpressed genes and gene Pairs

    Directory of Open Access Journals (Sweden)

    Ye Zhi-Qiang

    2011-08-01

    Full Text Available Abstract Background Differential coexpression analysis (DCEA is increasingly used for investigating the global transcriptional mechanisms underlying phenotypic changes. Current DCEA methods mostly adopt a gene connectivity-based strategy to estimate differential coexpression, which is characterized by comparing the numbers of gene neighbors in different coexpression networks. Although it simplifies the calculation, this strategy mixes up the identities of different coexpression neighbors of a gene, and fails to differentiate significant differential coexpression changes from those trivial ones. Especially, the correlation-reversal is easily missed although it probably indicates remarkable biological significance. Results We developed two link-based quantitative methods, DCp and DCe, to identify differentially coexpressed genes and gene pairs (links. Bearing the uniqueness of exploiting the quantitative coexpression change of each gene pair in the coexpression networks, both methods proved to be superior to currently popular methods in simulation studies. Re-mining of a publicly available type 2 diabetes (T2D expression dataset from the perspective of differential coexpression analysis led to additional discoveries than those from differential expression analysis. Conclusions This work pointed out the critical weakness of current popular DCEA methods, and proposed two link-based DCEA algorithms that will make contribution to the development of DCEA and help extend it to a broader spectrum.

  9. Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks.

    Science.gov (United States)

    Deeter, Anthony; Dalman, Mark; Haddad, Joseph; Duan, Zhong-Hui

    2017-01-01

    The PubMed database offers an extensive set of publication data that can be useful, yet inherently complex to use without automated computational techniques. Data repositories such as the Genomic Data Commons (GDC) and the Gene Expression Omnibus (GEO) offer experimental data storage and retrieval as well as curated gene expression profiles. Genetic interaction databases, including Reactome and Ingenuity Pathway Analysis, offer pathway and experiment data analysis using data curated from these publications and data repositories. We have created a method to generate and analyze consensus networks, inferring potential gene interactions, using large numbers of Bayesian networks generated by data mining publications in the PubMed database. Through the concept of network resolution, these consensus networks can be tailored to represent possible genetic interactions. We designed a set of experiments to confirm that our method is stable across variation in both sample and topological input sizes. Using gene product interactions from the KEGG pathway database and data mining PubMed publication abstracts, we verify that regardless of the network resolution or the inferred consensus network, our method is capable of inferring meaningful gene interactions through consensus Bayesian network generation with multiple, randomized topological orderings. Our method can not only confirm the existence of currently accepted interactions, but has the potential to hypothesize new ones as well. We show our method confirms the existence of known gene interactions such as JAK-STAT-PI3K-AKT-mTOR, infers novel gene interactions such as RAS- Bcl-2 and RAS-AKT, and found significant pathway-pathway interactions between the JAK-STAT signaling and Cardiac Muscle Contraction KEGG pathways.

  10. Pharmacogenomics of Hypertension and Preeclampsia: Focus on Gene–Gene Interactions

    Directory of Open Access Journals (Sweden)

    Marcelo R. Luizon

    2018-02-01

    Full Text Available Hypertension is a leading cause of cardiovascular mortality, but only about half of patients on antihypertensive therapy achieve blood pressure control. Preeclampsia is defined as pregnancy-induced hypertension and proteinuria, and is associated with increased maternal and perinatal mortality and morbidity. Similarly, a large number of patients with preeclampsia are non-responsive to antihypertensive therapy. Pharmacogenomics may help to guide the personalized treatment for non-responsive hypertensive patients. There is evidence for the association of genetic variants with variable response to the most commonly used antihypertensive drugs. However, further replication is needed to confirm these associations in different populations. The failure to replicate findings from single-locus association studies has prompted the search for novel statistical methods for data analysis, which are required to detect the complex effects from multiple genes to drug response phenotypes. Notably, gene–gene interaction analyses have been applied to pharmacogenetic studies, including antihypertensive drug response. In this perspective article, we present advances of considering the interactions among genetic polymorphisms of different candidate genes within pathways relevant to antihypertensive drug response, and we highlight recent findings related to gene–gene interactions on pharmacogenetics of hypertension and preeclampsia. Finally, we discuss the future directions that are needed to unravel additional genes and variants involved in the responsiveness to antihypertensive drugs.

  11. Screening key genes for abdominal aortic aneurysm based on gene expression omnibus dataset.

    Science.gov (United States)

    Wan, Li; Huang, Jingyong; Ni, Haizhen; Yu, Guanfeng

    2018-02-13

    Abdominal aortic aneurysm (AAA) is a common cardiovascular system disease with high mortality. The aim of this study was to identify potential genes for diagnosis and therapy in AAA. We searched and downloaded mRNA expression data from the Gene Expression Omnibus (GEO) database to identify differentially expressed genes (DEGs) from AAA and normal individuals. Then, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway analysis, transcriptional factors (TFs) network and protein-protein interaction (PPI) network were used to explore the function of genes. Additionally, immunohistochemical (IHC) staining was used to validate the expression of identified genes. Finally, the diagnostic value of identified genes was accessed by receiver operating characteristic (ROC) analysis in GEO database. A total of 1199 DEGs (188 up-regulated and 1011 down-regulated) were identified between AAA and normal individual. KEGG pathway analysis displayed that vascular smooth muscle contraction and pathways in cancer were significantly enriched signal pathway. The top 10 up-regulated and top 10 down-regulated DEGs were used to construct TFs and PPI networks. Some genes with high degrees such as NELL2, CCR7, MGAM, HBB, CSNK2A2, ZBTB16 and FOXO1 were identified to be related to AAA. The consequences of IHC staining showed that CCR7 and PDGFA were up-regulated in tissue samples of AAA. ROC analysis showed that NELL2, CCR7, MGAM, HBB, CSNK2A2, ZBTB16, FOXO1 and PDGFA had the potential diagnostic value for AAA. The identified genes including NELL2, CCR7, MGAM, HBB, CSNK2A2, ZBTB16, FOXO1 and PDGFA might be involved in the pathology of AAA.

  12. PCR-based detection of gene transfer vectors: application to gene doping surveillance.

    Science.gov (United States)

    Perez, Irene C; Le Guiner, Caroline; Ni, Weiyi; Lyles, Jennifer; Moullier, Philippe; Snyder, Richard O

    2013-12-01

    Athletes who illicitly use drugs to enhance their athletic performance are at risk of being banned from sports competitions. Consequently, some athletes may seek new doping methods that they expect to be capable of circumventing detection. With advances in gene transfer vector design and therapeutic gene transfer, and demonstrations of safety and therapeutic benefit in humans, there is an increased probability of the pursuit of gene doping by athletes. In anticipation of the potential for gene doping, assays have been established to directly detect complementary DNA of genes that are top candidates for use in doping, as well as vector control elements. The development of molecular assays that are capable of exposing gene doping in sports can serve as a deterrent and may also identify athletes who have illicitly used gene transfer for performance enhancement. PCR-based methods to detect foreign DNA with high reliability, sensitivity, and specificity include TaqMan real-time PCR, nested PCR, and internal threshold control PCR.

  13. C-State: an interactive web app for simultaneous multi-gene visualization and comparative epigenetic pattern search.

    Science.gov (United States)

    Sowpati, Divya Tej; Srivastava, Surabhi; Dhawan, Jyotsna; Mishra, Rakesh K

    2017-09-13

    Comparative epigenomic analysis across multiple genes presents a bottleneck for bench biologists working with NGS data. Despite the development of standardized peak analysis algorithms, the identification of novel epigenetic patterns and their visualization across gene subsets remains a challenge. We developed a fast and interactive web app, C-State (Chromatin-State), to query and plot chromatin landscapes across multiple loci and cell types. C-State has an interactive, JavaScript-based graphical user interface and runs locally in modern web browsers that are pre-installed on all computers, thus eliminating the need for cumbersome data transfer, pre-processing and prior programming knowledge. C-State is unique in its ability to extract and analyze multi-gene epigenetic information. It allows for powerful GUI-based pattern searching and visualization. We include a case study to demonstrate its potential for identifying user-defined epigenetic trends in context of gene expression profiles.

  14. The Gene-Lifestyle Interaction on Leptin Sensitivity and Lipid Metabolism in Adults: A Population Based Study

    Directory of Open Access Journals (Sweden)

    Harry Freitag Luglio

    2017-07-01

    Full Text Available Background: Obesity has been associated with leptin resistance and this might be caused by genetic factors. The aim of this study was to investigate the gene-lifestyle interaction between −866G/A UCP2 (uncoupling protein 2 gene polymorphism, dietary intake and leptin in a population based study. Methods: This is a cross sectional study conducted in adults living at urban area of Yogyakarta, Indonesia. Data of adiposity, lifestyle, triglyceride, high density lipoprotein (HDL cholesterol, leptin and UCP2 gene polymorphism were obtained in 380 men and female adults. Results: UCP2 gene polymorphism was not significantly associated with adiposity, leptin, triglyceride, HDL cholesterol, dietary intake and physical activity (all p > 0.05. Leptin was lower in overweight subjects with AA + GA genotypes than those with GG genotype counterparts (p = 0.029. In subjects with AA + GA genotypes there was a negative correlation between leptin concentration (r = −0.324; p < 0.0001 and total energy intake and this correlation was not seen in GG genotype (r = −0.111; p = 0.188. Conclusions: In summary, we showed how genetic variation in −866G/A UCP2 affected individual response to leptin production. AA + GA genotype had a better leptin sensitivity shown by its response in dietary intake and body mass index (BMI and this explained the protective effect of A allele to obesity.

  15. The Interaction of TXNIP and AFq1 Genes Increases the Susceptibility of Schizophrenia.

    Science.gov (United States)

    Su, Yousong; Ding, Wenhua; Xing, Mengjuan; Qi, Dake; Li, Zezhi; Cui, Donghong

    2017-08-01

    Although previous studies showed the reduced risk of cancer in patients with schizophrenia, whether patients with schizophrenia possess genetic factors that also contribute to tumor suppressor is still unknown. In the present study, based on our previous microarray data, we focused on the tumor suppressor genes TXNIP and AF1q, which differentially expressed in patients with schizophrenia. A total of 413 patients and 578 healthy controls were recruited. We found no significant differences in genotype, allele, or haplotype frequencies at the selected five single nucleotide polymorphisms (SNPs) (rs2236566 and rs7211 in TXNIP gene; rs10749659, rs2140709, and rs3738481 in AF1q gene) between patients with schizophrenia and controls. However, we found the association between the interaction of TXNIP and AF1q with schizophrenia by using the MDR method followed by traditional statistical analysis. The best gene-gene interaction model identified was a three-locus model TXNIP (rs2236566, rs7211)-AF1q (rs2140709). After traditional statistical analysis, we found the high-risk genotype combination was rs2236566 (GG)-rs7211(CC)-rs2140709(CC) (OR = 1.35 [1.03-1.76]). The low-risk genotype combination was rs2236566 (GT)-rs7211(CC)-rs2140709(CC) (OR = 0.67 [0.49-0.91]). Our finding suggested statistically significant role of interaction of TXNIP and AF1q polymorphisms (TXNIP-rs2236566, TXNIP-rs7211, and AF1q-rs2769605) in schizophrenia susceptibility.

  16. Genome-wide diet-gene interaction analyses for risk of colorectal cancer.

    Directory of Open Access Journals (Sweden)

    Jane C Figueiredo

    2014-04-01

    Full Text Available Dietary factors, including meat, fruits, vegetables and fiber, are associated with colorectal cancer; however, there is limited information as to whether these dietary factors interact with genetic variants to modify risk of colorectal cancer. We tested interactions between these dietary factors and approximately 2.7 million genetic variants for colorectal cancer risk among 9,287 cases and 9,117 controls from ten studies. We used logistic regression to investigate multiplicative gene-diet interactions, as well as our recently developed Cocktail method that involves a screening step based on marginal associations and gene-diet correlations and a testing step for multiplicative interactions, while correcting for multiple testing using weighted hypothesis testing. Per quartile increment in the intake of red and processed meat were associated with statistically significant increased risks of colorectal cancer and vegetable, fruit and fiber intake with lower risks. From the case-control analysis, we detected a significant interaction between rs4143094 (10p14/near GATA3 and processed meat consumption (OR = 1.17; p = 8.7E-09, which was consistently observed across studies (p heterogeneity = 0.78. The risk of colorectal cancer associated with processed meat was increased among individuals with the rs4143094-TG and -TT genotypes (OR = 1.20 and OR = 1.39, respectively and null among those with the GG genotype (OR = 1.03. Our results identify a novel gene-diet interaction with processed meat for colorectal cancer, highlighting that diet may modify the effect of genetic variants on disease risk, which may have important implications for prevention.

  17. Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

    Science.gov (United States)

    Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

    2015-06-01

    To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. Protein-protein interaction inference based on semantic similarity of Gene Ontology terms.

    Science.gov (United States)

    Zhang, Shu-Bo; Tang, Qiang-Rong

    2016-07-21

    Identifying protein-protein interactions is important in molecular biology. Experimental methods to this issue have their limitations, and computational approaches have attracted more and more attentions from the biological community. The semantic similarity derived from the Gene Ontology (GO) annotation has been regarded as one of the most powerful indicators for protein interaction. However, conventional methods based on GO similarity fail to take advantage of the specificity of GO terms in the ontology graph. We proposed a GO-based method to predict protein-protein interaction by integrating different kinds of similarity measures derived from the intrinsic structure of GO graph. We extended five existing methods to derive the semantic similarity measures from the descending part of two GO terms in the GO graph, then adopted a feature integration strategy to combines both the ascending and the descending similarity scores derived from the three sub-ontologies to construct various kinds of features to characterize each protein pair. Support vector machines (SVM) were employed as discriminate classifiers, and five-fold cross validation experiments were conducted on both human and yeast protein-protein interaction datasets to evaluate the performance of different kinds of integrated features, the experimental results suggest the best performance of the feature that combines information from both the ascending and the descending parts of the three ontologies. Our method is appealing for effective prediction of protein-protein interaction. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Gene × Environment Interactions in Schizophrenia: Evidence from Genetic Mouse Models.

    Science.gov (United States)

    Moran, Paula; Stokes, Jennifer; Marr, Julia; Bock, Gavin; Desbonnet, Lieve; Waddington, John; O'Tuathaigh, Colm

    2016-01-01

    The study of gene × environment, as well as epistatic interactions in schizophrenia, has provided important insight into the complex etiopathologic basis of schizophrenia. It has also increased our understanding of the role of susceptibility genes in the disorder and is an important consideration as we seek to translate genetic advances into novel antipsychotic treatment targets. This review summarises data arising from research involving the modelling of gene × environment interactions in schizophrenia using preclinical genetic models. Evidence for synergistic effects on the expression of schizophrenia-relevant endophenotypes will be discussed. It is proposed that valid and multifactorial preclinical models are important tools for identifying critical areas, as well as underlying mechanisms, of convergence of genetic and environmental risk factors, and their interaction in schizophrenia.

  20. Gene × Environment Interactions in Schizophrenia: Evidence from Genetic Mouse Models

    Science.gov (United States)

    Marr, Julia; Bock, Gavin; Desbonnet, Lieve; Waddington, John

    2016-01-01

    The study of gene × environment, as well as epistatic interactions in schizophrenia, has provided important insight into the complex etiopathologic basis of schizophrenia. It has also increased our understanding of the role of susceptibility genes in the disorder and is an important consideration as we seek to translate genetic advances into novel antipsychotic treatment targets. This review summarises data arising from research involving the modelling of gene × environment interactions in schizophrenia using preclinical genetic models. Evidence for synergistic effects on the expression of schizophrenia-relevant endophenotypes will be discussed. It is proposed that valid and multifactorial preclinical models are important tools for identifying critical areas, as well as underlying mechanisms, of convergence of genetic and environmental risk factors, and their interaction in schizophrenia. PMID:27725886

  1. Expression-based clustering of CAZyme-encoding genes of Aspergillus niger.

    Science.gov (United States)

    Gruben, Birgit S; Mäkelä, Miia R; Kowalczyk, Joanna E; Zhou, Miaomiao; Benoit-Gelber, Isabelle; De Vries, Ronald P

    2017-11-23

    The Aspergillus niger genome contains a large repertoire of genes encoding carbohydrate active enzymes (CAZymes) that are targeted to plant polysaccharide degradation enabling A. niger to grow on a wide range of plant biomass substrates. Which genes need to be activated in certain environmental conditions depends on the composition of the available substrate. Previous studies have demonstrated the involvement of a number of transcriptional regulators in plant biomass degradation and have identified sets of target genes for each regulator. In this study, a broad transcriptional analysis was performed of the A. niger genes encoding (putative) plant polysaccharide degrading enzymes. Microarray data focusing on the initial response of A. niger to the presence of plant biomass related carbon sources were analyzed of a wild-type strain N402 that was grown on a large range of carbon sources and of the regulatory mutant strains ΔxlnR, ΔaraR, ΔamyR, ΔrhaR and ΔgalX that were grown on their specific inducing compounds. The cluster analysis of the expression data revealed several groups of co-regulated genes, which goes beyond the traditionally described co-regulated gene sets. Additional putative target genes of the selected regulators were identified, based on their expression profile. Notably, in several cases the expression profile puts questions on the function assignment of uncharacterized genes that was based on homology searches, highlighting the need for more extensive biochemical studies into the substrate specificity of enzymes encoded by these non-characterized genes. The data also revealed sets of genes that were upregulated in the regulatory mutants, suggesting interaction between the regulatory systems and a therefore even more complex overall regulatory network than has been reported so far. Expression profiling on a large number of substrates provides better insight in the complex regulatory systems that drive the conversion of plant biomass by fungi. In

  2. MINER: exploratory analysis of gene interaction networks by machine learning from expression data

    Directory of Open Access Journals (Sweden)

    Sivieng Jane

    2009-12-01

    Full Text Available Abstract Background The reconstruction of gene regulatory networks from high-throughput "omics" data has become a major goal in the modelling of living systems. Numerous approaches have been proposed, most of which attempt only "one-shot" reconstruction of the whole network with no intervention from the user, or offer only simple correlation analysis to infer gene dependencies. Results We have developed MINER (Microarray Interactive Network Exploration and Representation, an application that combines multivariate non-linear tree learning of individual gene regulatory dependencies, visualisation of these dependencies as both trees and networks, and representation of known biological relationships based on common Gene Ontology annotations. MINER allows biologists to explore the dependencies influencing the expression of individual genes in a gene expression data set in the form of decision, model or regression trees, using their domain knowledge to guide the exploration and formulate hypotheses. Multiple trees can then be summarised in the form of a gene network diagram. MINER is being adopted by several of our collaborators and has already led to the discovery of a new significant regulatory relationship with subsequent experimental validation. Conclusion Unlike most gene regulatory network inference methods, MINER allows the user to start from genes of interest and build the network gene-by-gene, incorporating domain expertise in the process. This approach has been used successfully with RNA microarray data but is applicable to other quantitative data produced by high-throughput technologies such as proteomics and "next generation" DNA sequencing.

  3. Gene-environment interaction: Does fluoride influence the reproductive hormones in male farmers modified by ERα gene polymorphisms?

    Science.gov (United States)

    Ma, Qiang; Huang, Hui; Sun, Long; Zhou, Tong; Zhu, Jingyuan; Cheng, Xuemin; Duan, Lijv; Li, Zhiyuan; Cui, Liuxin; Ba, Yue

    2017-12-01

    The occurrence of endemic fluorosis is derived from high fluoride levels in drinking water and industrial fumes or dust. Reproductive disruption is also a major harm caused by fluoride exposure besides dental and skeletal lesions. However, few studies focus on the mechanism of fluoride exposure on male reproductive function, especially the possible interaction of fluoride exposure and gene polymorphism on male reproductive hormones. Therefore, we conducted a cross-sectional study in rural areas of Henan province in China to explore the interaction between the estrogen receptor alpha (ERα) gene and fluoride exposure on reproductive hormone levels in male farmers living in the endemic fluorosis villages. The results showed that fluoride exposure significantly increased the serum level of estradiol in the hypothalamic-pituitary-testicular (HPT) axis in male farmers. Moreover, the observations indicated that fluoride exposure and genetic markers had an interaction on serum concentration of follicle-stimulating hormone and estradiol, and the interaction among different loci of the ERα gene could impact the serum testosterone level. Findings in the present work suggest that chronic fluoride exposure in drinking water could modulate the levels of reproductive hormones in males living in endemic fluorosis areas, and the interaction between fluoride exposure and ERα polymorphisms might affect the serum levels of hormones in the HPT axis in male farmers. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. An environmental analysis of genes associated with schizophrenia: hypoxia and vascular factors as interacting elements in the neurodevelopmental model.

    Science.gov (United States)

    Schmidt-Kastner, R; van Os, J; Esquivel, G; Steinbusch, H W M; Rutten, B P F

    2012-12-01

    Investigating and understanding gene-environment interaction (G × E) in a neurodevelopmentally and biologically plausible manner is a major challenge for schizophrenia research. Hypoxia during neurodevelopment is one of several environmental factors related to the risk of schizophrenia, and links between schizophrenia candidate genes and hypoxia regulation or vascular expression have been proposed. Given the availability of a wealth of complex genetic information on schizophrenia in the literature without knowledge on the connections to environmental factors, we now systematically collected genes from candidate studies (using SzGene), genome-wide association studies (GWAS) and copy number variation (CNV) analyses, and then applied four criteria to test for a (theoretical) link to ischemia-hypoxia and/or vascular factors. In all, 55% of the schizophrenia candidate genes (n=42 genes) met the criteria for a link to ischemia-hypoxia and/or vascular factors. Genes associated with schizophrenia showed a significant, threefold enrichment among genes that were derived from microarray studies of the ischemia-hypoxia response (IHR) in the brain. Thus, the finding of a considerable match between genes associated with the risk of schizophrenia and IHR and/or vascular factors is reproducible. An additional survey of genes identified by GWAS and CNV analyses suggested novel genes that match the criteria. Findings for interactions between specific variants of genes proposed to be IHR and/or vascular factors with obstetric complications in patients with schizophrenia have been reported in the literature. Therefore, the extended gene set defined here may form a reasonable and evidence-based starting point for hypothesis-based testing of G × E interactions in clinical genetic and translational neuroscience studies.

  5. Measured Gene-by-Environment Interaction in Relation to Attention-Deficit/Hyperactivity Disorder

    Science.gov (United States)

    Nigg, Joel; Nikolas, Molly; Burt, S. Alexandra

    2010-01-01

    Objective: To summarize and evaluate the state of knowledge regarding the role of measured gene-by-environment interactions in relation to attention-deficit/hyperactivity disorder. Method: A selective review of methodologic issues was followed by a systematic search for relevant articles on measured gene-by-environment interactions; the search…

  6. Discovery of time-delayed gene regulatory networks based on temporal gene expression profiling

    Directory of Open Access Journals (Sweden)

    Guo Zheng

    2006-01-01

    Full Text Available Abstract Background It is one of the ultimate goals for modern biological research to fully elucidate the intricate interplays and the regulations of the molecular determinants that propel and characterize the progression of versatile life phenomena, to name a few, cell cycling, developmental biology, aging, and the progressive and recurrent pathogenesis of complex diseases. The vast amount of large-scale and genome-wide time-resolved data is becoming increasing available, which provides the golden opportunity to unravel the challenging reverse-engineering problem of time-delayed gene regulatory networks. Results In particular, this methodological paper aims to reconstruct regulatory networks from temporal gene expression data by using delayed correlations between genes, i.e., pairwise overlaps of expression levels shifted in time relative each other. We have thus developed a novel model-free computational toolbox termed TdGRN (Time-delayed Gene Regulatory Network to address the underlying regulations of genes that can span any unit(s of time intervals. This bioinformatics toolbox has provided a unified approach to uncovering time trends of gene regulations through decision analysis of the newly designed time-delayed gene expression matrix. We have applied the proposed method to yeast cell cycling and human HeLa cell cycling and have discovered most of the underlying time-delayed regulations that are supported by multiple lines of experimental evidence and that are remarkably consistent with the current knowledge on phase characteristics for the cell cyclings. Conclusion We established a usable and powerful model-free approach to dissecting high-order dynamic trends of gene-gene interactions. We have carefully validated the proposed algorithm by applying it to two publicly available cell cycling datasets. In addition to uncovering the time trends of gene regulations for cell cycling, this unified approach can also be used to study the complex

  7. Network-Based Method for Identifying Co- Regeneration Genes in Bone, Dentin, Nerve and Vessel Tissues.

    Science.gov (United States)

    Chen, Lei; Pan, Hongying; Zhang, Yu-Hang; Feng, Kaiyan; Kong, XiangYin; Huang, Tao; Cai, Yu-Dong

    2017-10-02

    Bone and dental diseases are serious public health problems. Most current clinical treatments for these diseases can produce side effects. Regeneration is a promising therapy for bone and dental diseases, yielding natural tissue recovery with few side effects. Because soft tissues inside the bone and dentin are densely populated with nerves and vessels, the study of bone and dentin regeneration should also consider the co-regeneration of nerves and vessels. In this study, a network-based method to identify co-regeneration genes for bone, dentin, nerve and vessel was constructed based on an extensive network of protein-protein interactions. Three procedures were applied in the network-based method. The first procedure, searching, sought the shortest paths connecting regeneration genes of one tissue type with regeneration genes of other tissues, thereby extracting possible co-regeneration genes. The second procedure, testing, employed a permutation test to evaluate whether possible genes were false discoveries; these genes were excluded by the testing procedure. The last procedure, screening, employed two rules, the betweenness ratio rule and interaction score rule, to select the most essential genes. A total of seventeen genes were inferred by the method, which were deemed to contribute to co-regeneration of at least two tissues. All these seventeen genes were extensively discussed to validate the utility of the method.

  8. Scuba: scalable kernel-based gene prioritization.

    Science.gov (United States)

    Zampieri, Guido; Tran, Dinh Van; Donini, Michele; Navarin, Nicolò; Aiolli, Fabio; Sperduti, Alessandro; Valle, Giorgio

    2018-01-25

    The uncovering of genes linked to human diseases is a pressing challenge in molecular biology and precision medicine. This task is often hindered by the large number of candidate genes and by the heterogeneity of the available information. Computational methods for the prioritization of candidate genes can help to cope with these problems. In particular, kernel-based methods are a powerful resource for the integration of heterogeneous biological knowledge, however, their practical implementation is often precluded by their limited scalability. We propose Scuba, a scalable kernel-based method for gene prioritization. It implements a novel multiple kernel learning approach, based on a semi-supervised perspective and on the optimization of the margin distribution. Scuba is optimized to cope with strongly unbalanced settings where known disease genes are few and large scale predictions are required. Importantly, it is able to efficiently deal both with a large amount of candidate genes and with an arbitrary number of data sources. As a direct consequence of scalability, Scuba integrates also a new efficient strategy to select optimal kernel parameters for each data source. We performed cross-validation experiments and simulated a realistic usage setting, showing that Scuba outperforms a wide range of state-of-the-art methods. Scuba achieves state-of-the-art performance and has enhanced scalability compared to existing kernel-based approaches for genomic data. This method can be useful to prioritize candidate genes, particularly when their number is large or when input data is highly heterogeneous. The code is freely available at https://github.com/gzampieri/Scuba .

  9. An Efficient Test for Gene-Environment Interaction in Generalized Linear Mixed Models with Family Data.

    Science.gov (United States)

    Mazo Lopera, Mauricio A; Coombes, Brandon J; de Andrade, Mariza

    2017-09-27

    Gene-environment (GE) interaction has important implications in the etiology of complex diseases that are caused by a combination of genetic factors and environment variables. Several authors have developed GE analysis in the context of independent subjects or longitudinal data using a gene-set. In this paper, we propose to analyze GE interaction for discrete and continuous phenotypes in family studies by incorporating the relatedness among the relatives for each family into a generalized linear mixed model (GLMM) and by using a gene-based variance component test. In addition, we deal with collinearity problems arising from linkage disequilibrium among single nucleotide polymorphisms (SNPs) by considering their coefficients as random effects under the null model estimation. We show that the best linear unbiased predictor (BLUP) of such random effects in the GLMM is equivalent to the ridge regression estimator. This equivalence provides a simple method to estimate the ridge penalty parameter in comparison to other computationally-demanding estimation approaches based on cross-validation schemes. We evaluated the proposed test using simulation studies and applied it to real data from the Baependi Heart Study consisting of 76 families. Using our approach, we identified an interaction between BMI and the Peroxisome Proliferator Activated Receptor Gamma ( PPARG ) gene associated with diabetes.

  10. Gene × Environment Interactions in Schizophrenia: Evidence from Genetic Mouse Models

    Directory of Open Access Journals (Sweden)

    Paula Moran

    2016-01-01

    Full Text Available The study of gene × environment, as well as epistatic interactions in schizophrenia, has provided important insight into the complex etiopathologic basis of schizophrenia. It has also increased our understanding of the role of susceptibility genes in the disorder and is an important consideration as we seek to translate genetic advances into novel antipsychotic treatment targets. This review summarises data arising from research involving the modelling of gene × environment interactions in schizophrenia using preclinical genetic models. Evidence for synergistic effects on the expression of schizophrenia-relevant endophenotypes will be discussed. It is proposed that valid and multifactorial preclinical models are important tools for identifying critical areas, as well as underlying mechanisms, of convergence of genetic and environmental risk factors, and their interaction in schizophrenia.

  11. Constructing an integrated gene similarity network for the identification of disease genes.

    Science.gov (United States)

    Tian, Zhen; Guo, Maozu; Wang, Chunyu; Xing, LinLin; Wang, Lei; Zhang, Yin

    2017-09-20

    Discovering novel genes that are involved human diseases is a challenging task in biomedical research. In recent years, several computational approaches have been proposed to prioritize candidate disease genes. Most of these methods are mainly based on protein-protein interaction (PPI) networks. However, since these PPI networks contain false positives and only cover less half of known human genes, their reliability and coverage are very low. Therefore, it is highly necessary to fuse multiple genomic data to construct a credible gene similarity network and then infer disease genes on the whole genomic scale. We proposed a novel method, named RWRB, to infer causal genes of interested diseases. First, we construct five individual gene (protein) similarity networks based on multiple genomic data of human genes. Then, an integrated gene similarity network (IGSN) is reconstructed based on similarity network fusion (SNF) method. Finally, we employee the random walk with restart algorithm on the phenotype-gene bilayer network, which combines phenotype similarity network, IGSN as well as phenotype-gene association network, to prioritize candidate disease genes. We investigate the effectiveness of RWRB through leave-one-out cross-validation methods in inferring phenotype-gene relationships. Results show that RWRB is more accurate than state-of-the-art methods on most evaluation metrics. Further analysis shows that the success of RWRB is benefited from IGSN which has a wider coverage and higher reliability comparing with current PPI networks. Moreover, we conduct a comprehensive case study for Alzheimer's disease and predict some novel disease genes that supported by literature. RWRB is an effective and reliable algorithm in prioritizing candidate disease genes on the genomic scale. Software and supplementary information are available at http://nclab.hit.edu.cn/~tianzhen/RWRB/ .

  12. Mining tissue specificity, gene connectivity and disease association to reveal a set of genes that modify the action of disease causing genes

    Directory of Open Access Journals (Sweden)

    Reverter Antonio

    2008-09-01

    Full Text Available Abstract Background The tissue specificity of gene expression has been linked to a number of significant outcomes including level of expression, and differential rates of polymorphism, evolution and disease association. Recent studies have also shown the importance of exploring differential gene connectivity and sequence conservation in the identification of disease-associated genes. However, no study relates gene interactions with tissue specificity and disease association. Methods We adopted an a priori approach making as few assumptions as possible to analyse the interplay among gene-gene interactions with tissue specificity and its subsequent likelihood of association with disease. We mined three large datasets comprising expression data drawn from massively parallel signature sequencing across 32 tissues, describing a set of 55,606 true positive interactions for 7,197 genes, and microarray expression results generated during the profiling of systemic inflammation, from which 126,543 interactions among 7,090 genes were reported. Results Amongst the myriad of complex relationships identified between expression, disease, connectivity and tissue specificity, some interesting patterns emerged. These include elevated rates of expression and network connectivity in housekeeping and disease-associated tissue-specific genes. We found that disease-associated genes are more likely to show tissue specific expression and most frequently interact with other disease genes. Using the thresholds defined in these observations, we develop a guilt-by-association algorithm and discover a group of 112 non-disease annotated genes that predominantly interact with disease-associated genes, impacting on disease outcomes. Conclusion We conclude that parameters such as tissue specificity and network connectivity can be used in combination to identify a group of genes, not previously confirmed as disease causing, that are involved in interactions with disease causing

  13. Rice-arsenate interactions in hydroponics: a three-gene model for tolerance.

    Science.gov (United States)

    Norton, Gareth J; Nigar, Meher; Williams, Paul N; Dasgupta, Tapash; Meharg, Andrew A; Price, Adam H

    2008-01-01

    In this study, the genetic mapping of the tolerance of root growth to 13.3 muM arsenate [As(V)] using the BalaxAzucena population is improved, and candidate genes for further study are identified. A remarkable three-gene model of tolerance is advanced, which appears to involve epistatic interaction between three major genes, two on chromosome 6 and one on chromosome 10. Any combination of two of these genes inherited from the tolerant parent leads to the plant having tolerance. Lists of potential positional candidate genes are presented. These are then refined using whole genome transcriptomics data and bioinformatics. Physiological evidence is also provided that genes related to phosphate transport are unlikely to be behind the genetic loci conferring tolerance. These results offer testable hypotheses for genes related to As(V) tolerance that might offer strategies for mitigating arsenic (As) accumulation in consumed rice.

  14. Rice–arsenate interactions in hydroponics: a three-gene model for tolerance

    Science.gov (United States)

    Norton, Gareth J.; Nigar, Meher; Dasgupta, Tapash; Meharg, Andrew A.; Price, Adam H.

    2008-01-01

    In this study, the genetic mapping of the tolerance of root growth to 13.3 μM arsenate [As(V)] using the Bala×Azucena population is improved, and candidate genes for further study are identified. A remarkable three-gene model of tolerance is advanced, which appears to involve epistatic interaction between three major genes, two on chromosome 6 and one on chromosome 10. Any combination of two of these genes inherited from the tolerant parent leads to the plant having tolerance. Lists of potential positional candidate genes are presented. These are then refined using whole genome transcriptomics data and bioinformatics. Physiological evidence is also provided that genes related to phosphate transport are unlikely to be behind the genetic loci conferring tolerance. These results offer testable hypotheses for genes related to As(V) tolerance that might offer strategies for mitigating arsenic (As) accumulation in consumed rice. PMID:18453529

  15. Interactive visualization of gene regulatory networks with associated gene expression time series data

    NARCIS (Netherlands)

    Westenberg, M.A.; Hijum, van S.A.F.T.; Lulko, A.T.; Kuipers, O.P.; Roerdink, J.B.T.M.; Linsen, L.; Hagen, H.; Hamann, B.

    2008-01-01

    We present GENeVis, an application to visualize gene expression time series data in a gene regulatory network context. This is a network of regulator proteins that regulate the expression of their respective target genes. The networks are represented as graphs, in which the nodes represent genes,

  16. Radiopharmaceuticals to monitor the expression of transferred genes in gene transfer therapy

    International Nuclear Information System (INIS)

    Wiebe, L. I.

    1997-01-01

    The development and application of radiopharmaceuticals has, in many instances, been based on the pharmacological properties of therapeutic agents. The molecular biology-biotechnology revolution has had an important impact on treatment of diseases, in part through the reduced toxicity of 'biologicals', in part because of their specificity for interaction at unique molecular sites and in part because of their selective delivery to the target site. Immunotherapeutic approaches include the use of monoclonal antibodies (MABs), MAB-fragments and chemotactic peptides. Such agents currently form the basis of both diagnostic and immunotherapeutic radiopharmaceuticals. More recently, gene transfer techniques have been advanced to the point that a new molecular approach, gene therapy, has become a reality. Gene therapy offers an opportunity to attack disease at its most fundamental level. The therapeutic mechanism is based on the expression of a specific gene or genes, the product of which will invoke immunological, receptor-based or enzyme-based therapeutic modalities. Several approaches to gene therapy of cancer have been envisioned, the most clinically-advanced concepts involving the introduction of genes that will encode for molecular targets nor normally found in healthy mammalian cells. A number of gene therapy clinical trials are based on the introduction of the Herpes simplex virus type-1 (HSV-1) gene that encodes for viral thymidine kinase (tk+). Once HSV-1 tk+ is expressed in the target (cancer) cell, therapy can be effected by the administration of a highly molecularly-targeted and systemically non-toxic antiviral drug such as ganciclovir. The development of radiodiagnostic imaging in gene therapy will be reviewed, using HSV-1 tk+ and radioiodinated IVFRU as a basis for development of the theme. Molecular targets that could be exploited in gene therapy, other than tk+, will be identified

  17. Radiopharmaceuticals to monitor the expression of transferred genes in gene transfer therapy

    Energy Technology Data Exchange (ETDEWEB)

    Wiebe, L I [University of Alberta, Edmonton (Canada). Noujaim Institute for Pharmaceutical Oncology Research

    1997-10-01

    The development and application of radiopharmaceuticals has, in many instances, been based on the pharmacological properties of therapeutic agents. The molecular biology-biotechnology revolution has had an important impact on treatment of diseases, in part through the reduced toxicity of `biologicals`, in part because of their specificity for interaction at unique molecular sites and in part because of their selective delivery to the target site. Immunotherapeutic approaches include the use of monoclonal antibodies (MABs), MAB-fragments and chemotactic peptides. Such agents currently form the basis of both diagnostic and immunotherapeutic radiopharmaceuticals. More recently, gene transfer techniques have been advanced to the point that a new molecular approach, gene therapy, has become a reality. Gene therapy offers an opportunity to attack disease at its most fundamental level. The therapeutic mechanism is based on the expression of a specific gene or genes, the product of which will invoke immunological, receptor-based or enzyme-based therapeutic modalities. Several approaches to gene therapy of cancer have been envisioned, the most clinically-advanced concepts involving the introduction of genes that will encode for molecular targets nor normally found in healthy mammalian cells. A number of gene therapy clinical trials are based on the introduction of the Herpes simplex virus type-1 (HSV-1) gene that encodes for viral thymidine kinase (tk+). Once HSV-1 tk+ is expressed in the target (cancer) cell, therapy can be effected by the administration of a highly molecularly-targeted and systemically non-toxic antiviral drug such as ganciclovir. The development of radiodiagnostic imaging in gene therapy will be reviewed, using HSV-1 tk+ and radioiodinated IVFRU as a basis for development of the theme. Molecular targets that could be exploited in gene therapy, other than tk+, will be identified

  18. Influenza NA and PB1 Gene Segments Interact during the Formation of Viral Progeny: Localization of the Binding Region within the PB1 Gene

    Directory of Open Access Journals (Sweden)

    Brad Gilbertson

    2016-08-01

    Full Text Available The influenza A virus genome comprises eight negative-sense viral RNAs (vRNAs that form individual ribonucleoprotein (RNP complexes. In order to incorporate a complete set of each of these vRNAs, the virus uses a selective packaging mechanism that facilitates co-packaging of specific gene segments but whose molecular basis is still not fully understood. Recently, we used a competitive transfection model where plasmids encoding the A/Puerto Rico/8/34 (PR8 and A/Udorn/307/72 (Udorn PB1 gene segments were competed to show that the Udorn PB1 gene segment is preferentially co-packaged into progeny virions with the Udorn NA gene segment. Here we created chimeric PB1 genes combining both Udorn and PR8 PB1 sequences to further define the location within the Udorn PB1 gene that drives co-segregation of these genes and show that nucleotides 1776–2070 of the PB1 gene are crucial for preferential selection. In vitro assays examining specific interactions between Udorn NA vRNA and purified vRNAs transcribed from chimeric PB1 genes also supported the importance of this region in the PB1-NA interaction. Hence, this work identifies an association between viral genes that are co-selected during packaging. It also reveals a region potentially important in the RNP-RNP interactions within the supramolecular complex that is predicted to form prior to budding to allow one of each segment to be packaged in the viral progeny. Our study lays the foundation to understand the co-selection of specific genes, which may be critical to the emergence of new viruses with pandemic potential.

  19. Network Diffusion-Based Prioritization of Autism Risk Genes Identifies Significantly Connected Gene Modules

    Directory of Open Access Journals (Sweden)

    Ettore Mosca

    2017-09-01

    Full Text Available Autism spectrum disorder (ASD is marked by a strong genetic heterogeneity, which is underlined by the low overlap between ASD risk gene lists proposed in different studies. In this context, molecular networks can be used to analyze the results of several genome-wide studies in order to underline those network regions harboring genetic variations associated with ASD, the so-called “disease modules.” In this work, we used a recent network diffusion-based approach to jointly analyze multiple ASD risk gene lists. We defined genome-scale prioritizations of human genes in relation to ASD genes from multiple studies, found significantly connected gene modules associated with ASD and predicted genes functionally related to ASD risk genes. Most of them play a role in synapsis and neuronal development and function; many are related to syndromes that can be in comorbidity with ASD and the remaining are involved in epigenetics, cell cycle, cell adhesion and cancer.

  20. Development of gene diagnosis for diabetes and cholecystitis based on gene analysis of CCK-A receptor

    International Nuclear Information System (INIS)

    Kono, Akira

    1999-01-01

    Base sequence analysis of CCKAR gene (a gene of A-type receptor for cholecystokinin) from OLETF rat, a model rat for insulin-independent diabetes was made based on the base sequence of wild CCKAR gene, which had been clarified in the previous year. From the pancreas of OLETF rat, DNA was extracted and transduced into λphage after fragmentation to construct the gene library of OLETF. Then, λphage DNA clone bound with labelled cDNA of CCKAR gene was analyzed and the gene structure was compared with that of the wild gene. It was demonstrated that CCKAR gene of OLETF had a deletion (6800 b.p.) ranging from the promoter region to the Exon 2, suggesting that CCKAR gene is not functional in OLETF rat. The whole sequence of this mutant gene was registered into Japan DNA Bank (D 50610). Then, F 2 offspring rats were obtained through crossing OLETF (female) and F344 (male) and the time course-changes in the blood glucose level after glucose loading were compared among them. The blood glucose level after glucose loading was significantly higher in the homo-mutant F 2 (CCKAR,-/-) as well as the parent OLETF rat than hetero-mutant F 2 (CCKARm-/+) or the wild rat (CCKAR,+/+). This suggests that CCKAR gene might be involved in the control of blood glucose level and an alteration of the expression level or the functions of CCKAR gene might affect the blood glucose level. (M.N.)

  1. Gene-Diet Interactions in Type 2 Diabetes: The Chicken and Egg Debate

    Science.gov (United States)

    Ortega, Ángeles; Berná, Genoveva; Rojas, Anabel; Martín, Franz; Soria, Bernat

    2017-01-01

    Consistent evidence from both experimental and human studies indicates that Type 2 diabetes mellitus (T2DM) is a complex disease resulting from the interaction of genetic, epigenetic, environmental, and lifestyle factors. Nutrients and dietary patterns are important environmental factors to consider in the prevention, development and treatment of this disease. Nutritional genomics focuses on the interaction between bioactive food components and the genome and includes studies of nutrigenetics, nutrigenomics and epigenetic modifications caused by nutrients. There is evidence supporting the existence of nutrient-gene and T2DM interactions coming from animal studies and family-based intervention studies. Moreover, many case-control, cohort, cross-sectional cohort studies and clinical trials have identified relationships between individual genetic load, diet and T2DM. Some of these studies were on a large scale. In addition, studies with animal models and human observational studies, in different countries over periods of time, support a causative relationship between adverse nutritional conditions during in utero development, persistent epigenetic changes and T2DM. This review provides comprehensive information on the current state of nutrient-gene interactions and their role in T2DM pathogenesis, the relationship between individual genetic load and diet, and the importance of epigenetic factors in influencing gene expression and defining the individual risk of T2DM. PMID:28574454

  2. Differential reconstructed gene interaction networks for deriving toxicity threshold in chemical risk assessment.

    Science.gov (United States)

    Yang, Yi; Maxwell, Andrew; Zhang, Xiaowei; Wang, Nan; Perkins, Edward J; Zhang, Chaoyang; Gong, Ping

    2013-01-01

    Pathway alterations reflected as changes in gene expression regulation and gene interaction can result from cellular exposure to toxicants. Such information is often used to elucidate toxicological modes of action. From a risk assessment perspective, alterations in biological pathways are a rich resource for setting toxicant thresholds, which may be more sensitive and mechanism-informed than traditional toxicity endpoints. Here we developed a novel differential networks (DNs) approach to connect pathway perturbation with toxicity threshold setting. Our DNs approach consists of 6 steps: time-series gene expression data collection, identification of altered genes, gene interaction network reconstruction, differential edge inference, mapping of genes with differential edges to pathways, and establishment of causal relationships between chemical concentration and perturbed pathways. A one-sample Gaussian process model and a linear regression model were used to identify genes that exhibited significant profile changes across an entire time course and between treatments, respectively. Interaction networks of differentially expressed (DE) genes were reconstructed for different treatments using a state space model and then compared to infer differential edges/interactions. DE genes possessing differential edges were mapped to biological pathways in databases such as KEGG pathways. Using the DNs approach, we analyzed a time-series Escherichia coli live cell gene expression dataset consisting of 4 treatments (control, 10, 100, 1000 mg/L naphthenic acids, NAs) and 18 time points. Through comparison of reconstructed networks and construction of differential networks, 80 genes were identified as DE genes with a significant number of differential edges, and 22 KEGG pathways were altered in a concentration-dependent manner. Some of these pathways were perturbed to a degree as high as 70% even at the lowest exposure concentration, implying a high sensitivity of our DNs approach

  3. [Dopamine and excessive alcohol consumption: how genes interact with their environment

    NARCIS (Netherlands)

    Schellekens, A.F.A.; Scholte, R.H.J.; Engels, R.C.M.E.; Verkes, R.J.

    2013-01-01

    SUMMARY BACKGROUND: Hereditary factors account for approximately 50% of the risk of developing alcohol dependence. Genes that affect the dopamine function in the brain have been extensively studied as candidate genes. AIM: To present the results of recent Dutch studies on the interaction between

  4. Inference of gene-phenotype associations via protein-protein interaction and orthology.

    Directory of Open Access Journals (Sweden)

    Panwen Wang

    Full Text Available One of the fundamental goals of genetics is to understand gene functions and their associated phenotypes. To achieve this goal, in this study we developed a computational algorithm that uses orthology and protein-protein interaction information to infer gene-phenotype associations for multiple species. Furthermore, we developed a web server that provides genome-wide phenotype inference for six species: fly, human, mouse, worm, yeast, and zebrafish. We evaluated our inference method by comparing the inferred results with known gene-phenotype associations. The high Area Under the Curve values suggest a significant performance of our method. By applying our method to two human representative diseases, Type 2 Diabetes and Breast Cancer, we demonstrated that our method is able to identify related Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways. The web server can be used to infer functions and putative phenotypes of a gene along with the candidate genes of a phenotype, and thus aids in disease candidate gene discovery. Our web server is available at http://jjwanglab.org/PhenoPPIOrth.

  5. Evolutionary signatures amongst disease genes permit novel methods for gene prioritization and construction of informative gene-based networks.

    Directory of Open Access Journals (Sweden)

    Nolan Priedigkeit

    2015-02-01

    Full Text Available Genes involved in the same function tend to have similar evolutionary histories, in that their rates of evolution covary over time. This coevolutionary signature, termed Evolutionary Rate Covariation (ERC, is calculated using only gene sequences from a set of closely related species and has demonstrated potential as a computational tool for inferring functional relationships between genes. To further define applications of ERC, we first established that roughly 55% of genetic diseases posses an ERC signature between their contributing genes. At a false discovery rate of 5% we report 40 such diseases including cancers, developmental disorders and mitochondrial diseases. Given these coevolutionary signatures between disease genes, we then assessed ERC's ability to prioritize known disease genes out of a list of unrelated candidates. We found that in the presence of an ERC signature, the true disease gene is effectively prioritized to the top 6% of candidates on average. We then apply this strategy to a melanoma-associated region on chromosome 1 and identify MCL1 as a potential causative gene. Furthermore, to gain global insight into disease mechanisms, we used ERC to predict molecular connections between 310 nominally distinct diseases. The resulting "disease map" network associates several diseases with related pathogenic mechanisms and unveils many novel relationships between clinically distinct diseases, such as between Hirschsprung's disease and melanoma. Taken together, these results demonstrate the utility of molecular evolution as a gene discovery platform and show that evolutionary signatures can be used to build informative gene-based networks.

  6. Bioinformatics, interaction network analysis, and neural networks to characterize gene expression of radicular cyst and periapical granuloma.

    Science.gov (United States)

    Poswar, Fabiano de Oliveira; Farias, Lucyana Conceição; Fraga, Carlos Alberto de Carvalho; Bambirra, Wilson; Brito-Júnior, Manoel; Sousa-Neto, Manoel Damião; Santos, Sérgio Henrique Souza; de Paula, Alfredo Maurício Batista; D'Angelo, Marcos Flávio Silveira Vasconcelos; Guimarães, André Luiz Sena

    2015-06-01

    Bioinformatics has emerged as an important tool to analyze the large amount of data generated by research in different diseases. In this study, gene expression for radicular cysts (RCs) and periapical granulomas (PGs) was characterized based on a leader gene approach. A validated bioinformatics algorithm was applied to identify leader genes for RCs and PGs. Genes related to RCs and PGs were first identified in PubMed, GenBank, GeneAtlas, and GeneCards databases. The Web-available STRING software (The European Molecular Biology Laboratory [EMBL], Heidelberg, Baden-Württemberg, Germany) was used in order to build the interaction map among the identified genes by a significance score named weighted number of links. Based on the weighted number of links, genes were clustered using k-means. The genes in the highest cluster were considered leader genes. Multilayer perceptron neural network analysis was used as a complementary supplement for gene classification. For RCs, the suggested leader genes were TP53 and EP300, whereas PGs were associated with IL2RG, CCL2, CCL4, CCL5, CCR1, CCR3, and CCR5 genes. Our data revealed different gene expression for RCs and PGs, suggesting that not only the inflammatory nature but also other biological processes might differentiate RCs and PGs. Copyright © 2015 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  7. Genomewide Expression and Functional Interactions of Genes under Drought Stress in Maize

    Directory of Open Access Journals (Sweden)

    Nepolean Thirunavukkarasu

    2017-01-01

    Full Text Available A genomewide transcriptome assay of two subtropical genotypes of maize was used to observe the expression of genes at seedling stage of drought stress. The number of genes expressed differentially was greater in HKI1532 (a drought tolerant genotype than in PC3 (a drought sensitive genotype, indicating primary differences at the transcriptional level in stress tolerance. The global coexpression networks of the two genotypes differed significantly with respect to the number of modules and the coexpression pattern within the modules. A total of 174 drought-responsive genes were selected from HKI1532, and their coexpression network revealed key correlations between different adaptive pathways, each cluster of the network representing a specific biological function. Transcription factors related to ABA-dependent stomatal closure, signalling, and phosphoprotein cascades work in concert to compensate for reduced photosynthesis. Under stress, water balance was maintained by coexpression of the genes involved in osmotic adjustments and transporter proteins. Metabolism was maintained by the coexpression of genes involved in cell wall modification and protein and lipid metabolism. The interaction of genes involved in crucial biological functions during stress was identified and the results will be useful in targeting important gene interactions to understand drought tolerance in greater detail.

  8. Interaction of two photoreceptors in the regulation of bacterial photosynthesis genes.

    Science.gov (United States)

    Metz, Sebastian; Haberzettl, Kerstin; Frühwirth, Sebastian; Teich, Kristin; Hasewinkel, Christian; Klug, Gabriele

    2012-07-01

    The expression of photosynthesis genes in the facultatively photosynthetic bacterium Rhodobacter sphaeroides is controlled by the oxygen tension and by light quantity. Two photoreceptor proteins, AppA and CryB, have been identified in the past, which are involved in this regulation. AppA senses light by its N-terminal BLUF domain, its C-terminal part binds heme and is redox-responsive. Through its interaction to the transcriptional repressor PpsR the AppA photoreceptor controls expression of photosynthesis genes. The cryptochrome-like protein CryB was shown to affect regulation of photosynthesis genes, but the underlying signal chain remained unknown. Here we show that CryB interacts with the C-terminal domain of AppA and modulates the binding of AppA to the transcriptional repressor PpsR in a light-dependent manner. Consequently, binding of the transcription factor PpsR to its DNA target is affected by CryB. In agreement with this, all genes of the PpsR regulon showed altered expression levels in a CryB deletion strain after blue-light illumination. These results elucidate for the first time how a bacterial cryptochrome affects gene expression.

  9. Gene-environment interaction between the oxytocin receptor (OXTR) gene and parenting behaviour on children's theory of mind.

    Science.gov (United States)

    Wade, Mark; Hoffmann, Thomas J; Jenkins, Jennifer M

    2015-12-01

    Theory of mind (ToM) is the ability to interpret and understand human behaviour by representing the mental states of others. Like many human capacities, ToM is thought to develop through both complex biological and socialization mechanisms. However, no study has examined the joint effect of genetic and environmental influences on ToM. This study examined how variability in the oxytocin receptor gene (OXTR) and parenting behavior--two widely studied factors in ToM development-interacted to predict ToM in pre-school-aged children. Participants were 301 children who were part of an ongoing longitudinal birth cohort study. ToM was assessed at age 4.5 using a previously validated scale. Parenting was assessed through observations of mothers' cognitively sensitive behaviours. Using a family-based association design, it was suggestive that a particular variant (rs11131149) interacted with maternal cognitive sensitivity on children's ToM (P = 0.019). More copies of the major allele were associated with higher ToM as a function of increasing cognitive sensitivity. A sizeable 26% of the variability in ToM was accounted for by this interaction. This study provides the first empirical evidence of gene-environment interactions on ToM, supporting the notion that genetic factors may be modulated by potent environmental influences early in development. © The Author (2015). Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  10. tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles.

    Science.gov (United States)

    Cejuela, Juan Miguel; McQuilton, Peter; Ponting, Laura; Marygold, Steven J; Stefancsik, Raymund; Millburn, Gillian H; Rost, Burkhard

    2014-01-01

    The breadth and depth of biomedical literature are increasing year upon year. To keep abreast of these increases, FlyBase, a database for Drosophila genomic and genetic information, is constantly exploring new ways to mine the published literature to increase the efficiency and accuracy of manual curation and to automate some aspects, such as triaging and entity extraction. Toward this end, we present the 'tagtog' system, a web-based annotation framework that can be used to mark up biological entities (such as genes) and concepts (such as Gene Ontology terms) in full-text articles. tagtog leverages manual user annotation in combination with automatic machine-learned annotation to provide accurate identification of gene symbols and gene names. As part of the BioCreative IV Interactive Annotation Task, FlyBase has used tagtog to identify and extract mentions of Drosophila melanogaster gene symbols and names in full-text biomedical articles from the PLOS stable of journals. We show here the results of three experiments with different sized corpora and assess gene recognition performance and curation speed. We conclude that tagtog-named entity recognition improves with a larger corpus and that tagtog-assisted curation is quicker than manual curation. DATABASE URL: www.tagtog.net, www.flybase.org.

  11. A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes

    Science.gov (United States)

    Seo, Minseok; Shin, Su-kyung; Kwon, Eun-Young; Kim, Sung-Eun; Bae, Yun-Jung; Lee, Seungyeoun; Sung, Mi-Kyung; Choi, Myung-Sook; Park, Taesung

    2016-01-01

    Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs) among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs). However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods. Through analysis of

  12. A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes.

    Directory of Open Access Journals (Sweden)

    Samuel Sunghwan Cho

    Full Text Available Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs. However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods

  13. Discovering genes underlying QTL

    Energy Technology Data Exchange (ETDEWEB)

    Vanavichit, Apichart [Kasetsart University, Kamphaengsaen, Nakorn Pathom (Thailand)

    2002-02-01

    A map-based approach has allowed scientists to discover few genes at a time. In addition, the reproductive barrier between cultivated rice and wild relatives has prevented us from utilizing the germ plasm by a map-based approach. Most genetic traits important to agriculture or human diseases are manifested as observable, quantitative phenotypes called Quantitative Trait Loci (QTL). In many instances, the complexity of the phenotype/genotype interaction and the general lack of clearly identifiable gene products render the direct molecular cloning approach ineffective, thus additional strategies like genome mapping are required to identify the QTL in question. Genome mapping requires no prior knowledge of the gene function, but utilizes statistical methods to identify the most likely gene location. To completely characterize genes of interest, the initially mapped region of a gene location will have to be narrowed down to a size that is suitable for cloning and sequencing. Strategies for gene identification within the critical region have to be applied after the sequencing of a potentially large clone or set of clones that contains this gene(s). Tremendous success of positional cloning has been shown for cloning many genes responsible for human diseases, including cystic fibrosis and muscular dystrophy as well as plant disease resistance genes. Genome and QTL mapping, positional cloning: the pre-genomics era, comparative approaches to gene identification, and positional cloning: the genomics era are discussed in the report. (M. Suetake)

  14. Novel candidate genes important for asthma and hypertension comorbidity revealed from associative gene networks.

    Science.gov (United States)

    Saik, Olga V; Demenkov, Pavel S; Ivanisenko, Timofey V; Bragina, Elena Yu; Freidin, Maxim B; Goncharova, Irina A; Dosenko, Victor E; Zolotareva, Olga I; Hofestaedt, Ralf; Lavrik, Inna N; Rogaev, Evgeny I; Ivanisenko, Vladimir A

    2018-02-13

    Hypertension and bronchial asthma are a major issue for people's health. As of 2014, approximately one billion adults, or ~ 22% of the world population, have had hypertension. As of 2011, 235-330 million people globally have been affected by asthma and approximately 250,000-345,000 people have died each year from the disease. The development of the effective treatment therapies against these diseases is complicated by their comorbidity features. This is often a major problem in diagnosis and their treatment. Hence, in this study the bioinformatical methodology for the analysis of the comorbidity of these two diseases have been developed. As such, the search for candidate genes related to the comorbid conditions of asthma and hypertension can help in elucidating the molecular mechanisms underlying the comorbid condition of these two diseases, and can also be useful for genotyping and identifying new drug targets. Using ANDSystem, the reconstruction and analysis of gene networks associated with asthma and hypertension was carried out. The gene network of asthma included 755 genes/proteins and 62,603 interactions, while the gene network of hypertension - 713 genes/proteins and 45,479 interactions. Two hundred and five genes/proteins and 9638 interactions were shared between asthma and hypertension. An approach for ranking genes implicated in the comorbid condition of two diseases was proposed. The approach is based on nine criteria for ranking genes by their importance, including standard methods of gene prioritization (Endeavor, ToppGene) as well as original criteria that take into account the characteristics of an associative gene network and the presence of known polymorphisms in the analysed genes. According to the proposed approach, the genes IL10, TLR4, and CAT had the highest priority in the development of comorbidity of these two diseases. Additionally, it was revealed that the list of top genes is enriched with apoptotic genes and genes involved in

  15. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.

    Science.gov (United States)

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-03-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.

  16. Clock genes × stress × reward interactions in alcohol and substance use disorders.

    Science.gov (United States)

    Perreau-Lenz, Stéphanie; Spanagel, Rainer

    2015-06-01

    Adverse life events and highly stressful environments have deleterious consequences for mental health. Those environmental factors can potentiate alcohol and drug abuse in vulnerable individuals carrying specific genetic risk factors, hence producing the final risk for alcohol- and substance-use disorders development. The nature of these genes remains to be fully determined, but studies indicate their direct or indirect relation to the stress hypothalamo-pituitary-adrenal (HPA) axis and/or reward systems. Over the past decade, clock genes have been revealed to be key-players in influencing acute and chronic alcohol/drug effects. In parallel, the influence of chronic stress and stressful life events in promoting alcohol and substance use and abuse has been demonstrated. Furthermore, the reciprocal interaction of clock genes with various HPA-axis components, as well as the evidence for an implication of clock genes in stress-induced alcohol abuse, have led to the idea that clock genes, and Period genes in particular, may represent key genetic factors to consider when examining gene × environment interaction in the etiology of addiction. The aim of the present review is to summarize findings linking clock genes, stress, and alcohol and substance abuse, and to propose potential underlying neurobiological mechanisms. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. Impact of Maspin Polymorphism rs2289520 G/C and Its Interaction with Gene to Gene, Alcohol Consumption Increase Susceptibility to Oral Cancer Occurrence.

    Science.gov (United States)

    Yang, Po-Yu; Miao, Nae-Fang; Lin, Chiao-Wen; Chou, Ying-Erh; Yang, Shun-Fa; Huang, Hui-Chuan; Chang, Hsiu-Ju; Tsai, Hsiu-Ting

    2016-01-01

    The purpose of this study was to identify gene polymorphisms of mammary serine protease inhibitor (Maspin) specific to patients with oral cancer susceptibility and clinicopathological status. Three single-nucleotide polymorphisms (SNPs) of the Maspin gene from 741 patients with oral cancer and 601 non-cancer controls were analyzed by real-time PCR. The participants with G/G homozygotes or with G/C heterozygotes of Maspin rs2289520 polymorphism had a 2.07-fold (p = 0.01) and a 2.01-fold (p = 0.02) risk of developing oral cancer compared to those with C/C homozygotes. Moreover, gene-gene interaction increased the risk of oral cancer susceptibility among subjects expose to oral cancer related risk factors, including areca, alcohol, and tobacco consumption. G allele of Maspin rs2289520 polymorphism may be a factor that increases the susceptibility to oral cancer. The interactions of gene to oral cancer-related environmental risk factors have a synergetic effect that can further enhance oral cancer development.

  18. Learning Gene Regulatory Networks Computationally from Gene Expression Data Using Weighted Consensus

    KAUST Repository

    Fujii, Chisato

    2015-04-16

    Gene regulatory networks analyze the relationships between genes allowing us to un- derstand the gene regulatory interactions in systems biology. Gene expression data from the microarray experiments is used to obtain the gene regulatory networks. How- ever, the microarray data is discrete, noisy and non-linear which makes learning the networks a challenging problem and existing gene network inference methods do not give consistent results. Current state-of-the-art study uses the average-ranking-based consensus method to combine and average the ranked predictions from individual methods. However each individual method has an equal contribution to the consen- sus prediction. We have developed a linear programming-based consensus approach which uses learned weights from linear programming among individual methods such that the methods have di↵erent weights depending on their performance. Our result reveals that assigning di↵erent weights to individual methods rather than giving them equal weights improves the performance of the consensus. The linear programming- based consensus method is evaluated and it had the best performance on in silico and Saccharomyces cerevisiae networks, and the second best on the Escherichia coli network outperformed by Inferelator Pipeline method which gives inconsistent results across a wide range of microarray data sets.

  19. Childhood temperament: passive gene-environment correlation, gene-environment interaction, and the hidden importance of the family environment.

    Science.gov (United States)

    Lemery-Chalfant, Kathryn; Kao, Karen; Swann, Gregory; Goldsmith, H Hill

    2013-02-01

    Biological parents pass on genotypes to their children, as well as provide home environments that correlate with their genotypes; thus, the association between the home environment and children's temperament can be genetically (i.e., passive gene-environment correlation) or environmentally mediated. Furthermore, family environments may suppress or facilitate the heritability of children's temperament (i.e., gene-environment interaction). The sample comprised 807 twin pairs (mean age = 7.93 years) from the longitudinal Wisconsin Twin Project. Important passive gene-environment correlations emerged, such that home environments were less chaotic for children with high effortful control, and this association was genetically mediated. Children with high extraversion/surgency experienced more chaotic home environments, and this correlation was also genetically mediated. In addition, heritability of children's temperament was moderated by home environments, such that effortful control and extraversion/surgency were more heritable in chaotic homes, and negative affectivity was more heritable under crowded or unsafe home conditions. Modeling multiple types of gene-environment interplay uncovered the complex role of genetic factors and the hidden importance of the family environment for children's temperament and development more generally.

  20. Development of gene diagnosis for diabetes and cholecystis based on gene analysis of CCK-A receptor

    International Nuclear Information System (INIS)

    Kono, Akira

    1998-01-01

    The gene structures of CCK, A type receptor in human, the rat and the mouse were investigated aiming to clarify that the aberration of the gene is involved in the incidences of diabetes and cholecystis. In this fiscal year, 1997, the normal structure of the gene and the accurate base sequence were analyzed using DNA fragments bound to 32 P-labelled cDNA of human CCKAR originated from the gene library of leucocyte. This gene contained about 2.2 x 10 5 base pairs and the base sequence was completely determined and registered to Japan DNA data bank (D85606). In addition, the genome structures and base sequences of mouse and rat CCKAR were analyzed and registered (D 85605 and D 50608, respectively). The differences in the base sequence of CCKAR among the species were found in the promotor region and the intron regions, suggesting that there might be differences in splicing among species. (M.N.)

  1. Enhancing the gene-environment interaction framework through a quasi-experimental research design: evidence from differential responses to September 11.

    Science.gov (United States)

    Fletcher, Jason M

    2014-01-01

    This article uses a gene-environment interaction framework to examine the differential responses to an objective external stressor based on genetic variation in the production of depressive symptoms. This article advances the literature by utilizing a quasi-experimental environmental exposure design, as well as a regression discontinuity design, to control for seasonal trends, which limit the potential for gene-environment correlation and allow stronger causal claims. Replications are attempted for two prominent genes (5-HTT and MAOA), and three additional genes are explored (DRD2, DRD4, and DAT1). This article provides evidence of a main effect of 9/11 on reports of feelings of sadness and fails to replicate a common finding of interaction using 5-HTT but does show support for interaction with MAOA in men. It also provides new evidence that variation in the DRD4 gene modifies an individual's response to the exposure, with individuals with no 7-repeats found to have a muted response.

  2. Finding gene-environment interactions for Phobias

    OpenAIRE

    Gregory, Alice M.; Lau, Jennifer Y. F.; Eley, Thalia C.

    2008-01-01

    Phobias are common disorders causing a great deal of suffering. Studies of gene-environment interaction (G × E) have revealed much about the complex processes underlying the development of various psychiatric disorders but have told us little about phobias. This article describes what is already known about genetic and environmental influences upon phobias and suggests how this information can be used to optimise the chances of discovering G × Es for phobias. In addition to the careful concep...

  3. Bayesian logistic regression in detection of gene-steroid interaction for cancer at PDLIM5 locus.

    Science.gov (United States)

    Wang, Ke-Sheng; Owusu, Daniel; Pan, Yue; Xie, Changchun

    2016-06-01

    The PDZ and LIM domain 5 (PDLIM5) gene may play a role in cancer, bipolar disorder, major depression, alcohol dependence and schizophrenia; however, little is known about the interaction effect of steroid and PDLIM5 gene on cancer. This study examined 47 single-nucleotide polymorphisms (SNPs) within the PDLIM5 gene in the Marshfield sample with 716 cancer patients (any diagnosed cancer, excluding minor skin cancer) and 2848 noncancer controls. Multiple logistic regression model in PLINK software was used to examine the association of each SNP with cancer. Bayesian logistic regression in PROC GENMOD in SAS statistical software, ver. 9.4 was used to detect gene- steroid interactions influencing cancer. Single marker analysis using PLINK identified 12 SNPs associated with cancer (Plogistic regression in PROC GENMOD showed that both rs6532496 and rs951613 revealed strong gene-steroid interaction effects (OR=2.18, 95% CI=1.31-3.63 with P = 2.9 × 10⁻³ for rs6532496 and OR=2.07, 95% CI=1.24-3.45 with P = 5.43 × 10⁻³ for rs951613, respectively). Results from Bayesian logistic regression showed stronger interaction effects (OR=2.26, 95% CI=1.2-3.38 for rs6532496 and OR=2.14, 95% CI=1.14-3.2 for rs951613, respectively). All the 12 SNPs associated with cancer revealed significant gene-steroid interaction effects (P logistic regression and OR=2.59, 95% CI=1.4-3.97 from Bayesian logistic regression; respectively). This study provides evidence of common genetic variants within the PDLIM5 gene and interactions between PLDIM5 gene polymorphisms and steroid use influencing cancer.

  4. A new essential protein discovery method based on the integration of protein-protein interaction and gene expression data

    Directory of Open Access Journals (Sweden)

    Li Min

    2012-03-01

    Full Text Available Abstract Background Identification of essential proteins is always a challenging task since it requires experimental approaches that are time-consuming and laborious. With the advances in high throughput technologies, a large number of protein-protein interactions are available, which have produced unprecedented opportunities for detecting proteins' essentialities from the network level. There have been a series of computational approaches proposed for predicting essential proteins based on network topologies. However, the network topology-based centrality measures are very sensitive to the robustness of network. Therefore, a new robust essential protein discovery method would be of great value. Results In this paper, we propose a new centrality measure, named PeC, based on the integration of protein-protein interaction and gene expression data. The performance of PeC is validated based on the protein-protein interaction network of Saccharomyces cerevisiae. The experimental results show that the predicted precision of PeC clearly exceeds that of the other fifteen previously proposed centrality measures: Degree Centrality (DC, Betweenness Centrality (BC, Closeness Centrality (CC, Subgraph Centrality (SC, Eigenvector Centrality (EC, Information Centrality (IC, Bottle Neck (BN, Density of Maximum Neighborhood Component (DMNC, Local Average Connectivity-based method (LAC, Sum of ECC (SoECC, Range-Limited Centrality (RL, L-index (LI, Leader Rank (LR, Normalized α-Centrality (NC, and Moduland-Centrality (MC. Especially, the improvement of PeC over the classic centrality measures (BC, CC, SC, EC, and BN is more than 50% when predicting no more than 500 proteins. Conclusions We demonstrate that the integration of protein-protein interaction network and gene expression data can help improve the precision of predicting essential proteins. The new centrality measure, PeC, is an effective essential protein discovery method.

  5. Community Structure Analysis of Gene Interaction Networks in Duchenne Muscular Dystrophy.

    Directory of Open Access Journals (Sweden)

    Tejaswini Narayanan

    Full Text Available Duchenne Muscular Dystrophy (DMD is an important pathology associated with the human skeletal muscle and has been studied extensively. Gene expression measurements on skeletal muscle of patients afflicted with DMD provides the opportunity to understand the underlying mechanisms that lead to the pathology. Community structure analysis is a useful computational technique for understanding and modeling genetic interaction networks. In this paper, we leverage this technique in combination with gene expression measurements from normal and DMD patient skeletal muscle tissue to study the structure of genetic interactions in the context of DMD. We define a novel framework for transforming a raw dataset of gene expression measurements into an interaction network, and subsequently apply algorithms for community structure analysis for the extraction of topological communities. The emergent communities are analyzed from a biological standpoint in terms of their constituent biological pathways, and an interpretation that draws correlations between functional and structural organization of the genetic interactions is presented. We also compare these communities and associated functions in pathology against those in normal human skeletal muscle. In particular, differential enhancements are observed in the following pathways between pathological and normal cases: Metabolic, Focal adhesion, Regulation of actin cytoskeleton and Cell adhesion, and implication of these mechanisms are supported by prior work. Furthermore, our study also includes a gene-level analysis to identify genes that are involved in the coupling between the pathways of interest. We believe that our results serve to highlight important distinguishing features in the structural/functional organization of constituent biological pathways, as it relates to normal and DMD cases, and provide the mechanistic basis for further biological investigations into specific pathways differently regulated

  6. Comparison of information-theoretic to statistical methods for gene-gene interactions in the presence of genetic heterogeneity

    Directory of Open Access Journals (Sweden)

    Sucheston Lara

    2010-09-01

    Full Text Available Abstract Background Multifactorial diseases such as cancer and cardiovascular diseases are caused by the complex interplay between genes and environment. The detection of these interactions remains challenging due to computational limitations. Information theoretic approaches use computationally efficient directed search strategies and thus provide a feasible solution to this problem. However, the power of information theoretic methods for interaction analysis has not been systematically evaluated. In this work, we compare power and Type I error of an information-theoretic approach to existing interaction analysis methods. Methods The k-way interaction information (KWII metric for identifying variable combinations involved in gene-gene interactions (GGI was assessed using several simulated data sets under models of genetic heterogeneity driven by susceptibility increasing loci with varying allele frequency, penetrance values and heritability. The power and proportion of false positives of the KWII was compared to multifactor dimensionality reduction (MDR, restricted partitioning method (RPM and logistic regression. Results The power of the KWII was considerably greater than MDR on all six simulation models examined. For a given disease prevalence at high values of heritability, the power of both RPM and KWII was greater than 95%. For models with low heritability and/or genetic heterogeneity, the power of the KWII was consistently greater than RPM; the improvements in power for the KWII over RPM ranged from 4.7% to 14.2% at for α = 0.001 in the three models at the lowest heritability values examined. KWII performed similar to logistic regression. Conclusions Information theoretic models are flexible and have excellent power to detect GGI under a variety of conditions that characterize complex diseases.

  7. Genetic interaction analysis of point mutations enables interrogation of gene function at a residue-level resolution

    Science.gov (United States)

    Braberg, Hannes; Moehle, Erica A.; Shales, Michael; Guthrie, Christine; Krogan, Nevan J.

    2014-01-01

    We have achieved a residue-level resolution of genetic interaction mapping – a technique that measures how the function of one gene is affected by the alteration of a second gene – by analyzing point mutations. Here, we describe how to interpret point mutant genetic interactions, and outline key applications for the approach, including interrogation of protein interaction interfaces and active sites, and examination of post-translational modifications. Genetic interaction analysis has proven effective for characterizing cellular processes; however, to date, systematic high-throughput genetic interaction screens have relied on gene deletions or knockdowns, which limits the resolution of gene function analysis and poses problems for multifunctional genes. Our point mutant approach addresses these issues, and further provides a tool for in vivo structure-function analysis that complements traditional biophysical methods. We also discuss the potential for genetic interaction mapping of point mutations in human cells and its application to personalized medicine. PMID:24842270

  8. Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network

    Science.gov (United States)

    2011-01-01

    Background Vaccine literature indexing is poorly performed in PubMed due to limited hierarchy of Medical Subject Headings (MeSH) annotation in the vaccine field. Vaccine Ontology (VO) is a community-based biomedical ontology that represents various vaccines and their relations. SciMiner is an in-house literature mining system that supports literature indexing and gene name tagging. We hypothesize that application of VO in SciMiner will aid vaccine literature indexing and mining of vaccine-gene interaction networks. As a test case, we have examined vaccines for Brucella, the causative agent of brucellosis in humans and animals. Results The VO-based SciMiner (VO-SciMiner) was developed to incorporate a total of 67 Brucella vaccine terms. A set of rules for term expansion of VO terms were learned from training data, consisting of 90 biomedical articles related to Brucella vaccine terms. VO-SciMiner demonstrated high recall (91%) and precision (99%) from testing a separate set of 100 manually selected biomedical articles. VO-SciMiner indexing exhibited superior performance in retrieving Brucella vaccine-related papers over that obtained with MeSH-based PubMed literature search. For example, a VO-SciMiner search of "live attenuated Brucella vaccine" returned 922 hits as of April 20, 2011, while a PubMed search of the same query resulted in only 74 hits. Using the abstracts of 14,947 Brucella-related papers, VO-SciMiner identified 140 Brucella genes associated with Brucella vaccines. These genes included known protective antigens, virulence factors, and genes closely related to Brucella vaccines. These VO-interacting Brucella genes were significantly over-represented in biological functional categories, including metabolite transport and metabolism, replication and repair, cell wall biogenesis, intracellular trafficking and secretion, posttranslational modification, and chaperones. Furthermore, a comprehensive interaction network of Brucella vaccines and genes were

  9. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes

    OpenAIRE

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-01-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix–loop–helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks e...

  10. Interaction between CRHR1 and BDNF genes increases the risk of recurrent major depressive disorder in Chinese population.

    Directory of Open Access Journals (Sweden)

    Zheman Xiao

    Full Text Available BACKGROUND: An important etiological hypothesis about depression is stress has neurotoxic effects that damage the hippocampal cells. Corticotropin-releasing hormone (CRH regulates brain-derived neurotrophic factor (BDNF expression through influencing cAMP and Ca2+ signaling pathways during the course. The aim of this study is to examine the single and combined effects of CRH receptor 1 (CRHR1 and BDNF genes in recurrent major depressive disorder (MDD. METHODOLOGY/PRINCIPAL FINDING: The sample consists of 181 patients with recurrent MDD and 186 healthy controls. Whether genetic variations interaction between CRHR1 and BDNF genes might be associated with increased susceptibility to recurrent MDD was studied by using a gene-based association analysis of single-nucleotide polymorphisms (SNPs. CRHR1 gene (rs1876828, rs242939 and rs242941 and BDNF gene (rs6265 were identified in the samples of patients diagnosed with recurrent MDD and matched controls. Allelic association between CRHR1 rs242939 and recurrent MDD was found in our sample (allelic: p = 0.018, genotypic: p = 0.022 with an Odds Ratio 0.454 (95% CI 0.266-0.775. A global test of these four haplotypes showed a significant difference between recurrent MDD group and control group (chi-2 = 13.117, df = 3, P = 0.016. Furthermore, BDNF and CRHR1 interactions were found in the significant 2-locus, gene-gene interaction models (p = 0.05 using a generalized multifactor dimensionality reduction (GMDR method. CONCLUSION: Our results suggest that an interaction between CRHR1 and BDNF genes constitutes susceptibility to recurrent MDD.

  11. Biomarker Identification for Prostate Cancer and Lymph Node Metastasis from Microarray Data and Protein Interaction Network Using Gene Prioritization Method

    Directory of Open Access Journals (Sweden)

    Carlos Roberto Arias

    2012-01-01

    Full Text Available Finding a genetic disease-related gene is not a trivial task. Therefore, computational methods are needed to present clues to the biomedical community to explore genes that are more likely to be related to a specific disease as biomarker. We present biomarker identification problem using gene prioritization method called gene prioritization from microarray data based on shortest paths, extended with structural and biological properties and edge flux using voting scheme (GP-MIDAS-VXEF. The method is based on finding relevant interactions on protein interaction networks, then scoring the genes using shortest paths and topological analysis, integrating the results using a voting scheme and a biological boosting. We applied two experiments, one is prostate primary and normal samples and the other is prostate primary tumor with and without lymph nodes metastasis. We used 137 truly prostate cancer genes as benchmark. In the first experiment, GP-MIDAS-VXEF outperforms all the other state-of-the-art methods in the benchmark by retrieving the truest related genes from the candidate set in the top 50 scores found. We applied the same technique to infer the significant biomarkers in prostate cancer with lymph nodes metastasis which is not established well.

  12. Antisocial peer affiliation and externalizing disorders: Evidence for Gene × Environment × Development interaction.

    Science.gov (United States)

    Samek, Diana R; Hicks, Brian M; Keyes, Margaret A; Iacono, William G; McGue, Matt

    2017-02-01

    Gene × Environment interaction contributes to externalizing disorders in childhood and adolescence, but little is known about whether such effects are long lasting or present in adulthood. We examined gene-environment interplay in the concurrent and prospective associations between antisocial peer affiliation and externalizing disorders (antisocial behavior and substance use disorders) at ages 17, 20, 24, and 29. The sample included 1,382 same-sex twin pairs participating in the Minnesota Twin Family Study. We detected a Gene × Environment interaction at age 17, such that additive genetic influences on antisocial behavior and substance use disorders were greater in the context of greater antisocial peer affiliation. This Gene × Environment interaction was not present for antisocial behavior symptoms after age 17, but it was for substance use disorder symptoms through age 29 (though effect sizes were largest at age 17). The results suggest adolescence is a critical period for the development of externalizing disorders wherein exposure to greater environmental adversity is associated with a greater expression of genetic risk. This form of Gene × Environment interaction may persist through young adulthood for substance use disorders, but it appears to be limited to adolescence for antisocial behavior.

  13. Antisocial Peer Affiliation and Externalizing Disorders: Evidence for Gene × Environment × Development Interaction

    Science.gov (United States)

    Samek, Diana R.; Hicks, Brian M.; Keyes, Margaret A.; Iacono, William G.; McGue, Matt

    2016-01-01

    Gene × environment interaction contributes to externalizing disorders in adolescence, but little is known about whether such effects are long-lasting or present in adulthood. We examined gene-environment interplay in the concurrent and prospective associations between antisocial peer affiliation and externalizing disorders (antisocial behavior and substance use disorders) at ages 17, 20, 24, and 29. The sample included 1,382 same-sex twin pairs participating in the Minnesota Twin Family Study. We detected a gene × environment interaction at age 17, such that additive genetic influences on antisocial behavior and substance use disorders were greater in the context of greater antisocial peer affiliation. This gene × environment interaction was not present for antisocial behavior symptoms after age 17, but was for substance use disorder symptoms through age 29 (though effect sizes were largest at age 17). Results suggest adolescence is a critical period for the development of externalizing disorders wherein exposure to greater environmental adversity is associated with a greater expression of genetic risk. This form of gene × environment interaction may persist through young adulthood for substance use disorders, but is limited to adolescence for antisocial behavior. PMID:27580681

  14. Representing virus-host interactions and other multi-organism processes in the Gene Ontology.

    Science.gov (United States)

    Foulger, R E; Osumi-Sutherland, D; McIntosh, B K; Hulo, C; Masson, P; Poux, S; Le Mercier, P; Lomax, J

    2015-07-28

    The Gene Ontology project is a collaborative effort to provide descriptions of gene products in a consistent and computable language, and in a species-independent manner. The Gene Ontology is designed to be applicable to all organisms but up to now has been largely under-utilized for prokaryotes and viruses, in part because of a lack of appropriate ontology terms. To address this issue, we have developed a set of Gene Ontology classes that are applicable to microbes and their hosts, improving both coverage and quality in this area of the Gene Ontology. Describing microbial and viral gene products brings with it the additional challenge of capturing both the host and the microbe. Recognising this, we have worked closely with annotation groups to test and optimize the GO classes, and we describe here a set of annotation guidelines that allow the controlled description of two interacting organisms. Building on the microbial resources already in existence such as ViralZone, UniProtKB keywords and MeGO, this project provides an integrated ontology to describe interactions between microbial species and their hosts, with mappings to the external resources above. Housing this information within the freely-accessible Gene Ontology project allows the classes and annotation structure to be utilized by a large community of biologists and users.

  15. Early life adversity and serotonin transporter gene variation interact to affect DNA methylation of the corticotropin-releasing factor gene promoter region in the adult rat brain

    NARCIS (Netherlands)

    Doelen, R.H.A. van der; Arnoldussen, I.A.C.; Ghareh, H.; Och, L. van; Homberg, J.R.; Kozicz, L.T.

    2015-01-01

    The interaction between childhood maltreatment and the serotonin transporter (5-HTT) gene linked polymorphic region has been associated with increased risk to develop major depression. This Gene x Environment interaction has furthermore been linked with increased levels of anxiety and glucocorticoid

  16. Gene therapy prospects--intranasal delivery of therapeutic genes.

    Science.gov (United States)

    Podolska, Karolina; Stachurska, Anna; Hajdukiewicz, Karolina; Małecki, Maciej

    2012-01-01

    Gene therapy is recognized to be a novel method for the treatment of various disorders. Gene therapy strategies involve gene manipulation on broad biological processes responsible for the spreading of diseases. Cancer, monogenic diseases, vascular and infectious diseases are the main targets of gene therapy. In order to obtain valuable experimental and clinical results, sufficient gene transfer methods are required. Therapeutic genes can be administered into target tissues via gene carriers commonly defined as vectors. The retroviral, adenoviral and adeno-associated virus based vectors are most frequently used in the clinic. So far, gene preparations may be administered directly into target organs or by intravenous, intramuscular, intratumor or intranasal injections. It is common knowledge that the number of gene therapy clinical trials has rapidly increased. However, some limitations such as transfection efficiency and stable and long-term gene expression are still not resolved. Consequently, great effort is focused on the evaluation of new strategies of gene delivery. There are many expectations associated with intranasal delivery of gene preparations for the treatment of diseases. Intranasal delivery of therapeutic genes is regarded as one of the most promising forms of pulmonary gene therapy research. Gene therapy based on inhalation of gene preparations offers an alternative way for the treatment of patients suffering from such lung diseases as cystic fibrosis, alpha-1-antitrypsin defect, or cancer. Experimental and first clinical trials based on plasmid vectors or recombinant viruses have revealed that gene preparations can effectively deliver therapeutic or marker genes to the cells of the respiratory tract. The noninvasive intranasal delivery of gene preparations or conventional drugs seems to be very encouraging, although basic scientific research still has to continue.

  17. Paper-based synthetic gene networks.

    Science.gov (United States)

    Pardee, Keith; Green, Alexander A; Ferrante, Tom; Cameron, D Ewen; DaleyKeyser, Ajay; Yin, Peng; Collins, James J

    2014-11-06

    Synthetic gene networks have wide-ranging uses in reprogramming and rewiring organisms. To date, there has not been a way to harness the vast potential of these networks beyond the constraints of a laboratory or in vivo environment. Here, we present an in vitro paper-based platform that provides an alternate, versatile venue for synthetic biologists to operate and a much-needed medium for the safe deployment of engineered gene circuits beyond the lab. Commercially available cell-free systems are freeze dried onto paper, enabling the inexpensive, sterile, and abiotic distribution of synthetic-biology-based technologies for the clinic, global health, industry, research, and education. For field use, we create circuits with colorimetric outputs for detection by eye and fabricate a low-cost, electronic optical interface. We demonstrate this technology with small-molecule and RNA actuation of genetic switches, rapid prototyping of complex gene circuits, and programmable in vitro diagnostics, including glucose sensors and strain-specific Ebola virus sensors.

  18. Paper-based Synthetic Gene Networks

    Science.gov (United States)

    Pardee, Keith; Green, Alexander A.; Ferrante, Tom; Cameron, D. Ewen; DaleyKeyser, Ajay; Yin, Peng; Collins, James J.

    2014-01-01

    Synthetic gene networks have wide-ranging uses in reprogramming and rewiring organisms. To date, there has not been a way to harness the vast potential of these networks beyond the constraints of a laboratory or in vivo environment. Here, we present an in vitro paper-based platform that provides a new venue for synthetic biologists to operate, and a much-needed medium for the safe deployment of engineered gene circuits beyond the lab. Commercially available cell-free systems are freeze-dried onto paper, enabling the inexpensive, sterile and abiotic distribution of synthetic biology-based technologies for the clinic, global health, industry, research and education. For field use, we create circuits with colorimetric outputs for detection by eye, and fabricate a low-cost, electronic optical interface. We demonstrate this technology with small molecule and RNA actuation of genetic switches, rapid prototyping of complex gene circuits, and programmable in vitro diagnostics, including glucose sensors and strain-specific Ebola virus sensors. PMID:25417167

  19. Prediction of disease-related genes based on weighted tissue-specific networks by using DNA methylation.

    Science.gov (United States)

    Li, Min; Zhang, Jiayi; Liu, Qing; Wang, Jianxin; Wu, Fang-Xiang

    2014-01-01

    Predicting disease-related genes is one of the most important tasks in bioinformatics and systems biology. With the advances in high-throughput techniques, a large number of protein-protein interactions are available, which make it possible to identify disease-related genes at the network level. However, network-based identification of disease-related genes is still a challenge as the considerable false-positives are still existed in the current available protein interaction networks (PIN). Considering the fact that the majority of genetic disorders tend to manifest only in a single or a few tissues, we constructed tissue-specific networks (TSN) by integrating PIN and tissue-specific data. We further weighed the constructed tissue-specific network (WTSN) by using DNA methylation as it plays an irreplaceable role in the development of complex diseases. A PageRank-based method was developed to identify disease-related genes from the constructed networks. To validate the effectiveness of the proposed method, we constructed PIN, weighted PIN (WPIN), TSN, WTSN for colon cancer and leukemia, respectively. The experimental results on colon cancer and leukemia show that the combination of tissue-specific data and DNA methylation can help to identify disease-related genes more accurately. Moreover, the PageRank-based method was effective to predict disease-related genes on the case studies of colon cancer and leukemia. Tissue-specific data and DNA methylation are two important factors to the study of human diseases. The same method implemented on the WTSN can achieve better results compared to those being implemented on original PIN, WPIN, or TSN. The PageRank-based method outperforms degree centrality-based method for identifying disease-related genes from WTSN.

  20. Geographical patterns of adaptation within a species' range : Interactions between drift and gene flow

    NARCIS (Netherlands)

    Alleaume-Benharira, M; Pen, IR; Ronce, O

    We use individual-based stochastic simulations and analytical deterministic predictions to investigate the interaction between drift, natural selection and gene flow on the patterns of local adaptation across a fragmented species' range under clinally varying selection. Migration between populations

  1. Frequency-based time-series gene expression recomposition using PRIISM

    Directory of Open Access Journals (Sweden)

    Rosa Bruce A

    2012-06-01

    Full Text Available Abstract Background Circadian rhythm pathways influence the expression patterns of as much as 31% of the Arabidopsis genome through complicated interaction pathways, and have been found to be significantly disrupted by biotic and abiotic stress treatments, complicating treatment-response gene discovery methods due to clock pattern mismatches in the fold change-based statistics. The PRIISM (Pattern Recomposition for the Isolation of Independent Signals in Microarray data algorithm outlined in this paper is designed to separate pattern changes induced by different forces, including treatment-response pathways and circadian clock rhythm disruptions. Results Using the Fourier transform, high-resolution time-series microarray data is projected to the frequency domain. By identifying the clock frequency range from the core circadian clock genes, we separate the frequency spectrum to different sections containing treatment-frequency (representing up- or down-regulation by an adaptive treatment response, clock-frequency (representing the circadian clock-disruption response and noise-frequency components. Then, we project the components’ spectra back to the expression domain to reconstruct isolated, independent gene expression patterns representing the effects of the different influences. By applying PRIISM on a high-resolution time-series Arabidopsis microarray dataset under a cold treatment, we systematically evaluated our method using maximum fold change and principal component analyses. The results of this study showed that the ranked treatment-frequency fold change results produce fewer false positives than the original methodology, and the 26-hour timepoint in our dataset was the best statistic for distinguishing the most known cold-response genes. In addition, six novel cold-response genes were discovered. PRIISM also provides gene expression data which represents only circadian clock influences, and may be useful for circadian clock studies

  2. Gene environment interaction studies in depression and suicidal behavior: An update.

    Science.gov (United States)

    Mandelli, Laura; Serretti, Alessandro

    2013-12-01

    Increasing evidence supports the involvement of both heritable and environmental risk factors in major depression (MD) and suicidal behavior (SB). Studies investigating gene-environment interaction (G × E) may be useful for elucidating the role of biological mechanisms in the risk for mental disorders. In the present paper, we review the literature regarding the interaction between genes modulating brain functions and stressful life events in the etiology of MD and SB and discuss their potential added benefit compared to genetic studies only. Within the context of G × E investigation, thus far, only a few reliable results have been obtained, although some genes have consistently shown interactive effects with environmental risk in MD and, to a lesser extent, in SB. Further investigation is required to disentangle the direct and mediated effects that are common or specific to MD and SB. Since traditional G × E studies overall suffer from important methodological limitations, further effort is required to develop novel methodological strategies with an interdisciplinary approach. Copyright © 2013 Elsevier Ltd. All rights reserved.

  3. A computational method based on the integration of heterogeneous networks for predicting disease-gene associations.

    Directory of Open Access Journals (Sweden)

    Xingli Guo

    Full Text Available The identification of disease-causing genes is a fundamental challenge in human health and of great importance in improving medical care, and provides a better understanding of gene functions. Recent computational approaches based on the interactions among human proteins and disease similarities have shown their power in tackling the issue. In this paper, a novel systematic and global method that integrates two heterogeneous networks for prioritizing candidate disease-causing genes is provided, based on the observation that genes causing the same or similar diseases tend to lie close to one another in a network of protein-protein interactions. In this method, the association score function between a query disease and a candidate gene is defined as the weighted sum of all the association scores between similar diseases and neighbouring genes. Moreover, the topological correlation of these two heterogeneous networks can be incorporated into the definition of the score function, and finally an iterative algorithm is designed for this issue. This method was tested with 10-fold cross-validation on all 1,126 diseases that have at least a known causal gene, and it ranked the correct gene as one of the top ten in 622 of all the 1,428 cases, significantly outperforming a state-of-the-art method called PRINCE. The results brought about by this method were applied to study three multi-factorial disorders: breast cancer, Alzheimer disease and diabetes mellitus type 2, and some suggestions of novel causal genes and candidate disease-causing subnetworks were provided for further investigation.

  4. Chemical-gene interaction networks and causal reasoning for ...

    Science.gov (United States)

    Evaluating the potential human health and ecological risks associated with exposures to complex chemical mixtures in the environment is one of the main challenges of chemical safety assessment and environmental protection. There is a need for approaches that can help to integrate chemical monitoring and biological effects data to evaluate risks associated with chemicals present in the environment. Here, we used prior knowledge about chemical-gene interactions to develop a knowledge assembly model for detected chemicals at five locations near the North Branch and Chisago wastewater treatment plants (WWTP) in the St. Croix River Basin, MN and WI. The assembly model was used to generate hypotheses about the biological impacts of the chemicals at each location. The hypotheses were tested using empirical hepatic gene expression data from fathead minnows exposed for 12 d at each location. Empirical gene expression data were also mapped to the assembly models to evaluate the likelihood of a chemical contributing to the observed biological responses using richness and concordance statistics. The prior knowledge approach was able predict the observed biological pathways impacted at one site but not the other. Atrazine was identified as a potential contributor to the observed gene expression responses at a location upstream of the North Branch WTTP. Four chemicals were identified as contributors to the observed biological responses at the effluent and downstream o

  5. Fast gene ontology based clustering for microarray experiments.

    Science.gov (United States)

    Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa

    2008-11-21

    Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  6. rSNPBase 3.0: an updated database of SNP-related regulatory elements, element-gene pairs and SNP-based gene regulatory networks.

    Science.gov (United States)

    Guo, Liyuan; Wang, Jing

    2018-01-04

    Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element-target gene pairs (E-G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. The interaction of BDNF and NTRK2 gene increases the susceptibility of paranoid schizophrenia.

    Directory of Open Access Journals (Sweden)

    Zheng Lin

    Full Text Available The association between BDNF gene functional Val66Met polymorphism rs6265 and the schizophrenia is far from being consistent. In addition to the heterogeneous in schizophrenia per se leading to the inconsistent results, the interaction among multi-genes is probably playing the main role in the pathogenesis of schizophrenia, but not a single gene. Neurotrophic tyrosine kinase receptor 2 (NTRK2 is the high-affinity receptor of BDNF, and was reported to be associated with mood disorders, though no literature reported the association with schizophrenia. Thus, in the present study, total 402 patients with paranoid schizophrenia (the most common subtype of schizophrenia and matched 406 healthy controls were recruited to investigate the role of rs6265 in BDNF, three polymorphisms in NTRK2 gene (rs1387923, rs2769605 and rs1565445 and their interaction in the susceptibility to paranoid schizophrenia in a Chinese Han population. We did not observe significant differences in allele and genotype frequencies between patients and healthy controls for all four polymorphisms separately. The haplotype analysis also showed no association between haplotype of NTRK2 genes (rs1387923, rs2769605, and rs1565445 and paranoid schizophrenia. However, we found the association between the interaction of BDNF and NTRK2 with paranoid schizophrenia by using the MDR method followed by conventional statistical analysis. The best gene-gene interaction model was a three-locus model (BDNF rs6265, NTRK2 rs1387923 and NTRK2 rs2769605, in which one low-risk and three high-risk four-locus genotype combinations were identified. Our findings implied that single polymorphism of rs6265 rs1387923, rs2769605, and rs1565445 in BDNF and NTRK2 were not associated with the development of paranoid schizophrenia in a Han population, however, the interaction of BDNF and NTRK2 genes polymorphisms (BDNF-rs6265, NTRK2-rs1387923 and NTRK2-rs2769605 may be involved in the susceptibility to paranoid

  8. The interaction of BDNF and NTRK2 gene increases the susceptibility of paranoid schizophrenia.

    Science.gov (United States)

    Lin, Zheng; Su, Yousong; Zhang, Chengfang; Xing, Mengjuan; Ding, Wenhua; Liao, Liwei; Guan, Yangtai; Li, Zezhi; Cui, Donghong

    2013-01-01

    The association between BDNF gene functional Val66Met polymorphism rs6265 and the schizophrenia is far from being consistent. In addition to the heterogeneous in schizophrenia per se leading to the inconsistent results, the interaction among multi-genes is probably playing the main role in the pathogenesis of schizophrenia, but not a single gene. Neurotrophic tyrosine kinase receptor 2 (NTRK2) is the high-affinity receptor of BDNF, and was reported to be associated with mood disorders, though no literature reported the association with schizophrenia. Thus, in the present study, total 402 patients with paranoid schizophrenia (the most common subtype of schizophrenia) and matched 406 healthy controls were recruited to investigate the role of rs6265 in BDNF, three polymorphisms in NTRK2 gene (rs1387923, rs2769605 and rs1565445) and their interaction in the susceptibility to paranoid schizophrenia in a Chinese Han population. We did not observe significant differences in allele and genotype frequencies between patients and healthy controls for all four polymorphisms separately. The haplotype analysis also showed no association between haplotype of NTRK2 genes (rs1387923, rs2769605, and rs1565445) and paranoid schizophrenia. However, we found the association between the interaction of BDNF and NTRK2 with paranoid schizophrenia by using the MDR method followed by conventional statistical analysis. The best gene-gene interaction model was a three-locus model (BDNF rs6265, NTRK2 rs1387923 and NTRK2 rs2769605), in which one low-risk and three high-risk four-locus genotype combinations were identified. Our findings implied that single polymorphism of rs6265 rs1387923, rs2769605, and rs1565445 in BDNF and NTRK2 were not associated with the development of paranoid schizophrenia in a Han population, however, the interaction of BDNF and NTRK2 genes polymorphisms (BDNF-rs6265, NTRK2-rs1387923 and NTRK2-rs2769605) may be involved in the susceptibility to paranoid schizophrenia.

  9. Beyond main effects of gene-sets: harsh parenting moderates the association between a dopamine gene-set and child externalizing behavior.

    Science.gov (United States)

    Windhorst, Dafna A; Mileva-Seitz, Viara R; Rippe, Ralph C A; Tiemeier, Henning; Jaddoe, Vincent W V; Verhulst, Frank C; van IJzendoorn, Marinus H; Bakermans-Kranenburg, Marian J

    2016-08-01

    In a longitudinal cohort study, we investigated the interplay of harsh parenting and genetic variation across a set of functionally related dopamine genes, in association with children's externalizing behavior. This is one of the first studies to employ gene-based and gene-set approaches in tests of Gene by Environment (G × E) effects on complex behavior. This approach can offer an important alternative or complement to candidate gene and genome-wide environmental interaction (GWEI) studies in the search for genetic variation underlying individual differences in behavior. Genetic variants in 12 autosomal dopaminergic genes were available in an ethnically homogenous part of a population-based cohort. Harsh parenting was assessed with maternal (n = 1881) and paternal (n = 1710) reports at age 3. Externalizing behavior was assessed with the Child Behavior Checklist (CBCL) at age 5 (71 ± 3.7 months). We conducted gene-set analyses of the association between variation in dopaminergic genes and externalizing behavior, stratified for harsh parenting. The association was statistically significant or approached significance for children without harsh parenting experiences, but was absent in the group with harsh parenting. Similarly, significant associations between single genes and externalizing behavior were only found in the group without harsh parenting. Effect sizes in the groups with and without harsh parenting did not differ significantly. Gene-environment interaction tests were conducted for individual genetic variants, resulting in two significant interaction effects (rs1497023 and rs4922132) after correction for multiple testing. Our findings are suggestive of G × E interplay, with associations between dopamine genes and externalizing behavior present in children without harsh parenting, but not in children with harsh parenting experiences. Harsh parenting may overrule the role of genetic factors in externalizing behavior. Gene-based and gene

  10. Evidence for gene-environment interaction in a genome wide study of nonsyndromic cleft palate

    DEFF Research Database (Denmark)

    Beaty, Terri H; Ruczinski, Ingo; Murray, Jeffrey C

    2011-01-01

    Nonsyndromic cleft palate (CP) is a common birth defect with a complex and heterogeneous etiology involving both genetic and environmental risk factors. We conducted a genome-wide association study (GWAS) using 550 case-parent trios, ascertained through a CP case collected in an international...... consortium. Family-based association tests of single nucleotide polymorphisms (SNP) and three common maternal exposures (maternal smoking, alcohol consumption, and multivitamin supplementation) were used in a combined 2 df test for gene (G) and gene-environment (G × E) interaction simultaneously, plus...... multiple SNPs associated with higher risk of CP in the presence of maternal smoking. Additional evidence of reduced risk due to G × E interaction in the presence of multivitamin supplementation was observed for SNPs in BAALC on chr. 8. These results emphasize the need to consider G × E interaction when...

  11. Global map of physical interactions among differentially expressed genes in multiple sclerosis relapses and remissions.

    Science.gov (United States)

    Tuller, Tamir; Atar, Shimshi; Ruppin, Eytan; Gurevich, Michael; Achiron, Anat

    2011-09-15

    Multiple sclerosis (MS) is a central nervous system autoimmune inflammatory T-cell-mediated disease with a relapsing-remitting course in the majority of patients. In this study, we performed a high-resolution systems biology analysis of gene expression and physical interactions in MS relapse and remission. To this end, we integrated 164 large-scale measurements of gene expression in peripheral blood mononuclear cells of MS patients in relapse or remission and healthy subjects, with large-scale information about the physical interactions between these genes obtained from public databases. These data were analyzed with a variety of computational methods. We find that there is a clear and significant global network-level signal that is related to the changes in gene expression of MS patients in comparison to healthy subjects. However, despite the clear differences in the clinical symptoms of MS patients in relapse versus remission, the network level signal is weaker when comparing patients in these two stages of the disease. This result suggests that most of the genes have relatively similar expression levels in the two stages of the disease. In accordance with previous studies, we found that the pathways related to regulation of cell death, chemotaxis and inflammatory response are differentially expressed in the disease in comparison to healthy subjects, while pathways related to cell adhesion, cell migration and cell-cell signaling are activated in relapse in comparison to remission. However, the current study includes a detailed report of the exact set of genes involved in these pathways and the interactions between them. For example, we found that the genes TP53 and IL1 are 'network-hub' that interacts with many of the differentially expressed genes in MS patients versus healthy subjects, and the epidermal growth factor receptor is a 'network-hub' in the case of MS patients with relapse versus remission. The statistical approaches employed in this study enabled us

  12. Overview of diet-gene interactions and the example of xanthophylls.

    Science.gov (United States)

    Demmig-Adams, Barbara; Adams, William W

    2010-01-01

    This chapter provides an overview of diet-gene interaction and the role of dietary factors in human health and disease. Human master control genes that regulate processes of fundamental importance, such as cell proliferation and the immune response, are introduced and their modulation by nutraceuticals, produced by plants and photosynthetic microbes, is reviewed. Emphasis is placed on antioxidants and polyunsaturated fatty acids as regulators of master control genes. Furthermore, a case study is presented on xanthophylls, a group of carotenoids with multiple health benefits in the protection against eye disease and other chronic diseases, as well as the synergism between xanthophylls and other dietary factors. Lastly, dietary sources of the xanthophylls zeaxanthin and lutein are reviewed and their enhancement via genetic engineering is discussed.

  13. Global gene expression analysis of the zoonotic parasite Trichinella spiralis revealed novel genes in host parasite interaction.

    Directory of Open Access Journals (Sweden)

    Xiaolei Liu

    Full Text Available BACKGROUND: Trichinellosis is a typical food-borne zoonotic disease which is epidemic worldwide and the nematode Trichinella spiralis is the main pathogen. The life cycle of T. spiralis contains three developmental stages, i.e. adult worms, new borne larva (new borne L1 larva and muscular larva (infective L1 larva. Stage-specific gene expression in the parasites has been investigated with various immunological and cDNA cloning approaches, whereas the genome-wide transcriptome and expression features of the parasite have been largely unknown. The availability of the genome sequence information of T. spiralis has made it possible to deeply dissect parasite biology in association with global gene expression and pathogenesis. METHODOLOGY AND PRINCIPAL FINDINGS: In this study, we analyzed the global gene expression patterns in the three developmental stages of T. spiralis using digital gene expression (DGE analysis. Almost 15 million sequence tags were generated with the Illumina RNA-seq technology, producing expression data for more than 9,000 genes, covering 65% of the genome. The transcriptome analysis revealed thousands of differentially expressed genes within the genome, and importantly, a panel of genes encoding functional proteins associated with parasite invasion and immuno-modulation were identified. More than 45% of the genes were found to be transcribed from both strands, indicating the importance of RNA-mediated gene regulation in the development of the parasite. Further, based on gene ontological analysis, over 3000 genes were functionally categorized and biological pathways in the three life cycle stage were elucidated. CONCLUSIONS AND SIGNIFICANCE: The global transcriptome of T. spiralis in three developmental stages has been profiled, and most gene activity in the genome was found to be developmentally regulated. Many metabolic and biological pathways have been revealed. The findings of the differential expression of several protein

  14. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    Science.gov (United States)

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  15. Topological and organizational properties of the products of house-keeping and tissue-specific genes in protein-protein interaction networks.

    Science.gov (United States)

    Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing

    2009-03-11

    Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene

  16. Dynamic changes in the interchromosomal interaction of early histone gene loci during development of sea urchin.

    Science.gov (United States)

    Matsushita, Masaya; Ochiai, Hiroshi; Suzuki, Ken-Ichi T; Hayashi, Sayaka; Yamamoto, Takashi; Awazu, Akinori; Sakamoto, Naoaki

    2017-12-15

    The nuclear positioning and chromatin dynamics of eukaryotic genes are closely related to the regulation of gene expression, but they have not been well examined during early development, which is accompanied by rapid cell cycle progression and dynamic changes in nuclear organization, such as nuclear size and chromatin constitution. In this study, we focused on the early development of the sea urchin Hemicentrotus pulcherrimus and performed three-dimensional fluorescence in situ hybridization of gene loci encoding early histones (one of the types of histone in sea urchin). There are two non-allelic early histone gene loci per sea urchin genome. We found that during the morula stage, when the early histone gene expression levels are at their maximum, interchromosomal interactions were often formed between the early histone gene loci on separate chromosomes and that the gene loci were directed to locate to more interior positions. Furthermore, these interactions were associated with the active transcription of the early histone genes. Thus, such dynamic interchromosomal interactions may contribute to the efficient synthesis of early histone mRNA during the morula stage of sea urchin development. © 2017. Published by The Company of Biologists Ltd.

  17. Influence of IL1B, IL6 and IL10 gene variants and plasma fatty acid interaction on metabolic syndrome risk in a cross-sectional population-based study.

    Science.gov (United States)

    Maintinguer Norde, Marina; Oki, Erica; Ferreira Carioca, Antonio Augusto; Teixeira Damasceno, Nágila Raquel; Fisberg, Regina Mara; Lobo Marchioni, Dirce Maria; Rogero, Marcelo Macedo

    2018-04-01

    Metabolic syndrome (MetS) is a cluster of interrelated risk factors for type 2 diabetes mellitus, and cardiovascular disease, with underlying inflammatory pathophysiology. Genetic variations and diet are well-known risk factor for MetS, but the interaction between these two factors is less explored. The aim of the study was to evaluate the influence of interaction between SNP of inflammatory genes (encoding interleukin (IL)-6, IL-1β and IL-10) and plasma fatty acids on the odds of MetS, in a population-based cross-sectional study. Among participants of the Health Survey - São Paulo, 301 adults (19-59 y) from whom a blood sample was collected were included. Individuals with and without MetS were compared according to their plasma inflammatory biomarkers, fatty acid profile, and genotype frequency of the IL1B (rs16944, rs1143623, rs1143627, rs1143634 and rs1143643), IL6 (rs1800795, rs1800796 and rs1800797) and IL10 (rs1554286, rs1800871, rs1800872, rs1800890 and rs3024490) genes SNP. The influence of gene-fatty acids interaction on MetS risk was investigated. IL6 gene SNP rs1800795 G allele was associated with higher odds for MetS (OR = 1.88; p = 0.017). Gene-fatty acid interaction was found between the IL1B gene SNP rs116944 and stearic acid (p inter = 0.043), and between rs1143634 and EPA (p inter = 0.017). For the IL10 gene SNP rs1800896, an interaction was found for arachidonic acid (p inter = 0.007) and estimated D5D activity (p inter = 0.019). The IL6 gene SNP rs1800795 G allele is associated with increased odds for MetS. Plasma fatty acid profile interacts with the IL1B and IL10 gene variants to modulate the odds for MetS. This and other interactions of risk factors can account for the unexplained heritability of MetS, and their elucidation can lead to new strategies for genome-customized prevention of MetS. Copyright © 2017 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.

  18. Modeling Gene-Environment Interactions With Quasi-Natural Experiments.

    Science.gov (United States)

    Schmitz, Lauren; Conley, Dalton

    2017-02-01

    This overview develops new empirical models that can effectively document Gene × Environment (G×E) interactions in observational data. Current G×E studies are often unable to support causal inference because they use endogenous measures of the environment or fail to adequately address the nonrandom distribution of genes across environments, confounding estimates. Comprehensive measures of genetic variation are incorporated into quasi-natural experimental designs to exploit exogenous environmental shocks or isolate variation in environmental exposure to avoid potential confounders. In addition, we offer insights from population genetics that improve upon extant approaches to address problems from population stratification. Together, these tools offer a powerful way forward for G×E research on the origin and development of social inequality across the life course. © 2015 Wiley Periodicals, Inc.

  19. Gene expression and gene therapy imaging

    International Nuclear Information System (INIS)

    Rome, Claire; Couillaud, Franck; Moonen, Chrit T.W.

    2007-01-01

    The fast growing field of molecular imaging has achieved major advances in imaging gene expression, an important element of gene therapy. Gene expression imaging is based on specific probes or contrast agents that allow either direct or indirect spatio-temporal evaluation of gene expression. Direct evaluation is possible with, for example, contrast agents that bind directly to a specific target (e.g., receptor). Indirect evaluation may be achieved by using specific substrate probes for a target enzyme. The use of marker genes, also called reporter genes, is an essential element of MI approaches for gene expression in gene therapy. The marker gene may not have a therapeutic role itself, but by coupling the marker gene to a therapeutic gene, expression of the marker gene reports on the expression of the therapeutic gene. Nuclear medicine and optical approaches are highly sensitive (detection of probes in the picomolar range), whereas MRI and ultrasound imaging are less sensitive and require amplification techniques and/or accumulation of contrast agents in enlarged contrast particles. Recently developed MI techniques are particularly relevant for gene therapy. Amongst these are the possibility to track gene therapy vectors such as stem cells, and the techniques that allow spatiotemporal control of gene expression by non-invasive heating (with MRI guided focused ultrasound) and the use of temperature sensitive promoters. (orig.)

  20. A multilevel prediction of physiological response to challenge: Interactions among child maltreatment, neighborhood crime, endothelial nitric oxide synthase gene (eNOS), and GABA(A) receptor subunit alpha-6 gene (GABRA6).

    Science.gov (United States)

    Lynch, Michael; Manly, Jody Todd; Cicchetti, Dante

    2015-11-01

    Physiological response to stress has been linked to a variety of healthy and pathological conditions. The current study conducted a multilevel examination of interactions among environmental toxins (i.e., neighborhood crime and child maltreatment) and specific genetic polymorphisms of the endothelial nitric oxide synthase gene (eNOS) and GABA(A) receptor subunit alpha-6 gene (GABRA6). One hundred eighty-six children were recruited at age 4. The presence or absence of child maltreatment as well as the amount of crime that occurred in their neighborhood during the previous year were determined at that time. At age 9, the children were brought to the lab, where their physiological response to a cognitive challenge (i.e., change in the amplitude of the respiratory sinus arrhythmia) was assessed and DNA samples were collected for subsequent genotyping. The results confirmed that complex Gene × Gene, Environment × Environment, and Gene × Environment interactions were associated with different patterns of respiratory sinus arrhythmia reactivity. The implications for future research and evidence-based intervention are discussed.

  1. Fast Gene Ontology based clustering for microarray experiments

    Directory of Open Access Journals (Sweden)

    Ovaska Kristian

    2008-11-01

    Full Text Available Abstract Background Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. Results We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Conclusion Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.

  2. Distilling a Visual Network of Retinitis Pigmentosa Gene-Protein Interactions to Uncover New Disease Candidates.

    Directory of Open Access Journals (Sweden)

    Daniel Boloc

    Full Text Available Retinitis pigmentosa (RP is a highly heterogeneous genetic visual disorder with more than 70 known causative genes, some of them shared with other non-syndromic retinal dystrophies (e.g. Leber congenital amaurosis, LCA. The identification of RP genes has increased steadily during the last decade, and the 30% of the cases that still remain unassigned will soon decrease after the advent of exome/genome sequencing. A considerable amount of genetic and functional data on single RD genes and mutations has been gathered, but a comprehensive view of the RP genes and their interacting partners is still very fragmentary. This is the main gap that needs to be filled in order to understand how mutations relate to progressive blinding disorders and devise effective therapies.We have built an RP-specific network (RPGeNet by merging data from different sources: high-throughput data from BioGRID and STRING databases, manually curated data for interactions retrieved from iHOP, as well as interactions filtered out by syntactical parsing from up-to-date abstracts and full-text papers related to the RP research field. The paths emerging when known RP genes were used as baits over the whole interactome have been analysed, and the minimal number of connections among the RP genes and their close neighbors were distilled in order to simplify the search space.In contrast to the analysis of single isolated genes, finding the networks linking disease genes renders powerful etiopathological insights. We here provide an interactive interface, RPGeNet, for the molecular biologist to explore the network centered on the non-syndromic and syndromic RP and LCA causative genes. By integrating tissue-specific expression levels and phenotypic data on top of that network, a more comprehensive biological view will highlight key molecular players of retinal degeneration and unveil new RP disease candidates.

  3. Genome-wide gene-environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee.

    OpenAIRE

    Taye H Hamza; Honglei Chen; Erin M Hill-Burns; Shannon L Rhodes; Jennifer Montimurro; Denise M Kay; Albert Tenesa; Victoria I Kusel; Patricia Sheehan; Muthukrishnan Eaaswarkhanth; Dora Yearout; Ali Samii; John W Roberts; Pinky Agarwal; Yvette Bordelon

    2011-01-01

    Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal compo...

  4. Gene-Lifestyle Interactions in Complex Diseases: Design and Description of the GLACIER and VIKING Studies.

    Science.gov (United States)

    Kurbasic, Azra; Poveda, Alaitz; Chen, Yan; Agren, Asa; Engberg, Elisabeth; Hu, Frank B; Johansson, Ingegerd; Barroso, Ines; Brändström, Anders; Hallmans, Göran; Renström, Frida; Franks, Paul W

    2014-12-01

    Most complex diseases have well-established genetic and non-genetic risk factors. In some instances, these risk factors are likely to interact, whereby their joint effects convey a level of risk that is either significantly more or less than the sum of these risks. Characterizing these gene-environment interactions may help elucidate the biology of complex diseases, as well as to guide strategies for their targeted prevention. In most cases, the detection of gene-environment interactions will require sample sizes in excess of those needed to detect the marginal effects of the genetic and environmental risk factors. Although many consortia have been formed, comprising multiple diverse cohorts to detect gene-environment interactions, few robust examples of such interactions have been discovered. This may be because combining data across studies, usually through meta-analysis of summary data from the contributing cohorts, is often a statistically inefficient approach for the detection of gene-environment interactions. Ideally, single, very large and well-genotyped prospective cohorts, with validated measures of environmental risk factor and disease outcomes should be used to study interactions. The presence of strong founder effects within those cohorts might further strengthen the capacity to detect novel genetic effects and gene-environment interactions. Access to accurate genealogical data would also aid in studying the diploid nature of the human genome, such as genomic imprinting (parent-of-origin effects). Here we describe two studies from northern Sweden (the GLACIER and VIKING studies) that fulfill these characteristics.

  5. GENIE: a software package for gene-gene interaction analysis in genetic association studies using multiple GPU or CPU cores

    Directory of Open Access Journals (Sweden)

    Wang Kai

    2011-05-01

    Full Text Available Abstract Background Gene-gene interaction in genetic association studies is computationally intensive when a large number of SNPs are involved. Most of the latest Central Processing Units (CPUs have multiple cores, whereas Graphics Processing Units (GPUs also have hundreds of cores and have been recently used to implement faster scientific software. However, currently there are no genetic analysis software packages that allow users to fully utilize the computing power of these multi-core devices for genetic interaction analysis for binary traits. Findings Here we present a novel software package GENIE, which utilizes the power of multiple GPU or CPU processor cores to parallelize the interaction analysis. GENIE reads an entire genetic association study dataset into memory and partitions the dataset into fragments with non-overlapping sets of SNPs. For each fragment, GENIE analyzes: 1 the interaction of SNPs within it in parallel, and 2 the interaction between the SNPs of the current fragment and other fragments in parallel. We tested GENIE on a large-scale candidate gene study on high-density lipoprotein cholesterol. Using an NVIDIA Tesla C1060 graphics card, the GPU mode of GENIE achieves a speedup of 27 times over its single-core CPU mode run. Conclusions GENIE is open-source, economical, user-friendly, and scalable. Since the computing power and memory capacity of graphics cards are increasing rapidly while their cost is going down, we anticipate that GENIE will achieve greater speedups with faster GPU cards. Documentation, source code, and precompiled binaries can be downloaded from http://www.cceb.upenn.edu/~mli/software/GENIE/.

  6. dictyExpress: a Dictyostelium discoideum gene expression database with an explorative data analysis web-based interface

    Science.gov (United States)

    Rot, Gregor; Parikh, Anup; Curk, Tomaz; Kuspa, Adam; Shaulsky, Gad; Zupan, Blaz

    2009-01-01

    Background Bioinformatics often leverages on recent advancements in computer science to support biologists in their scientific discovery process. Such efforts include the development of easy-to-use web interfaces to biomedical databases. Recent advancements in interactive web technologies require us to rethink the standard submit-and-wait paradigm, and craft bioinformatics web applications that share analytical and interactive power with their desktop relatives, while retaining simplicity and availability. Results We have developed dictyExpress, a web application that features a graphical, highly interactive explorative interface to our database that consists of more than 1000 Dictyostelium discoideum gene expression experiments. In dictyExpress, the user can select experiments and genes, perform gene clustering, view gene expression profiles across time, view gene co-expression networks, perform analyses of Gene Ontology term enrichment, and simultaneously display expression profiles for a selected gene in various experiments. Most importantly, these tasks are achieved through web applications whose components are seamlessly interlinked and immediately respond to events triggered by the user, thus providing a powerful explorative data analysis environment. Conclusion dictyExpress is a precursor for a new generation of web-based bioinformatics applications with simple but powerful interactive interfaces that resemble that of the modern desktop. While dictyExpress serves mainly the Dictyostelium research community, it is relatively easy to adapt it to other datasets. We propose that the design ideas behind dictyExpress will influence the development of similar applications for other model organisms. PMID:19706156

  7. A postprocessing method in the HMC framework for predicting gene function based on biological instrumental data

    Science.gov (United States)

    Feng, Shou; Fu, Ping; Zheng, Wenbin

    2018-03-01

    Predicting gene function based on biological instrumental data is a complicated and challenging hierarchical multi-label classification (HMC) problem. When using local approach methods to solve this problem, a preliminary results processing method is usually needed. This paper proposed a novel preliminary results processing method called the nodes interaction method. The nodes interaction method revises the preliminary results and guarantees that the predictions are consistent with the hierarchy constraint. This method exploits the label dependency and considers the hierarchical interaction between nodes when making decisions based on the Bayesian network in its first phase. In the second phase, this method further adjusts the results according to the hierarchy constraint. Implementing the nodes interaction method in the HMC framework also enhances the HMC performance for solving the gene function prediction problem based on the Gene Ontology (GO), the hierarchy of which is a directed acyclic graph that is more difficult to tackle. The experimental results validate the promising performance of the proposed method compared to state-of-the-art methods on eight benchmark yeast data sets annotated by the GO.

  8. Protein-Protein Interaction Network and Gene Ontology

    Science.gov (United States)

    Choi, Yunkyu; Kim, Seok; Yi, Gwan-Su; Park, Jinah

    Evolution of computer technologies makes it possible to access a large amount and various kinds of biological data via internet such as DNA sequences, proteomics data and information discovered about them. It is expected that the combination of various data could help researchers find further knowledge about them. Roles of a visualization system are to invoke human abilities to integrate information and to recognize certain patterns in the data. Thus, when the various kinds of data are examined and analyzed manually, an effective visualization system is an essential part. One instance of these integrated visualizations can be combination of protein-protein interaction (PPI) data and Gene Ontology (GO) which could help enhance the analysis of PPI network. We introduce a simple but comprehensive visualization system that integrates GO and PPI data where GO and PPI graphs are visualized side-by-side and supports quick reference functions between them. Furthermore, the proposed system provides several interactive visualization methods for efficiently analyzing the PPI network and GO directedacyclic- graph such as context-based browsing and common ancestors finding.

  9. Interactive effects of antioxidant genes and air pollution on respiratory function and airway disease: a HuGE review.

    Science.gov (United States)

    Minelli, Cosetta; Wei, Igor; Sagoo, Gurdeep; Jarvis, Debbie; Shaheen, Seif; Burney, Peter

    2011-03-15

    Susceptibility to the respiratory effects of air pollution varies between individuals. Although some evidence suggests higher susceptibility for subjects carrying variants of antioxidant genes, findings from gene-pollution interaction studies conflict in terms of the presence and direction of interactions. The authors conducted a systematic review on antioxidant gene-pollution interactions which included 15 studies, with 12 supporting the presence of interactions. For the glutathione S-transferase M1 gene (GSTM1) (n=10 studies), only 1 study found interaction with the null genotype alone, although 5 observed interactions when GSTM1 was evaluated jointly with other genes (mainly NAD(P)H dehydrogenase [quinone] 1 (NQO1)). All studies on the glutathione S-transferase P1 (GSTP1) Ile105Val polymorphism (n=11) provided some evidence of interaction, but findings conflicted in terms of risk allele. Results were negative for glutathione S-transferase T1 (GSTT1) (n=3) and positive for heme oxygenase 1 (HMOX-1) (n=2). Meta-analysis could not be performed because there were insufficient data available for any specific gene-pollutant-outcome combination. Overall the evidence supports the presence of gene-pollution interactions, although which pollutant interacts with which gene is unclear. However, issues regarding multiple testing, selective reporting, and publication bias raise the possibility of false-positive findings. Larger studies with greater accuracy of pollution assessment and improved quality of conduct and reporting are required. © The Author 2011. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved.

  10. A network-based gene expression signature informs prognosis and treatment for colorectal cancer patients.

    Directory of Open Access Journals (Sweden)

    Mingguang Shi

    Full Text Available Several studies have reported gene expression signatures that predict recurrence risk in stage II and III colorectal cancer (CRC patients with minimal gene membership overlap and undefined biological relevance. The goal of this study was to investigate biological themes underlying these signatures, to infer genes of potential mechanistic importance to the CRC recurrence phenotype and to test whether accurate prognostic models can be developed using mechanistically important genes.We investigated eight published CRC gene expression signatures and found no functional convergence in Gene Ontology enrichment analysis. Using a random walk-based approach, we integrated these signatures and publicly available somatic mutation data on a protein-protein interaction network and inferred 487 genes that were plausible candidate molecular underpinnings for the CRC recurrence phenotype. We named the list of 487 genes a NEM signature because it integrated information from Network, Expression, and Mutation. The signature showed significant enrichment in four biological processes closely related to cancer pathophysiology and provided good coverage of known oncogenes, tumor suppressors, and CRC-related signaling pathways. A NEM signature-based Survival Support Vector Machine prognostic model was trained using a microarray gene expression dataset and tested on an independent dataset. The model-based scores showed a 75.7% concordance with the real survival data and separated patients into two groups with significantly different relapse-free survival (p = 0.002. Similar results were obtained with reversed training and testing datasets (p = 0.007. Furthermore, adjuvant chemotherapy was significantly associated with prolonged survival of the high-risk patients (p = 0.006, but not beneficial to the low-risk patients (p = 0.491.The NEM signature not only reflects CRC biology but also informs patient prognosis and treatment response. Thus, the network-based

  11. Radioresistance related genes screened by protein-protein interaction network analysis in nasopharyngeal carcinoma

    International Nuclear Information System (INIS)

    Zhu Xiaodong; Guo Ya; Qu Song; Li Ling; Huang Shiting; Li Danrong; Zhang Wei

    2012-01-01

    Objective: To discover radioresistance associated molecular biomarkers and its mechanism in nasopharyngeal carcinoma by protein-protein interaction network analysis. Methods: Whole genome expression microarray was applied to screen out differentially expressed genes in two cell lines CNE-2R and CNE-2 with different radiosensitivity. Four differentially expressed genes were randomly selected for further verification by the semi-quantitative RT-PCR analysis with self-designed primers. The common differentially expressed genes from two experiments were analyzed with the SNOW online database in order to find out the central node related to the biomarkers of nasopharyngeal carcinoma radioresistance. The expression of STAT1 in CNE-2R and CNE-2 cells was measured by Western blot. Results: Compared with CNE-2 cells, 374 genes in CNE-2R cells were differentially expressed while 197 genes showed significant differences. Four randomly selected differentially expressed genes were verified by RT-PCR and had same change trend in consistent with the results of chip assay. Analysis with the SNOW database demonstrated that those 197 genes could form a complicated interaction network where STAT1 and JUN might be two key nodes. Indeed, the STAT1-α expression in CNE-2R was higher than that in CNE-2 (t=4.96, P<0.05). Conclusions: The key nodes of STAT1 and JUN may be the molecular biomarkers leading to radioresistance in nasopharyngeal carcinoma, and STAT1-α might have close relationship with radioresistance. (authors)

  12. The role of genes involved in neuroplasticity and neurogenesis in the observation of a gene-environment interaction (GxE) in schizophrenia.

    Science.gov (United States)

    Le Strat, Yann; Ramoz, Nicolas; Gorwood, Philip

    2009-05-01

    Schizophrenia is a multifactorial disease characterized by a high heritability. Several candidate genes have been suggested, with the strongest evidences for genes encoding dystrobrevin binding protein 1 (DTNBP1), neuregulin 1 (NRG1), neuregulin 1 receptor (ERBB4) and disrupted in schizophrenia 1 (DISC1), as well as several neurotrophic factors. These genes are involved in neuronal plasticity and play also a role in adult neurogenesis. Therefore, the genetic basis of schizophrenia could involve different factors more or less specifically required for neuroplasticity, including the synapse maturation, potentiation and plasticity as well as neurogenesis. Following the model of Knudson in tumors, we propose a two-hit hypothesis of schizophrenia. In this model of gene-environment interaction, a variant in a gene related to neurogenesis is transmitted to the descent (first hit), and, secondarily, an environmental factor occurs during the development of the central nervous system (second hit). Both of these vulnerability and trigger factors are probably necessary to generate a deficit in neurogenesis and therefore to cause schizophrenia. The literature supporting this gene x environment hypothesis is reviewed, with emphasis on some molecular pathways, raising the possibility to propose more specific molecular medicine.

  13. Comparative GO: a web application for comparative gene ontology and gene ontology-based gene selection in bacteria.

    Directory of Open Access Journals (Sweden)

    Mario Fruzangohar

    Full Text Available The primary means of classifying new functions for genes and proteins relies on Gene Ontology (GO, which defines genes/proteins using a controlled vocabulary in terms of their Molecular Function, Biological Process and Cellular Component. The challenge is to present this information to researchers to compare and discover patterns in multiple datasets using visually comprehensible and user-friendly statistical reports. Importantly, while there are many GO resources available for eukaryotes, there are none suitable for simultaneous, graphical and statistical comparison between multiple datasets. In addition, none of them supports comprehensive resources for bacteria. By using Streptococcus pneumoniae as a model, we identified and collected GO resources including genes, proteins, taxonomy and GO relationships from NCBI, UniProt and GO organisations. Then, we designed database tables in PostgreSQL database server and developed a Java application to extract data from source files and loaded into database automatically. We developed a PHP web application based on Model-View-Control architecture, used a specific data structure as well as current and novel algorithms to estimate GO graphs parameters. We designed different navigation and visualization methods on the graphs and integrated these into graphical reports. This tool is particularly significant when comparing GO groups between multiple samples (including those of pathogenic bacteria from different sources simultaneously. Comparing GO protein distribution among up- or down-regulated genes from different samples can improve understanding of biological pathways, and mechanism(s of infection. It can also aid in the discovery of genes associated with specific function(s for investigation as a novel vaccine or therapeutic targets.http://turing.ersa.edu.au/BacteriaGO.

  14. Pollen Killer Gene S35 Function Requires Interaction with an Activator That Maps Close to S24, Another Pollen Killer Gene in Rice

    Directory of Open Access Journals (Sweden)

    Takahiko Kubo

    2016-05-01

    Full Text Available Pollen killer genes disable noncarrier pollens, and are responsible for male sterility and segregation distortion in hybrid populations of distantly related plant species. The genetic networks and the molecular mechanisms underlying the pollen killer system remain largely unknown. Two pollen killer genes, S24 and S35, have been found in an intersubspecific cross of Oryza sativa ssp. indica and japonica. The effect of S24 is counteracted by an unlinked locus EFS. Additionally, S35 has been proposed to interact with S24 to induce pollen sterility. These genetic interactions are suggestive of a single S24-centric genetic pathway (EFS–S24–S35 for the pollen killer system. To examine this hypothetical genetic pathway, the S35 and the S24 regions were further characterized and genetically dissected in this study. Our results indicated that S35 causes pollen sterility independently of both the EFS and S24 genes, but is dependent on a novel gene close to the S24 locus, named incentive for killing pollen (INK. We confirmed the phenotypic effect of the INK gene separately from the S24 gene, and identified the INK locus within an interval of less than 0.6 Mb on rice chromosome 5. This study characterized the genetic effect of the two independent genetic pathways of INK–S35 and EFS–S24 in indica–japonica hybrid progeny. Our results provide clear evidence that hybrid male sterility in rice is caused by several pollen killer networks with multiple factors positively and negatively regulating pollen killer genes.

  15. Pollen Killer Gene S35 Function Requires Interaction with an Activator That Maps Close to S24, Another Pollen Killer Gene in Rice.

    Science.gov (United States)

    Kubo, Takahiko; Yoshimura, Atsushi; Kurata, Nori

    2016-05-03

    Pollen killer genes disable noncarrier pollens, and are responsible for male sterility and segregation distortion in hybrid populations of distantly related plant species. The genetic networks and the molecular mechanisms underlying the pollen killer system remain largely unknown. Two pollen killer genes, S24 and S35, have been found in an intersubspecific cross of Oryza sativa ssp. indica and japonica The effect of S24 is counteracted by an unlinked locus EFS Additionally, S35 has been proposed to interact with S24 to induce pollen sterility. These genetic interactions are suggestive of a single S24-centric genetic pathway (EFS-S24-S35) for the pollen killer system. To examine this hypothetical genetic pathway, the S35 and the S24 regions were further characterized and genetically dissected in this study. Our results indicated that S35 causes pollen sterility independently of both the EFS and S24 genes, but is dependent on a novel gene close to the S24 locus, named incentive for killing pollen (INK). We confirmed the phenotypic effect of the INK gene separately from the S24 gene, and identified the INK locus within an interval of less than 0.6 Mb on rice chromosome 5. This study characterized the genetic effect of the two independent genetic pathways of INK-S35 and EFS-S24 in indica-japonica hybrid progeny. Our results provide clear evidence that hybrid male sterility in rice is caused by several pollen killer networks with multiple factors positively and negatively regulating pollen killer genes. Copyright © 2016 Kubo et al.

  16. Diet-gene interactions between dietary fat intake and common polymorphisms in determining lipid metabolism

    Directory of Open Access Journals (Sweden)

    Corella, Dolores

    2009-03-01

    Full Text Available Current dietary guidelines for fat intake have not taken into consideration the possible genetic differences underlying the individual variability in responsiveness to dietary components. Genetic variability has been identified in humans for all the known lipid metabolim-related genes resulting in a plethora of candidate genes and genetic variants to examine in diet-gene interaction studies focused on fat consumption. Some examples of fat-gene interaction are reviewed. These include: the interaction between total intake and the 514C/T in the hepatic lipase gene promoter in determining high-density lipoprotein cholesterol (HDL-C metabolism; the interaction between polyunsaturated fatty acids (PUFA and the 75G/A polymorphism in the APOA1 gene plasma HDL-C concentrations; the interaction between PUFA and the L162V polymorphism in the PPARA gene in determining triglycerides and APOC3 concentrations; and the interaction between PUFA intake and the 1131TC in the APOA5 gene in determining triglyceride metabolism. Although hundreds of diet-gene interaction studies in lipid metabolism have been published, the level of evidence to make specific nutritional recommendations to the population is still low and more research in nutrigenetics has to be undertaken.Las recomendaciones dietéticas actuales referentes al consumo de grasas en la dieta han sido realizadas sin tener en cuenta las posibles diferencias genéticas de las personas que podrían ser las responsables de las diferentes respuestas interindividuales que frecuentemente se observan ante la misma dieta. La presencia de variabilidad genética ha sido puesta de manifiesto para todos los genes relacionados con el metabolismo lipídico, por lo que existe un ingente número de genes y de variantes genéticas para ser incluidas en los estudios sobre interacciones dieta-genotipo en el ámbito específico del consumo de grasas y aceites. Se revisarán algunos ejemplos sobre interacciones grasa

  17. Inferring gene dependency network specific to phenotypic alteration based on gene expression data and clinical information of breast cancer.

    Science.gov (United States)

    Zhou, Xionghui; Liu, Juan

    2014-01-01

    Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for

  18. Reference gene selection for quantitative gene expression studies during biological invasions: A test on multiple genes and tissues in a model ascidian Ciona savignyi.

    Science.gov (United States)

    Huang, Xuena; Gao, Yangchun; Jiang, Bei; Zhou, Zunchun; Zhan, Aibin

    2016-01-15

    As invasive species have successfully colonized a wide range of dramatically different local environments, they offer a good opportunity to study interactions between species and rapidly changing environments. Gene expression represents one of the primary and crucial mechanisms for rapid adaptation to local environments. Here, we aim to select reference genes for quantitative gene expression analysis based on quantitative Real-Time PCR (qRT-PCR) for a model invasive ascidian, Ciona savignyi. We analyzed the stability of ten candidate reference genes in three tissues (siphon, pharynx and intestine) under two key environmental stresses (temperature and salinity) in the marine realm based on three programs (geNorm, NormFinder and delta Ct method). Our results demonstrated only minor difference for stability rankings among the three methods. The use of different single reference gene might influence the data interpretation, while multiple reference genes could minimize possible errors. Therefore, reference gene combinations were recommended for different tissues - the optimal reference gene combination for siphon was RPS15 and RPL17 under temperature stress, and RPL17, UBQ and TubA under salinity treatment; for pharynx, TubB, TubA and RPL17 were the most stable genes under temperature stress, while TubB, TubA and UBQ were the best under salinity stress; for intestine, UBQ, RPS15 and RPL17 were the most reliable reference genes under both treatments. Our results suggest that the necessity of selection and test of reference genes for different tissues under varying environmental stresses. The results obtained here are expected to reveal mechanisms of gene expression-mediated invasion success using C. savignyi as a model species. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. Crowdsourcing the nodulation gene network discovery environment.

    Science.gov (United States)

    Li, Yupeng; Jackson, Scott A

    2016-05-26

    The Legumes (Fabaceae) are an economically and ecologically important group of plant species with the conspicuous capacity for symbiotic nitrogen fixation in root nodules, specialized plant organs containing symbiotic microbes. With the aim of understanding the underlying molecular mechanisms leading to nodulation, many efforts are underway to identify nodulation-related genes and determine how these genes interact with each other. In order to accurately and efficiently reconstruct nodulation gene network, a crowdsourcing platform, CrowdNodNet, was created. The platform implements the jQuery and vis.js JavaScript libraries, so that users are able to interactively visualize and edit the gene network, and easily access the information about the network, e.g. gene lists, gene interactions and gene functional annotations. In addition, all the gene information is written on MediaWiki pages, enabling users to edit and contribute to the network curation. Utilizing the continuously updated, collaboratively written, and community-reviewed Wikipedia model, the platform could, in a short time, become a comprehensive knowledge base of nodulation-related pathways. The platform could also be used for other biological processes, and thus has great potential for integrating and advancing our understanding of the functional genomics and systems biology of any process for any species. The platform is available at http://crowd.bioops.info/ , and the source code can be openly accessed at https://github.com/bioops/crowdnodnet under MIT License.

  20. Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

    Science.gov (United States)

    Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

    2018-04-23

    Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis

  1. Imaging gene expression in gene therapy

    International Nuclear Information System (INIS)

    Wiebe, Leonard I.

    1997-01-01

    Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on 'suicide gene therapy' of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k + ) has been use for 'suicide' in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k + gene expression where the H S V-1 t k + gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([ 18 F]F H P G; [ 18 F]-A C V), and pyrimidine- ([ 123 / 131 I]I V R F U; [ 124 / 131I ]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [ 123 / 131I ]I V R F U imaging with the H S V-1 t k + reporter gene will be presented

  2. Gene interactions in the DNA damage-response pathway identified by genome-wide RNA-interference analysis of synthetic lethality

    NARCIS (Netherlands)

    van Haaften, Gijs; Vastenhouw, Nadine L; Nollen, Ellen A A; Plasterk, Ronald H A; Tijsterman, Marcel

    2004-01-01

    Here, we describe a systematic search for synthetic gene interactions in a multicellular organism, the nematode Caenorhabditis elegans. We established a high-throughput method to determine synthetic gene interactions by genome-wide RNA interference and identified genes that are required to protect

  3. Evidence for gene-environment interaction in a genome wide study of isolated, non-syndromic cleft palate

    Science.gov (United States)

    Beaty, Terri H.; Ruczinski, Ingo; Murray, Jeffrey C.; Marazita, Mary L.; Munger, Ronald G.; Hetmanski, Jacqueline B.; Murray, Tanda; Redett, Richard J.; Fallin, M. Daniele; Liang, Kung Yee; Wu, Tao; Patel, Poorav J.; Jin, Sheng C.; Zhang, Tian Xiao; Schwender, Holger; Wu-Chou, Yah Huei; Chen, Philip K; Chong, Samuel S; Cheah, Felicia; Yeow, Vincent; Ye, Xiaoqian; Wang, Hong; Huang, Shangzhi; Jabs, Ethylin W.; Shi, Bing; Wilcox, Allen J.; Lie, Rolv T.; Jee, Sun Ha; Christensen, Kaare; Doheny, Kimberley F.; Pugh, Elizabeth W.; Ling, Hua; Scott, Alan F.

    2011-01-01

    Non-syndromic cleft palate (CP) is a common birth defect with a complex and heterogeneous etiology involving both genetic and environmental risk factors. We conducted a genome wide association study (GWAS) using 550 case-parent trios, ascertained through a CP case collected in an international consortium. Family based association tests of single nucleotide polymorphisms (SNP) and three common maternal exposures (maternal smoking, alcohol consumption and multivitamin supplementation) were used in a combined 2 df test for gene (G) and gene-environment (G×E) interaction simultaneously, plus a separate 1 df test for G×E interaction alone. Conditional logistic regression models were used to estimate effects on risk to exposed and unexposed children. While no SNP achieved genome wide significance when considered alone, markers in several genes attained or approached genome wide significance when G×E interaction was included. Among these, MLLT3 and SMC2 on chromosome 9 showed multiple SNPs resulting in increased risk if the mother consumed alcohol during the peri-conceptual period (3 months prior to conception through the first trimester). TBK1 on chr. 12 and ZNF236 on chr. 18 showed multiple SNPs associated with higher risk of CP in the presence of maternal smoking. Additional evidence of reduced risk due to G×E interaction in the presence of multivitamin supplementation was observed for SNPs in BAALC on chr. 8. These results emphasize the need to consider G×E interaction when searching for genes influencing risk to complex and heterogeneous disorders, such as non-syndromic CP. PMID:21618603

  4. Improving functional modules discovery by enriching interaction networks with gene profiles

    KAUST Repository

    Salem, Saeed

    2013-05-01

    Recent advances in proteomic and transcriptomic technologies resulted in the accumulation of vast amount of high-throughput data that span multiple biological processes and characteristics in different organisms. Much of the data come in the form of interaction networks and mRNA expression arrays. An important task in systems biology is functional modules discovery where the goal is to uncover well-connected sub-networks (modules). These discovered modules help to unravel the underlying mechanisms of the observed biological processes. While most of the existing module discovery methods use only the interaction data, in this work we propose, CLARM, which discovers biological modules by incorporating gene profiles data with protein-protein interaction networks. We demonstrate the effectiveness of CLARM on Yeast and Human interaction datasets, and gene expression and molecular function profiles. Experiments on these real datasets show that the CLARM approach is competitive to well established functional module discovery methods.

  5. Network Based Integrated Analysis of Phenotype-Genotype Data for Prioritization of Candidate Symptom Genes

    Directory of Open Access Journals (Sweden)

    Xing Li

    2014-01-01

    Full Text Available Background. Symptoms and signs (symptoms in brief are the essential clinical manifestations for individualized diagnosis and treatment in traditional Chinese medicine (TCM. To gain insights into the molecular mechanism of symptoms, we develop a computational approach to identify the candidate genes of symptoms. Methods. This paper presents a network-based approach for the integrated analysis of multiple phenotype-genotype data sources and the prediction of the prioritizing genes for the associated symptoms. The method first calculates the similarities between symptoms and diseases based on the symptom-disease relationships retrieved from the PubMed bibliographic database. Then the disease-gene associations and protein-protein interactions are utilized to construct a phenotype-genotype network. The PRINCE algorithm is finally used to rank the potential genes for the associated symptoms. Results. The proposed method gets reliable gene rank list with AUC (area under curve 0.616 in classification. Some novel genes like CALCA, ESR1, and MTHFR were predicted to be associated with headache symptoms, which are not recorded in the benchmark data set, but have been reported in recent published literatures. Conclusions. Our study demonstrated that by integrating phenotype-genotype relationships into a complex network framework it provides an effective approach to identify candidate genes of symptoms.

  6. Identification of novel risk genes associated with type 1 diabetes mellitus using a genome-wide gene-based association analysis.

    Science.gov (United States)

    Qiu, Ying-Hua; Deng, Fei-Yan; Li, Min-Jing; Lei, Shu-Feng

    2014-11-01

    Type 1 diabetes mellitus is a serious disorder characterized by destruction of pancreatic β-cells, culminating in absolute insulin deficiency. Genetic factors contribute to the susceptibility of type 1 diabetes mellitus. The aim of the present study was to identify more susceptibility genes of type 1 diabetes mellitus. We carried out an initial gene-based genome-wide association study in a total of 4,075 type 1 diabetes mellitus cases and 2,604 controls by using the Gene-based Association Test using Extended Simes procedure. Furthermore, we carried out replication studies, differential expression analysis and functional annotation clustering analysis to support the significance of the identified susceptibility genes. We identified 452 genes associated with type 1 diabetes mellitus, even after adapting the genome-wide threshold for significance (P diabetes mellitus, which were ignored in single-nucleotide polymorphism-based association analysis and were not previously reported. We found that 53 genes have supportive evidence from replication studies and/or differential expression studies. In particular, seven genes including four non-human leukocyte antigen (HLA) genes (RASIP1, STRN4, BCAR1 and MYL2) are replicated in at least one independent population and also differentially expressed in peripheral blood mononuclear cells or monocytes. Furthermore, the associated genes tend to enrich in immune-related pathways or Gene Ontology project terms. The present results suggest the high power of gene-based association analysis in detecting disease-susceptibility genes. Our findings provide more insights into the genetic basis of type 1 diabetes mellitus.

  7. Gene-Environment Interactions in Genome-Wide Association Studies: Current Approaches and New Directions

    Science.gov (United States)

    Winham, Stacey J.; Biernacka, Joanna M.

    2013-01-01

    Background: Complex psychiatric traits have long been thought to be the result of a combination of genetic and environmental factors, and gene-environment interactions are thought to play a crucial role in behavioral phenotypes and the susceptibility and progression of psychiatric disorders. Candidate gene studies to investigate hypothesized…

  8. Genes related to antioxidant metabolism are involved in Methylobacterium mesophilicum-soybean interaction.

    Science.gov (United States)

    Araújo, Welington Luiz; Santos, Daiene Souza; Dini-Andreote, Francisco; Salgueiro-Londoño, Jennifer Katherine; Camargo-Neves, Aline Aparecida; Andreote, Fernando Dini; Dourado, Manuella Nóbrega

    2015-10-01

    The genus Methylobacterium is composed of pink-pigmented methylotrophic bacterial species that are widespread in natural environments, such as soils, stream water and plants. When in association with plants, this genus colonizes the host plant epiphytically and/or endophytically. This association is known to promote plant growth, induce plant systemic resistance and inhibit plant infection by phytopathogens. In the present study, we focused on evaluating the colonization of soybean seedling-roots by Methylobacterium mesophilicum strain SR1.6/6. We focused on the identification of the key genes involved in the initial step of soybean colonization by methylotrophic bacteria, which includes the plant exudate recognition and adaptation by planktonic bacteria. Visualization by scanning electron microscopy revealed that M. mesophilicum SR1.6/6 colonizes soybean roots surface effectively at 48 h after inoculation, suggesting a mechanism for root recognition and adaptation before this period. The colonization proceeds by the development of a mature biofilm on roots at 96 h after inoculation. Transcriptomic analysis of the planktonic bacteria (with plant) revealed the expression of several genes involved in membrane transport, thus confirming an initial metabolic activation of bacterial responses when in the presence of plant root exudates. Moreover, antioxidant genes were mostly expressed during the interaction with the plant exudates. Further evaluation of stress- and methylotrophic-related genes expression by qPCR showed that glutathione peroxidase and glutathione synthetase genes were up-regulated during the Methylobacterium-soybean interaction. These findings support that glutathione (GSH) is potentially a key molecule involved in cellular detoxification during plant root colonization. In addition to methylotrophic metabolism, antioxidant genes, mainly glutathione-related genes, play a key role during soybean exudate recognition and adaptation, the first step in

  9. Finding gene-environment interactions for phobias.

    Science.gov (United States)

    Gregory, Alice M; Lau, Jennifer Y F; Eley, Thalia C

    2008-03-01

    Phobias are common disorders causing a great deal of suffering. Studies of gene-environment interaction (G x E) have revealed much about the complex processes underlying the development of various psychiatric disorders but have told us little about phobias. This article describes what is already known about genetic and environmental influences upon phobias and suggests how this information can be used to optimise the chances of discovering G x Es for phobias. In addition to the careful conceptualisation of new studies, it is suggested that data already collected should be re-analysed in light of increased understanding of processes influencing phobias.

  10. Gene-environment interactions linking air pollution and inflammation in Parkinson's disease.

    Science.gov (United States)

    Lee, Pei-Chen; Raaschou-Nielsen, Ole; Lill, Christina M; Bertram, Lars; Sinsheimer, Janet S; Hansen, Johnni; Ritz, Beate

    2016-11-01

    Both air pollution exposure and systemic inflammation have been linked to Parkinson's disease (PD). In the PASIDA study, 408 incident cases of PD diagnosed in 2006-2009 and their 495 population controls were interviewed and provided DNA samples. Markers of long term traffic related air pollution measures were derived from geographic information systems (GIS)-based modeling. Furthermore, we genotyped functional polymorphisms in genes encoding proinflammatory cytokines, namely rs1800629 in TNFα (tumor necrosis factor alpha) and rs16944 in IL1B (interleukin-1β). In logistic regression models, long-term exposure to NO 2 increased PD risk overall (odds ratio (OR)=1.06 per 2.94μg/m 3 increase, 95% CI=1.00-1.13). The OR for PD in individuals with high NO 2 exposure (≧75th percentile) and the AA genotype of IL1B rs16944 was 3.10 (95% CI=1.14-8.38) compared with individuals with lower NO 2 exposure (<75th percentile) and the GG genotype. The interaction term was nominally significant on the multiplicative scale (p=0.01). We did not find significant gene-environment interactions with TNF rs1800629. Our finds may provide suggestive evidence that a combination of traffic-related air pollution and genetic variation in the proinflammatory cytokine gene IL1B contribute to risk of developing PD. However, as statistical evidence was only modest in this large sample we cannot rule out that these results represent a chance finding, and additional replication efforts are warranted. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Extracting Gene Networks for Low-Dose Radiation Using Graph Theoretical Algorithms

    Energy Technology Data Exchange (ETDEWEB)

    Voy, Brynn H [ORNL; Scharff, Jon [University of Tennessee, Knoxville (UTK); Perkins, Andy [University of Tennessee, Knoxville (UTK); Saxton, Arnold [University of Tennessee, Knoxville (UTK); Borate, Bhavesh [University of Tennessee, Knoxville (UTK); Chesler, Elissa J [ORNL; Branstetter, Lisa R [ORNL; Langston, Michael A [University of Tennessee, Knoxville (UTK)

    2006-01-01

    Genes with common functions often exhibit correlated expression levels, which can be used to identify sets of interacting genes from microarray data. Microarrays typically measure expression across genomic space, creating a massive matrix of co-expression that must be mined to extract only the most relevant gene interactions. We describe a graph theoretical approach to extracting co-expressed sets of genes, based on the computation of cliques. Unlike the results of traditional clustering algorithms, cliques are not disjoint and allow genes to be assigned to multiple sets of interacting partners, consistent with biological reality. A graph is created by thresholding the correlation matrix to include only the correlations most likely to signify functional relationships. Cliques computed from the graph correspond to sets of genes for which significant edges are present between all members of the set, representing potential members of common or interacting pathways. Clique membership can be used to infer function about poorly annotated genes, based on the known functions of better-annotated genes with which they share clique membership (i.e., ''guilt-by-association''). We illustrate our method by applying it to microarray data collected from the spleens of mice exposed to low-dose ionizing radiation. Differential analysis is used to identify sets of genes whose interactions are impacted by radiation exposure. The correlation graph is also queried independently of clique to extract edges that are impacted by radiation. We present several examples of multiple gene interactions that are altered by radiation exposure and thus represent potential molecular pathways that mediate the radiation response.

  12. Extracting gene networks for low-dose radiation using graph theoretical algorithms.

    Directory of Open Access Journals (Sweden)

    Brynn H Voy

    2006-07-01

    Full Text Available Genes with common functions often exhibit correlated expression levels, which can be used to identify sets of interacting genes from microarray data. Microarrays typically measure expression across genomic space, creating a massive matrix of co-expression that must be mined to extract only the most relevant gene interactions. We describe a graph theoretical approach to extracting co-expressed sets of genes, based on the computation of cliques. Unlike the results of traditional clustering algorithms, cliques are not disjoint and allow genes to be assigned to multiple sets of interacting partners, consistent with biological reality. A graph is created by thresholding the correlation matrix to include only the correlations most likely to signify functional relationships. Cliques computed from the graph correspond to sets of genes for which significant edges are present between all members of the set, representing potential members of common or interacting pathways. Clique membership can be used to infer function about poorly annotated genes, based on the known functions of better-annotated genes with which they share clique membership (i.e., "guilt-by-association". We illustrate our method by applying it to microarray data collected from the spleens of mice exposed to low-dose ionizing radiation. Differential analysis is used to identify sets of genes whose interactions are impacted by radiation exposure. The correlation graph is also queried independently of clique to extract edges that are impacted by radiation. We present several examples of multiple gene interactions that are altered by radiation exposure and thus represent potential molecular pathways that mediate the radiation response.

  13. Mining gene expression data of multiple sclerosis.

    Directory of Open Access Journals (Sweden)

    Pi Guo

    Full Text Available Microarray produces a large amount of gene expression data, containing various biological implications. The challenge is to detect a panel of discriminative genes associated with disease. This study proposed a robust classification model for gene selection using gene expression data, and performed an analysis to identify disease-related genes using multiple sclerosis as an example.Gene expression profiles based on the transcriptome of peripheral blood mononuclear cells from a total of 44 samples from 26 multiple sclerosis patients and 18 individuals with other neurological diseases (control were analyzed. Feature selection algorithms including Support Vector Machine based on Recursive Feature Elimination, Receiver Operating Characteristic Curve, and Boruta algorithms were jointly performed to select candidate genes associating with multiple sclerosis. Multiple classification models categorized samples into two different groups based on the identified genes. Models' performance was evaluated using cross-validation methods, and an optimal classifier for gene selection was determined.An overlapping feature set was identified consisting of 8 genes that were differentially expressed between the two phenotype groups. The genes were significantly associated with the pathways of apoptosis and cytokine-cytokine receptor interaction. TNFSF10 was significantly associated with multiple sclerosis. A Support Vector Machine model was established based on the featured genes and gave a practical accuracy of ∼86%. This binary classification model also outperformed the other models in terms of Sensitivity, Specificity and F1 score.The combined analytical framework integrating feature ranking algorithms and Support Vector Machine model could be used for selecting genes for other diseases.

  14. Gene-environment interaction and male reproductive function

    Science.gov (United States)

    Axelsson, Jonatan; Bonde, Jens Peter; Giwercman, Yvonne L.; Rylander, Lars; Giwercman, Aleksander

    2010-01-01

    As genetic factors can hardly explain the changes taking place during short time spans, environmental and lifestyle-related factors have been suggested as the causes of time-related deterioration of male reproductive function. However, considering the strong heterogeneity of male fecundity between and within populations, genetic variants might be important determinants of the individual susceptibility to the adverse effects of environment or lifestyle. Although the possible mechanisms of such interplay in relation to the reproductive system are largely unknown, some recent studies have indicated that specific genotypes may confer a larger risk of male reproductive disorders following certain exposures. This paper presents a critical review of animal and human evidence on how genes may modify environmental effects on male reproductive function. Some examples have been found that support this mechanism, but the number of studies is still limited. This type of interaction studies may improve our understanding of normal physiology and help us to identify the risk factors to male reproductive malfunction. We also shortly discuss other aspects of gene-environment interaction specifically associated with the issue of reproduction, namely environmental and lifestyle factors as the cause of sperm DNA damage. It remains to be investigated to what extent such genetic changes, by natural conception or through the use of assisted reproductive techniques, are transmitted to the next generation, thereby causing increased morbidity in the offspring. PMID:20348940

  15. Gene-diet-interactions in folate-mediated one-carbon metabolism modify colon cancer risk.

    Science.gov (United States)

    Liu, Amy Y; Scherer, Dominique; Poole, Elizabeth; Potter, John D; Curtin, Karen; Makar, Karen; Slattery, Martha L; Caan, Bette J; Ulrich, Cornelia M

    2013-04-01

    The importance of folate-mediated one-carbon metabolism (FOCM) in colorectal carcinogenesis is emphasized by observations that high dietary folate intake is associated with decreased risk of colon cancer (CC) and its precursors. Additionally, polymorphisms in FOCM-related genes have been repeatedly associated with risk, supporting a causal relationship between folate and colorectal carcinogenesis. We investigated ten candidate polymorphisms with defined or probable functional impact in eight FOCM-related genes (SHMT1, DHFR, DNMT1, MTHFD1, MTHFR, MTRR, TCN2, and TDG) in 1609 CC cases and 1974 controls for association with CC risk and for interaction with dietary factors. No polymorphism was statistically significantly associated with overall risk of CC. However, statistically significant interactions modifying CC risk were observed for DNMT1 I311V with dietary folate, methionine, vitamin B2 , and vitamin B12 intake and for MTRR I22M with dietary folate, a predefined one-carbon dietary pattern, and vitamin B6 intake. We observed statistically significant gene-diet interactions with five additional polymorphisms. Our results provide evidence that FOCM-related dietary intakes modify the association between CC risk and FOCM allelic variants. These findings add to observations showing that folate-related gene-nutrient interactions play an important role in modifying the risk of CC. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Imaging gene expression in gene therapy

    Energy Technology Data Exchange (ETDEWEB)

    Wiebe, Leonard I. [Alberta Univ., Edmonton (Canada). Noujaim Institute for Pharmaceutical Oncology Research

    1997-12-31

    Full text. Gene therapy can be used to introduce new genes, or to supplement the function of indigenous genes. At the present time, however, there is non-invasive test to demonstrate efficacy of the gene transfer and expression processes. It has been postulated that scintigraphic imaging can offer unique information on both the site at which the transferred gene is expressed, and the degree of expression, both of which are critical issue for safety and clinical efficacy. Many current studies are based on `suicide gene therapy` of cancer. Cells modified to express these genes commit metabolic suicide in the presence of an enzyme encoded by the transferred gene and a specifically-convertible pro drug. Pro drug metabolism can lead to selective metabolic trapping, required for scintigraphy. Herpes simplex virus type-1 thymidine kinase (H S V-1 t k{sup +}) has been use for `suicide` in vivo tumor gene therapy. It has been proposed that radiolabelled nucleosides can be used as radiopharmaceuticals to detect H S V-1 t k{sup +} gene expression where the H S V-1 t k{sup +} gene serves a reporter or therapeutic function. Animal gene therapy models have been studied using purine-([{sup 18} F]F H P G; [{sup 18} F]-A C V), and pyrimidine- ([{sup 123}/{sup 131} I]I V R F U; [{sup 124}/{sup 131I}]) antiviral nucleosides. Principles of gene therapy and gene therapy imaging will be reviewed and experimental data for [{sup 123}/{sup 131I}]I V R F U imaging with the H S V-1 t k{sup +} reporter gene will be presented

  17. Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

    Science.gov (United States)

    Osato, Naoki

    2018-01-19

    Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional

  18. Network-based differential gene expression analysis suggests cell cycle related genes regulated by E2F1 underlie the molecular difference between smoker and non-smoker lung adenocarcinoma

    Science.gov (United States)

    2013-01-01

    Background Differential gene expression (DGE) analysis is commonly used to reveal the deregulated molecular mechanisms of complex diseases. However, traditional DGE analysis (e.g., the t test or the rank sum test) tests each gene independently without considering interactions between them. Top-ranked differentially regulated genes prioritized by the analysis may not directly relate to the coherent molecular changes underlying complex diseases. Joint analyses of co-expression and DGE have been applied to reveal the deregulated molecular modules underlying complex diseases. Most of these methods consist of separate steps: first to identify gene-gene relationships under the studied phenotype then to integrate them with gene expression changes for prioritizing signature genes, or vice versa. It is warrant a method that can simultaneously consider gene-gene co-expression strength and corresponding expression level changes so that both types of information can be leveraged optimally. Results In this paper, we develop a gene module based method for differential gene expression analysis, named network-based differential gene expression (nDGE) analysis, a one-step integrative process for prioritizing deregulated genes and grouping them into gene modules. We demonstrate that nDGE outperforms existing methods in prioritizing deregulated genes and discovering deregulated gene modules using simulated data sets. When tested on a series of smoker and non-smoker lung adenocarcinoma data sets, we show that top differentially regulated genes identified by the rank sum test in different sets are not consistent while top ranked genes defined by nDGE in different data sets significantly overlap. nDGE results suggest that a differentially regulated gene module, which is enriched for cell cycle related genes and E2F1 targeted genes, plays a role in the molecular differences between smoker and non-smoker lung adenocarcinoma. Conclusions In this paper, we develop nDGE to prioritize

  19. Shame and Guilt-Proneness in Adolescents: Gene-Environment Interactions.

    Science.gov (United States)

    Szentágotai-Tătar, Aurora; Chiș, Adina; Vulturar, Romana; Dobrean, Anca; Cândea, Diana Mirela; Miu, Andrei C

    2015-01-01

    Rooted in people's preoccupation with how they are perceived and evaluated, shame and guilt are self-conscious emotions that play adaptive roles in social behavior, but can also contribute to psychopathology when dysregulated. Shame and guilt-proneness develop during childhood and adolescence, and are influenced by genetic and environmental factors that are little known to date. This study investigated the effects of early traumatic events and functional polymorphisms in the brain-derived neurotrophic factor (BDNF) gene and the serotonin transporter gene promoter (5-HTTLPR) on shame and guilt in adolescents. A sample of N = 271 healthy adolescents between 14 and 17 years of age filled in measures of early traumatic events and proneness to shame and guilt, and were genotyped for the BDNF Val66Met and 5-HTTLPR polymorphisms. Results of moderator analyses indicated that trauma intensity was positively associated with guilt-proneness only in carriers of the low-expressing Met allele of BDNF Val66Met. This is the first study that identifies a gene-environment interaction that significantly contributes to guilt proneness in adolescents, with potential implications for developmental psychopathology.

  20. A comparison of 100 human genes using an alu element-based instability model.

    Science.gov (United States)

    Cook, George W; Konkel, Miriam K; Walker, Jerilyn A; Bourgeois, Matthew G; Fullerton, Mitchell L; Fussell, John T; Herbold, Heath D; Batzer, Mark A

    2013-01-01

    The human retrotransposon with the highest copy number is the Alu element. The human genome contains over one million Alu elements that collectively account for over ten percent of our DNA. Full-length Alu elements are randomly distributed throughout the genome in both forward and reverse orientations. However, full-length widely spaced Alu pairs having two Alus in the same (direct) orientation are statistically more prevalent than Alu pairs having two Alus in the opposite (inverted) orientation. The cause of this phenomenon is unknown. It has been hypothesized that this imbalance is the consequence of anomalous inverted Alu pair interactions. One proposed mechanism suggests that inverted Alu pairs can ectopically interact, exposing both ends of each Alu element making up the pair to a potential double-strand break, or "hit". This hypothesized "two-hit" (two double-strand breaks) potential per Alu element was used to develop a model for comparing the relative instabilities of human genes. The model incorporates both 1) the two-hit double-strand break potential of Alu elements and 2) the probability of exon-damaging deletions extending from these double-strand breaks. This model was used to compare the relative instabilities of 50 deletion-prone cancer genes and 50 randomly selected genes from the human genome. The output of the Alu element-based genomic instability model developed here is shown to coincide with the observed instability of deletion-prone cancer genes. The 50 cancer genes are collectively estimated to be 58% more unstable than the randomly chosen genes using this model. Seven of the deletion-prone cancer genes, ATM, BRCA1, FANCA, FANCD2, MSH2, NCOR1 and PBRM1, were among the most unstable 10% of the 100 genes analyzed. This algorithm may lay the foundation for comparing genetic risks posed by structural variations that are unique to specific individuals, families and people groups.

  1. Speeding disease gene discovery by sequence based candidate prioritization

    Directory of Open Access Journals (Sweden)

    Porteous David J

    2005-03-01

    Full Text Available Abstract Background Regions of interest identified through genetic linkage studies regularly exceed 30 centimorgans in size and can contain hundreds of genes. Traditionally this number is reduced by matching functional annotation to knowledge of the disease or phenotype in question. However, here we show that disease genes share patterns of sequence-based features that can provide a good basis for automatic prioritization of candidates by machine learning. Results We examined a variety of sequence-based features and found that for many of them there are significant differences between the sets of genes known to be involved in human hereditary disease and those not known to be involved in disease. We have created an automatic classifier called PROSPECTR based on those features using the alternating decision tree algorithm which ranks genes in the order of likelihood of involvement in disease. On average, PROSPECTR enriches lists for disease genes two-fold 77% of the time, five-fold 37% of the time and twenty-fold 11% of the time. Conclusion PROSPECTR is a simple and effective way to identify genes involved in Mendelian and oligogenic disorders. It performs markedly better than the single existing sequence-based classifier on novel data. PROSPECTR could save investigators looking at large regions of interest time and effort by prioritizing positional candidate genes for mutation detection and case-control association studies.

  2. Identifying Novel Candidate Genes Related to Apoptosis from a Protein-Protein Interaction Network

    Directory of Open Access Journals (Sweden)

    Baoman Wang

    2015-01-01

    Full Text Available Apoptosis is the process of programmed cell death (PCD that occurs in multicellular organisms. This process of normal cell death is required to maintain the balance of homeostasis. In addition, some diseases, such as obesity, cancer, and neurodegenerative diseases, can be cured through apoptosis, which produces few side effects. An effective comprehension of the mechanisms underlying apoptosis will be helpful to prevent and treat some diseases. The identification of genes related to apoptosis is essential to uncover its underlying mechanisms. In this study, a computational method was proposed to identify novel candidate genes related to apoptosis. First, protein-protein interaction information was used to construct a weighted graph. Second, a shortest path algorithm was applied to the graph to search for new candidate genes. Finally, the obtained genes were filtered by a permutation test. As a result, 26 genes were obtained, and we discuss their likelihood of being novel apoptosis-related genes by collecting evidence from published literature.

  3. Partial Least Squares Based Gene Expression Analysis in EBV- Positive and EBV-Negative Posttransplant Lymphoproliferative Disorders.

    Science.gov (United States)

    Wu, Sa; Zhang, Xin; Li, Zhi-Ming; Shi, Yan-Xia; Huang, Jia-Jia; Xia, Yi; Yang, Hang; Jiang, Wen-Qi

    2013-01-01

    Post-transplant lymphoproliferative disorder (PTLD) is a common complication of therapeutic immunosuppression after organ transplantation. Gene expression profile facilitates the identification of biological difference between Epstein-Barr virus (EBV) positive and negative PTLDs. Previous studies mainly implemented variance/regression analysis without considering unaccounted array specific factors. The aim of this study is to investigate the gene expression difference between EBV positive and negative PTLDs through partial least squares (PLS) based analysis. With a microarray data set from the Gene Expression Omnibus database, we performed PLS based analysis. We acquired 1188 differentially expressed genes. Pathway and Gene Ontology enrichment analysis identified significantly over-representation of dysregulated genes in immune response and cancer related biological processes. Network analysis identified three hub genes with degrees higher than 15, including CREBBP, ATXN1, and PML. Proteins encoded by CREBBP and PML have been reported to be interact with EBV before. Our findings shed light on expression distinction of EBV positive and negative PTLDs with the hope to offer theoretical support for future therapeutic study.

  4. Gene-gene combination effect and interactions among ABCA1, APOA1, SR-B1, and CETP polymorphisms for serum high-density lipoprotein-cholesterol in the Japanese population.

    Directory of Open Access Journals (Sweden)

    Akihiko Nakamura

    Full Text Available BACKGROUND/OBJECTIVE: Gene-gene interactions in the reverse cholesterol transport system for high-density lipoprotein-cholesterol (HDL-C are poorly understood. The present study observed gene-gene combination effect and interactions between single nucleotide polymorphisms (SNPs in ABCA1, APOA1, SR-B1, and CETP in serum HDL-C from a cross-sectional study in the Japanese population. METHODS: The study population comprised 1,535 men and 1,515 women aged 35-69 years who were enrolled in the Japan Multi-Institutional Collaborative Cohort (J-MICC Study. We selected 13 SNPs in the ABCA1, APOA1, CETP, and SR-B1 genes in the reverse cholesterol transport system. The effects of genetic and environmental factors were assessed using general linear and logistic regression models after adjusting for age, sex, and region. PRINCIPAL FINDINGS: Alcohol consumption and daily activity were positively associated with HDL-C levels, whereas smoking had a negative relationship. The T allele of CETP, rs3764261, was correlated with higher HDL-C levels and had the highest coefficient (2.93 mg/dL/allele among the 13 SNPs, which was statistically significant after applying the Bonferroni correction (p<0.001. Gene-gene combination analysis revealed that CETP rs3764261 was associated with high HDL-C levels with any combination of SNPs from ABCA1, APOA1, and SR-B1, although no gene-gene interaction was apparent. An increasing trend for serum HDL-C was also observed with an increasing number of alleles (p<0.001. CONCLUSIONS: The present study identified a multiplier effect from a polymorphism in CETP with ABCA1, APOA1, and SR-B1, as well as a dose-dependence according to the number of alleles present.

  5. Identifying overrepresented concepts in gene lists from literature: a statistical approach based on Poisson mixture model

    Directory of Open Access Journals (Sweden)

    Zhai Chengxiang

    2010-05-01

    Full Text Available Abstract Background Large-scale genomic studies often identify large gene lists, for example, the genes sharing the same expression patterns. The interpretation of these gene lists is generally achieved by extracting concepts overrepresented in the gene lists. This analysis often depends on manual annotation of genes based on controlled vocabularies, in particular, Gene Ontology (GO. However, the annotation of genes is a labor-intensive process; and the vocabularies are generally incomplete, leaving some important biological domains inadequately covered. Results We propose a statistical method that uses the primary literature, i.e. free-text, as the source to perform overrepresentation analysis. The method is based on a statistical framework of mixture model and addresses the methodological flaws in several existing programs. We implemented this method within a literature mining system, BeeSpace, taking advantage of its analysis environment and added features that facilitate the interactive analysis of gene sets. Through experimentation with several datasets, we showed that our program can effectively summarize the important conceptual themes of large gene sets, even when traditional GO-based analysis does not yield informative results. Conclusions We conclude that the current work will provide biologists with a tool that effectively complements the existing ones for overrepresentation analysis from genomic experiments. Our program, Genelist Analyzer, is freely available at: http://workerbee.igb.uiuc.edu:8080/BeeSpace/Search.jsp

  6. Exploring the role of peptides in polymer-based gene delivery.

    Science.gov (United States)

    Sun, Yanping; Yang, Zhen; Wang, Chunxi; Yang, Tianzhi; Cai, Cuifang; Zhao, Xiaoyun; Yang, Li; Ding, Pingtian

    2017-09-15

    Polymers are widely studied as non-viral gene vectors because of their strong DNA binding ability, capacity to carry large payload, flexibility of chemical modifications, low immunogenicity, and facile processes for manufacturing. However, high cytotoxicity and low transfection efficiency substantially restrict their application in clinical trials. Incorporating functional peptides is a promising approach to address these issues. Peptides demonstrate various functions in polymer-based gene delivery systems, such as targeting to specific cells, breaching membrane barriers, facilitating DNA condensation and release, and lowering cytotoxicity. In this review, we systematically summarize the role of peptides in polymer-based gene delivery, and elaborate how to rationally design polymer-peptide based gene delivery vectors. Polymers are widely studied as non-viral gene vectors, but suffer from high cytotoxicity and low transfection efficiency. Incorporating short, bioactive peptides into polymer-based gene delivery systems can address this issue. Peptides demonstrate various functions in polymer-based gene delivery systems, such as targeting to specific cells, breaching membrane barriers, facilitating DNA condensation and release, and lowering cytotoxicity. In this review, we highlight the peptides' roles in polymer-based gene delivery, and elaborate how to utilize various functional peptides to enhance the transfection efficiency of polymers. The optimized peptide-polymer vectors should be able to alter their structures and functions according to biological microenvironments and utilize inherent intracellular pathways of cells, and consequently overcome the barriers during gene delivery to enhance transfection efficiency. Copyright © 2017 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.

  7. A copula method for modeling directional dependence of genes

    Directory of Open Access Journals (Sweden)

    Park Changyi

    2008-05-01

    Full Text Available Abstract Background Genes interact with each other as basic building blocks of life, forming a complicated network. The relationship between groups of genes with different functions can be represented as gene networks. With the deposition of huge microarray data sets in public domains, study on gene networking is now possible. In recent years, there has been an increasing interest in the reconstruction of gene networks from gene expression data. Recent work includes linear models, Boolean network models, and Bayesian networks. Among them, Bayesian networks seem to be the most effective in constructing gene networks. A major problem with the Bayesian network approach is the excessive computational time. This problem is due to the interactive feature of the method that requires large search space. Since fitting a model by using the copulas does not require iterations, elicitation of the priors, and complicated calculations of posterior distributions, the need for reference to extensive search spaces can be eliminated leading to manageable computational affords. Bayesian network approach produces a discretely expression of conditional probabilities. Discreteness of the characteristics is not required in the copula approach which involves use of uniform representation of the continuous random variables. Our method is able to overcome the limitation of Bayesian network method for gene-gene interaction, i.e. information loss due to binary transformation. Results We analyzed the gene interactions for two gene data sets (one group is eight histone genes and the other group is 19 genes which include DNA polymerases, DNA helicase, type B cyclin genes, DNA primases, radiation sensitive genes, repaire related genes, replication protein A encoding gene, DNA replication initiation factor, securin gene, nucleosome assembly factor, and a subunit of the cohesin complex by adopting a measure of directional dependence based on a copula function. We have compared

  8. Common sources of bias in gene-lifestyle interaction studies of cardiometabolic disease

    DEFF Research Database (Denmark)

    Oskari Kilpeläinen, Tuomas

    2013-01-01

    The role of gene x lifestyle interactions in the development of cardiometabolic diseases is often highlighted, but very few robustly replicated examples of interactions exist in the literature. The slow pace of discoveries may largely be due to interaction effects being generally small in magnitude...

  9. Hox gene function and interaction in the milkweed bug Oncopeltus fasciatus (Hemiptera).

    Science.gov (United States)

    Angelini, David R; Liu, Paul Z; Hughes, Cynthia L; Kaufman, Thomas C

    2005-11-15

    Studies in genetic model organisms such as Drosophila have demonstrated that the homeotic complex (Hox) genes impart segmental identity during embryogenesis. Comparative studies in a wide range of other insect taxa have shown that the Hox genes are expressed in largely conserved domains along the anterior-posterior body axis, but whether they are performing the same functions in different insects is an open question. Most of the Hox genes have been studied functionally in only a few holometabolous insects that undergo metamorphosis. Thus, it is unclear how the Hox genes are functioning in the majority of direct-developing insects and other arthropods. To address this question, we used a combination of RNAi and in situ hybridization to reveal the expression, functions, and regulatory interactions of the Hox genes in the milkweed bug Oncopeltus fasciatus. Our results reveal many similarities and some interesting differences compared to Drosophila. We find that the gene Antennapedia is required for the identity of all three thoracic segments, while Ultrabithorax, abdominal-A and Abdominal-B cooperate to pattern the abdomen. The three abdominal genes exhibit posterior prevalence like in Drosophila, but apparently via some post-transcriptional mechanism. The functions of the head genes proboscipedia, Deformed, and Sex combs reduced were shown previously, and here we find that the complex temporal expression of pb in the labium is like that of other insects, but its regulatory relationship with Scr is unique. Overall, our data reveal that the evolution of insect Hox genes has included many small changes within general conservation of expression and function, and that the milkweed bug provides a useful model for understanding the roles of Hox genes in a direct-developing insect.

  10. Suicide genes or p53 gene and p53 target genes as targets for cancer gene therapy by ionizing radiation

    International Nuclear Information System (INIS)

    Liu Bing; Chinese Academy of Sciences, Beijing; Zhang Hong

    2005-01-01

    Radiotherapy has some disadvantages due to the severe side-effect on the normal tissues at a curative dose of ionizing radiation (IR). Similarly, as a new developing approach, gene therapy also has some disadvantages, such as lack of specificity for tumors, limited expression of therapeutic gene, potential biological risk. To certain extent, above problems would be solved by the suicide genes or p53 gene and its target genes therapies targeted by ionizing radiation. This strategy not only makes up the disadvantage from radiotherapy or gene therapy alone, but also promotes success rate on the base of lower dose. By present, there have been several vectors measuring up to be reaching clinical trials. This review focused on the development of the cancer gene therapy through suicide genes or p53 and its target genes mediated by IR. (authors)

  11. Lack of evidence for intermolecular epistatic interactions between adiponectin and resistin gene polymorphisms in Malaysian male subjects

    Directory of Open Access Journals (Sweden)

    Cia-Hin Lau

    2012-01-01

    Full Text Available Epistasis (gene-gene interaction is a ubiquitous component of the genetic architecture of complex traits such as susceptibility to common human diseases. Given the strong negative correlation between circulating adiponectin and resistin levels, the potential intermolecular epistatic interactions between ADIPOQ (SNP+45T > G, SNP+276G > T, SNP+639T > C and SNP+1212A > G and RETN (SNP-420C > G and SNP+299G > A gene polymorphisms in the genetic risk underlying type 2 diabetes (T2DM and metabolic syndrome (MS were assessed. The potential mutual influence of the ADIPOQ and RETN genes on their adipokine levels was also examined. The rare homozygous genotype (risk alleles of SNP-420C > G at the RETN locus tended to be co-inherited together with the common homozygous genotypes (protective alleles of SNP+639T > C and SNP+1212A > G at the ADIPOQ locus. Despite the close structural relationship between the ADIPOQ and RETN genes, there was no evidence of an intermolecular epistatic interaction between these genes. There was also no reciprocal effect of the ADIPOQ and RETN genes on their adipokine levels, i.e., ADIPOQ did not affect resistin levels nor did RETN affect adiponectin levels. The possible influence of the ADIPOQ gene on RETN expression warrants further investigation.

  12. Exploring the key genes and pathways in enchondromas using a gene expression microarray.

    Science.gov (United States)

    Shi, Zhongju; Zhou, Hengxing; Pan, Bin; Lu, Lu; Kang, Yi; Liu, Lu; Wei, Zhijian; Feng, Shiqing

    2017-07-04

    Enchondromas are the most common primary benign osseous neoplasms that occur in the medullary bone; they can undergo malignant transformation into chondrosarcoma. However, enchondromas are always undetected in patients, and the molecular mechanism is unclear. To identify key genes and pathways associated with the occurrence and development of enchondromas, we downloaded the gene expression dataset GSE22855 and obtained the differentially expressed genes (DEGs) by analyzing high-throughput gene expression in enchondromas. In total, 635 genes were identified as DEGs. Of these, 225 genes (35.43%) were up-regulated, and the remaining 410 genes (64.57%) were down-regulated. We identified the predominant gene ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways that were significantly over-represented in the enchondromas samples compared with the control samples. Subsequently the top 10 core genes were identified from the protein-protein interaction (PPI) network. The enrichment analyses of the genes mainly involved in two significant modules showed that the DEGs were principally related to ribosomes, protein digestion and absorption, ECM-receptor interaction, focal adhesion, amoebiasis and the PI3K-Akt signaling pathway.Together, these data elucidate the molecular mechanisms underlying the occurrence and development of enchondromas and provide promising candidates for therapeutic intervention and prognostic evaluation. However, further experimental studies are needed to confirm these results.

  13. Interaction between Social/Psychosocial Factors and Genetic Variants on Body Mass Index: A Gene-Environment Interaction Analysis in a Longitudinal Setting.

    Science.gov (United States)

    Zhao, Wei; Ware, Erin B; He, Zihuai; Kardia, Sharon L R; Faul, Jessica D; Smith, Jennifer A

    2017-09-29

    Obesity, which develops over time, is one of the leading causes of chronic diseases such as cardiovascular disease. However, hundreds of BMI (body mass index)-associated genetic loci identified through large-scale genome-wide association studies (GWAS) only explain about 2.7% of BMI variation. Most common human traits are believed to be influenced by both genetic and environmental factors. Past studies suggest a variety of environmental features that are associated with obesity, including socioeconomic status and psychosocial factors. This study combines both gene/regions and environmental factors to explore whether social/psychosocial factors (childhood and adult socioeconomic status, social support, anger, chronic burden, stressful life events, and depressive symptoms) modify the effect of sets of genetic variants on BMI in European American and African American participants in the Health and Retirement Study (HRS). In order to incorporate longitudinal phenotype data collected in the HRS and investigate entire sets of single nucleotide polymorphisms (SNPs) within gene/region simultaneously, we applied a novel set-based test for gene-environment interaction in longitudinal studies (LGEWIS). Childhood socioeconomic status (parental education) was found to modify the genetic effect in the gene/region around SNP rs9540493 on BMI in European Americans in the HRS. The most significant SNP (rs9540488) by childhood socioeconomic status interaction within the rs9540493 gene/region was suggestively replicated in the Multi-Ethnic Study of Atherosclerosis (MESA) ( p = 0.07).

  14. Interaction between Social/Psychosocial Factors and Genetic Variants on Body Mass Index: A Gene-Environment Interaction Analysis in a Longitudinal Setting

    Directory of Open Access Journals (Sweden)

    Wei Zhao

    2017-09-01

    Full Text Available Obesity, which develops over time, is one of the leading causes of chronic diseases such as cardiovascular disease. However, hundreds of BMI (body mass index-associated genetic loci identified through large-scale genome-wide association studies (GWAS only explain about 2.7% of BMI variation. Most common human traits are believed to be influenced by both genetic and environmental factors. Past studies suggest a variety of environmental features that are associated with obesity, including socioeconomic status and psychosocial factors. This study combines both gene/regions and environmental factors to explore whether social/psychosocial factors (childhood and adult socioeconomic status, social support, anger, chronic burden, stressful life events, and depressive symptoms modify the effect of sets of genetic variants on BMI in European American and African American participants in the Health and Retirement Study (HRS. In order to incorporate longitudinal phenotype data collected in the HRS and investigate entire sets of single nucleotide polymorphisms (SNPs within gene/region simultaneously, we applied a novel set-based test for gene-environment interaction in longitudinal studies (LGEWIS. Childhood socioeconomic status (parental education was found to modify the genetic effect in the gene/region around SNP rs9540493 on BMI in European Americans in the HRS. The most significant SNP (rs9540488 by childhood socioeconomic status interaction within the rs9540493 gene/region was suggestively replicated in the Multi-Ethnic Study of Atherosclerosis (MESA (p = 0.07.

  15. Gene-Environment Interaction in Parkinson's Disease

    DEFF Research Database (Denmark)

    Chuang, Yu-Hsuan; Lill, Christina M; Lee, Pei-Chen

    2016-01-01

    BACKGROUND AND PURPOSE: Drinking caffeinated coffee has been reported to provide protection against Parkinson's disease (PD). Caffeine is an adenosine A2A receptor (encoded by the gene ADORA2A) antagonist that increases dopaminergic neurotransmission and Cytochrome P450 1A2 (gene: CYP1A2...

  16. Susceptibility Genes in Thyroid Autoimmunity

    Directory of Open Access Journals (Sweden)

    Yoshiyuki Ban

    2005-01-01

    Full Text Available The autoimmune thyroid diseases (AITD are complex diseases which are caused by an interaction between susceptibility genes and environmental triggers. Genetic susceptibility in combination with external factors (e.g. dietary iodine is believed to initiate the autoimmune response to thyroid antigens. Abundant epidemiological data, including family and twin studies, point to a strong genetic influence on the development of AITD. Various techniques have been employed to identify the genes contributing to the etiology of AITD, including candidate gene analysis and whole genome screening. These studies have enabled the identification of several loci (genetic regions that are linked with AITD, and in some of these loci, putative AITD susceptibility genes have been identified. Some of these genes/loci are unique to Graves' disease (GD and Hashimoto's thyroiditis (HT and some are common to both the diseases, indicating that there is a shared genetic susceptibility to GD and HT. The putative GD and HT susceptibility genes include both immune modifying genes (e.g. HLA, CTLA-4 and thyroid specific genes (e.g. TSHR, Tg. Most likely, these loci interact and their interactions may influence disease phenotype and severity.

  17. The influence of nutrigenetics on the lipid profile: interaction between genes and dietary habits.

    Science.gov (United States)

    de Andrade, Fabiana M; Bulhões, Andréa C; Maluf, Sharbel W; Schuch, Jaqueline B; Voigt, Francine; Lucatelli, Juliana F; Barros, Alessandra C; Hutz, Mara H

    2010-04-01

    Nutrigenetics is a new field with few studies in Latin America. Our aim is to investigate the way in which different genes related to the lipid profile influence the response to specific dietary habits. Eight polymorphisms on seven genes were investigated in a sample (n = 567) from Porto Alegre, RS, Brazil. All the volunteers completed a food diary that was then assessed and classified into nine food groups. A number of nutrigenetic interactions were detected primarily related to the apolipoprotein E (apoE) gene. For example, frequent consumption of foods rich in polyunsaturated fat resulted in the beneficial effect of increasing HDL-C only in individuals who were not carriers of the E*4 allele of the APOE gene, whereas variations in eating habits of E*4 carriers did not affect their HDL-C (P = 0.018). Our data demonstrate for the first time nutrigenetic interactions in a Brazilian population.

  18. Utilizing virus-induced gene silencing for the functional characterization of maize genes during infection with the fungal pathogen Ustilago maydis.

    Science.gov (United States)

    van der Linde, Karina; Doehlemann, Gunther

    2013-01-01

    While in dicotyledonous plants virus-induced gene silencing (VIGS) is well established to study plant-pathogen interaction, in monocots only few examples of efficient VIGS have been reported so far. One of the available systems is based on the brome mosaic virus (BMV) which allows gene silencing in different cereals including barley (Hordeum vulgare), wheat (Triticum aestivum), and maize (Zea mays).Infection of maize plants by the corn smut fungus Ustilago maydis leads to the formation of large tumors on stem, leaves, and inflorescences. During this biotrophic interaction, plant defense responses are actively suppressed by the pathogen, and previous transcriptome analyses of infected maize plants showed comprehensive and stage-specific changes in host gene expression during disease progression.To identify maize genes that are functionally involved in the interaction with U. maydis, we adapted a VIGS system based on the Brome mosaic virus (BMV) to maize at conditions that allow successful U. maydis infection of BMV pre-infected maize plants. This setup enables quantification of VIGS and its impact on U. maydis infection using a quantitative real-time PCR (q(RT)-PCR)-based readout.

  19. False positive reduction in protein-protein interaction predictions using gene ontology annotations

    Directory of Open Access Journals (Sweden)

    Lin Yen-Han

    2007-07-01

    Full Text Available Abstract Background Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein-protein interactions. However, the lack of robust protein-protein interaction information is a challenge. One reason for the lack of solid protein-protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein-protein interaction datasets based on highly confident experimental results has not been adequately investigated. Results Gene Ontology (GO annotations were used to reduce false positive protein-protein interactions (PPI pairs resulting from computational predictions. Using experimentally obtained PPI pairs as a training dataset, eight top-ranking keywords were extracted from GO molecular function annotations. The sensitivity of these keywords is 64.21% in the yeast experimental dataset and 80.83% in the worm experimental dataset. The specificities, a measure of recovery power, of these keywords applied to four predicted PPI datasets for each studied organisms, are 48.32% and 46.49% (by average of four datasets in yeast and worm, respectively. Based on eight top-ranking keywords and co-localization of interacting proteins a set of two knowledge rules were deduced and applied to remove false positive protein pairs. The 'strength', a measure of improvement provided by the rules was defined based on the signal-to-noise ratio and implemented to measure the applicability of knowledge rules applying to the predicted PPI datasets. Depending on the employed PPI-predicting methods, the strength varies between two and ten-fold of randomly removing protein pairs from the datasets. Conclusion Gene Ontology annotations along with the deduced knowledge rules could be implemented to partially

  20. Modulation of Colorectal Cancer Risk by Polymorphisms in 51Gln/His, 64Ile/Val, and 148Asp/Glu of APEX Gene; 23Gly/Ala of XPA Gene; and 689Ser/Arg of ERCC4 Gene

    Directory of Open Access Journals (Sweden)

    L. Dziki

    2017-01-01

    Full Text Available Polymorphisms in DNA repair genes may affect the activity of the BER (base excision repair and NER (nucleotide excision repair systems. Using DNA isolated from blood taken from patients (n=312 and a control group (n=320 with CRC, we have analyzed the polymorphisms of selected DNA repair genes and we have demonstrated that genotypes 51Gln/His and 148Asp/Glu of APEX gene and 23Gly/Ala of XPA gene may increase the risk of colorectal cancer. At the same time analyzing the gene-gene interactions, we suggest the thesis that the main factor to be considered when analyzing the impact of polymorphisms on the risk of malignant transformation should be intergenic interactions. Moreover, we are suggesting that some polymorphisms may have impact not only on the malignant transformation but also on the stage of the tumor.

  1. A mechanistic explanation of popularity: genes, rule breaking, and evocative gene-environment correlations.

    Science.gov (United States)

    Burt, Alexandra

    2009-04-01

    Previous work has suggested that the serotonergic system plays a key role in "popularity" or likeability. A polymorphism within the 5HT-sub(2A) serotonin receptor gene (-G1438A) has also been associated with popularity, suggesting that genes may predispose individuals to particular social experiences. However, because genes cannot code directly for others' reactions, any legitimate association should be mediated via the individual's behavior (i.e., genes-->behaviors-->social consequences), a phenomenon referred to as an evocative gene-environment correlation (rGE). The current study aimed to identify one such mediating behavior. The author focused on rule breaking given its prior links to both the serotonergic system and to increased popularity during adolescence. Two samples of previously unacquainted late-adolescent boys completed a peer-based interaction paradigm designed to assess their popularity. Analyses revealed that rule breaking partially mediated the genetic effect on popularity, thereby furthering our understanding of the biological mechanisms that underlie popularity. Moreover, the present results represent the first meaningfully explicated evidence that genes predispose individuals not only to particular behaviors but also to the social consequences of those behaviors. (c) 2009 APA, all rights reserved.

  2. A double-mutant collection targeting MAP kinase related genes in Arabidopsis for studying genetic interactions.

    Science.gov (United States)

    Su, Shih-Heng; Krysan, Patrick J

    2016-12-01

    Mitogen-activated protein kinase cascades are conserved in all eukaryotes. In Arabidopsis thaliana there are approximately 80 genes encoding MAP kinase kinase kinases (MAP3K), 10 genes encoding MAP kinase kinases (MAP2K), and 20 genes encoding MAP kinases (MAPK). Reverse genetic analysis has failed to reveal abnormal phenotypes for a majority of these genes. One strategy for uncovering gene function when single-mutant lines do not produce an informative phenotype is to perform a systematic genetic interaction screen whereby double-mutants are created from a large library of single-mutant lines. Here we describe a new collection of 275 double-mutant lines derived from a library of single-mutants targeting genes related to MAP kinase signaling. To facilitate this study, we developed a high-throughput double-mutant generating pipeline using a system for growing Arabidopsis seedlings in 96-well plates. A quantitative root growth assay was used to screen for evidence of genetic interactions in this double-mutant collection. Our screen revealed four genetic interactions, all of which caused synthetic enhancement of the root growth defects observed in a MAP kinase 4 (MPK4) single-mutant line. Seeds for this double-mutant collection are publicly available through the Arabidopsis Biological Resource Center. Scientists interested in diverse biological processes can now screen this double-mutant collection under a wide range of growth conditions in order to search for additional genetic interactions that may provide new insights into MAP kinase signaling. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  3. SoFoCles: feature filtering for microarray classification based on gene ontology.

    Science.gov (United States)

    Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A

    2010-02-01

    Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.

  4. Interactions between SNPs affecting inflammatory response genes are associated with multiple myeloma disease risk and survival

    DEFF Research Database (Denmark)

    Nielsen, Kaspar René; Rodrigo-Domingo, Maria; Steffensen, Rudi

    2017-01-01

    The origin of multiple myeloma depends on interactions with stromal cells in the course of normal B-cell differentiation and evolution of immunity. The concept of the present study is that genes involved in MM pathogenesis, such as immune response genes, can be identified by screening for single......3L1 gene promoters. The occurrence of single polymorphisms, haplotypes and SNP-SNP interactions were statistically analyzed for association with disease risk and outcome following high-dose therapy. Identified genes that carried SNPs or haplotypes that were identified as risk or prognostic factors......= .005). The 'risk genes' were analyzed for expression in normal B-cell subsets (N = 6) from seven healthy donors and we found TNFA and IL-6 expressed both in naïve and in memory B cells when compared to preBI, II, immature and plasma cells. The 'prognosis genes' CHI3L1, IL-6 and IL-10 were differential...

  5. Gene-based Association Approach Identify Genes Across Stress Traits in Fruit Flies

    DEFF Research Database (Denmark)

    Rohde, Palle Duun; Edwards, Stefan McKinnon; Sarup, Pernille Merete

    Identification of genes explaining variation in quantitative traits or genetic risk factors of human diseases requires both good phenotypic- and genotypic data, but also efficient statistical methods. Genome-wide association studies may reveal association between phenotypic variation and variation...... approach grouping variants accordingly to gene position, thus lowering the number of statistical tests performed and increasing the probability of identifying genes with small to moderate effects. Using this approach we identify numerous genes associated with different types of stresses in Drosophila...... melanogaster, but also identify common genes that affects the stress traits....

  6. BEEF CATTLE MUSCULARITY CANDIDATE GENES

    Directory of Open Access Journals (Sweden)

    Irida Novianti

    2010-04-01

    Full Text Available Muscularity is a potential indicator for the selection of more productive cattle. Mapping quantitative trait loci (QTL for traits related to muscularity is useful to identify the genomic regions where the genes affecting muscularity reside. QTL analysis from a Limousin-Jersey double backcross herd was conducted using QTL Express software with cohort and breed as the fixed effects. Nine QTL suggested to have an association with muscularity were identified on cattle chromosomes BTA 1, 2, 3, 4, 5, 8, 12, 14 and 17. The myostatin gene is located at the centromeric end of chromosome 2 and not surprisingly, the Limousin myostatin F94L variant accounted for the QTL on BTA2. However, when the myostatin F94L genotype was included as an additional fixed effect, the QTL on BTA17 was also no longer significant. This result suggests that there may be gene(s that have epistatic effects with myostatin located on cattle chromosome 17. Based on the position of the QTL in base pairs, all the genes that reside in the region were determined using the Ensembl data base (www.ensembl.org. There were two potential candidate genes residing within these QTL regions were selected. They were Smad nuclear interacting protein 1 (SNIP1 and similar to follistatin-like 5 (FSTL5. (JIIPB 2010 Vol 20 No 1: 1-10

  7. SLC6A1 gene involvement in susceptibility to attention-deficit/hyperactivity disorder: A case-control study and gene-environment interaction.

    Science.gov (United States)

    Yuan, Fang-Fen; Gu, Xue; Huang, Xin; Zhong, Yan; Wu, Jing

    2017-07-03

    Attention-deficit/hyperactivity disorder (ADHD) is an early onset childhood neurodevelopmental disorder with an estimated heritability of approximately 76%. We conducted a case-control study to explore the role of the SLC6A1 gene in ADHD. The genotypes of eight variants were determined using Sequenom MassARRAY technology. The participants in the study were 302 children with ADHD and 411 controls. ADHD symptoms were assessed using the Conners Parent Symptom Questionnaire. In our study, rs2944366 was consistently shown to be associated with the ADHD risk in the dominant model (odds ratio [OR]=0.554, 95% confidence interval [CI]=0.404-0.760), and nominally associated with Hyperactive index score (P=0.027). In addition, rs1170695 has been found to be associated with the ADHD risk in the addictive model (OR=1.457, 95%CI=1.173-1.809), while rs9990174 was associated with the Hyperactive index score (P=0.010). Intriguingly, gene-environmental interactions analysis consistently revealed the potential interactions of rs1170695 with blood lead (P mul =0.044) to modify the ADHD risk. Expression quantitative trait loci analysis suggested that these positive single nucleotide polymorphisms (SNPs) may mediate SLC6A1 gene expression. Therefore, our results suggest that selected SLC6A1 gene variants may have a significant effect on the ADHD risk. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Directory of Open Access Journals (Sweden)

    Lemay Danielle G

    2012-09-01

    Full Text Available Abstract Background In previous studies, gene neighborhoods—spatial clusters of co-expressed genes in the genome—have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Scoring Tool (G-NEST which combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all possible window sizes simultaneously. Results Using G-NEST on atlases of mouse and human tissue expression data, we found that large neighborhoods of ten or more genes are extremely rare in mammalian genomes. When they do occur, neighborhoods are typically composed of families of related genes. Both the highest scoring and the largest neighborhoods in mammalian genomes are formed by tandem gene duplication. Mammalian gene neighborhoods contain highly and variably expressed genes. Co-localized noisy gene pairs exhibit lower evolutionary conservation of their adjacent genome locations, suggesting that their shared transcriptional background may be disadvantageous. Genes that are essential to mammalian survival and reproduction are less likely to occur in neighborhoods, although neighborhoods are enriched with genes that function in mitosis. We also found that gene orientation and protein-protein interactions are partially responsible for maintenance of gene neighborhoods. Conclusions Our experiments using G-NEST confirm that tandem gene duplication is the primary driver of non-random gene order in mammalian genomes. Non-essentiality, co-functionality, gene orientation, and protein-protein interactions are additional forces that maintain gene neighborhoods, especially those formed by tandem duplicates. We expect G-NEST to be useful for other applications such as the identification of core regulatory modules, common transcriptional backgrounds, and chromatin domains. The

  9. Identification of novel type 1 diabetes candidate genes by integrating genome-wide association data, protein-protein interactions, and human pancreatic islet gene expression

    DEFF Research Database (Denmark)

    Bergholdt, Regine; Brorsson, Caroline; Palleja, Albert

    2012-01-01

    Genome-wide association studies (GWAS) have heralded a new era in susceptibility locus discovery in complex diseases. For type 1 diabetes, >40 susceptibility loci have been discovered. However, GWAS do not inevitably lead to identification of the gene or genes in a given locus associated with dis......-cells. Our results provide novel insight to the mechanisms behind type 1 diabetes pathogenesis and, thus, may provide the basis for the design of novel treatment strategies.......Genome-wide association studies (GWAS) have heralded a new era in susceptibility locus discovery in complex diseases. For type 1 diabetes, >40 susceptibility loci have been discovered. However, GWAS do not inevitably lead to identification of the gene or genes in a given locus associated...... with disease, and they do not typically inform the broader context in which the disease genes operate. Here, we integrated type 1 diabetes GWAS data with protein-protein interactions to construct biological networks of relevance for disease. A total of 17 networks were identified. To prioritize...

  10. Sex-based differences in gene expression in hippocampus following postnatal lead exposure

    International Nuclear Information System (INIS)

    Schneider, J.S.; Anderson, D.W.; Sonnenahalli, H.; Vadigepalli, R.

    2011-01-01

    The influence of sex as an effect modifier of childhood lead poisoning has received little systematic attention. Considering the paucity of information available concerning the interactive effects of lead and sex on the brain, the current study examined the interactive effects of lead and sex on gene expression patterns in the hippocampus, a structure involved in learning and memory. Male or female rats were fed either 1500 ppm lead-containing chow or control chow for 30 days beginning at weaning.Blood lead levels were 26.7 ± 2.1 μg/dl and 27.1 ± 1.7 μg/dl for females and males, respectively. The expression of 175 unique genes was differentially regulated between control male and female rats. A total of 167 unique genes were differentially expressed in response to lead in either males or females. Lead exposure had a significant effect without a significant difference between male and female responses in 77 of these genes. In another set of 71 genes, there were significant differences in male vs. female response. A third set of 30 genes was differentially expressed in opposite directions in males vs. females, with the majority of genes expressed at a lower level in females than in males. Highly differentially expressed genes in males and females following lead exposure were associated with diverse biological pathways and functions. These results show that a brief exposure to lead produced significant changes in expression of a variety of genes in the hippocampus and that the response of the brain to a given lead exposure may vary depending on sex. - Highlights: → Postnatal lead exposure has a significant effect on hippocampal gene expression patterns. → At least one set of genes was affected in opposite directions in males and females. → Differentially expressed genes were associated with diverse biological pathways.

  11. Microarray-based screening of differentially expressed genes in glucocorticoid-induced avascular necrosis

    Science.gov (United States)

    Huang, Gangyong; Wei, Yibing; Zhao, Guanglei; Xia, Jun; Wang, Siqun; Wu, Jianguo; Chen, Feiyan; Chen, Jie; Shi, Jingshen

    2017-01-01

    The underlying mechanisms of glucocorticoid (GC)-induced avascular necrosis of the femoral head (ANFH) have yet to be fully understood, in particular the mechanisms associated with the change of gene expression pattern. The present study aimed to identify key genes with a differential expression pattern in GC-induced ANFH. E-MEXP-2751 microarray data were downloaded from the ArrayExpress database. Differentially expressed genes (DEGs) were identified in 5 femoral head samples of steroid-induced ANFH rats compared with 5 placebo-treated rat samples. Gene Ontology (GO) and pathway enrichment analyses were performed upon these DEGs. A total 93 DEGs (46 upregulated and 47 downregulated genes) were identified in GC-induced ANFH samples. These DEGs were enriched in different GO terms and pathways, including chondrocyte differentiation and detection of chemical stimuli. The enrichment map revealed that skeletal system development was interconnected with several other GO terms by gene overlap. The literature mined network analysis revealed that 5 upregulated genes were associated with femoral necrosis, including parathyroid hormone receptor 1 (PTHR1), vitamin D (1,25-Dihydroxyvitamin D3) receptor (VDR), collagen, type II, α1, proprotein convertase subtilisin/kexin type 6 and zinc finger protein 354C (ZFP354C). In addition, ZFP354C and VDR were identified to transcription factors. Furthermore, PTHR1 was revealed to interact with VDR, and α-2-macroglobulin (A2M) interacted with fibronectin 1 (FN1) in the PPI network. PTHR1 may be involved in GC-induced ANFH via interacting with VDR. A2M may also be involved in the development of GC-induced ANFH through interacting with FN1. An improved understanding of the molecular mechanisms underlying GC-induced ANFH may provide novel targets for diagnostics and therapeutic treatment. PMID:28393228

  12. Microarray‑based screening of differentially expressed genes in glucocorticoid‑induced avascular necrosis.

    Science.gov (United States)

    Huang, Gangyong; Wei, Yibing; Zhao, Guanglei; Xia, Jun; Wang, Siqun; Wu, Jianguo; Chen, Feiyan; Chen, Jie; Shi, Jingshen

    2017-06-01

    The underlying mechanisms of glucocorticoid (GC)‑induced avascular necrosis of the femoral head (ANFH) have yet to be fully understood, in particular the mechanisms associated with the change of gene expression pattern. The present study aimed to identify key genes with a differential expression pattern in GC‑induced ANFH. E‑MEXP‑2751 microarray data were downloaded from the ArrayExpress database. Differentially expressed genes (DEGs) were identified in 5 femoral head samples of steroid‑induced ANFH rats compared with 5 placebo‑treated rat samples. Gene Ontology (GO) and pathway enrichment analyses were performed upon these DEGs. A total 93 DEGs (46 upregulated and 47 downregulated genes) were identified in GC‑induced ANFH samples. These DEGs were enriched in different GO terms and pathways, including chondrocyte differentiation and detection of chemical stimuli. The enrichment map revealed that skeletal system development was interconnected with several other GO terms by gene overlap. The literature mined network analysis revealed that 5 upregulated genes were associated with femoral necrosis, including parathyroid hormone receptor 1 (PTHR1), vitamin D (1,25‑Dihydroxyvitamin D3) receptor (VDR), collagen, type II, α1, proprotein convertase subtilisin/kexin type 6 and zinc finger protein 354C (ZFP354C). In addition, ZFP354C and VDR were identified to transcription factors. Furthermore, PTHR1 was revealed to interact with VDR, and α‑2‑macroglobulin (A2M) interacted with fibronectin 1 (FN1) in the PPI network. PTHR1 may be involved in GC‑induced ANFH via interacting with VDR. A2M may also be involved in the development of GC‑induced ANFH through interacting with FN1. An improved understanding of the molecular mechanisms underlying GC‑induced ANFH may provide novel targets for diagnostics and therapeutic treatment.

  13. A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network

    Directory of Open Access Journals (Sweden)

    Han Kyungsook

    2010-06-01

    Full Text Available Abstract Background Genetic interaction profiles are highly informative and helpful for understanding the functional linkages between genes, and therefore have been extensively exploited for annotating gene functions and dissecting specific pathway structures. However, our understanding is rather limited to the relationship between double concurrent perturbation and various higher level phenotypic changes, e.g. those in cells, tissues or organs. Modifier screens, such as synthetic genetic arrays (SGA can help us to understand the phenotype caused by combined gene mutations. Unfortunately, exhaustive tests on all possible combined mutations in any genome are vulnerable to combinatorial explosion and are infeasible either technically or financially. Therefore, an accurate computational approach to predict genetic interaction is highly desirable, and such methods have the potential of alleviating the bottleneck on experiment design. Results In this work, we introduce a computational systems biology approach for the accurate prediction of pairwise synthetic genetic interactions (SGI. First, a high-coverage and high-precision functional gene network (FGN is constructed by integrating protein-protein interaction (PPI, protein complex and gene expression data; then, a graph-based semi-supervised learning (SSL classifier is utilized to identify SGI, where the topological properties of protein pairs in weighted FGN is used as input features of the classifier. We compare the proposed SSL method with the state-of-the-art supervised classifier, the support vector machines (SVM, on a benchmark dataset in S. cerevisiae to validate our method's ability to distinguish synthetic genetic interactions from non-interaction gene pairs. Experimental results show that the proposed method can accurately predict genetic interactions in S. cerevisiae (with a sensitivity of 92% and specificity of 91%. Noticeably, the SSL method is more efficient than SVM, especially for

  14. Nearest Neighbor Networks: clustering expression data based on gene neighborhoods

    Directory of Open Access Journals (Sweden)

    Olszewski Kellen L

    2007-07-01

    Full Text Available Abstract Background The availability of microarrays measuring thousands of genes simultaneously across hundreds of biological conditions represents an opportunity to understand both individual biological pathways and the integrated workings of the cell. However, translating this amount of data into biological insight remains a daunting task. An important initial step in the analysis of microarray data is clustering of genes with similar behavior. A number of classical techniques are commonly used to perform this task, particularly hierarchical and K-means clustering, and many novel approaches have been suggested recently. While these approaches are useful, they are not without drawbacks; these methods can find clusters in purely random data, and even clusters enriched for biological functions can be skewed towards a small number of processes (e.g. ribosomes. Results We developed Nearest Neighbor Networks (NNN, a graph-based algorithm to generate clusters of genes with similar expression profiles. This method produces clusters based on overlapping cliques within an interaction network generated from mutual nearest neighborhoods. This focus on nearest neighbors rather than on absolute distance measures allows us to capture clusters with high connectivity even when they are spatially separated, and requiring mutual nearest neighbors allows genes with no sufficiently similar partners to remain unclustered. We compared the clusters generated by NNN with those generated by eight other clustering methods. NNN was particularly successful at generating functionally coherent clusters with high precision, and these clusters generally represented a much broader selection of biological processes than those recovered by other methods. Conclusion The Nearest Neighbor Networks algorithm is a valuable clustering method that effectively groups genes that are likely to be functionally related. It is particularly attractive due to its simplicity, its success in the

  15. The integration of weighted human gene association networks based on link prediction.

    Science.gov (United States)

    Yang, Jian; Yang, Tinghong; Wu, Duzhi; Lin, Limei; Yang, Fan; Zhao, Jing

    2017-01-31

    Physical and functional interplays between genes or proteins have important biological meaning for cellular functions. Some efforts have been made to construct weighted gene association meta-networks by integrating multiple biological resources, where the weight indicates the confidence of the interaction. However, it is found that these existing human gene association networks share only quite limited overlapped interactions, suggesting their incompleteness and noise. Here we proposed a workflow to construct a weighted human gene association network using information of six existing networks, including two weighted specific PPI networks and four gene association meta-networks. We applied link prediction algorithm to predict possible missing links of the networks, cross-validation approach to refine each network and finally integrated the refined networks to get the final integrated network. The common information among the refined networks increases notably, suggesting their higher reliability. Our final integrated network owns much more links than most of the original networks, meanwhile its links still keep high functional relevance. Being used as background network in a case study of disease gene prediction, the final integrated network presents good performance, implying its reliability and application significance. Our workflow could be insightful for integrating and refining existing gene association data.

  16. Model-based gene set analysis for Bioconductor.

    Science.gov (United States)

    Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien

    2011-07-01

    Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.

  17. Multi-label literature classification based on the Gene Ontology graph

    Directory of Open Access Journals (Sweden)

    Lu Xinghua

    2008-12-01

    Full Text Available Abstract Background The Gene Ontology is a controlled vocabulary for representing knowledge related to genes and proteins in a computable form. The current effort of manually annotating proteins with the Gene Ontology is outpaced by the rate of accumulation of biomedical knowledge in literature, which urges the development of text mining approaches to facilitate the process by automatically extracting the Gene Ontology annotation from literature. The task is usually cast as a text classification problem, and contemporary methods are confronted with unbalanced training data and the difficulties associated with multi-label classification. Results In this research, we investigated the methods of enhancing automatic multi-label classification of biomedical literature by utilizing the structure of the Gene Ontology graph. We have studied three graph-based multi-label classification algorithms, including a novel stochastic algorithm and two top-down hierarchical classification methods for multi-label literature classification. We systematically evaluated and compared these graph-based classification algorithms to a conventional flat multi-label algorithm. The results indicate that, through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods can significantly improve predictions of the Gene Ontology terms implied by the analyzed text. Furthermore, the graph-based multi-label classifiers are capable of suggesting Gene Ontology annotations (to curators that are closely related to the true annotations even if they fail to predict the true ones directly. A software package implementing the studied algorithms is available for the research community. Conclusion Through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods have better potential than the conventional flat multi-label classification approach to facilitate

  18. Radionuclide reporter gene imaging for cardiac gene therapy

    International Nuclear Information System (INIS)

    Inubushi, Masayuki; Tamaki, Nagara

    2007-01-01

    In the field of cardiac gene therapy, angiogenic gene therapy has been most extensively investigated. The first clinical trial of cardiac angiogenic gene therapy was reported in 1998, and at the peak, more than 20 clinical trial protocols were under evaluation. However, most trials have ceased owing to the lack of decisive proof of therapeutic effects and the potential risks of viral vectors. In order to further advance cardiac angiogenic gene therapy, remaining open issues need to be resolved: there needs to be improvement of gene transfer methods, regulation of gene expression, development of much safer vectors and optimisation of therapeutic genes. For these purposes, imaging of gene expression in living organisms is of great importance. In radionuclide reporter gene imaging, ''reporter genes'' transferred into cell nuclei encode for a protein that retains a complementary ''reporter probe'' of a positron or single-photon emitter; thus expression of the reporter genes can be imaged with positron emission tomography or single-photon emission computed tomography. Accordingly, in the setting of gene therapy, the location, magnitude and duration of the therapeutic gene co-expression with the reporter genes can be monitored non-invasively. In the near future, gene therapy may evolve into combination therapy with stem/progenitor cell transplantation, so-called cell-based gene therapy or gene-modified cell therapy. Radionuclide reporter gene imaging is now expected to contribute in providing evidence on the usefulness of this novel therapeutic approach, as well as in investigating the molecular mechanisms underlying neovascularisation and safety issues relevant to further progress in conventional gene therapy. (orig.)

  19. Interaction between LRP5 and periostin gene polymorphisms on serum periostin levels and cortical bone microstructure.

    Science.gov (United States)

    Pepe, J; Bonnet, N; Herrmann, F R; Biver, E; Rizzoli, R; Chevalley, T; Ferrari, S L

    2018-02-01

    We investigated the interaction between periostin SNPs and the SNPs of the genes assumed to modulate serum periostin levels and bone microstructure in a cohort of postmenopausal women. We identified an interaction between LRP5 SNP rs648438 and periostin SNP rs9547970 on serum periostin levels and on radial cortical porosity. The purpose of this study is to investigate the interaction between periostin gene polymorphisms (SNPs) and other genes potentially responsible for modulating serum periostin levels and bone microstructure in a cohort of postmenopausal women. In 648 postmenopausal women from the Geneva Retirees Cohort, we analyzed 6 periostin SNPs and another 149 SNPs in 14 genes, namely BMP2, CTNNB1, ESR1, ESR2, LRP5, LRP6, PTH, SPTBN1, SOST, TGFb1, TNFRSF11A, TNFSF11, TNFRSF11B and WNT16. Volumetric BMD and bone microstructure were measured by high-resolution peripheral quantitative computed tomography at the distal radius and tibia. Serum periostin levels were associated with radial cortical porosity, including after adjustment for age, BMI, and years since menopause (p = 0.036). Sixteen SNPs in the ESR1, LRP5, TNFRSF11A, SOST, SPTBN1, TNFRSF11B and TNFSF11 genes were associated with serum periostin levels (p range 0.03-0.001) whereas 26 SNPs in 9 genes were associated with cortical porosity at the radius and/or at the tibia. WNT 16 was the gene with the highest number of SNPs associated with both trabecular and cortical microstructure. The periostin SNP rs9547970 was also associated with cortical porosity (p = 0.04). In particular, SNPs in LRP5, ESR1 and near the TNFRSF11A gene were associated with both cortical porosity and serum periostin levels. Eventually, we identified an interaction between LRP5 SNP rs648438 and periostin SNP rs9547970 on serum periostin levels (interaction p = 0.01) and on radial cortical porosity (interaction p = 0.005). These results suggest that periostin expression is genetically modulated, particularly by polymorphisms

  20. Oh, Behave! Behavior as an Interaction between Genes & the Environment

    Science.gov (United States)

    Weigel, Emily G.; DeNieu, Michael; Gall, Andrew J.

    2014-01-01

    This lesson is designed to teach students that behavior is a trait shaped by both genes and the environment. Students will read a scientific paper, discuss and generate predictions based on the ideas and data therein, and model the relationships between genes, the environment, and behavior. The lesson is targeted to meet the educational goals of…

  1. A modifier screen for Bazooka/PAR-3 interacting genes in the Drosophila embryo epithelium.

    Directory of Open Access Journals (Sweden)

    Wei Shao

    2010-04-01

    Full Text Available The development and homeostasis of multicellular organisms depends on sheets of epithelial cells. Bazooka (Baz; PAR-3 localizes to the apical circumference of epithelial cells and is a key hub in the protein interaction network regulating epithelial structure. We sought to identify additional proteins that function with Baz to regulate epithelial structure in the Drosophila embryo.The baz zygotic mutant cuticle phenotype could be dominantly enhanced by loss of known interaction partners. To identify additional enhancers, we screened molecularly defined chromosome 2 and 3 deficiencies. 37 deficiencies acted as strong dominant enhancers. Using deficiency mapping, bioinformatics, and available single gene mutations, we identified 17 interacting genes encoding known and predicted polarity, cytoskeletal, transmembrane, trafficking and signaling proteins. For each gene, their loss of function enhanced adherens junction defects in zygotic baz mutants during early embryogenesis. To further evaluate involvement in epithelial polarity, we generated GFP fusion proteins for 15 of the genes which had not been found to localize to the apical domain previously. We found that GFP fusion proteins for Drosophila ASAP, Arf79F, CG11210, Septin 5 and Sds22 could be recruited to the apical circumference of epithelial cells. Nine of the other proteins showed various intracellular distributions, and one was not detected.Our enhancer screen identified 17 genes that function with Baz to regulate epithelial structure in the Drosophila embryo. Our secondary localization screen indicated that some of the proteins may affect epithelial cell polarity by acting at the apical cell cortex while others may act through intracellular processes. For 13 of the 17 genes, this is the first report of a link to baz or the regulation of epithelial structure.

  2. Multiple genetic interaction experiments provide complementary information useful for gene function prediction.

    Directory of Open Access Journals (Sweden)

    Magali Michaut

    Full Text Available Genetic interactions help map biological processes and their functional relationships. A genetic interaction is defined as a deviation from the expected phenotype when combining multiple genetic mutations. In Saccharomyces cerevisiae, most genetic interactions are measured under a single phenotype - growth rate in standard laboratory conditions. Recently genetic interactions have been collected under different phenotypic readouts and experimental conditions. How different are these networks and what can we learn from their differences? We conducted a systematic analysis of quantitative genetic interaction networks in yeast performed under different experimental conditions. We find that networks obtained using different phenotypic readouts, in different conditions and from different laboratories overlap less than expected and provide significant unique information. To exploit this information, we develop a novel method to combine individual genetic interaction data sets and show that the resulting network improves gene function prediction performance, demonstrating that individual networks provide complementary information. Our results support the notion that using diverse phenotypic readouts and experimental conditions will substantially increase the amount of gene function information produced by genetic interaction screens.

  3. Gene expression analysis uncovers novel Hedgehog interacting protein (HHIP) effects in human bronchial epithelial cells

    Science.gov (United States)

    Zhou, Xiaobo; Qiu, Weiliang; Sathirapongsasuti, J. Fah.; Cho, Michael H.; Mancini, John D.; Lao, Taotao; Thibault, Derek M.; Litonjua, Gus; Bakke, Per S.; Gulsvik, Amund; Lomas, David A.; Beaty, Terri H.; Hersh, Craig P.; Anderson, Christopher; Geigenmuller, Ute; Raby, Benjamin A.; Rennard, Stephen I.; Perrella, Mark A.; Choi, Augustine M.K.; Quackenbush, John; Silverman, Edwin K.

    2013-01-01

    Hedgehog Interacting Protein (HHIP) was implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS). However, it remains unclear how HHIP contributes to COPD pathogenesis. To identify genes regulated by HHIP, we performed gene expression microarray analysis in a human bronchial epithelial cell line (Beas-2B) stably infected with HHIP shRNAs. HHIP silencing led to differential expression of 296 genes; enrichment for variants nominally associated with COPD was found. Eighteen of the differentially expressed genes were validated by real-time PCR in Beas-2B cells. Seven of 11 validated genes tested in human COPD and control lung tissues demonstrated significant gene expression differences. Functional annotation indicated enrichment for extracellular matrix and cell growth genes. Network modeling demonstrated that the extracellular matrix and cell proliferation genes influenced by HHIP tended to be interconnected. Thus, we identified potential HHIP targets in human bronchial epithelial cells that may contribute to COPD pathogenesis. PMID:23459001

  4. A sight on the current nanoparticle-based gene delivery vectors

    Science.gov (United States)

    Dizaj, Solmaz Maleki; Jafari, Samira; Khosroushahi, Ahmad Yari

    2014-05-01

    Nowadays, gene delivery for therapeutic objects is considered one of the most promising strategies to cure both the genetic and acquired diseases of human. The design of efficient gene delivery vectors possessing the high transfection efficiencies and low cytotoxicity is considered the major challenge for delivering a target gene to specific tissues or cells. On this base, the investigations on non-viral gene vectors with the ability to overcome physiological barriers are increasing. Among the non-viral vectors, nanoparticles showed remarkable properties regarding gene delivery such as the ability to target the specific tissue or cells, protect target gene against nuclease degradation, improve DNA stability, and increase the transformation efficiency or safety. This review attempts to represent a current nanoparticle based on its lipid, polymer, hybrid, and inorganic properties. Among them, hybrids, as efficient vectors, are utilized in gene delivery in terms of materials (synthetic or natural), design, and in vitro/ in vivo transformation efficiency.

  5. Gene Ontology

    Directory of Open Access Journals (Sweden)

    Gaston K. Mazandu

    2012-01-01

    Full Text Available The wide coverage and biological relevance of the Gene Ontology (GO, confirmed through its successful use in protein function prediction, have led to the growth in its popularity. In order to exploit the extent of biological knowledge that GO offers in describing genes or groups of genes, there is a need for an efficient, scalable similarity measure for GO terms and GO-annotated proteins. While several GO similarity measures exist, none adequately addresses all issues surrounding the design and usage of the ontology. We introduce a new metric for measuring the distance between two GO terms using the intrinsic topology of the GO-DAG, thus enabling the measurement of functional similarities between proteins based on their GO annotations. We assess the performance of this metric using a ROC analysis on human protein-protein interaction datasets and correlation coefficient analysis on the selected set of protein pairs from the CESSM online tool. This metric achieves good performance compared to the existing annotation-based GO measures. We used this new metric to assess functional similarity between orthologues, and show that it is effective at determining whether orthologues are annotated with similar functions and identifying cases where annotation is inconsistent between orthologues.

  6. The Integration of Epistasis Network and Functional Interactions in a GWAS Implicates RXR Pathway Genes in the Immune Response to Smallpox Vaccine.

    Directory of Open Access Journals (Sweden)

    Brett A McKinney

    Full Text Available Although many diseases and traits show large heritability, few genetic variants have been found to strongly separate phenotype groups by genotype. Complex regulatory networks of variants and expression of multiple genes lead to small individual-variant effects and difficulty replicating the effect of any single variant in an affected pathway. Interaction network modeling of GWAS identifies effects ignored by univariate models, but population differences may still cause specific genes to not replicate. Integrative network models may help detect indirect effects of variants in the underlying biological pathway. In this study, we used gene-level functional interaction information from the Integrative Multi-species Prediction (IMP tool to reveal important genes associated with a complex phenotype through evidence from epistasis networks and pathway enrichment. We test this method for augmenting variant-based network analyses with functional interactions by applying it to a smallpox vaccine immune response GWAS. The integrative analysis spotlights the role of genes related to retinoid X receptor alpha (RXRA, which has been implicated in a previous epistasis network analysis of smallpox vaccine.

  7. Cis-regulatory element based targeted gene finding: genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana.

    Science.gov (United States)

    Zhang, Weixiong; Ruan, Jianhua; Ho, Tuan-Hua David; You, Youngsook; Yu, Taotao; Quatrano, Ralph S

    2005-07-15

    A fundamental problem of computational genomics is identifying the genes that respond to certain endogenous cues and environmental stimuli. This problem can be referred to as targeted gene finding. Since gene regulation is mainly determined by the binding of transcription factors and cis-regulatory DNA sequences, most existing gene annotation methods, which exploit the conservation of open reading frames, are not effective in finding target genes. A viable approach to targeted gene finding is to exploit the cis-regulatory elements that are known to be responsible for the transcription of target genes. Given such cis-elements, putative target genes whose promoters contain the elements can be identified. As a case study, we apply the above approach to predict the genes in model plant Arabidopsis thaliana which are inducible by a phytohormone, abscisic acid (ABA), and abiotic stress, such as drought, cold and salinity. We first construct and analyze two ABA specific cis-elements, ABA-responsive element (ABRE) and its coupling element (CE), in A.thaliana, based on their conservation in rice and other cereal plants. We then use the ABRE-CE module to identify putative ABA-responsive genes in A.thaliana. Based on RT-PCR verification and the results from literature, this method has an accuracy rate of 67.5% for the top 40 predictions. The cis-element based targeted gene finding approach is expected to be widely applicable since a large number of cis-elements in many species are available.

  8. Density based pruning for identification of differentially expressed genes from microarray data

    Directory of Open Access Journals (Sweden)

    Xu Jia

    2010-11-01

    Full Text Available Abstract Motivation Identification of differentially expressed genes from microarray datasets is one of the most important analyses for microarray data mining. Popular algorithms such as statistical t-test rank genes based on a single statistics. The false positive rate of these methods can be improved by considering other features of differentially expressed genes. Results We proposed a pattern recognition strategy for identifying differentially expressed genes. Genes are mapped to a two dimension feature space composed of average difference of gene expression and average expression levels. A density based pruning algorithm (DB Pruning is developed to screen out potential differentially expressed genes usually located in the sparse boundary region. Biases of popular algorithms for identifying differentially expressed genes are visually characterized. Experiments on 17 datasets from Gene Omnibus Database (GEO with experimentally verified differentially expressed genes showed that DB pruning can significantly improve the prediction accuracy of popular identification algorithms such as t-test, rank product, and fold change. Conclusions Density based pruning of non-differentially expressed genes is an effective method for enhancing statistical testing based algorithms for identifying differentially expressed genes. It improves t-test, rank product, and fold change by 11% to 50% in the numbers of identified true differentially expressed genes. The source code of DB pruning is freely available on our website http://mleg.cse.sc.edu/degprune

  9. Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

    Science.gov (United States)

    Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

    2009-04-21

    To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease

  10. Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

    Directory of Open Access Journals (Sweden)

    Mixon Mark

    2009-04-01

    Full Text Available Abstract Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene

  11. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records.

    Science.gov (United States)

    Jiang, Li; Edwards, Stefan M; Thomsen, Bo; Workman, Christopher T; Guldbrandtsen, Bernt; Sørensen, Peter

    2014-09-24

    Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data

  12. Microsatellite polymorphisms associated with human behavioural and psychological phenotypes including a gene-environment interaction.

    Science.gov (United States)

    Bagshaw, Andrew T M; Horwood, L John; Fergusson, David M; Gemmell, Neil J; Kennedy, Martin A

    2017-02-03

    The genetic and environmental influences on human personality and behaviour are a complex matter of ongoing debate. Accumulating evidence indicates that short tandem repeats (STRs) in regulatory regions are good candidates to explain heritability not accessed by genome-wide association studies. We tested for associations between the genotypes of four selected repeats and 18 traits relating to personality, behaviour, cognitive ability and mental health in a well-studied longitudinal birth cohort (n = 458-589) using one way analysis of variance. The repeats were a highly conserved poly-AC microsatellite in the upstream promoter region of the T-box brain 1 (TBR1) gene and three previously studied STRs in the activating enhancer-binding protein 2-beta (AP2-β) and androgen receptor (AR) genes. Where significance was found we used multiple regression to assess the influence of confounding factors. Carriers of the shorter, most common, allele of the AR gene's GGN microsatellite polymorphism had fewer anxiety-related symptoms, which was consistent with previous studies, but in our study this was not significant following Bonferroni correction. No associations with two repeats in the AP2-β gene withstood this correction. A novel finding was that carriers of the minor allele of the TBR1 AC microsatellite were at higher risk of conduct problems in childhood at age 7-9 (p = 0.0007, which did pass Bonferroni correction). Including maternal smoking during pregnancy (MSDP) in models controlling for potentially confounding influences showed that an interaction between TBR1 genotype and MSDP was a significant predictor of conduct problems in childhood and adolescence (p behaviour up to age 25 years (p ≤ 0.02). This interaction remained significant after controlling for possible confounders including maternal age at birth, socio-economic status and education, and offspring birth weight. The potential functional importance of the TBR1 gene's promoter microsatellite

  13. Clock gene modulates roles of OXTR and AVPR1b genes in prosociality.

    Directory of Open Access Journals (Sweden)

    Haipeng Ci

    Full Text Available BACKGROUND: The arginine vasopressin receptor (AVPR and oxytocin receptor (OXTR genes have been demonstrated to contribute to prosocial behavior. Recent research has focused on the manner by which these simple receptor genes influence prosociality, particularly with regard to the AVP system, which is modulated by the clock gene. The clock gene is responsible for regulating the human biological clock, affecting sleep, emotion and behavior. The current study examined in detail whether the influences of the OXTR and AVPR1b genes on prosociality are dependent on the clock gene. METHODOLOGY/PRINCIPAL FINDINGS: This study assessed interactions between the clock gene (rs1801260, rs6832769 and the OXTR (rs1042778, rs237887 and AVPR1b (rs28373064 genes in association with individual differences in prosociality in healthy male Chinese subjects (n = 436. The Prosocial Tendencies Measure (PTM-R was used to assess prosociality. Participants carrying both the GG/GA variant of AVPR1b rs28373064 and the AA variant of clock rs6832769 showed the highest scores on the Emotional PTM. Carriers of both the T allele of OXTR rs1042778 and the C allele of clock rs1801260 showed the lowest total PTM scores compared with the other groups. CONCLUSIONS: The observed interaction effects provide converging evidence that the clock gene and OXT/AVP systems are intertwined and contribute to human prosociality.

  14. Clock gene modulates roles of OXTR and AVPR1b genes in prosociality.

    Science.gov (United States)

    Ci, Haipeng; Wu, Nan; Su, Yanjie

    2014-01-01

    The arginine vasopressin receptor (AVPR) and oxytocin receptor (OXTR) genes have been demonstrated to contribute to prosocial behavior. Recent research has focused on the manner by which these simple receptor genes influence prosociality, particularly with regard to the AVP system, which is modulated by the clock gene. The clock gene is responsible for regulating the human biological clock, affecting sleep, emotion and behavior. The current study examined in detail whether the influences of the OXTR and AVPR1b genes on prosociality are dependent on the clock gene. This study assessed interactions between the clock gene (rs1801260, rs6832769) and the OXTR (rs1042778, rs237887) and AVPR1b (rs28373064) genes in association with individual differences in prosociality in healthy male Chinese subjects (n = 436). The Prosocial Tendencies Measure (PTM-R) was used to assess prosociality. Participants carrying both the GG/GA variant of AVPR1b rs28373064 and the AA variant of clock rs6832769 showed the highest scores on the Emotional PTM. Carriers of both the T allele of OXTR rs1042778 and the C allele of clock rs1801260 showed the lowest total PTM scores compared with the other groups. The observed interaction effects provide converging evidence that the clock gene and OXT/AVP systems are intertwined and contribute to human prosociality.

  15. A comparison of 100 human genes using an alu element-based instability model.

    Directory of Open Access Journals (Sweden)

    George W Cook

    Full Text Available The human retrotransposon with the highest copy number is the Alu element. The human genome contains over one million Alu elements that collectively account for over ten percent of our DNA. Full-length Alu elements are randomly distributed throughout the genome in both forward and reverse orientations. However, full-length widely spaced Alu pairs having two Alus in the same (direct orientation are statistically more prevalent than Alu pairs having two Alus in the opposite (inverted orientation. The cause of this phenomenon is unknown. It has been hypothesized that this imbalance is the consequence of anomalous inverted Alu pair interactions. One proposed mechanism suggests that inverted Alu pairs can ectopically interact, exposing both ends of each Alu element making up the pair to a potential double-strand break, or "hit". This hypothesized "two-hit" (two double-strand breaks potential per Alu element was used to develop a model for comparing the relative instabilities of human genes. The model incorporates both 1 the two-hit double-strand break potential of Alu elements and 2 the probability of exon-damaging deletions extending from these double-strand breaks. This model was used to compare the relative instabilities of 50 deletion-prone cancer genes and 50 randomly selected genes from the human genome. The output of the Alu element-based genomic instability model developed here is shown to coincide with the observed instability of deletion-prone cancer genes. The 50 cancer genes are collectively estimated to be 58% more unstable than the randomly chosen genes using this model. Seven of the deletion-prone cancer genes, ATM, BRCA1, FANCA, FANCD2, MSH2, NCOR1 and PBRM1, were among the most unstable 10% of the 100 genes analyzed. This algorithm may lay the foundation for comparing genetic risks posed by structural variations that are unique to specific individuals, families and people groups.

  16. Improving functional modules discovery by enriching interaction networks with gene profiles

    KAUST Repository

    Salem, Saeed; Alroobi, Rami; Banitaan, Shadi; Seridi, Loqmane; Aljarah, Ibrahim; Brewer, James

    2013-01-01

    networks. We demonstrate the effectiveness of CLARM on Yeast and Human interaction datasets, and gene expression and molecular function profiles. Experiments on these real datasets show that the CLARM approach is competitive to well established functional

  17. Social Regulation of Gene Expression in Threespine Sticklebacks.

    Directory of Open Access Journals (Sweden)

    Anna K Greenwood

    Full Text Available Identifying genes that are differentially expressed in response to social interactions is informative for understanding the molecular basis of social behavior. To address this question, we described changes in gene expression as a result of differences in the extent of social interactions. We housed threespine stickleback (Gasterosteus aculeatus females in either group conditions or individually for one week, then measured levels of gene expression in three brain regions using RNA-sequencing. We found that numerous genes in the hindbrain/cerebellum had altered expression in response to group or individual housing. However, relatively few genes were differentially expressed in either the diencephalon or telencephalon. The list of genes upregulated in fish from social groups included many genes related to neural development and cell adhesion as well as genes with functions in sensory signaling, stress, and social and reproductive behavior. The list of genes expressed at higher levels in individually-housed fish included several genes previously identified as regulated by social interactions in other animals. The identified genes are interesting targets for future research on the molecular mechanisms of normal social interactions.

  18. GOBO: gene expression-based outcome for breast cancer online.

    Directory of Open Access Journals (Sweden)

    Markus Ringnér

    Full Text Available Microarray-based gene expression analysis holds promise of improving prognostication and treatment decisions for breast cancer patients. However, the heterogeneity of breast cancer emphasizes the need for validation of prognostic gene signatures in larger sample sets stratified into relevant subgroups. Here, we describe a multifunctional user-friendly online tool, GOBO (http://co.bmc.lu.se/gobo, allowing a range of different analyses to be performed in an 1881-sample breast tumor data set, and a 51-sample breast cancer cell line set, both generated on Affymetrix U133A microarrays. GOBO supports a wide range of applications including: 1 rapid assessment of gene expression levels in subgroups of breast tumors and cell lines, 2 identification of co-expressed genes for creation of potential metagenes, 3 association with outcome for gene expression levels of single genes, sets of genes, or gene signatures in multiple subgroups of the 1881-sample breast cancer data set. The design and implementation of GOBO facilitate easy incorporation of additional query functions and applications, as well as additional data sets irrespective of tumor type and array platform.

  19. Gene-environment interaction in Major Depression: focus on experience-dependent biological systems

    Directory of Open Access Journals (Sweden)

    Nicola eLopizzo

    2015-05-01

    Full Text Available Major Depressive Disorder (MDD is a multifactorial and polygenic disorder, where multiple and partially overlapping sets of susceptibility genes interact each other and with the environment, predisposing individuals to the development of the illness. Thus, MDD results from a complex interplay of vulnerability genes and environmental factors that act cumulatively throughout individual's lifetime. Among these environmental factors, stressful life experiences, especially those occurring early in life, have been suggested to exert a crucial impact on brain development, leading to permanent functional changes that may contribute to life long risk for mental health outcomes. In this review we will discuss how genetic variants (polymorphisms, SNPs within genes operating in neurobiological systems that mediate stress response and synaptic plasticity, can impact, by themselves, the vulnerability risk for MDD; we will also consider how this MDD risk can be further modulated when gene X environment interaction is taken into account. Finally, we will discuss the role of epigenetic mechanisms, and in particular of DNA methylation and miRNAs expression changes, in mediating the effect of the stress on the vulnerability risk to develop MDD. Taken together, in this review we aim to underlie the role of genetic and epigenetic processes involved in stress and neuroplasticity related biological systems on development of MDD after exposure to early life stress, thereby building the basis for future research and clinical interventions.

  20. Attachment style and oxytocin receptor gene variation interact in influencing social anxiety.

    Science.gov (United States)

    Notzon, S; Domschke, K; Holitschke, K; Ziegler, C; Arolt, V; Pauli, P; Reif, A; Deckert, J; Zwanzger, P

    2016-01-01

    Social anxiety has been suggested to be promoted by an insecure attachment style. Oxytocin is discussed as a mediator of trust and social bonding as well as a modulator of social anxiety. Applying a gene-environment (G × E) interaction approach, in the present pilot study the main and interactive effects of attachment styles and oxytocin receptor (OXTR) gene variation were probed in a combined risk factor model of social anxiety in healthy probands. Participants (N = 388; 219 females, 169 males; age 24.7 ± 4.7 years) were assessed for anxiety in social situations (Social Phobia and Anxiety Inventory) depending on attachment style (Adult Attachment Scale, AAS) and OXTR rs53576 A/G genotype. A less secure attachment style was significantly associated with higher social anxiety. This association was partly modulated by OXTR genotype, with a stronger negative influence of a less secure attachment style on social anxiety in A allele carriers as compared to GG homozygotes. The present pilot data point to a strong association of less secure attachment and social anxiety as well as to a gene-environment interaction effect of OXTR rs53576 genotype and attachment style on social anxiety possibly constituting a targetable combined risk marker of social anxiety disorder.

  1. A network approach to predict pathogenic genes for Fusarium graminearum.

    Science.gov (United States)

    Liu, Xiaoping; Tang, Wei-Hua; Zhao, Xing-Ming; Chen, Luonan

    2010-10-04

    Fusarium graminearum is the pathogenic agent of Fusarium head blight (FHB), which is a destructive disease on wheat and barley, thereby causing huge economic loss and health problems to human by contaminating foods. Identifying pathogenic genes can shed light on pathogenesis underlying the interaction between F. graminearum and its plant host. However, it is difficult to detect pathogenic genes for this destructive pathogen by time-consuming and expensive molecular biological experiments in lab. On the other hand, computational methods provide an alternative way to solve this problem. Since pathogenesis is a complicated procedure that involves complex regulations and interactions, the molecular interaction network of F. graminearum can give clues to potential pathogenic genes. Furthermore, the gene expression data of F. graminearum before and after its invasion into plant host can also provide useful information. In this paper, a novel systems biology approach is presented to predict pathogenic genes of F. graminearum based on molecular interaction network and gene expression data. With a small number of known pathogenic genes as seed genes, a subnetwork that consists of potential pathogenic genes is identified from the protein-protein interaction network (PPIN) of F. graminearum, where the genes in the subnetwork are further required to be differentially expressed before and after the invasion of the pathogenic fungus. Therefore, the candidate genes in the subnetwork are expected to be involved in the same biological processes as seed genes, which imply that they are potential pathogenic genes. The prediction results show that most of the pathogenic genes of F. graminearum are enriched in two important signal transduction pathways, including G protein coupled receptor pathway and MAPK signaling pathway, which are known related to pathogenesis in other fungi. In addition, several pathogenic genes predicted by our method are verified in other pathogenic fungi, which

  2. A network approach to predict pathogenic genes for Fusarium graminearum.

    Directory of Open Access Journals (Sweden)

    Xiaoping Liu

    Full Text Available Fusarium graminearum is the pathogenic agent of Fusarium head blight (FHB, which is a destructive disease on wheat and barley, thereby causing huge economic loss and health problems to human by contaminating foods. Identifying pathogenic genes can shed light on pathogenesis underlying the interaction between F. graminearum and its plant host. However, it is difficult to detect pathogenic genes for this destructive pathogen by time-consuming and expensive molecular biological experiments in lab. On the other hand, computational methods provide an alternative way to solve this problem. Since pathogenesis is a complicated procedure that involves complex regulations and interactions, the molecular interaction network of F. graminearum can give clues to potential pathogenic genes. Furthermore, the gene expression data of F. graminearum before and after its invasion into plant host can also provide useful information. In this paper, a novel systems biology approach is presented to predict pathogenic genes of F. graminearum based on molecular interaction network and gene expression data. With a small number of known pathogenic genes as seed genes, a subnetwork that consists of potential pathogenic genes is identified from the protein-protein interaction network (PPIN of F. graminearum, where the genes in the subnetwork are further required to be differentially expressed before and after the invasion of the pathogenic fungus. Therefore, the candidate genes in the subnetwork are expected to be involved in the same biological processes as seed genes, which imply that they are potential pathogenic genes. The prediction results show that most of the pathogenic genes of F. graminearum are enriched in two important signal transduction pathways, including G protein coupled receptor pathway and MAPK signaling pathway, which are known related to pathogenesis in other fungi. In addition, several pathogenic genes predicted by our method are verified in other

  3. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E.; Re, Matteo

    2014-01-01

    Objective In the context of “network medicine”, gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. Materials and methods We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. Results The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different “informativeness” embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Conclusions Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further

  4. Identifying novel fruit-related genes in Arabidopsis thaliana based on the random walk with restart algorithm.

    Science.gov (United States)

    Zhang, Yunhua; Dai, Li; Liu, Ying; Zhang, YuHang; Wang, ShaoPeng

    2017-01-01

    Fruit is essential for plant reproduction and is responsible for protection and dispersal of seeds. The development and maturation of fruit is tightly regulated by numerous genetic factors that respond to environmental and internal stimulation. In this study, we attempted to identify novel fruit-related genes in a model organism, Arabidopsis thaliana, using a computational method. Based on validated fruit-related genes, the random walk with restart (RWR) algorithm was applied on a protein-protein interaction (PPI) network using these genes as seeds. The identified genes with high probabilities were filtered by the permutation test and linkage tests. In the permutation test, the genes that were selected due to the structure of the PPI network were discarded. In the linkage tests, the importance of each candidate gene was measured from two aspects: (1) its functional associations with validated genes and (2) its similarity with validated genes on gene ontology (GO) terms and KEGG pathways. Finally, 255 inferred genes were obtained, subsequent extensive analysis of important genes revealed that they mainly contribute to ubiquitination (UBQ9, UBQ8, UBQ11, UBQ10), serine hydroxymethyl transfer (SHM7, SHM5, SHM6) or glycol-metabolism (HXKL2_ARATH, CSY5, GAPCP1), suggesting essential roles during the development and maturation of fruit in Arabidopsis thaliana.

  5. New Genome Similarity Measures based on Conserved Gene Adjacencies.

    Science.gov (United States)

    Doerr, Daniel; Kowada, Luis Antonio B; Araujo, Eloi; Deshpande, Shachi; Dantas, Simone; Moret, Bernard M E; Stoye, Jens

    2017-06-01

    Many important questions in molecular biology, evolution, and biomedicine can be addressed by comparative genomic approaches. One of the basic tasks when comparing genomes is the definition of measures of similarity (or dissimilarity) between two genomes, for example, to elucidate the phylogenetic relationships between species. The power of different genome comparison methods varies with the underlying formal model of a genome. The simplest models impose the strong restriction that each genome under study must contain the same genes, each in exactly one copy. More realistic models allow several copies of a gene in a genome. One speaks of gene families, and comparative genomic methods that allow this kind of input are called gene family-based. The most powerful-but also most complex-models avoid this preprocessing of the input data and instead integrate the family assignment within the comparative analysis. Such methods are called gene family-free. In this article, we study an intermediate approach between family-based and family-free genomic similarity measures. Introducing this simpler model, called gene connections, we focus on the combinatorial aspects of gene family-free genome comparison. While in most cases, the computational costs to the general family-free case are the same, we also find an instance where the gene connections model has lower complexity. Within the gene connections model, we define three variants of genomic similarity measures that have different expression powers. We give polynomial-time algorithms for two of them, while we show NP-hardness for the third, most powerful one. We also generalize the measures and algorithms to make them more robust against recent local disruptions in gene order. Our theoretical findings are supported by experimental results, proving the applicability and performance of our newly defined similarity measures.

  6. Avirulence (AVR) Gene-Based Diagnosis Complements Existing Pathogen Surveillance Tools for Effective Deployment of Resistance (R) Genes Against Rice Blast Disease.

    Science.gov (United States)

    Selisana, S M; Yanoria, M J; Quime, B; Chaipanya, C; Lu, G; Opulencia, R; Wang, G-L; Mitchell, T; Correll, J; Talbot, N J; Leung, H; Zhou, B

    2017-06-01

    Avirulence (AVR) genes in Magnaporthe oryzae, the fungal pathogen that causes the devastating rice blast disease, have been documented to be major targets subject to mutations to avoid recognition by resistance (R) genes. In this study, an AVR-gene-based diagnosis tool for determining the virulence spectrum of a rice blast pathogen population was developed and validated. A set of 77 single-spore field isolates was subjected to pathotype analysis using differential lines, each containing a single R gene, and classified into 20 virulent pathotypes, except for 4 isolates that lost pathogenicity. In all, 10 differential lines showed low frequency (95%), inferring the effectiveness of R genes present in the respective differential lines. In addition, the haplotypes of seven AVR genes were determined by polymerase chain reaction amplification and sequencing, if applicable. The calculated frequency of different AVR genes displayed significant variations in the population. AVRPiz-t and AVR-Pii were detected in 100 and 84.9% of the isolates, respectively. Five AVR genes such as AVR-Pik-D (20.5%) and AVR-Pik-E (1.4%), AVRPiz-t (2.7%), AVR-Pita (0%), AVR-Pia (0%), and AVR1-CO39 (0%) displayed low or even zero frequency. The frequency of AVR genes correlated almost perfectly with the resistance frequency of the cognate R genes in differential lines, except for International Rice Research Institute-bred blast-resistant lines IRBLzt-T, IRBLta-K1, and IRBLkp-K60. Both genetic analysis and molecular marker validation revealed an additional R gene, most likely Pi19 or its allele, in these three differential lines. This can explain the spuriously higher resistance frequency of each target R gene based on conventional pathotyping. This study demonstrates that AVR-gene-based diagnosis provides a precise, R-gene-specific, and differential line-free assessment method that can be used for determining the virulence spectrum of a rice blast pathogen population and for predicting the

  7. Interactions between Bmp-4 and Msx-1 act to restrict gene expression to odontogenic mesenchyme.

    Science.gov (United States)

    Tucker, A S; Al Khamis, A; Sharpe, P T

    1998-08-01

    Tooth development is regulated by a reciprocal series of epithelial-mesenchymal interactions. Bmp4 has been identified as a candidate signalling molecule in these interactions, initially as an epithelial signal and then later at the bud stage as a mesenchymal signal (Vainio et al. [1993] Cell 75:45-58). A target gene for Bmp4 signalling is the homeobox gene Msx-1, identified by the ability of recombinant Bmp4 protein to induce expression in mesenchyme. There is, however, no evidence that Bmp4 is the endogenous inducer of Msx-1 expression. Msx-1 and Bmp-4 show dynamic, interactive patterns of expression in oral epithelium and ectomesenchyme during the early stages of tooth development. In this study, we compare the temporal and spatial expression of these two genes to determine whether the changing expression patterns of these genes are consistent with interactions between the two molecules. We show that changes in Bmp-4 expression precede changes in Msx-1 expression. At embryonic day (E)10.5-E11.0, expression patterns are consistent with BMP4 from the epithelium, inducing or maintaining Msx-1 in underlying mesenchyme. At E11.5, Bmp-4 expression shifts from epithelium to mesenchyme and is rapidly followed by localised up-regulation of Msx-1 expression at the sites of Bmp-4 expression. Using cultured explants of developing mandibles, we confirm that exogenous BMP4 is capable of replacing the endogenous source in epithelium and inducing Msx-1 gene expression in mesenchyme. By using noggin, a BMP inhibitor, we show that endogenous Msx-1 expression can be inhibited at E10.5 and E11.5, providing the first evidence that endogenous Bmp-4 from the epithelium is responsible for regulating the early spatial expression of Msx-1. We also show that the mesenchymal shift in Bmp-4 is responsible for up-regulating Msx-1 specifically at the sites of future tooth formation. Thus, we establish that a reciprocal series of interactions act to restrict expression of both genes to future

  8. The drug target genes show higher evolutionary conservation than non-target genes.

    Science.gov (United States)

    Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie

    2016-01-26

    Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.

  9. Analyzing the genes related to Alzheimer's disease via a network and pathway-based approach.

    Science.gov (United States)

    Hu, Yan-Shi; Xin, Juncai; Hu, Ying; Zhang, Lei; Wang, Ju

    2017-04-27

    Our understanding of the molecular mechanisms underlying Alzheimer's disease (AD) remains incomplete. Previous studies have revealed that genetic factors provide a significant contribution to the pathogenesis and development of AD. In the past years, numerous genes implicated in this disease have been identified via genetic association studies on candidate genes or at the genome-wide level. However, in many cases, the roles of these genes and their interactions in AD are still unclear. A comprehensive and systematic analysis focusing on the biological function and interactions of these genes in the context of AD will therefore provide valuable insights to understand the molecular features of the disease. In this study, we collected genes potentially associated with AD by screening publications on genetic association studies deposited in PubMed. The major biological themes linked with these genes were then revealed by function and biochemical pathway enrichment analysis, and the relation between the pathways was explored by pathway crosstalk analysis. Furthermore, the network features of these AD-related genes were analyzed in the context of human interactome and an AD-specific network was inferred using the Steiner minimal tree algorithm. We compiled 430 human genes reported to be associated with AD from 823 publications. Biological theme analysis indicated that the biological processes and biochemical pathways related to neurodevelopment, metabolism, cell growth and/or survival, and immunology were enriched in these genes. Pathway crosstalk analysis then revealed that the significantly enriched pathways could be grouped into three interlinked modules-neuronal and metabolic module, cell growth/survival and neuroendocrine pathway module, and immune response-related module-indicating an AD-specific immune-endocrine-neuronal regulatory network. Furthermore, an AD-specific protein network was inferred and novel genes potentially associated with AD were identified. By

  10. PINTA: a web server for network-based gene prioritization from expression data

    DEFF Research Database (Denmark)

    Nitsch, Daniela; Tranchevent, Léon-Charles; Goncalves, Joana P.

    2011-01-01

    PINTA (available at http://www.esat.kuleuven.be/ pinta/; this web site is free and open to all users and there is no login requirement) is a web resource for the prioritization of candidate genes based on the differential expression of their neighborhood in a genome-wide protein–protein interaction...

  11. Zn(II)-dipicolylamine-based metallo-lipids as novel non-viral gene vectors.

    Science.gov (United States)

    Su, Rong-Chuan; Liu, Qiang; Yi, Wen-Jing; Zhao, Zhi-Gang

    2017-08-01

    In this study, a series of Zn(II)-dipicolylamine (Zn-DPA) based cationic lipids bearing different hydrophobic tails (long chains, α-tocopherol, cholesterol or diosgenin) were synthesized. Structure-activity relationship (SAR) of these lipids was studied in detail by investigating the effects of several structural aspects including the type of hydrophobic tails, the chain length and saturation degree. In addition, several assays were used to study their interactions with plasmid DNA, and results reveal that these lipids could condense DNA into nanosized particles with appropriate size and zeta-potentials. MTT-based cell viability assays showed that lipoplexes 5 had low cytotoxicity. The in vitro gene transfection studies showed the hydrophobic tails clearly affected the TE, and hexadecanol-containing lipid 5b gives the best TE, which was 2.2 times higher than bPEI 25k in the presence of 10% serum. The results not only demonstrate that these lipids might be promising non-viral gene vectors, but also afford us clues for further optimization of lipidic gene delivery materials.

  12. Differential gene expression in foxtail millet during incompatible interaction with Uromyces setariae-italicae.

    Directory of Open Access Journals (Sweden)

    Zhi Yong Li

    Full Text Available Foxtail millet (Setaria italica is an important food and fodder grain crop that is grown for human consumption. Production of this species is affected by several plant diseases, such as rust. The cultivar Shilixiang has been identified as resistant to the foxtail millet rust pathogen, Uromyces setariae-italicae. In order to identify signaling pathways and genes related to the plant's defense mechanisms against rust, the Shilixiang cultivar was used to construct a digital gene expression (DGE library during the interaction of foxtail millet with U. setariae-italicae. In this study, we determined the most abundant differentially expressed signaling pathways of up-regulated genes in foxtail millet and identified significantly up-regulated genes. Finally, quantitative real-time polymerase chain reaction (qRT-PCR analysis was used to analyze the expression of nine selected genes, and the patterns observed agreed well with DGE analysis. Expression levels of the genes were also compared between a resistant cultivar Shilixiang and a susceptible cultivar Yugu-1, and the result indicated that expression level of Shilixiang is higher than that of Yugu-1. This study reveals the relatively comprehensive mechanisms of rust-responsive transcription in foxtail millet.

  13. Differential expression and interaction of host factors augment HIV-1 gene expression in neonatal mononuclear cells

    International Nuclear Information System (INIS)

    Sundaravaradan, Vasudha; Mehta, Roshni; Harris, David T.; Zack, Jerome A.; Ahmad, Nafees

    2010-01-01

    We have previously shown a higher level of HIV-1 replication and gene expression in neonatal (cord) blood mononuclear cells (CBMC) compared with adult blood cells (PBMC), which could be due to differential expression of host factors. We performed the gene expression profile of CBMC and PBMC and found that 8013 genes were expressed at higher levels in CBMC than PBMC and 8028 genes in PBMC than CBMC, including 1181 and 1414 genes upregulated after HIV-1 infection in CBMC and PBMC, respectively. Several transcription factors (NF-κB, E2F, HAT-1, TFIIE, Cdk9, Cyclin T1), signal transducers (STAT3, STAT5A) and cytokines (IL-1β, IL-6, IL-10) were upregulated in CBMC than PBMC, which are known to influence HIV-1 replication. In addition, a repressor of HIV-1 transcription, YY1, was down regulated in CBMC than PBMC and several matrix metalloproteinase (MMP-7, -12, -14) were significantly upregulated in HIV-1 infected CBMC than PBMC. Furthermore, we show that CBMC nuclear extracts interacted with a higher extent to HIV-1 LTR cis-acting sequences, including NF-κB, NFAT, AP1 and NF-IL6 compared with PBMC nuclear extracts and retroviral based short hairpin RNA (shRNA) for STAT3 and IL-6 down regulated their own and HIV-1 gene expression, signifying that these factors influenced differential HIV-1 gene expression in CBMC than PBMC.

  14. Targeted delivery of genes to endothelial cells and cell- and gene-based therapy in pulmonary vascular diseases.

    Science.gov (United States)

    Suen, Colin M; Mei, Shirley H J; Kugathasan, Lakshmi; Stewart, Duncan J

    2013-10-01

    Pulmonary arterial hypertension (PAH) is a devastating disease that, despite significant advances in medical therapies over the last several decades, continues to have an extremely poor prognosis. Gene therapy is a method to deliver therapeutic genes to replace defective or mutant genes or supplement existing cellular processes to modify disease. Over the last few decades, several viral and nonviral methods of gene therapy have been developed for preclinical PAH studies with varying degrees of efficacy. However, these gene delivery methods face challenges of immunogenicity, low transduction rates, and nonspecific targeting which have limited their translation to clinical studies. More recently, the emergence of regenerative approaches using stem and progenitor cells such as endothelial progenitor cells (EPCs) and mesenchymal stem cells (MSCs) have offered a new approach to gene therapy. Cell-based gene therapy is an approach that augments the therapeutic potential of EPCs and MSCs and may deliver on the promise of reversal of established PAH. These new regenerative approaches have shown tremendous potential in preclinical studies; however, large, rigorously designed clinical studies will be necessary to evaluate clinical efficacy and safety. © 2013 American Physiological Society. Compr Physiol 3:1749-1779, 2013.

  15. Proteome Profiling Outperforms Transcriptome Profiling for Coexpression Based Gene Function Prediction

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Jing; Ma, Zihao; Carr, Steven A.; Mertins, Philipp; Zhang, Hui; Zhang, Zhen; Chan, Daniel W.; Ellis, Matthew J. C.; Townsend, R. Reid; Smith, Richard D.; McDermott, Jason E.; Chen, Xian; Paulovich, Amanda G.; Boja, Emily S.; Mesri, Mehdi; Kinsinger, Christopher R.; Rodriguez, Henry; Rodland, Karin D.; Liebler, Daniel C.; Zhang, Bing

    2016-11-11

    Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies

  16. Characteristics and Validation Techniques for PCA-Based Gene-Expression Signatures

    Directory of Open Access Journals (Sweden)

    Anders E. Berglund

    2017-01-01

    Full Text Available Background. Many gene-expression signatures exist for describing the biological state of profiled tumors. Principal Component Analysis (PCA can be used to summarize a gene signature into a single score. Our hypothesis is that gene signatures can be validated when applied to new datasets, using inherent properties of PCA. Results. This validation is based on four key concepts. Coherence: elements of a gene signature should be correlated beyond chance. Uniqueness: the general direction of the data being examined can drive most of the observed signal. Robustness: if a gene signature is designed to measure a single biological effect, then this signal should be sufficiently strong and distinct compared to other signals within the signature. Transferability: the derived PCA gene signature score should describe the same biology in the target dataset as it does in the training dataset. Conclusions. The proposed validation procedure ensures that PCA-based gene signatures perform as expected when applied to datasets other than those that the signatures were trained upon. Complex signatures, describing multiple independent biological components, are also easily identified.

  17. Using FlyBase, a Database of Drosophila Genes and Genomes.

    Science.gov (United States)

    Marygold, Steven J; Crosby, Madeline A; Goodman, Joshua L

    2016-01-01

    For nearly 25 years, FlyBase (flybase.org) has provided a freely available online database of biological information about Drosophila species, focusing on the model organism D. melanogaster. The need for a centralized, integrated view of Drosophila research has never been greater as advances in genomic, proteomic, and high-throughput technologies add to the quantity and diversity of available data and resources.FlyBase has taken several approaches to respond to these changes in the research landscape. Novel report pages have been generated for new reagent types and physical interaction data; Drosophila models of human disease are now represented and showcased in dedicated Human Disease Model Reports; other integrated reports have been established that bring together related genes, datasets, or reagents; Gene Reports have been revised to improve access to new data types and to highlight functional data; links to external sites have been organized and expanded; and new tools have been developed to display and interrogate all these data, including improved batch processing and bulk file availability. In addition, several new community initiatives have served to enhance interactions between researchers and FlyBase, resulting in direct user contributions and improved feedback.This chapter provides an overview of the data content, organization, and available tools within FlyBase, focusing on recent improvements. We hope it serves as a guide for our diverse user base, enabling efficient and effective exploration of the database and thereby accelerating research discoveries.

  18. The Omics Dashboard for interactive exploration of gene-expression data.

    Science.gov (United States)

    Paley, Suzanne; Parker, Karen; Spaulding, Aaron; Tomb, Jean-Francois; O'Maille, Paul; Karp, Peter D

    2017-12-01

    The Omics Dashboard is a software tool for interactive exploration and analysis of gene-expression datasets. The Omics Dashboard is organized as a hierarchy of cellular systems. At the highest level of the hierarchy the Dashboard contains graphical panels depicting systems such as biosynthesis, energy metabolism, regulation and central dogma. Each of those panels contains a series of X-Y plots depicting expression levels of subsystems of that panel, e.g. subsystems within the central dogma panel include transcription, translation and protein maturation and folding. The Dashboard presents a visual read-out of the expression status of cellular systems to facilitate a rapid top-down user survey of how all cellular systems are responding to a given stimulus, and to enable the user to quickly view the responses of genes within specific systems of interest. Although the Dashboard is complementary to traditional statistical methods for analysis of gene-expression data, we show how it can detect changes in gene expression that statistical techniques may overlook. We present the capabilities of the Dashboard using two case studies: the analysis of lipid production for the marine alga Thalassiosira pseudonana, and an investigation of a shift from anaerobic to aerobic growth for the bacterium Escherichia coli. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Mapping of Wnt-Frizzled interactions by multiplex CRISPR targeting of receptor gene families.

    Science.gov (United States)

    Voloshanenko, Oksana; Gmach, Philipp; Winter, Jan; Kranz, Dominique; Boutros, Michael

    2017-11-01

    Signaling pathway modules are often encoded by several closely related paralogous genes that can have redundant roles and are therefore difficult to analyze by loss-of-function analysis. A typical example is the Wnt signaling pathway, which in mammals is mediated by 19 Wnt ligands that can bind to 10 Frizzled (FZD) receptors. Although significant progress in understanding Wnt-FZD receptor interactions has been made in recent years, tools to generate systematic interaction maps have been largely lacking. Here we generated cell lines with multiplex mutant alleles of FZD1 , FZD2 , and FZD7 and demonstrate that these cells are unresponsive to canonical Wnt ligands. Subsequently, we performed genetic rescue experiments with combinations of FZDs and canonical Wnts to create a functional ligand-receptor interaction map. These experiments showed that whereas several Wnt ligands, such as Wnt3a, induce signaling through a broad spectrum of FZD receptors, others, such as Wnt8a, act through a restricted set of FZD genes. Together, our results map functional interactions of FZDs and 10 Wnt ligands and demonstrate how multiplex targeting by clustered regularly interspaced short palindromic repeat (CRISPR)/Cas9 can be used to systematically elucidate the functions of multigene families.-Voloshanenko, O., Gmach, P., Winter, J., Kranz, D., Boutros, M. Mapping of Wnt-Frizzled interactions by multiplex CRISPR targeting of receptor gene families. © The Author(s).

  20. Arabidopsis mRNA polyadenylation machinery: comprehensive analysis of protein-protein interactions and gene expression profiling

    Directory of Open Access Journals (Sweden)

    Mo Min

    2008-05-01

    Full Text Available Abstract Background The polyadenylation of mRNA is one of the critical processing steps during expression of almost all eukaryotic genes. It is tightly integrated with transcription, particularly its termination, as well as other RNA processing events, i.e. capping and splicing. The poly(A tail protects the mRNA from unregulated degradation, and it is required for nuclear export and translation initiation. In recent years, it has been demonstrated that the polyadenylation process is also involved in the regulation of gene expression. The polyadenylation process requires two components, the cis-elements on the mRNA and a group of protein factors that recognize the cis-elements and produce the poly(A tail. Here we report a comprehensive pairwise protein-protein interaction mapping and gene expression profiling of the mRNA polyadenylation protein machinery in Arabidopsis. Results By protein sequence homology search using human and yeast polyadenylation factors, we identified 28 proteins that may be components of Arabidopsis polyadenylation machinery. To elucidate the protein network and their functions, we first tested their protein-protein interaction profiles. Out of 320 pair-wise protein-protein interaction assays done using the yeast two-hybrid system, 56 (~17% showed positive interactions. 15 of these interactions were further tested, and all were confirmed by co-immunoprecipitation and/or in vitro co-purification. These interactions organize into three distinct hubs involving the Arabidopsis polyadenylation factors. These hubs are centered around AtCPSF100, AtCLPS, and AtFIPS. The first two are similar to complexes seen in mammals, while the third one stands out as unique to plants. When comparing the gene expression profiles extracted from publicly available microarray datasets, some of the polyadenylation related genes showed tissue-specific expression, suggestive of potential different polyadenylation complex configurations. Conclusion An

  1. Animal model for schizophrenia that reflects gene-environment interactions.

    Science.gov (United States)

    Nagai, Taku; Ibi, Daisuke; Yamada, Kiyofumi

    2011-01-01

    Schizophrenia is a devastating psychiatric disorder that impairs mental and social functioning and affects approximately 1% of the population worldwide. Genetic susceptibility factors for schizophrenia have recently been reported, some of which are known to play a role in neurodevelopment; these include neuregulin-1, dysbindin, and disrupted-in-schizophrenia 1 (DISC1). Moreover, epidemiologic studies suggest that environmental insults, such as prenatal infection and perinatal complication, are involved in the development of schizophrenia. The possible interaction between environment and genetic susceptibility factors, especially during neurodevelopment, is proposed as a promising disease etiology of schizophrenia. Polyriboinosinic-polyribocytidilic acid (polyI : C) is a synthetic analogue of double-stranded RNA that leads to the pronounced but time-limited production of pro-inflammatory cytokines. Maternal immune activation by polyI : C exposure in rodents is known to precipitate a wide spectrum of behavioral, cognitive, and pharmacological abnormalities in adult offspring. Recently, we have reported that neonatal injection of polyI : C in mice results in schizophrenia-like behavioral alterations in adulthood. In this review, we show how gene-environment interactions during neurodevelopment result in phenotypic changes in adulthood by injecting polyI : C into transgenic mice that express a dominant-negative form of human DISC1 (DN-DISC1). Our findings suggest that polyI : C-treated DN-DISC1 mice are a well-validated animal model for schizophrenia that reflects gene-environment interactions.

  2. Differential reconstructed gene interaction networks for deriving toxicity threshold in chemical risk assessment

    OpenAIRE

    Yang, Yi; Maxwell, Andrew; Zhang, Xiaowei; Wang, Nan; Perkins, Edward J; Zhang, Chaoyang; Gong, Ping

    2013-01-01

    Background Pathway alterations reflected as changes in gene expression regulation and gene interaction can result from cellular exposure to toxicants. Such information is often used to elucidate toxicological modes of action. From a risk assessment perspective, alterations in biological pathways are a rich resource for setting toxicant thresholds, which may be more sensitive and mechanism-informed than traditional toxicity endpoints. Here we developed a novel differential networks (DNs) appro...

  3. Hippocampal gene expression patterns in oxytocin male knockout mice are related to impaired social interaction.

    Science.gov (United States)

    Lazzari, Virginia Meneghini; Zimmermann-Peruzatto, Josi Maria; Agnes, Grasiela; Becker, Roberta Oriques; de Moura, Ana Carolina; Almeida, Silvana; Guedes, Renata Padilha; Giovenardi, Marcia

    2017-11-02

    Social interaction between animals is crucial for the survival and life in groups. It is well demonstrated that oxytocin (OT) and vasopressin (AVP) play critical roles in the regulation of social behaviors in mammals, however, other neurotransmitters and hormones are involved in the brain circuitry related to these behaviors. The present study aimed to investigate the gene expression of neurotransmitter receptors in the brain of OT knockout (OTKO) male mice. In this study, we evaluated the expression levels of the OT receptor (Oxtr), AVP receptors 1a and 1b (Avpr1a; Avpr1b), dopamine receptor 2 (Drd2), and the estrogen receptors alpha and beta (Esr1; Esr2) genes in the hippocampus (HPC), olfactory bulb (OB), hypothalamus (HPT) and prefrontal cortex (PFC). AVP gene (Avp) expression was analyzed in the HPT. Gene expression results were discussed regarding to social interaction and sexual behavior findings. Additionally, we analyzed the influence of OT absence on the Avp mRNA expression levels in the HPT. RNA extraction and cDNAs synthesis followed by quantitative polymerase chain reaction were performed for gene expression determination. Results were calculated with the 2 -ΔΔCt method. Our main finding was that HPC is more susceptible to gene expression changes due to the lack of OT. OTKOs exhibited decreased expression of Drd2 and Avpr1b, but increased expression of Oxtr in the HPC. In the PFC, Esr2 was increased. In the HPT, there was a reduced Avp expression in the OTKO group. No differences were detected in the OB and HPT. Despite these changes in gene expression, sexual behavior was not affected. However, OTKO showed higher social investigation and lower aggressive performance than wild-type mice. Our data highlight the importance of OT for proper gene expression of neurotransmitter receptors related to the regulation of social interaction in male mice. Copyright © 2017. Published by Elsevier B.V.

  4. Variable Persister Gene Interactions with (pppGpp for Persister Formation in Escherichia coli

    Directory of Open Access Journals (Sweden)

    Shuang Liu

    2017-09-01

    Full Text Available Persisters comprise a group of phenotypically heterogeneous metabolically quiescent bacteria with multidrug tolerance and contribute to the recalcitrance of chronic infections. Although recent work has shown that toxin-antitoxin (TA system HipAB depends on stringent response effector (pppGppin persister formation, whether other persister pathways are also dependent on stringent response has not been explored. Here we examined the relationship of (pppGpp with 15 common persister genes (dnaK, clpB, rpoS, pspF, tnaA, sucB, ssrA, smpB, recA, umuD, uvrA, hipA, mqsR, relE, dinJ using Escherichia coli as a model. By comparing the persister levels of wild type with their single gene knockout and double knockout mutants with relA, we divided their interactions into five types, namely A “dependent” (dnaK, recA, B “positive reinforcement” (rpoS, pspF, ssrA, recA, C “antagonistic” (clpB, sucB, umuD, uvrA, hipA, mqsR, relE, dinJ, D “epistasis” (clpB, rpoS, tnaA, ssrA, smpB, hipA, and E “irrelevant” (dnaK, clpB, rpoS, tnaA, sucB, smpB, umuD, uvrA, hipA, mqsR, relE, dinJ. We found that the persister gene interactions are intimately dependent on bacterial culture age, cell concentrations (diluted versus undiluted culture, and drug classifications, where the same gene may belong to different groups with varying antibiotics, culture age or cell concentrations. Together, this study represents the first attempt to systematically characterize the intricate relationships among the different mechanisms of persistence and as such provide new insights into the complexity of the persistence phenomenon at the level of persister gene network interactions.

  5. A Shortest-Path-Based Method for the Analysis and Prediction of Fruit-Related Genes in Arabidopsis thaliana.

    Science.gov (United States)

    Zhu, Liucun; Zhang, Yu-Hang; Su, Fangchu; Chen, Lei; Huang, Tao; Cai, Yu-Dong

    2016-01-01

    Biologically, fruits are defined as seed-bearing reproductive structures in angiosperms that develop from the ovary. The fertilization, development and maturation of fruits are crucial for plant reproduction and are precisely regulated by intrinsic genetic regulatory factors. In this study, we used Arabidopsis thaliana as a model organism and attempted to identify novel genes related to fruit-associated biological processes. Specifically, using validated genes, we applied a shortest-path-based method to identify several novel genes in a large network constructed using the protein-protein interactions observed in Arabidopsis thaliana. The described analyses indicate that several of the discovered genes are associated with fruit fertilization, development and maturation in Arabidopsis thaliana.

  6. A relative variation-based method to unraveling gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Yali Wang

    Full Text Available Gene regulatory network (GRN reconstruction is essential in understanding the functioning and pathology of a biological system. Extensive models and algorithms have been developed to unravel a GRN. The DREAM project aims to clarify both advantages and disadvantages of these methods from an application viewpoint. An interesting yet surprising observation is that compared with complicated methods like those based on nonlinear differential equations, etc., methods based on a simple statistics, such as the so-called Z-score, usually perform better. A fundamental problem with the Z-score, however, is that direct and indirect regulations can not be easily distinguished. To overcome this drawback, a relative expression level variation (RELV based GRN inference algorithm is suggested in this paper, which consists of three major steps. Firstly, on the basis of wild type and single gene knockout/knockdown experimental data, the magnitude of RELV of a gene is estimated. Secondly, probability for the existence of a direct regulation from a perturbed gene to a measured gene is estimated, which is further utilized to estimate whether a gene can be regulated by other genes. Finally, the normalized RELVs are modified to make genes with an estimated zero in-degree have smaller RELVs in magnitude than the other genes, which is used afterwards in queuing possibilities of the existence of direct regulations among genes and therefore leads to an estimate on the GRN topology. This method can in principle avoid the so-called cascade errors under certain situations. Computational results with the Size 100 sub-challenges of DREAM3 and DREAM4 show that, compared with the Z-score based method, prediction performances can be substantially improved, especially the AUPR specification. Moreover, it can even outperform the best team of both DREAM3 and DREAM4. Furthermore, the high precision of the obtained most reliable predictions shows that the suggested algorithm may be

  7. Msx homeobox gene family and craniofacial development.

    Science.gov (United States)

    Alappat, Sylvia; Zhang, Zun Yi; Chen, Yi Ping

    2003-12-01

    Vertebrate Msx genes are unlinked, homeobox-containing genes that bear homology to the Drosophila muscle segment homeobox gene. These genes are expressed at multiple sites of tissue-tissue interactions during vertebrate embryonic development. Inductive interactions mediated by the Msx genes are essential for normal craniofacial, limb and ectodermal organ morphogenesis, and are also essential to survival in mice, as manifested by the phenotypic abnormalities shown in knockout mice and in humans. This review summarizes studies on the expression, regulation, and functional analysis of Msx genes that bear relevance to craniofacial development in humans and mice. Key words: Msx genes, craniofacial, tooth, cleft palate, suture, development, transcription factor, signaling molecule.

  8. Gene-Environment Interaction in Parkinson's Disease: Coffee, ADORA2A, and CYP1A2.

    Science.gov (United States)

    Chuang, Yu-Hsuan; Lill, Christina M; Lee, Pei-Chen; Hansen, Johnni; Lassen, Christina F; Bertram, Lars; Greene, Naomi; Sinsheimer, Janet S; Ritz, Beate

    2016-01-01

    Drinking caffeinated coffee has been reported to provide protection against Parkinson's disease (PD). Caffeine is an adenosine A2A receptor (encoded by the gene ADORA2A) antagonist that increases dopaminergic neurotransmission and Cytochrome P450 1A2 (gene: CYP1A2) metabolizes caffeine; thus, gene polymorphisms in ADORA2A and CYP1A2 may influence the effect coffee consumption has on PD risk. In a population-based case-control study (PASIDA) in Denmark (1,556 PD patients and 1,606 birth year- and gender-matched controls), we assessed interactions between lifetime coffee consumption and 3 polymorphisms in ADORA2A and CYP1A2 for all subjects, and incident and prevalent PD cases separately using logistic regression models. We also conducted a meta-analysis combining our results with those from previous studies. We estimated statistically significant interactions for ADORA2A rs5760423 and heavy vs. light coffee consumption in incident (OR interaction = 0.66 [95% CI 0.46-0.94], p = 0.02) but not prevalent PD. We did not observe interactions for CYP1A2 rs762551 and rs2472304 in incident or prevalent PD. In meta-analyses, PD associations with daily coffee consumption were strongest among carriers of variant alleles in both ADORA2A and CYP1A2. We corroborated results from a previous report that described interactions between ADORA2A and CYP1A2 polymorphisms and coffee consumption. Our results also suggest that survivor bias may affect results of studies that enroll prevalent PD cases. © 2017 S. Karger AG, Basel.

  9. Gene-Diet Interaction and Precision Nutrition in Obesity

    Directory of Open Access Journals (Sweden)

    Yoriko Heianza

    2017-04-01

    Full Text Available The rapid rise of obesity during the past decades has coincided with a profound shift of our living environment, including unhealthy dietary patterns, a sedentary lifestyle, and physical inactivity. Genetic predisposition to obesity may have interacted with such an obesogenic environment in determining the obesity epidemic. Growing studies have found that changes in adiposity and metabolic response to low-calorie weight loss diets might be modified by genetic variants related to obesity, metabolic status and preference to nutrients. This review summarized data from recent studies of gene-diet interactions, and discussed integration of research of metabolomics and gut microbiome, as well as potential application of the findings in precision nutrition.

  10. Reranking candidate gene models with cross-species comparison for improved gene prediction

    Directory of Open Access Journals (Sweden)

    Pereira Fernando CN

    2008-10-01

    Full Text Available Abstract Background Most gene finders score candidate gene models with state-based methods, typically HMMs, by combining local properties (coding potential, splice donor and acceptor patterns, etc. Competing models with similar state-based scores may be distinguishable with additional information. In particular, functional and comparative genomics datasets may help to select among competing models of comparable probability by exploiting features likely to be associated with the correct gene models, such as conserved exon/intron structure or protein sequence features. Results We have investigated the utility of a simple post-processing step for selecting among a set of alternative gene models, using global scoring rules to rerank competing models for more accurate prediction. For each gene locus, we first generate the K best candidate gene models using the gene finder Evigan, and then rerank these models using comparisons with putative orthologous genes from closely-related species. Candidate gene models with lower scores in the original gene finder may be selected if they exhibit strong similarity to probable orthologs in coding sequence, splice site location, or signal peptide occurrence. Experiments on Drosophila melanogaster demonstrate that reranking based on cross-species comparison outperforms the best gene models identified by Evigan alone, and also outperforms the comparative gene finders GeneWise and Augustus+. Conclusion Reranking gene models with cross-species comparison improves gene prediction accuracy. This straightforward method can be readily adapted to incorporate additional lines of evidence, as it requires only a ranked source of candidate gene models.

  11. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods.

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E; Re, Matteo

    2014-06-01

    In the context of "network medicine", gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different "informativeness" embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both

  12. In vivo protein-DNA interactions at the β-globin gene locus

    International Nuclear Information System (INIS)

    Tohru Ikuta; Yuet Wai Kan

    1991-01-01

    The authors have investigated in vivo protein-DNA interactions in the β-globin gene locus by dimethyl sulfate (DMS) footprinting in K562 cells, which express var-epsilon- and γ-globin but not β-globin. In the locus control region, hypersensitive site 2 (HS-2) exhibited footprints in several putative protein binding motifs. HS-3 was not footprinted. The β promoter was also not footprinted, while extensive footprints were observed in the promoter of the active γ-globin gene. No footprints were seen in the A γ and β3' enhancers. With several motifs, additional protein interactions and alterations in binding patterns occurred with hemin induction. In HeLa cells, some footprints were observed in some of the motifs in HS-2, compatible with the finding that HS-2 has some enhancer function in HeLa cells, albeit much weaker than its activity in K562 cells. No footprint was seen in B lymphocytes. In vivo footprinting is a useful method for studying relevant protein-DNA interactions in erythroid cells

  13. A study for association and interaction analysis to metabolic syndrome and the ESR1 gene on cardiovascular autonomic neuropathy in a Chinese Han population.

    Science.gov (United States)

    Zeng, Fangfang; Zhou, Linuo; Tang, Zihui

    2016-01-01

    The aim of this study was to investigate the association and interaction of metabolic syndrome (MetS) and estrogen receptor alpha 1 (ESR1) gene polymorphisms on cardiovascular autonomic neuropathy (CAN). A large-scale, population-based study was conducted to analyze the interaction of MetS and ESR1 gene polymorphisms to CAN, including a total of 1977 Chinese subjects. The most common studied single nucleotide polymorphism of ESR1 gene-rs9340799, was genotyped. Multiple logistic regression (MLR) was performed to evaluate the interaction effect of environmental variables and gene polymorphisms. Interaction on an additive scale can be calculated by using the relative excess risk due to interaction (RERI), the proportion attributable to interaction (AP), and the synergy index (S). After controlling potential confounders, MLR showed that significant association between MetS and CAN (p interaction was estimated by using RETI = 0.396 (95 % CI 0.262 to 0.598), AP = 0.216 (95 % CI -0.784 to 1.216) and S = 1.906 (95 % CI 0.905 to 4.015). The present findings suggest that MetS is significantly associated with CAN and provide evidence for the hypothesis that MetS and ESR1 gene polymorphism (rs9340799) have interactive effects on CAN. ClinicalTrials gov Identifier NCT02461342.

  14. An interaction map of circulating metabolites, immune gene networks, and their genetic regulation.

    Science.gov (United States)

    Nath, Artika P; Ritchie, Scott C; Byars, Sean G; Fearnley, Liam G; Havulinna, Aki S; Joensuu, Anni; Kangas, Antti J; Soininen, Pasi; Wennerström, Annika; Milani, Lili; Metspalu, Andres; Männistö, Satu; Würtz, Peter; Kettunen, Johannes; Raitoharju, Emma; Kähönen, Mika; Juonala, Markus; Palotie, Aarno; Ala-Korpela, Mika; Ripatti, Samuli; Lehtimäki, Terho; Abraham, Gad; Raitakari, Olli; Salomaa, Veikko; Perola, Markus; Inouye, Michael

    2017-08-01

    Immunometabolism plays a central role in many cardiometabolic diseases. However, a robust map of immune-related gene networks in circulating human cells, their interactions with metabolites, and their genetic control is still lacking. Here, we integrate blood transcriptomic, metabolomic, and genomic profiles from two population-based cohorts (total N = 2168), including a subset of individuals with matched multi-omic data at 7-year follow-up. We identify topologically replicable gene networks enriched for diverse immune functions including cytotoxicity, viral response, B cell, platelet, neutrophil, and mast cell/basophil activity. These immune gene modules show complex patterns of association with 158 circulating metabolites, including lipoprotein subclasses, lipids, fatty acids, amino acids, small molecules, and CRP. Genome-wide scans for module expression quantitative trait loci (mQTLs) reveal five modules with mQTLs that have both cis and trans effects. The strongest mQTL is in ARHGEF3 (rs1354034) and affects a module enriched for platelet function, independent of platelet counts. Modules of mast cell/basophil and neutrophil function show temporally stable metabolite associations over 7-year follow-up, providing evidence that these modules and their constituent gene products may play central roles in metabolic inflammation. Furthermore, the strongest mQTL in ARHGEF3 also displays clear temporal stability, supporting widespread trans effects at this locus. This study provides a detailed map of natural variation at the blood immunometabolic interface and its genetic basis, and may facilitate subsequent studies to explain inter-individual variation in cardiometabolic disease.

  15. Synergistic interactions between Drosophila orthologues of genes spanned by de novo human CNVs support multiple-hit models of autism.

    Science.gov (United States)

    Grice, Stuart J; Liu, Ji-Long; Webber, Caleb

    2015-03-01

    Autism spectrum disorders (ASDs) are highly heritable and characterised by deficits in social interaction and communication, as well as restricted and repetitive behaviours. Although a number of highly penetrant ASD gene variants have been identified, there is growing evidence to support a causal role for combinatorial effects arising from the contributions of multiple loci. By examining synaptic and circadian neurological phenotypes resulting from the dosage variants of unique human:fly orthologues in Drosophila, we observe numerous synergistic interactions between pairs of informatically-identified candidate genes whose orthologues are jointly affected by large de novo copy number variants (CNVs). These CNVs were found in the genomes of individuals with autism, including a patient carrying a 22q11.2 deletion. We first demonstrate that dosage alterations of the unique Drosophila orthologues of candidate genes from de novo CNVs that harbour only a single candidate gene display neurological defects similar to those previously reported in Drosophila models of ASD-associated variants. We then considered pairwise dosage changes within the set of orthologues of candidate genes that were affected by the same single human de novo CNV. For three of four CNVs with complete orthologous relationships, we observed significant synergistic effects following the simultaneous dosage change of gene pairs drawn from a single CNV. The phenotypic variation observed at the Drosophila synapse that results from these interacting genetic variants supports a concordant phenotypic outcome across all interacting gene pairs following the direction of human gene copy number change. We observe both specificity and transitivity between interactors, both within and between CNV candidate gene sets, supporting shared and distinct genetic aetiologies. We then show that different interactions affect divergent synaptic processes, demonstrating distinct molecular aetiologies. Our study illustrates

  16. The properties of genome conformation and spatial gene interaction and regulation networks of normal and malignant human cell types.

    Directory of Open Access Journals (Sweden)

    Zheng Wang

    Full Text Available The spatial conformation of a genome plays an important role in the long-range regulation of genome-wide gene expression and methylation, but has not been extensively studied due to lack of genome conformation data. The recently developed chromosome conformation capturing techniques such as the Hi-C method empowered by next generation sequencing can generate unbiased, large-scale, high-resolution chromosomal interaction (contact data, providing an unprecedented opportunity to investigate the spatial structure of a genome and its applications in gene regulation, genomics, epigenetics, and cell biology. In this work, we conducted a comprehensive, large-scale computational analysis of this new stream of genome conformation data generated for three different human leukemia cells or cell lines by the Hi-C technique. We developed and applied a set of bioinformatics methods to reliably generate spatial chromosomal contacts from high-throughput sequencing data and to effectively use them to study the properties of the genome structures in one-dimension (1D and two-dimension (2D. Our analysis demonstrates that Hi-C data can be effectively applied to study tissue-specific genome conformation, chromosome-chromosome interaction, chromosomal translocations, and spatial gene-gene interaction and regulation in a three-dimensional genome of primary tumor cells. Particularly, for the first time, we constructed genome-scale spatial gene-gene interaction network, transcription factor binding site (TFBS - TFBS interaction network, and TFBS-gene interaction network from chromosomal contact information. Remarkably, all these networks possess the properties of scale-free modular networks.

  17. Thermo-Regulation of Genes Mediating Motility and Plant Interactions in Pseudomonas syringae

    Science.gov (United States)

    Hockett, Kevin L.; Burch, Adrien Y.; Lindow, Steven E.

    2013-01-01

    Pseudomonas syringae is an important phyllosphere colonist that utilizes flagellum-mediated motility both as a means to explore leaf surfaces, as well as to invade into leaf interiors, where it survives as a pathogen. We found that multiple forms of flagellum-mediated motility are thermo-suppressed, including swarming and swimming motility. Suppression of swarming motility occurs between 28° and 30°C, which coincides with the optimal growth temperature of P. syringae. Both fliC (encoding flagellin) and syfA (encoding a non-ribosomal peptide synthetase involved in syringafactin biosynthesis) were suppressed with increasing temperature. RNA-seq revealed 1440 genes of the P. syringae genome are temperature sensitive in expression. Genes involved in polysaccharide synthesis and regulation, phage and IS elements, type VI secretion, chemosensing and chemotaxis, translation, flagellar synthesis and motility, and phytotoxin synthesis and transport were generally repressed at 30°C, while genes involved in transcriptional regulation, quaternary ammonium compound metabolism and transport, chaperone/heat shock proteins, and hypothetical genes were generally induced at 30°C. Deletion of flgM, a key regulator in the transition from class III to class IV gene expression, led to elevated and constitutive expression of fliC regardless of temperature, but did not affect thermo-regulation of syfA. This work highlights the importance of temperature in the biology of P. syringae, as many genes encoding traits important for plant-microbe interactions were thermo-regulated. PMID:23527276

  18. Gene-environment interaction involving recently identified colorectal cancer susceptibility loci

    Science.gov (United States)

    Kantor, Elizabeth D.; Hutter, Carolyn M.; Minnier, Jessica; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cotterchio, Michelle; Du, Mengmeng; Duggan, David; Fuchs, Charles S.; Giovannucci, Edward L.; Gong, Jian; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Jenkins, Mark A.; Jiao, Shuo; Kolonel, Laurence N.; Le Marchand, Loic; Lemire, Mathieu; Ma, Jing; Newcomb, Polly A.; Ochs-Balcom, Heather M.; Pflugeisen, Bethann M.; Potter, John D.; Rudolph, Anja; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; Stelling, Deanna L.; Thomas, Fridtjof; Thornquist, Mark; Ulrich, Cornelia M.; Warnick, Greg S.; Zanke, Brent W.; Peters, Ulrike; Hsu, Li; White, Emily

    2014-01-01

    BACKGROUND Genome-wide association studies have identified several single nucleotide polymorphisms (SNPs) that are associated with risk of colorectal cancer (CRC). Prior research has evaluated the presence of gene-environment interaction involving the first 10 identified susceptibility loci, but little work has been conducted on interaction involving SNPs at recently identified susceptibility loci, including: rs10911251, rs6691170, rs6687758, rs11903757, rs10936599, rs647161, rs1321311, rs719725, rs1665650, rs3824999, rs7136702, rs11169552, rs59336, rs3217810, rs4925386, and rs2423279. METHODS Data on 9160 cases and 9280 controls from the Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) and Colon Cancer Family Registry (CCFR) were used to evaluate the presence of interaction involving the above-listed SNPs and sex, body mass index (BMI), alcohol consumption, smoking, aspirin use, post-menopausal hormone (PMH) use, as well as intake of dietary calcium, dietary fiber, dietary folate, red meat, processed meat, fruit, and vegetables. Interaction was evaluated using a fixed-effects meta-analysis of an efficient Empirical Bayes estimator, and permutation was used to account for multiple comparisons. RESULTS None of the permutation-adjusted p-values reached statistical significance. CONCLUSIONS The associations between recently identified genetic susceptibility loci and CRC are not strongly modified by sex, BMI, alcohol, smoking, aspirin, PMH use, and various dietary factors. IMPACT Results suggest no evidence of strong gene-environment interactions involving the recently identified 16 susceptibility loci for CRC taken one at a time. PMID:24994789

  19. Genes and Gene Therapy

    Science.gov (United States)

    ... correctly, a child can have a genetic disorder. Gene therapy is an experimental technique that uses genes to ... or prevent disease. The most common form of gene therapy involves inserting a normal gene to replace an ...

  20. A pathway-based network analysis of hypertension-related genes

    Science.gov (United States)

    Wang, Huan; Hu, Jing-Bo; Xu, Chuan-Yun; Zhang, De-Hai; Yan, Qian; Xu, Ming; Cao, Ke-Fei; Zhang, Xu-Sheng

    2016-02-01

    Complex network approach has become an effective way to describe interrelationships among large amounts of biological data, which is especially useful in finding core functions and global behavior of biological systems. Hypertension is a complex disease caused by many reasons including genetic, physiological, psychological and even social factors. In this paper, based on the information of biological pathways, we construct a network model of hypertension-related genes of the salt-sensitive rat to explore the interrelationship between genes. Statistical and topological characteristics show that the network has the small-world but not scale-free property, and exhibits a modular structure, revealing compact and complex connections among these genes. By the threshold of integrated centrality larger than 0.71, seven key hub genes are found: Jun, Rps6kb1, Cycs, Creb312, Cdk4, Actg1 and RT1-Da. These genes should play an important role in hypertension, suggesting that the treatment of hypertension should focus on the combination of drugs on multiple genes.

  1. The impact of gene expression variation on the robustness and evolvability of a developmental gene regulatory network.

    Directory of Open Access Journals (Sweden)

    David A Garfield

    2013-10-01

    Full Text Available Regulatory interactions buffer development against genetic and environmental perturbations, but adaptation requires phenotypes to change. We investigated the relationship between robustness and evolvability within the gene regulatory network underlying development of the larval skeleton in the sea urchin Strongylocentrotus purpuratus. We find extensive variation in gene expression in this network throughout development in a natural population, some of which has a heritable genetic basis. Switch-like regulatory interactions predominate during early development, buffer expression variation, and may promote the accumulation of cryptic genetic variation affecting early stages. Regulatory interactions during later development are typically more sensitive (linear, allowing variation in expression to affect downstream target genes. Variation in skeletal morphology is associated primarily with expression variation of a few, primarily structural, genes at terminal positions within the network. These results indicate that the position and properties of gene interactions within a network can have important evolutionary consequences independent of their immediate regulatory role.

  2. Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization

    Directory of Open Access Journals (Sweden)

    McDonald Karen

    2011-08-01

    Full Text Available Abstract Background Direct gene synthesis is becoming more popular owing to decreases in gene synthesis pricing. Compared with using natural genes, gene synthesis provides a good opportunity to optimize gene sequence for specific applications. In order to facilitate gene optimization, we have developed a stand-alone software called Visual Gene Developer. Results The software not only provides general functions for gene analysis and optimization along with an interactive user-friendly interface, but also includes unique features such as programming capability, dedicated mRNA secondary structure prediction, artificial neural network modeling, network & multi-threaded computing, and user-accessible programming modules. The software allows a user to analyze and optimize a sequence using main menu functions or specialized module windows. Alternatively, gene optimization can be initiated by designing a gene construct and configuring an optimization strategy. A user can choose several predefined or user-defined algorithms to design a complicated strategy. The software provides expandable functionality as platform software supporting module development using popular script languages such as VBScript and JScript in the software programming environment. Conclusion Visual Gene Developer is useful for both researchers who want to quickly analyze and optimize genes, and those who are interested in developing and testing new algorithms in bioinformatics. The software is available for free download at http://www.visualgenedeveloper.net.

  3. GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.

    Science.gov (United States)

    Doungpan, Narumol; Engchuan, Worrawat; Chan, Jonathan H; Meechai, Asawin

    2016-12-05

    Gene expression has been used to identify disease gene biomarkers, but there are ongoing challenges. Single gene or gene-set biomarkers are inadequate to provide sufficient understanding of complex disease mechanisms and the relationship among those genes. Network-based methods have thus been considered for inferring the interaction within a group of genes to further study the disease mechanism. Recently, the Gene-Network-based Feature Set (GNFS), which is capable of handling case-control and multiclass expression for gene biomarker identification, has been proposed, partly taking into account of network topology. However, its performance relies on a greedy search for building subnetworks and thus requires further improvement. In this work, we establish a new approach named Gene Sub-Network-based Feature Selection (GSNFS) by implementing the GNFS framework with two proposed searching and scoring algorithms, namely gene-set-based (GS) search and parent-node-based (PN) search, to identify subnetworks. An additional dataset is used to validate the results. The two proposed searching algorithms of the GSNFS method for subnetwork expansion are concerned with the degree of connectivity and the scoring scheme for building subnetworks and their topology. For each iteration of expansion, the neighbour genes of a current subnetwork, whose expression data improved the overall subnetwork score, is recruited. While the GS search calculated the subnetwork score using an activity score of a current subnetwork and the gene expression values of its neighbours, the PN search uses the expression value of the corresponding parent of each neighbour gene. Four lung cancer expression datasets were used for subnetwork identification. In addition, using pathway data and protein-protein interaction as network data in order to consider the interaction among significant genes were discussed. Classification was performed to compare the performance of the identified gene subnetworks with three

  4. DNA Array-Based Gene Profiling

    Science.gov (United States)

    Mocellin, Simone; Provenzano, Maurizio; Rossi, Carlo Riccardo; Pilati, Pierluigi; Nitti, Donato; Lise, Mario

    2005-01-01

    Cancer is a heterogeneous disease in most respects, including its cellularity, different genetic alterations, and diverse clinical behaviors. Traditional molecular analyses are reductionist, assessing only 1 or a few genes at a time, thus working with a biologic model too specific and limited to confront a process whose clinical outcome is likely to be governed by the combined influence of many genes. The potential of functional genomics is enormous, because for each experiment, thousands of relevant observations can be made simultaneously. Accordingly, DNA array, like other high-throughput technologies, might catalyze and ultimately accelerate the development of knowledge in tumor cell biology. Although in its infancy, the implementation of DNA array technology in cancer research has already provided investigators with novel data and intriguing new hypotheses on the molecular cascade leading to carcinogenesis, tumor aggressiveness, and sensitivity to antiblastic agents. Given the revolutionary implications that the use of this technology might have in the clinical management of patients with cancer, principles of DNA array-based tumor gene profiling need to be clearly understood for the data to be correctly interpreted and appreciated. In the present work, we discuss the technical features characterizing this powerful laboratory tool and review the applications so far described in the field of oncology. PMID:15621987

  5. Unveiling network-based functional features through integration of gene expression into protein networks.

    Science.gov (United States)

    Jalili, Mahdi; Gebhardt, Tom; Wolkenhauer, Olaf; Salehzadeh-Yazdi, Ali

    2018-06-01

    Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers. Copyright © 2018 Elsevier B.V. All rights reserved.

  6. Radionuclide reporter gene imaging

    Energy Technology Data Exchange (ETDEWEB)

    Min, Jung Joon [School of Medicine, Chonnam National Univ., Gwangju (Korea, Republic of)

    2004-04-01

    Recent progress in the development of non-invasive imaging technologies continues to strengthen the role of molecular imaging biological research. These tools have been validated recently in variety of research models, and have been shown to provide continuous quantitative monitoring of the location(s), magnitude, and time-variation of gene expression. This article reviews the principles, characteristics, categories and the use of radionuclide reporter gene imaging technologies as they have been used in imaging cell trafficking, imaging gene therapy, imaging endogenous gene expression and imaging molecular interactions. The studies published to date demonstrate that reporter gene imaging technologies will help to accelerate model validation as well as allow for clinical monitoring of human diseases.

  7. Radionuclide reporter gene imaging

    International Nuclear Information System (INIS)

    Min, Jung Joon

    2004-01-01

    Recent progress in the development of non-invasive imaging technologies continues to strengthen the role of molecular imaging biological research. These tools have been validated recently in variety of research models, and have been shown to provide continuous quantitative monitoring of the location(s), magnitude, and time-variation of gene expression. This article reviews the principles, characteristics, categories and the use of radionuclide reporter gene imaging technologies as they have been used in imaging cell trafficking, imaging gene therapy, imaging endogenous gene expression and imaging molecular interactions. The studies published to date demonstrate that reporter gene imaging technologies will help to accelerate model validation as well as allow for clinical monitoring of human diseases

  8. Statistics on gene-based laser speckles with a small number of scatterers: implications for the detection of polymorphism in the Chlamydia trachomatis omp1 gene

    Science.gov (United States)

    Ulyanov, Sergey S.; Ulianova, Onega V.; Zaytsev, Sergey S.; Saltykov, Yury V.; Feodorova, Valentina A.

    2018-04-01

    The transformation mechanism for a nucleotide sequence of the Chlamydia trachomatis gene into a speckle pattern has been considered. The first and second-order statistics of gene-based speckles have been analyzed. It has been demonstrated that gene-based speckles do not obey Gaussian statistics and belong to the class of speckles with a small number of scatterers. It has been shown that gene polymorphism can be easily detected through analysis of the statistical characteristics of gene-based speckles.

  9. Neuroplasticity of selective attention: Research foundations and preliminary evidence for a gene by intervention interaction

    Science.gov (United States)

    Stevens, Courtney; Pakulak, Eric; Hampton Wray, Amanda; Bell, Theodore A.; Neville, Helen J.

    2017-01-01

    This article reviews the trajectory of our research program on selective attention, which has moved from basic research on the neural processes underlying selective attention to translational studies using selective attention as a neurobiological target for evidence-based interventions. We use this background to present a promising preliminary investigation of how genetic and experiential factors interact during development (i.e., gene × intervention interactions). Our findings provide evidence on how exposure to a family-based training can modify the associations between genotype (5-HTTLPR) and the neural mechanisms of selective attention in preschool children from lower socioeconomic status backgrounds. PMID:28819066

  10. Neuroplasticity of selective attention: Research foundations and preliminary evidence for a gene by intervention interaction.

    Science.gov (United States)

    Isbell, Elif; Stevens, Courtney; Pakulak, Eric; Hampton Wray, Amanda; Bell, Theodore A; Neville, Helen J

    2017-08-29

    This article reviews the trajectory of our research program on selective attention, which has moved from basic research on the neural processes underlying selective attention to translational studies using selective attention as a neurobiological target for evidence-based interventions. We use this background to present a promising preliminary investigation of how genetic and experiential factors interact during development (i.e., gene × intervention interactions). Our findings provide evidence on how exposure to a family-based training can modify the associations between genotype (5-HTTLPR) and the neural mechanisms of selective attention in preschool children from lower socioeconomic status backgrounds.

  11. FiGS: a filter-based gene selection workbench for microarray data

    Directory of Open Access Journals (Sweden)

    Yun Taegyun

    2010-01-01

    Full Text Available Abstract Background The selection of genes that discriminate disease classes from microarray data is widely used for the identification of diagnostic biomarkers. Although various gene selection methods are currently available and some of them have shown excellent performance, no single method can retain the best performance for all types of microarray datasets. It is desirable to use a comparative approach to find the best gene selection result after rigorous test of different methodological strategies for a given microarray dataset. Results FiGS is a web-based workbench that automatically compares various gene selection procedures and provides the optimal gene selection result for an input microarray dataset. FiGS builds up diverse gene selection procedures by aligning different feature selection techniques and classifiers. In addition to the highly reputed techniques, FiGS diversifies the gene selection procedures by incorporating gene clustering options in the feature selection step and different data pre-processing options in classifier training step. All candidate gene selection procedures are evaluated by the .632+ bootstrap errors and listed with their classification accuracies and selected gene sets. FiGS runs on parallelized computing nodes that capacitate heavy computations. FiGS is freely accessible at http://gexp.kaist.ac.kr/figs. Conclusion FiGS is an web-based application that automates an extensive search for the optimized gene selection analysis for a microarray dataset in a parallel computing environment. FiGS will provide both an efficient and comprehensive means of acquiring optimal gene sets that discriminate disease states from microarray datasets.

  12. A case-only genome-wide association study for assessing gene-sex interaction in Allergic rhinitis

    DEFF Research Database (Denmark)

    Mohammadnejad, Afsaneh; Brasch-Andersen, Charlotte; Haagerup, Annette

    -value = 2.8 × 10−5) sits in C5orf66 gene on 5q31. Poster abstract 2 Discussion: Our study was able to detect a significant SNP rs4251459 mapping to IRAK4 gene on 12q12 locus which appeared to increase the risk of AR in females than males. This gene has previously been reported to have a sex dependent effect...... on AR. C5orf66 loci might also be an interesting candidate for AR, but its role warrants further validations. Additionally, pathway analysis from GSEA identified a pathway related to immune system which is biologically meaningful and supportive. In conclusion, our study revealed the gene-sex interaction...

  13. Diet-Gene Interactions and PUFA Metabolism: A Potential Contributor to Health Disparities and Human Diseases

    Directory of Open Access Journals (Sweden)

    Floyd H. Chilton

    2014-05-01

    Full Text Available The “modern western” diet (MWD has increased the onset and progression of chronic human diseases as qualitatively and quantitatively maladaptive dietary components give rise to obesity and destructive gene-diet interactions. There has been a three-fold increase in dietary levels of the omega-6 (n-6 18 carbon (C18, polyunsaturated fatty acid (PUFA linoleic acid (LA; 18:2n-6, with the addition of cooking oils and processed foods to the MWD. Intense debate has emerged regarding the impact of this increase on human health. Recent studies have uncovered population-related genetic variation in the LCPUFA biosynthetic pathway (especially within the fatty acid desaturase gene (FADS cluster that is associated with levels of circulating and tissue PUFAs and several biomarkers and clinical endpoints of cardiovascular disease (CVD. Importantly, populations of African descent have higher frequencies of variants associated with elevated levels of arachidonic acid (ARA, CVD biomarkers and disease endpoints. Additionally, nutrigenomic interactions between dietary n-6 PUFAs and variants in genes that encode for enzymes that mobilize and metabolize ARA to eicosanoids have been identified. These observations raise important questions of whether gene-PUFA interactions are differentially driving the risk of cardiovascular and other diseases in diverse populations, and contributing to health disparities, especially in African American populations.

  14. Dairy Product Consumption Interacts with Glucokinase (GCK Gene Polymorphisms Associated with Insulin Resistance

    Directory of Open Access Journals (Sweden)

    Marine S. Da Silva

    2017-08-01

    Full Text Available Dairy product intake and a person’s genetic background have been reported to be associated with the risk of type 2 diabetes (T2D. The objective of this study was to examine the interaction between dairy products and genes related to T2D on glucose-insulin homeostasis parameters. A validated food frequency questionnaire, fasting blood samples, and glucokinase (GCK genotypes were analyzed in 210 healthy participants. An interaction between rs1799884 in GCK and dairy intake on the homeostasis model assessment of insulin resistance was identified. Secondly, human hepatocellular carcinoma cells (HepG2 were grown in a high-glucose medium and incubated with either 1-dairy proteins: whey, caseins, and a mixture of whey and casein; and 2-four amino acids (AA or mixtures of AA. The expression of GCK-related genes insulin receptor substrate-1 (IRS-1 and fatty acid synthase (FASN was increased with whey protein isolate or hydrolysate. Individually, leucine increased IRS-1 expression, whereas isoleucine and valine decreased FASN expression. A branched-chain AA mixture decreased IRS-1 and FASN expression. In conclusion, carriers of the A allele for rs1799884 in the GCK gene may benefit from a higher intake of dairy products to maintain optimal insulin sensitivity. Moreover, the results show that whey proteins affect the expression of genes related to glucose metabolism.

  15. Interaction between the RGS6 gene and psychosocial stress on obesity-related traits.

    Science.gov (United States)

    Kim, Hyun-Jin; Min, Jin-Young; Min, Kyoung-Bok

    2017-03-31

    Obesity is a major risk factor for chronic diseases and arises from the interactions between environmental factors and multiple genes. Psychosocial stress may affect the risk for obesity, modifying food intake and choice. A recent study suggested regulator of G-protein signaling 6 (RGS6) as a novel candidate gene for obesity in terms of reward-related feeding under stress. In this study, we tried to verify the unidentified connection between RGS6 and human obesity with psychosocial stress in a Korean population. A total of 1,462 adult subjects, who participated in the Korean Association Resource cohort project, were included for this analysis. Obesity-related traits including waist circumference, body mass index, and visceral adipose tissue were recorded. A total of 4 intronic SNPs for the RGS6 gene were used for this study. We found that interactions between SNP rs2239219 and psychosocial stress are significantly associated with abdominal obesity (p = 0.007). As risk allele of this SNP increased, prevalence of abdominal obesity under high-stress conditions gradually increased (p = 0.013). However, we found no SNPs-by-stress interaction effect on other adiposity phenotypes. This study suggests that RGS6 is closely linked to stress-induced abdominal obesity in Korean adults.

  16. A database and tool, IM Browser, for exploring and integrating emerging gene and protein interaction data for Drosophila

    Directory of Open Access Journals (Sweden)

    Parrish Jodi R

    2006-04-01

    Full Text Available Abstract Background Biological processes are mediated by networks of interacting genes and proteins. Efforts to map and understand these networks are resulting in the proliferation of interaction data derived from both experimental and computational techniques for a number of organisms. The volume of this data combined with the variety of specific forms it can take has created a need for comprehensive databases that include all of the available data sets, and for exploration tools to facilitate data integration and analysis. One powerful paradigm for the navigation and analysis of interaction data is an interaction graph or map that represents proteins or genes as nodes linked by interactions. Several programs have been developed for graphical representation and analysis of interaction data, yet there remains a need for alternative programs that can provide casual users with rapid easy access to many existing and emerging data sets. Description Here we describe a comprehensive database of Drosophila gene and protein interactions collected from a variety of sources, including low and high throughput screens, genetic interactions, and computational predictions. We also present a program for exploring multiple interaction data sets and for combining data from different sources. The program, referred to as the Interaction Map (IM Browser, is a web-based application for searching and visualizing interaction data stored in a relational database system. Use of the application requires no downloads and minimal user configuration or training, thereby enabling rapid initial access to interaction data. IM Browser was designed to readily accommodate and integrate new types of interaction data as it becomes available. Moreover, all information associated with interaction measurements or predictions and the genes or proteins involved are accessible to the user. This allows combined searches and analyses based on either common or technique-specific attributes

  17. Random regression models for detection of gene by environment interaction

    Directory of Open Access Journals (Sweden)

    Meuwissen Theo HE

    2007-02-01

    Full Text Available Abstract Two random regression models, where the effect of a putative QTL was regressed on an environmental gradient, are described. The first model estimates the correlation between intercept and slope of the random regression, while the other model restricts this correlation to 1 or -1, which is expected under a bi-allelic QTL model. The random regression models were compared to a model assuming no gene by environment interactions. The comparison was done with regards to the models ability to detect QTL, to position them accurately and to detect possible QTL by environment interactions. A simulation study based on a granddaughter design was conducted, and QTL were assumed, either by assigning an effect independent of the environment or as a linear function of a simulated environmental gradient. It was concluded that the random regression models were suitable for detection of QTL effects, in the presence and absence of interactions with environmental gradients. Fixing the correlation between intercept and slope of the random regression had a positive effect on power when the QTL effects re-ranked between environments.

  18. RNAi-based silencing of genes encoding the vacuolar- ATPase ...

    African Journals Online (AJOL)

    RNAi-based silencing of genes encoding the vacuolar- ATPase subunits a and c in pink bollworm (Pectinophora gossypiella). Ahmed M. A. Mohammed. Abstract. RNA interference is a post- transcriptional gene regulation mechanism that is predominantly found in eukaryotic organisms. RNAi demonstrated a successful ...

  19. Reporter gene imaging: potential impact on therapy

    International Nuclear Information System (INIS)

    Serganova, Inna; Blasberg, Ronald

    2005-01-01

    Positron emission tomography (PET)-based molecular-genetic imaging in living organisms has enjoyed exceptional growth over the past 5 years; this is particularly striking since it has been identified as a new discipline only within the past decade. Positron emission tomography is one of three imaging technologies (nuclear, magnetic resonance and optical) that has begun to incorporate methods that are established in molecular and cell biology research. The convergence of these disciplines and the wider application of multi-modality imaging are at the heart of this success story. Most current molecular-genetic imaging strategies are 'indirect,' coupling a 'reporter gene' with a complimentary 'reporter probe.' Reporter gene constructs can be driven by constitutive promoter elements and used to monitor gene therapy vectors and the efficacy of trans gene targeting and transduction, as well as to monitor adoptive cell-based therapies. Inducible promoters can be used as 'sensors' to regulate the magnitude of reporter gene expression and can be used to provide information about endogenous cell processes. Reporter systems can also be constructed to monitor mRNA stabilization and specific protein-protein interactions. Promoters can be cell specific and restrict transgene expression to certain tissue and organs. The translation of reporter gene imaging to specific clinical applications is discussed. Several examples that have potential for patient imaging studies in the near future include monitoring adenoviral-based gene therapy, oncolytic herpes virus therapy, adoptive cell-based therapies and Salmonella-based tumor-targeted cancer therapy and imaging. The primary translational applications of noninvasive in vivo reporter gene imaging are likely to be (a) quantitative monitoring of the gene therapy vector and the efficacy of transduction in clinical protocols, by imaging the location, extent and duration of transgene expression; (b) monitoring cell trafficking, targeting

  20. Coordinated rates of evolution between interacting plastid and nuclear genes in Geraniaceae.

    Science.gov (United States)

    Zhang, Jin; Ruhlman, Tracey A; Sabir, Jamal; Blazier, J Chris; Jansen, Robert K

    2015-03-01

    Although gene coevolution has been widely observed within individuals and between different organisms, rarely has this phenomenon been investigated within a phylogenetic framework. The Geraniaceae is an attractive system in which to study plastid-nuclear genome coevolution due to the highly elevated evolutionary rates in plastid genomes. In plants, the plastid-encoded RNA polymerase (PEP) is a protein complex composed of subunits encoded by both plastid (rpoA, rpoB, rpoC1, and rpoC2) and nuclear genes (sig1-6). We used transcriptome and genomic data for 27 species of Geraniales in a systematic evaluation of coevolution between genes encoding subunits of the PEP holoenzyme. We detected strong correlations of dN (nonsynonymous substitutions) but not dS (synonymous substitutions) within rpoB/sig1 and rpoC2/sig2, but not for other plastid/nuclear gene pairs, and identified the correlation of dN/dS ratio between rpoB/C1/C2 and sig1/5/6, rpoC1/C2 and sig2, and rpoB/C2 and sig3 genes. Correlated rates between interacting plastid and nuclear sequences across the Geraniales could result from plastid-nuclear genome coevolution. Analyses of coevolved amino acid positions suggest that structurally mediated coevolution is not the major driver of plastid-nuclear coevolution. The detection of strong correlation of evolutionary rates between SIG and RNAP genes suggests a plausible explanation for plastome-genome incompatibility in Geraniaceae. © 2015 American Society of Plant Biologists. All rights reserved.

  1. An integrative approach to inferring biologically meaningful gene modules

    Directory of Open Access Journals (Sweden)

    Wang Kai

    2011-07-01

    Full Text Available Abstract Background The ability to construct biologically meaningful gene networks and modules is critical for contemporary systems biology. Though recent studies have demonstrated the power of using gene modules to shed light on the functioning of complex biological systems, most modules in these networks have shown little association with meaningful biological function. We have devised a method which directly incorporates gene ontology (GO annotation in construction of gene modules in order to gain better functional association. Results We have devised a method, Semantic Similarity-Integrated approach for Modularization (SSIM that integrates various gene-gene pairwise similarity values, including information obtained from gene expression, protein-protein interactions and GO annotations, in the construction of modules using affinity propagation clustering. We demonstrated the performance of the proposed method using data from two complex biological responses: 1. the osmotic shock response in Saccharomyces cerevisiae, and 2. the prion-induced pathogenic mouse model. In comparison with two previously reported algorithms, modules identified by SSIM showed significantly stronger association with biological functions. Conclusions The incorporation of semantic similarity based on GO annotation with gene expression and protein-protein interaction data can greatly enhance the functional relevance of inferred gene modules. In addition, the SSIM approach can also reveal the hierarchical structure of gene modules to gain a broader functional view of the biological system. Hence, the proposed method can facilitate comprehensive and in-depth analysis of high throughput experimental data at the gene network level.

  2. Application of Various Statistical Models to Explore Gene-Gene Interactions in Folate, Xenobiotic, Toll-Like Receptor and STAT4 Pathways that Modulate Susceptibility to Systemic Lupus Erythematosus.

    Science.gov (United States)

    Rupasree, Yedluri; Naushad, Shaik Mohammad; Varshaa, Ravi; Mahalakshmi, Govindaraj Swathika; Kumaraswami, Konda; Rajasekhar, Liza; Kutala, Vijay Kumar

    2016-02-01

    In view of our previous studies showing an independent association of genetic polymorphisms in folate, xenobiotic, and toll-like receptor (TLR) pathways with the risk for systemic lupus erythematosus (SLE), we have developed three statistical models to delineate complex gene-gene interactions between folate, xenobiotic, TLR, and signal transducer and activator of transcription 4 (STAT4) signaling pathways in association with the molecular pathophysiology of SLE. We developed additive, multifactor dimensionality reduction (MDR), and artificial neural network (ANN) models. The additive model, although the simplest, suggested a moderate predictability of 30 polymorphisms of these four pathways (area under the curve [AUC] 0.66). MDR analysis revealed significant gene-gene interactions among glutathione-S-transferase (GST)T1 and STAT4 (rs3821236 and rs7574865) polymorphisms, which account for moderate predictability of SLE. The MDR model for specific auto-antibodies revealed the importance of gene-gene interactions among cytochrome P450, family1, subfamily A, polypeptide 1 (CYP1A1) m1, catechol-O-methyltransferase (COMT) H108L, solute carrier family 19 (folate transporter), member 1 (SLC19A1) G80A, estrogen receptor 1 (ESR1), TLR5, 5-methyltetrahydrofolate-homocysteine methyltransferase reductase (MTRR), thymidylate synthase (TYMS). and STAT4 polymorphisms. The ANN model for disease prediction showed reasonably good predictability of SLE risk with 30 polymorphisms (AUC 0.76). These polymorphisms contribute towards the production of SSB and anti-dsDNA antibodies to the extent of 48 and 40%, respectively, while their contribution for the production of antiRNP, SSA, and anti-cardiolipin antibodies varies between 20 and 30%. The current study highlighted the importance of genetic polymorphisms in folate, xenobiotic, TLR, and STAT4 signaling pathways as moderate predictors of SLE risk and delineates the molecular pathophysiology associated with these single nucleotide

  3. Comprehensive analysis of gene expression patterns of hedgehog-related genes

    Directory of Open Access Journals (Sweden)

    Baillie David

    2006-10-01

    efficacy of our GFP expression effort with EST, OST and SAGE data. Conclusion No bona-fide Hh signaling pathway is present in C. elegans. Given that the hh-related gene products have a predicted signal peptide for secretion, it is possible that they constitute components of the extracellular matrix (ECM. They might be associated with the cuticle or be present in soluble form in the body cavity. They might interact with the Patched or the Patched-related proteins in a manner similar to the interaction of Hedgehog with its receptor Patched.

  4. A reference gene set for sex pheromone biosynthesis and degradation genes from the diamondback moth, Plutella xylostella, based on genome and transcriptome digital gene expression analyses.

    Science.gov (United States)

    He, Peng; Zhang, Yun-Fei; Hong, Duan-Yang; Wang, Jun; Wang, Xing-Liang; Zuo, Ling-Hua; Tang, Xian-Fu; Xu, Wei-Ming; He, Ming

    2017-03-01

    comprehensive gene data set of sex pheromone biosynthesis and degradation enzyme related genes in DBM created by genome- and transcriptome-wide identification, characterization and expression profiling. Our findings provide a basis to better understand the function of genes with tissue enriched expression. The results also provide information on the genes involved in sex pheromone biosynthesis and degradation, and may be useful to identify potential gene targets for pest control strategies by disrupting the insect-insect communication using pheromone-based behavioral antagonists.

  5. Genes and Social Behavior

    OpenAIRE

    Robinson, Gene E.; Fernald, Russell D.; Clayton, David F.

    2008-01-01

    What specific genes and regulatory sequences contribute to the organization and functioning of brain circuits that support social behavior? How does social experience interact with information in the genome to modulate these brain circuits? Here we address these questions by highlighting progress that has been made in identifying and understanding two key “vectors of influence” that link genes, brain, and social behavior: 1) social information alters gene readout in the brain to influence beh...

  6. TranscriptomeBrowser 3.0: introducing a new compendium of molecular interactions and a new visualization tool for the study of gene regulatory networks.

    Science.gov (United States)

    Lepoivre, Cyrille; Bergon, Aurélie; Lopez, Fabrice; Perumal, Narayanan B; Nguyen, Catherine; Imbert, Jean; Puthier, Denis

    2012-01-31

    Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i) predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices), (ii) potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii) regulatory interactions curated from the literature, (iv) predicted post-transcriptional regulation by micro-RNA, (v) protein kinase-substrate interactions and (vi) physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration of heterogeneous biological information

  7. An Entropy-based gene selection method for cancer classification using microarray data

    Directory of Open Access Journals (Sweden)

    Krishnan Arun

    2005-03-01

    Full Text Available Abstract Background Accurate diagnosis of cancer subtypes remains a challenging problem. Building classifiers based on gene expression data is a promising approach; yet the selection of non-redundant but relevant genes is difficult. The selected gene set should be small enough to allow diagnosis even in regular clinical laboratories and ideally identify genes involved in cancer-specific regulatory pathways. Here an entropy-based method is proposed that selects genes related to the different cancer classes while at the same time reducing the redundancy among the genes. Results The present study identifies a subset of features by maximizing the relevance and minimizing the redundancy of the selected genes. A merit called normalized mutual information is employed to measure the relevance and the redundancy of the genes. In order to find a more representative subset of features, an iterative procedure is adopted that incorporates an initial clustering followed by data partitioning and the application of the algorithm to each of the partitions. A leave-one-out approach then selects the most commonly selected genes across all the different runs and the gene selection algorithm is applied again to pare down the list of selected genes until a minimal subset is obtained that gives a satisfactory accuracy of classification. The algorithm was applied to three different data sets and the results obtained were compared to work done by others using the same data sets Conclusion This study presents an entropy-based iterative algorithm for selecting genes from microarray data that are able to classify various cancer sub-types with high accuracy. In addition, the feature set obtained is very compact, that is, the redundancy between genes is reduced to a large extent. This implies that classifiers can be built with a smaller subset of genes.

  8. Inferring nonlinear gene regulatory networks from gene expression data based on distance correlation.

    Directory of Open Access Journals (Sweden)

    Xiaobo Guo

    Full Text Available Nonlinear dependence is general in regulation mechanism of gene regulatory networks (GRNs. It is vital to properly measure or test nonlinear dependence from real data for reconstructing GRNs and understanding the complex regulatory mechanisms within the cellular system. A recently developed measurement called the distance correlation (DC has been shown powerful and computationally effective in nonlinear dependence for many situations. In this work, we incorporate the DC into inferring GRNs from the gene expression data without any underling distribution assumptions. We propose three DC-based GRNs inference algorithms: CLR-DC, MRNET-DC and REL-DC, and then compare them with the mutual information (MI-based algorithms by analyzing two simulated data: benchmark GRNs from the DREAM challenge and GRNs generated by SynTReN network generator, and an experimentally determined SOS DNA repair network in Escherichia coli. According to both the receiver operator characteristic (ROC curve and the precision-recall (PR curve, our proposed algorithms significantly outperform the MI-based algorithms in GRNs inference.

  9. The temporal dynamics of differential gene expression in Aspergillus fumigatus interacting with human immature dendritic cells in vitro.

    LENUS (Irish Health Repository)

    Morton, Charles O

    2011-01-01

    Dendritic cells (DC) are the most important antigen presenting cells and play a pivotal role in host immunity to infectious agents by acting as a bridge between the innate and adaptive immune systems. Monocyte-derived immature DCs (iDC) were infected with viable resting conidia of Aspergillus fumigatus (Af293) for 12 hours at an MOI of 5; cells were sampled every three hours. RNA was extracted from both organisms at each time point and hybridised to microarrays. iDC cell death increased at 6 h in the presence of A. fumigatus which coincided with fungal germ tube emergence; >80% of conidia were associated with iDC. Over the time course A. fumigatus differentially regulated 210 genes, FunCat analysis indicated significant up-regulation of genes involved in fermentation, drug transport, pathogenesis and response to oxidative stress. Genes related to cytotoxicity were differentially regulated but the gliotoxin biosynthesis genes were down regulated over the time course, while Aspf1 was up-regulated at 9 h and 12 h. There was an up-regulation of genes in the subtelomeric regions of the genome as the interaction progressed. The genes up-regulated by iDC in the presence of A. fumigatus indicated that they were producing a pro-inflammatory response which was consistent with previous transcriptome studies of iDC interacting with A. fumigatus germ tubes. This study shows that A. fumigatus adapts to phagocytosis by iDCs by utilising genes that allow it to survive the interaction rather than just up-regulation of specific virulence genes.

  10. The temporal dynamics of differential gene expression in Aspergillus fumigatus interacting with human immature dendritic cells in vitro.

    Directory of Open Access Journals (Sweden)

    Charles O Morton

    2011-01-01

    Full Text Available Dendritic cells (DC are the most important antigen presenting cells and play a pivotal role in host immunity to infectious agents by acting as a bridge between the innate and adaptive immune systems. Monocyte-derived immature DCs (iDC were infected with viable resting conidia of Aspergillus fumigatus (Af293 for 12 hours at an MOI of 5; cells were sampled every three hours. RNA was extracted from both organisms at each time point and hybridised to microarrays. iDC cell death increased at 6 h in the presence of A. fumigatus which coincided with fungal germ tube emergence; >80% of conidia were associated with iDC. Over the time course A. fumigatus differentially regulated 210 genes, FunCat analysis indicated significant up-regulation of genes involved in fermentation, drug transport, pathogenesis and response to oxidative stress. Genes related to cytotoxicity were differentially regulated but the gliotoxin biosynthesis genes were down regulated over the time course, while Aspf1 was up-regulated at 9 h and 12 h. There was an up-regulation of genes in the subtelomeric regions of the genome as the interaction progressed. The genes up-regulated by iDC in the presence of A. fumigatus indicated that they were producing a pro-inflammatory response which was consistent with previous transcriptome studies of iDC interacting with A. fumigatus germ tubes. This study shows that A. fumigatus adapts to phagocytosis by iDCs by utilising genes that allow it to survive the interaction rather than just up-regulation of specific virulence genes.

  11. The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach.

    Science.gov (United States)

    Hindumathi, V; Kranthi, T; Rao, S B; Manimaran, P

    2014-06-01

    With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.

  12. Gene-Environment Interactions of Circadian-Related Genes for Cardiometabolic Traits

    DEFF Research Database (Denmark)

    Dashti, Hassan S; Follis, Jack L; Smith, Caren E

    2015-01-01

    OBJECTIVE: Common circadian-related gene variants associate with increased risk for metabolic alterations including type 2 diabetes. However, little is known about whether diet and sleep could modify associations between circadian-related variants (CLOCK-rs1801260, CRY2-rs11605924, MTNR1B-rs13871...

  13. Gene-environment interactions of circadian-related genes for cardiometabolic traits

    Science.gov (United States)

    Objective: Common circadian-related gene variants associate with increased risk for metabolic alterations including type 2 diabetes. However, little is known about whether diet and sleep could modify associations between circadian-related variants (CLOCK-rs1801260, CRY2-rs11605924, MTNR1B-rs1387153,...

  14. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model

    DEFF Research Database (Denmark)

    Kogelman, Lisette; Cirera Salicio, Susanna; Zhernakova, Daria V.

    2014-01-01

    interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model...... (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. Results WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P ... the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using...

  15. Identification of highly synchronized subnetworks from gene expression data.

    Science.gov (United States)

    Gao, Shouguo; Wang, Xujing

    2013-01-01

    There has been a growing interest in identifying context-specific active protein-protein interaction (PPI) subnetworks through integration of PPI and time course gene expression data. However the interaction dynamics during the biological process under study has not been sufficiently considered previously. Here we propose a topology-phase locking (TopoPL) based scoring metric for identifying active PPI subnetworks from time series expression data. First the temporal coordination in gene expression changes is evaluated through phase locking analysis; The results are subsequently integrated with PPI to define an activity score for each PPI subnetwork, based on individual member expression, as well topological characteristics of the PPI network and of the expression temporal coordination network; Lastly, the subnetworks with the top scores in the whole PPI network are identified through simulated annealing search. Application of TopoPL to simulated data and to the yeast cell cycle data showed that it can more sensitively identify biologically meaningful subnetworks than the method that only utilizes the static PPI topology, or the additive scoring method. Using TopoPL we identified a core subnetwork with 49 genes important to yeast cell cycle. Interestingly, this core contains a protein complex known to be related to arrangement of ribosome subunits that exhibit extremely high gene expression synchronization. Inclusion of interaction dynamics is important to the identification of relevant gene networks.

  16. A Cancer Gene Selection Algorithm Based on the K-S Test and CFS

    Directory of Open Access Journals (Sweden)

    Qiang Su

    2017-01-01

    Full Text Available Background. To address the challenging problem of selecting distinguished genes from cancer gene expression datasets, this paper presents a gene subset selection algorithm based on the Kolmogorov-Smirnov (K-S test and correlation-based feature selection (CFS principles. The algorithm selects distinguished genes first using the K-S test, and then, it uses CFS to select genes from those selected by the K-S test. Results. We adopted support vector machines (SVM as the classification tool and used the criteria of accuracy to evaluate the performance of the classifiers on the selected gene subsets. This approach compared the proposed gene subset selection algorithm with the K-S test, CFS, minimum-redundancy maximum-relevancy (mRMR, and ReliefF algorithms. The average experimental results of the aforementioned gene selection algorithms for 5 gene expression datasets demonstrate that, based on accuracy, the performance of the new K-S and CFS-based algorithm is better than those of the K-S test, CFS, mRMR, and ReliefF algorithms. Conclusions. The experimental results show that the K-S test-CFS gene selection algorithm is a very effective and promising approach compared to the K-S test, CFS, mRMR, and ReliefF algorithms.

  17. Gene-environment interaction in Parkinson’s disease: coffee, ADORA2A, and CYP1A2

    Science.gov (United States)

    Chuang, Yu-Hsuan; Lill, Christina M.; Lee, Pei-Chen; Hansen, Johnni; Lassen, Christina Funch; Bertram, Lars; Greene, Naomi; Sinsheimer, Janet S.; Ritz, Beate

    2017-01-01

    Background and purpose Drinking caffeinated coffee has been reported to protect against Parkinson’s disease (PD). Caffeine is an adenosine A2A receptor (encoded by the gene ADORA2A) antagonist that increases dopaminergic neurotransmission and Cytochrome P450 1A2 (gene: CYP1A2) metabolizes caffeine, thus gene polymorphisms in ADORA2A and CYP1A2 may influence the effect coffee consumption has on PD risk. Methods In a population-based case control study (PASIDA) in Denmark (1,556 PD patients and 1,606 birth year- and sex- matched controls), we assessed interactions between lifetime coffee consumption and three polymorphisms in ADORA2A and CYP1A2 for all subjects and incident and prevalent PD cases separately using logistic regression models. We also conducted a meta-analysis combining our results with those from previous studies. Results We estimated statistically significant interactions for ADORA2A rs5760423 and heavy vs. light coffee consumption in incident (OR interaction=0.66 [0.46–0.94], p=0.02) but not prevalent PD. We did not observe interactions for CYP1A2 rs762551 and rs2472304 in incident or prevalent PD. In meta-analyses, PD associations with daily coffee consumption were strongest among carriers of variant alleles in both ADORA2A and CYP1A2. Conclusion We corroborated results from a previous report that described interactions between ADORA2A and CYP1A2 polymorphisms and coffee consumption. Our results also suggest that survivor bias may affect results of studies that enrol prevalent PD cases. PMID:28135712

  18. Combining random gene fission and rational gene fusion to discover near-infrared fluorescent protein fragments that report on protein-protein interactions.

    Science.gov (United States)

    Pandey, Naresh; Nobles, Christopher L; Zechiedrich, Lynn; Maresso, Anthony W; Silberg, Jonathan J

    2015-05-15

    Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein-protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein-protein interactions within whole animals.

  19. Down-Regulation of Gene Expression by RNA-Induced Gene Silencing

    Science.gov (United States)

    Travella, Silvia; Keller, Beat

    Down-regulation of endogenous genes via post-transcriptional gene silencing (PTGS) is a key to the characterization of gene function in plants. Many RNA-based silencing mechanisms such as post-transcriptional gene silencing, co-suppression, quelling, and RNA interference (RNAi) have been discovered among species of different kingdoms (plants, fungi, and animals). One of the most interesting discoveries was RNAi, a sequence-specific gene-silencing mechanism initiated by the introduction of double-stranded RNA (dsRNA), homologous in sequence to the silenced gene, which triggers degradation of mRNA. Infection of plants with modified viruses can also induce RNA silencing and is referred to as virus-induced gene silencing (VIGS). In contrast to insertional mutagenesis, these emerging new reverse genetic approaches represent a powerful tool for exploring gene function and for manipulating gene expression experimentally in cereal species such as barley and wheat. We examined how RNAi and VIGS have been used to assess gene function in barley and wheat, including molecular mechanisms involved in the process and available methodological elements, such as vectors, inoculation procedures, and analysis of silenced phenotypes.

  20. In search of functional association from time-series microarray data based on the change trend and level of gene expression

    Directory of Open Access Journals (Sweden)

    Zeng An-Ping

    2006-02-01

    Full Text Available Abstract Background The increasing availability of time-series expression data opens up new possibilities to study functional linkages of genes. Present methods used to infer functional linkages between genes from expression data are mainly based on a point-to-point comparison. Change trends between consecutive time points in time-series data have been so far not well explored. Results In this work we present a new method based on extracting main features of the change trend and level of gene expression between consecutive time points. The method, termed as trend correlation (TC, includes two major steps: 1, calculating a maximal local alignment of change trend score by dynamic programming and a change trend correlation coefficient between the maximal matched change levels of each gene pair; 2, inferring relationships of gene pairs based on two statistical extraction procedures. The new method considers time shifts and inverted relationships in a similar way as the local clustering (LC method but the latter is merely based on a point-to-point comparison. The TC method is demonstrated with data from yeast cell cycle and compared with the LC method and the widely used Pearson correlation coefficient (PCC based clustering method. The biological significance of the gene pairs is examined with several large-scale yeast databases. Although the TC method predicts an overall lower number of gene pairs than the other two methods at a same p-value threshold, the additional number of gene pairs inferred by the TC method is considerable: e.g. 20.5% compared with the LC method and 49.6% with the PCC method for a p-value threshold of 2.7E-3. Moreover, the percentage of the inferred gene pairs consistent with databases by our method is generally higher than the LC method and similar to the PCC method. A significant number of the gene pairs only inferred by the TC method are process-identity or function-similarity pairs or have well-documented biological

  1. Protein Annotation from Protein Interaction Networks and Gene Ontology

    OpenAIRE

    Nguyen, Cao D.; Gardiner, Katheleen J.; Cios, Krzysztof J.

    2011-01-01

    We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precis...

  2. Evaluation of gene importance in microarray data based upon probability of selection

    Directory of Open Access Journals (Sweden)

    Fu Li M

    2005-03-01

    Full Text Available Abstract Background Microarray devices permit a genome-scale evaluation of gene function. This technology has catalyzed biomedical research and development in recent years. As many important diseases can be traced down to the gene level, a long-standing research problem is to identify specific gene expression patterns linking to metabolic characteristics that contribute to disease development and progression. The microarray approach offers an expedited solution to this problem. However, it has posed a challenging issue to recognize disease-related genes expression patterns embedded in the microarray data. In selecting a small set of biologically significant genes for classifier design, the nature of high data dimensionality inherent in this problem creates substantial amount of uncertainty. Results Here we present a model for probability analysis of selected genes in order to determine their importance. Our contribution is that we show how to derive the P value of each selected gene in multiple gene selection trials based on different combinations of data samples and how to conduct a reliability analysis accordingly. The importance of a gene is indicated by its associated P value in that a smaller value implies higher information content from information theory. On the microarray data concerning the subtype classification of small round blue cell tumors, we demonstrate that the method is capable of finding the smallest set of genes (19 genes with optimal classification performance, compared with results reported in the literature. Conclusion In classifier design based on microarray data, the probability value derived from gene selection based on multiple combinations of data samples enables an effective mechanism for reducing the tendency of fitting local data particularities.

  3. Gene interactions and genetics for yield and its attributes in grass pea

    Indian Academy of Sciences (India)

    [Parihar A. K., Dixit G. P. and Singh D. 2016 Gene interactions and genetics for yield and its attributes .... Biological yield. Seed yield factors. Plant height. Primary branches plant pod ..... indicates that these traits are under the control of several.

  4. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records

    DEFF Research Database (Denmark)

    Jiang, Li; Edwards, Stefan M.; Thomsen, Bo

    2014-01-01

    from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text...

  5. The evolution of milk casein genes from tooth genes before the origin of mammals.

    Science.gov (United States)

    Kawasaki, Kazuhiko; Lafont, Anne-Gaelle; Sire, Jean-Yves

    2011-07-01

    Caseins are among cardinal proteins that evolved in the lineage leading to mammals. In milk, caseins and calcium phosphate (CaP) form a huge complex called casein micelle. By forming the micelle, milk maintains high CaP concentrations, which help altricial mammalian neonates to grow bone and teeth. Two types of caseins are known. Ca-sensitive caseins (α(s)- and β-caseins) bind Ca but precipitate at high Ca concentrations, whereas Ca-insensitive casein (κ-casein) does not usually interact with Ca but instead stabilizes the micelle. Thus, it is thought that these two types of caseins are both necessary for stable micelle formation. Both types of caseins show high substitution rates, which make it difficult to elucidate the evolution of caseins. Yet, recent studies have revealed that all casein genes belong to the secretory calcium-binding phosphoprotein (SCPP) gene family that arose by gene duplication. In the present study, we investigated exon-intron structures and phylogenetic distributions of casein and other SCPP genes, particularly the odontogenic ameloblast-associated (ODAM) gene, the SCPP-Pro-Gln-rich 1 (SCPPPQ1) gene, and the follicular dendritic cell secreted peptide (FDCSP) gene. The results suggest that contemporary Ca-sensitive casein genes arose from a putative common ancestor, which we refer to as CSN1/2. The six putative exons comprising CSN1/2 are all found in SCPPPQ1, although ODAM also shares four of these exons. By contrast, the five exons of the Ca-insensitive casein gene are all reminiscent of FDCSP. The phylogenetic distribution of these genes suggests that both SCPPPQ1 and FDCSP arose from ODAM. We thus argue that all casein genes evolved from ODAM via two different pathways; Ca-sensitive casein genes likely originated directly from SCPPPQ1, whereas the Ca-insensitive casein genes directly differentiated from FDCSP. Further, expression of ODAM, SCPPPQ1, and FDCSP was detected in dental tissues, supporting the idea that both types of caseins

  6. Ranking and characterization of established BMI and lipid associated loci as candidates for gene-environment interactions

    DEFF Research Database (Denmark)

    Shungin, Dmitry; Deng, Wei Q; Varga, Tibor V

    2017-01-01

    Phenotypic variance heterogeneity across genotypes at a single nucleotide polymorphism (SNP) may reflect underlying gene-environment (G×E) or gene-gene interactions. We modeled variance heterogeneity for blood lipids and BMI in up to 44,211 participants and investigated relationships between...... variance effects (Pv), G×E interaction effects (with smoking and physical activity), and marginal genetic effects (Pm). Correlations between Pv and Pm were stronger for SNPs with established marginal effects (Spearman's ρ = 0.401 for triglycerides, and ρ = 0.236 for BMI) compared to all SNPs. When Pv...... and Pm were compared for all pruned SNPs, only BMI was statistically significant (Spearman's ρ = 0.010). Overall, SNPs with established marginal effects were overrepresented in the nominally significant part of the Pv distribution (Pbinomial BMI had...

  7. Molecular genetic gene-environment studies using candidate genes in schizophrenia: a systematic review.

    Science.gov (United States)

    Modinos, Gemma; Iyegbe, Conrad; Prata, Diana; Rivera, Margarita; Kempton, Matthew J; Valmaggia, Lucia R; Sham, Pak C; van Os, Jim; McGuire, Philip

    2013-11-01

    The relatively high heritability of schizophrenia suggests that genetic factors play an important role in the etiology of the disorder. On the other hand, a number of environmental factors significantly influence its incidence. As few direct genetic effects have been demonstrated, and there is considerable inter-individual heterogeneity in the response to the known environmental factors, interactions between genetic and environmental factors may be important in determining whether an individual develops the disorder. To date, a considerable number of studies of gene-environment interactions (G×E) in schizophrenia have employed a hypothesis-based molecular genetic approach using candidate genes, which have led to a range of different findings. This systematic review aims to summarize the results from molecular genetic candidate studies and to review challenges and opportunities of this approach in psychosis research. Finally, we discuss the potential of future prospects, such as new studies that combine hypothesis-based molecular genetic candidate approaches with agnostic genome-wide association studies in determining schizophrenia risk. © 2013 Elsevier B.V. All rights reserved.

  8. HMM-Based Gene Annotation Methods

    Energy Technology Data Exchange (ETDEWEB)

    Haussler, David; Hughey, Richard; Karplus, Keven

    1999-09-20

    Development of new statistical methods and computational tools to identify genes in human genomic DNA, and to provide clues to their functions by identifying features such as transcription factor binding sites, tissue, specific expression and splicing patterns, and remove homologies at the protein level with genes of known function.

  9. The light gene of Drosophila melanogaster encodes a homologue of VPS41, a yeast gene involved in cellular-protein trafficking.

    Science.gov (United States)

    Warner, T S; Sinclair, D A; Fitzpatrick, K A; Singh, M; Devlin, R H; Honda, B M

    1998-04-01

    Mutations in a number of genes affect eye colour in Drosophila melanogaster; some of these "eye-colour" genes have been shown to be involved in various aspects of cellular transport processes. In addition, combinations of viable mutant alleles of some of these genes, such as carnation (car) combined with either light (lt) or deep-orange (dor) mutants, show lethal interactions. Recently, dor was shown to be homologous to the yeast gene PEP3 (VPS18), which is known to be involved in intracellular trafficking. We have undertaken to extend our earlier work on the lt gene, in order to examine in more detail its expression pattern and to characterize its gene product via sequencing of a cloned cDNA. The gene appears to be expressed at relatively high levels in all stages and tissues examined, and shows strong homology to VPS41, a gene involved in cellular-protein trafficking in yeast and higher eukaryotes. Further genetic experiments also point to a role for lt in transport processes: we describe lethal interactions between viable alleles of lt and dor, as well as phenotypic interactions (reductions in eye pigment) between allels of lt and another eye-colour gene, garnet (g), whose gene product has close homology to a subunit of the human adaptor complex, AP-3.

  10. Network-based association of hypoxia-responsive genes with cardiovascular diseases

    International Nuclear Information System (INIS)

    Wang, Rui-Sheng; Oldham, William M; Loscalzo, Joseph

    2014-01-01

    Molecular oxygen is indispensable for cellular viability and function. Hypoxia is a stress condition in which oxygen demand exceeds supply. Low cellular oxygen content induces a number of molecular changes to activate regulatory pathways responsible for increasing the oxygen supply and optimizing cellular metabolism under limited oxygen conditions. Hypoxia plays critical roles in the pathobiology of many diseases, such as cancer, heart failure, myocardial ischemia, stroke, and chronic lung diseases. Although the complicated associations between hypoxia and cardiovascular (and cerebrovascular) diseases (CVD) have been recognized for some time, there are few studies that investigate their biological link from a systems biology perspective. In this study, we integrate hypoxia genes, CVD genes, and the human protein interactome in order to explore the relationship between hypoxia and cardiovascular diseases at a systems level. We show that hypoxia genes are much closer to CVD genes in the human protein interactome than that expected by chance. We also find that hypoxia genes play significant bridging roles in connecting different cardiovascular diseases. We construct a hypoxia-CVD bipartite network and find several interesting hypoxia-CVD modules with significant gene ontology similarity. Finally, we show that hypoxia genes tend to have more CVD interactors in the human interactome than in random networks of matching topology. Based on these observations, we can predict novel genes that may be associated with CVD. This network-based association study gives us a broad view of the relationships between hypoxia and cardiovascular diseases and provides new insights into the role of hypoxia in cardiovascular biology. (paper)

  11. Cross-species global and subset gene expression profiling identifies genes involved in prostate cancer response to selenium

    Directory of Open Access Journals (Sweden)

    Dhir Rajiv

    2004-08-01

    Full Text Available Abstract Background Gene expression technologies have the ability to generate vast amounts of data, yet there often resides only limited resources for subsequent validation studies. This necessitates the ability to perform sorting and prioritization of the output data. Previously described methodologies have used functional pathways or transcriptional regulatory grouping to sort genes for further study. In this paper we demonstrate a comparative genomics based method to leverage data from animal models to prioritize genes for validation. This approach allows one to develop a disease-based focus for the prioritization of gene data, a process that is essential for systems that lack significant functional pathway data yet have defined animal models. This method is made possible through the use of highly controlled spotted cDNA slide production and the use of comparative bioinformatics databases without the use of cross-species slide hybridizations. Results Using gene expression profiling we have demonstrated a similar whole transcriptome gene expression patterns in prostate cancer cells from human and rat prostate cancer cell lines both at baseline expression levels and after treatment with physiologic concentrations of the proposed chemopreventive agent Selenium. Using both the human PC3 and rat PAII prostate cancer cell lines have gone on to identify a subset of one hundred and fifty-four genes that demonstrate a similar level of differential expression to Selenium treatment in both species. Further analysis and data mining for two genes, the Insulin like Growth Factor Binding protein 3, and Retinoic X Receptor alpha, demonstrates an association with prostate cancer, functional pathway links, and protein-protein interactions that make these genes prime candidates for explaining the mechanism of Selenium's chemopreventive effect in prostate cancer. These genes are subsequently validated by western blots showing Selenium based induction and using

  12. The apolipoprotein L family of programmed cell death and immunity genes rapidly evolved in primates at discrete sites of host-pathogen interactions.

    Science.gov (United States)

    Smith, Eric E; Malik, Harmit S

    2009-05-01

    Apolipoprotein L1 (APOL1) is a human protein that confers immunity to Trypanosoma brucei infections but can be countered by a trypanosome-encoded antagonist SRA. APOL1 belongs to a family of programmed cell death genes whose proteins can initiate host apoptosis or autophagic death. We report here that all six members of the APOL gene family (APOL1-6) present in humans have rapidly evolved in simian primates. APOL6, furthermore, shows evidence of an adaptive sweep during recent human evolution. In each APOL gene tested, we found rapidly evolving codons in or adjacent to the SRA-interacting protein domain (SID), which is the domain of APOL1 that interacts with SRA. In APOL6, we also found a rapidly changing 13-amino-acid cluster in the membrane-addressing domain (MAD), which putatively functions as a pH sensor and regulator of cell death. We predict that APOL genes are antagonized by pathogens by at least two distinct mechanisms: SID antagonists, which include SRA, that interact with the SID of various APOL proteins, and MAD antagonists that interact with the MAD hinge base of APOL6. These antagonists either block or prematurely cause APOL-mediated programmed cell death of host cells to benefit the infecting pathogen. These putative interactions must occur inside host cells, in contrast to secreted APOL1 that trafficks to the trypanosome lysosome. Hence, the dynamic APOL gene family appears to be an important link between programmed cell death of host cells and immunity to pathogens.

  13. Prioritization of candidate genes for cattle reproductive traits, based on protein-protein interactions, gene expression, and text-mining

    DEFF Research Database (Denmark)

    Hulsegge, Ina; Woelders, Henri; Smits, Mari

    2013-01-01

    Reproduction is of significant economic importance in dairy cattle. Improved understanding of mechanisms that control estrous behavior and other reproduction traits could help in developing strategies to improve and/or monitor these traits. The objective of this study was to predict and rank gene...

  14. Graph-based semi-supervised learning with genomic data integration using condition-responsive genes applied to phenotype classification.

    Science.gov (United States)

    Doostparast Torshizi, Abolfazl; Petzold, Linda R

    2018-01-01

    Data integration methods that combine data from different molecular levels such as genome, epigenome, transcriptome, etc., have received a great deal of interest in the past few years. It has been demonstrated that the synergistic effects of different biological data types can boost learning capabilities and lead to a better understanding of the underlying interactions among molecular levels. In this paper we present a graph-based semi-supervised classification algorithm that incorporates latent biological knowledge in the form of biological pathways with gene expression and DNA methylation data. The process of graph construction from biological pathways is based on detecting condition-responsive genes, where 3 sets of genes are finally extracted: all condition responsive genes, high-frequency condition-responsive genes, and P-value-filtered genes. The proposed approach is applied to ovarian cancer data downloaded from the Human Genome Atlas. Extensive numerical experiments demonstrate superior performance of the proposed approach compared to other state-of-the-art algorithms, including the latest graph-based classification techniques. Simulation results demonstrate that integrating various data types enhances classification performance and leads to a better understanding of interrelations between diverse omics data types. The proposed approach outperforms many of the state-of-the-art data integration algorithms. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  15. Inferring Gene Regulatory Networks Using Conditional Regulation Pattern to Guide Candidate Genes.

    Directory of Open Access Journals (Sweden)

    Fei Xiao

    Full Text Available Combining path consistency (PC algorithms with conditional mutual information (CMI are widely used in reconstruction of gene regulatory networks. CMI has many advantages over Pearson correlation coefficient in measuring non-linear dependence to infer gene regulatory networks. It can also discriminate the direct regulations from indirect ones. However, it is still a challenge to select the conditional genes in an optimal way, which affects the performance and computation complexity of the PC algorithm. In this study, we develop a novel conditional mutual information-based algorithm, namely RPNI (Regulation Pattern based Network Inference, to infer gene regulatory networks. For conditional gene selection, we define the co-regulation pattern, indirect-regulation pattern and mixture-regulation pattern as three candidate patterns to guide the selection of candidate genes. To demonstrate the potential of our algorithm, we apply it to gene expression data from DREAM challenge. Experimental results show that RPNI outperforms existing conditional mutual information-based methods in both accuracy and time complexity for different sizes of gene samples. Furthermore, the robustness of our algorithm is demonstrated by noisy interference analysis using different types of noise.

  16. Digital Gene Expression Analysis to Screen Disease Resistance-Relevant Genes from Leaves of Herbaceous Peony (Paeonia lactiflora Pall. Infected by Botrytis cinerea.

    Directory of Open Access Journals (Sweden)

    Saijie Gong

    Full Text Available Herbaceous peony (Paeonia lactiflora Pall. is a well-known traditional flower in China and is widely used for landscaping and garden greening due to its high ornamental value. However, disease spots usually appear after the flowering of the plant and may result in the withering of the plant in severe cases. This study examined the disease incidence in an herbaceous peony field in the Yangzhou region, Jiangsu Province. Based on morphological characteristics and molecular data, the disease in this area was identified as a gray mold caused by Botrytis cinerea. Based on previously obtained transcriptome data, eight libraries generated from two herbaceous peony cultivars 'Zifengyu' and 'Dafugui' with different susceptibilities to the disease were then analyzed using digital gene expression profiling (DGE. Thousands of differentially expressed genes (DEGs were screened by comparing the eight samples, and these genes were annotated using the Gene ontology (GO and Kyoto encyclopedia of genes and genomes (KEGG database. The pathways related to plant-pathogen interaction, secondary metabolism synthesis and antioxidant system were concentrated, and 51, 76, and 13 disease resistance-relevant candidate genes were identified, respectively. The expression patterns of these candidate genes differed between the two cultivars: their expression of the disease-resistant cultivar 'Zifengyu' sharply increased during the early stages of infection, while it was relatively subdued in the disease-sensitive cultivar 'Dafugui'. A selection of ten candidate genes was evaluated by quantitative real-time PCR (qRT-PCR to validate the DGE data. These results revealed the transcriptional changes that took place during the interaction of herbaceous peony with B. cinerea, providing insight into the molecular mechanisms of host resistance to gray mold.

  17. Genetic interactions of MAF1 identify a role for Med20 in transcriptional repression of ribosomal protein genes.

    Directory of Open Access Journals (Sweden)

    Ian M Willis

    2008-07-01

    Full Text Available Transcriptional repression of ribosomal components and tRNAs is coordinately regulated in response to a wide variety of environmental stresses. Part of this response involves the convergence of different nutritional and stress signaling pathways on Maf1, a protein that is essential for repressing transcription by RNA polymerase (pol III in Saccharomyces cerevisiae. Here we identify the functions buffering yeast cells that are unable to down-regulate transcription by RNA pol III. MAF1 genetic interactions identified in screens of non-essential gene-deletions and conditionally expressed essential genes reveal a highly interconnected network of 64 genes involved in ribosome biogenesis, RNA pol II transcription, tRNA modification, ubiquitin-dependent proteolysis and other processes. A survey of non-essential MAF1 synthetic sick/lethal (SSL genes identified six gene-deletions that are defective in transcriptional repression of ribosomal protein (RP genes following rapamycin treatment. This subset of MAF1 SSL genes included MED20 which encodes a head module subunit of the RNA pol II Mediator complex. Genetic interactions between MAF1 and subunits in each structural module of Mediator were investigated to examine the functional relationship between these transcriptional regulators. Gene expression profiling identified a prominent and highly selective role for Med20 in the repression of RP gene transcription under multiple conditions. In addition, attenuated repression of RP genes by rapamycin was observed in a strain deleted for the Mediator tail module subunit Med16. The data suggest that Mediator and Maf1 function in parallel pathways to negatively regulate RP mRNA and tRNA synthesis.

  18. Multiple interactions between maternally-activated signalling pathways control Xenopus nodal-related genes.

    Science.gov (United States)

    Rex, Maria; Hilton, Emma; Old, Robert

    2002-03-01

    We have investigated the induction of the six Xenopus nodal-related genes, Xnr1-Xnr6, by maternal determinants. The beta-catenin pathway was modelled by stimulation using Xwnt8, activin-like signalling was modelled by activin, and VegT action was studied by overexpression in animal cap explants. Combinations of factors were examined, and previously unrecognised interactions were revealed in animal caps and whole embryos. For the induction of Xnr5 and Xnr6 in whole embryos, using a beta-catenin antisense morpholino oligonucleotide or a dominant negative XTcf3, we have demonstrated an absolute permissive requirement for the beta-catenin/Tcf pathway, in addition to the requirement for VegT action. In animal caps Xnr5 and Xnr6 are induced in response to VegT overexpression, and this induction is dependent upon the concomitant activation of the beta-catenin pathway that VegT initiates in animal caps. For the induction of Xnr3, VegT interacts negatively so as to inhibit the induction otherwise observed with wnt-signalling alone. The negative effect of VegT is not the result of a general inhibition of wnt-signalling, and does not result from an inhibition of wnt-induced siamois expression. A 294 bp proximal promoter fragment of the Xnr3 gene is sufficient to mediate the negative effect of VegT. Further experiments, employing cycloheximide to examine the dependence of Xnr gene expression upon proteins translated after the mid-blastula stage, demonstrated that Xnrs 4, 5 and 6 are 'primary' Xnr genes whose expression in the late blastula is solely dependent upon factors present before the mid-blastula stage.

  19. Integration of steady-state and temporal gene expression data for the inference of gene regulatory networks.

    Science.gov (United States)

    Wang, Yi Kan; Hurley, Daniel G; Schnell, Santiago; Print, Cristin G; Crampin, Edmund J

    2013-01-01

    We develop a new regression algorithm, cMIKANA, for inference of gene regulatory networks from combinations of steady-state and time-series gene expression data. Using simulated gene expression datasets to assess the accuracy of reconstructing gene regulatory networks, we show that steady-state and time-series data sets can successfully be combined to identify gene regulatory interactions using the new algorithm. Inferring gene networks from combined data sets was found to be advantageous when using noisy measurements collected with either lower sampling rates or a limited number of experimental replicates. We illustrate our method by applying it to a microarray gene expression dataset from human umbilical vein endothelial cells (HUVECs) which combines time series data from treatment with growth factor TNF and steady state data from siRNA knockdown treatments. Our results suggest that the combination of steady-state and time-series datasets may provide better prediction of RNA-to-RNA interactions, and may also reveal biological features that cannot be identified from dynamic or steady state information alone. Finally, we consider the experimental design of genomics experiments for gene regulatory network inference and show that network inference can be improved by incorporating steady-state measurements with time-series data.

  20. A robust approach based on Weibull distribution for clustering gene expression data

    Directory of Open Access Journals (Sweden)

    Gong Binsheng

    2011-05-01

    Full Text Available Abstract Background Clustering is a widely used technique for analysis of gene expression data. Most clustering methods group genes based on the distances, while few methods group genes according to the similarities of the distributions of the gene expression levels. Furthermore, as the biological annotation resources accumulated, an increasing number of genes have been annotated into functional categories. As a result, evaluating the performance of clustering methods in terms of the functional consistency of the resulting clusters is of great interest. Results In this paper, we proposed the WDCM (Weibull Distribution-based Clustering Method, a robust approach for clustering gene expression data, in which the gene expressions of individual genes are considered as the random variables following unique Weibull distributions. Our WDCM is based on the concept that the genes with similar expression profiles have similar distribution parameters, and thus the genes are clustered via the Weibull distribution parameters. We used the WDCM to cluster three cancer gene expression data sets from the lung cancer, B-cell follicular lymphoma and bladder carcinoma and obtained well-clustered results. We compared the performance of WDCM with k-means and Self Organizing Map (SOM using functional annotation information given by the Gene Ontology (GO. The results showed that the functional annotation ratios of WDCM are higher than those of the other methods. We also utilized the external measure Adjusted Rand Index to validate the performance of the WDCM. The comparative results demonstrate that the WDCM provides the better clustering performance compared to k-means and SOM algorithms. The merit of the proposed WDCM is that it can be applied to cluster incomplete gene expression data without imputing the missing values. Moreover, the robustness of WDCM is also evaluated on the incomplete data sets. Conclusions The results demonstrate that our WDCM produces clusters

  1. Gene X Environment Interactions in Autism Spectrum Disorders: Role of Epigenetic Mechanisms

    Directory of Open Access Journals (Sweden)

    Sylvie eTordjman

    2014-08-01

    Full Text Available Several studies support currently the hypothesis that autism etiology is based on a polygenic and epistatic model. However, despite advances in epidemiological, molecular and clinical genetics, the genetic risk factors remain difficult to identify, with the exception of a few chromosomal disorders and several single gene disorders associated with an increased risk for autism. Furthermore, several studies suggest a role of environmental factors in autism spectrum disorders (ASD. First, arguments for a genetic contribution to autism, based on updated family and twin studies, are examined. Second, a review of possible prenatal, perinatal and postnatal environmental risk factors for ASD are presented. Then, the hypotheses are discussed concerning the underlying mechanisms related to a role of environmental factors in the development of ASD in association with genetic factors. In particular, epigenetics as a candidate biological mechanism for gene X environment interactions is considered and the possible role of epigenetic mechanisms reported in genetic disorders associated with ASD is discussed. Furthermore, the example of in utero exposure to valproate provides a good illustration of epigenetic mechanisms involved in ASD and innovative therapeutic strategies. Epigenetic remodeling by environmental factors opens new perspectives for a better understanding, prevention and early therapeutic intervention of ASD.

  2. A new measure for functional similarity of gene products based on Gene Ontology

    Directory of Open Access Journals (Sweden)

    Lengauer Thomas

    2006-06-01

    Full Text Available Abstract Background Gene Ontology (GO is a standard vocabulary of functional terms and allows for coherent annotation of gene products. These annotations provide a basis for new methods that compare gene products regarding their molecular function and biological role. Results We present a new method for comparing sets of GO terms and for assessing the functional similarity of gene products. The method relies on two semantic similarity measures; simRel and funSim. One measure (simRel is applied in the comparison of the biological processes found in different groups of organisms. The other measure (funSim is used to find functionally related gene products within the same or between different genomes. Results indicate that the method, in addition to being in good agreement with established sequence similarity approaches, also provides a means for the identification of functionally related proteins independent of evolutionary relationships. The method is also applied to estimating functional similarity between all proteins in Saccharomyces cerevisiae and to visualizing the molecular function space of yeast in a map of the functional space. A similar approach is used to visualize the functional relationships between protein families. Conclusion The approach enables the comparison of the underlying molecular biology of different taxonomic groups and provides a new comparative genomics tool identifying functionally related gene products independent of homology. The proposed map of the functional space provides a new global view on the functional relationships between gene products or protein families.

  3. Tensor decomposition-based unsupervised feature extraction identifies candidate genes that induce post-traumatic stress disorder-mediated heart diseases.

    Science.gov (United States)

    Taguchi, Y-H

    2017-12-21

    Although post-traumatic stress disorder (PTSD) is primarily a mental disorder, it can cause additional symptoms that do not seem to be directly related to the central nervous system, which PTSD is assumed to directly affect. PTSD-mediated heart diseases are some of such secondary disorders. In spite of the significant correlations between PTSD and heart diseases, spatial separation between the heart and brain (where PTSD is primarily active) prevents researchers from elucidating the mechanisms that bridge the two disorders. Our purpose was to identify genes linking PTSD and heart diseases. In this study, gene expression profiles of various murine tissues observed under various types of stress or without stress were analyzed in an integrated manner using tensor decomposition (TD). Based upon the obtained features, ∼ 400 genes were identified as candidate genes that may mediate heart diseases associated with PTSD. Various gene enrichment analyses supported biological reliability of the identified genes. Ten genes encoding protein-, DNA-, or mRNA-interacting proteins-ILF2, ILF3, ESR1, ESR2, RAD21, HTT, ATF2, NR3C1, TP53, and TP63-were found to be likely to regulate expression of most of these ∼ 400 genes and therefore are candidate primary genes that cause PTSD-mediated heart diseases. Approximately 400 genes in the heart were also found to be strongly affected by various drugs whose known adverse effects are related to heart diseases and/or fear memory conditioning; these data support the reliability of our findings. TD-based unsupervised feature extraction turned out to be a useful method for gene selection and successfully identified possible genes causing PTSD-mediated heart diseases.

  4. GO(vis), a gene ontology visualization tool based on multi-dimensional values.

    Science.gov (United States)

    Ning, Zi; Jiang, Zhenran

    2010-05-01

    Most of gene product similarity measurements concentrate on the information content of Gene Ontology (GO) terms or use a path-based similarity between GO terms, which may ignore other important information contained in the structure of the ontology. In our study, we integrate different GO similarity measure approaches to analyze the functional relationship of genes and gene products with a new triangle-based visualization tool called GO(Vis). The purpose of this tool is to demonstrate the effect of three important information factors when measuring the similarity between gene products. One advantage of this tool is that its important ratio can be adjusted to meet different measuring requirements according to the biological knowledge of each factor. The experimental results demonstrate that GO(Vis) can display diagrams of the functional relationship for gene products effectively.

  5. cDNA-AFLP analysis reveals differential gene expression in compatible interaction of wheat challenged with Puccinia striiformis f. sp. tritici

    Directory of Open Access Journals (Sweden)

    Huang Lili

    2009-06-01

    Full Text Available Abstract Background Puccinia striiformis f. sp. tritici is a fungal pathogen causing stripe rust, one of the most important wheat diseases worldwide. The fungus is strictly biotrophic and thus, completely dependent on living host cells for its reproduction, which makes it difficult to study genes of the pathogen. In spite of its economic importance, little is known about the molecular basis of compatible interaction between the pathogen and wheat host. In this study, we identified wheat and P. striiformis genes associated with the infection process by conducting a large-scale transcriptomic analysis using cDNA-AFLP. Results Of the total 54,912 transcript derived fragments (TDFs obtained using cDNA-AFLP with 64 primer pairs, 2,306 (4.2% displayed altered expression patterns after inoculation, of which 966 showed up-regulated and 1,340 down-regulated. 186 TDFs produced reliable sequences after sequencing of 208 TDFs selected, of which 74 (40% had known functions through BLAST searching the GenBank database. Majority of the latter group had predicted gene products involved in energy (13%, signal transduction (5.4%, disease/defence (5.9% and metabolism (5% of the sequenced TDFs. BLAST searching of the wheat stem rust fungus genome database identified 18 TDFs possibly from the stripe rust pathogen, of which 9 were validated of the pathogen origin using PCR-based assays followed by sequencing confirmation. Of the 186 reliable TDFs, 29 homologous to genes known to play a role in disease/defense, signal transduction or uncharacterized genes were further selected for validation of cDNA-AFLP expression patterns using qRT-PCR analyses. Results confirmed the altered expression patterns of 28 (96.5% genes revealed by the cDNA-AFLP technique. Conclusion The results show that cDNA-AFLP is a reliable technique for studying expression patterns of genes involved in the wheat-stripe rust interactions. Genes involved in compatible interactions between wheat and the

  6. Patenting human genes: Chinese academic articles' portrayal of gene patents.

    Science.gov (United States)

    Du, Li

    2018-04-24

    The patenting of human genes has been the subject of debate for decades. While China has gradually come to play an important role in the global genomics-based testing and treatment market, little is known about Chinese scholars' perspectives on patent protection for human genes. A content analysis of academic literature was conducted to identify Chinese scholars' concerns regarding gene patents, including benefits and risks of patenting human genes, attitudes that researchers hold towards gene patenting, and any legal and policy recommendations offered for the gene patent regime in China. 57.2% of articles were written by law professors, but scholars from health sciences, liberal arts, and ethics also participated in discussions on gene patent issues. While discussions of benefits and risks were relatively balanced in the articles, 63.5% of the articles favored gene patenting in general and, of the articles (n = 41) that explored gene patents in the Chinese context, 90.2% supported patent protections for human genes in China. The patentability of human genes was discussed in 33 articles, and 75.8% of these articles reached the conclusion that human genes are patentable. Chinese scholars view the patent regime as an important legal tool to protect the interests of inventors and inventions as well as the genetic resources of China. As such, many scholars support a gene patent system in China. These attitudes towards gene patents remain unchanged following the court ruling in the Myriad case in 2013, but arguments have been raised about the scope of gene patents, in particular that the increasing numbers of gene patents may negatively impact public health in China.

  7. Canonical correlation analysis for gene-based pleiotropy discovery.

    Directory of Open Access Journals (Sweden)

    Jose A Seoane

    2014-10-01

    Full Text Available Genome-wide association studies have identified a wealth of genetic variants involved in complex traits and multifactorial diseases. There is now considerable interest in testing variants for association with multiple phenotypes (pleiotropy and for testing multiple variants for association with a single phenotype (gene-based association tests. Such approaches can increase statistical power by combining evidence for association over multiple phenotypes or genetic variants respectively. Canonical Correlation Analysis (CCA measures the correlation between two sets of multidimensional variables, and thus offers the potential to combine these two approaches. To apply CCA, we must restrict the number of attributes relative to the number of samples. Hence we consider modules of genetic variation that can comprise a gene, a pathway or another biologically relevant grouping, and/or a set of phenotypes. In order to do this, we use an attribute selection strategy based on a binary genetic algorithm. Applied to a UK-based prospective cohort study of 4286 women (the British Women's Heart and Health Study, we find improved statistical power in the detection of previously reported genetic associations, and identify a number of novel pleiotropic associations between genetic variants and phenotypes. New discoveries include gene-based association of NSF with triglyceride levels and several genes (ACSM3, ERI2, IL18RAP, IL23RAP and NRG1 with left ventricular hypertrophy phenotypes. In multiple-phenotype analyses we find association of NRG1 with left ventricular hypertrophy phenotypes, fibrinogen and urea and pleiotropic relationships of F7 and F10 with Factor VII, Factor IX and cholesterol levels.

  8. Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean) Cattle.

    Science.gov (United States)

    Lim, Dajeong; Lee, Seung-Hwan; Kim, Nam-Kuk; Cho, Yong-Min; Chai, Han-Ha; Seong, Hwan-Hoo; Kim, Heebal

    2013-01-01

    Marbling (intramuscular fat) is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the 'marbling score' trait and systemically analyzed the network topology in Hanwoo (Korean cattle). As a result, we determined 3 modules (gene groups) that showed statistically significant results for marbling score. In particular, one module (denoted as red) has a statistically significant result for marbling score (p = 0.008) and intramuscular fat (p = 0.02) and water capacity (p = 0.006). From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA) have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

  9. Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean Cattle

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    2013-01-01

    Full Text Available Marbling (intramuscular fat is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the ‘marbling score’ trait and systemically analyzed the network topology in Hanwoo (Korean cattle. As a result, we determined 3 modules (gene groups that showed statistically significant results for marbling score. In particular, one module (denoted as red has a statistically significant result for marbling score (p = 0.008 and intramuscular fat (p = 0.02 and water capacity (p = 0.006. From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

  10. Assessing SNP-SNP interactions among DNA repair, modification and metabolism related pathway genes in breast cancer susceptibility.

    Directory of Open Access Journals (Sweden)

    Yadav Sapkota

    Full Text Available Genome-wide association studies (GWASs have identified low-penetrance common variants (i.e., single nucleotide polymorphisms, SNPs associated with breast cancer susceptibility. Although GWASs are primarily focused on single-locus effects, gene-gene interactions (i.e., epistasis are also assumed to contribute to the genetic risks for complex diseases including breast cancer. While it has been hypothesized that moderately ranked (P value based weak single-locus effects in GWASs could potentially harbor valuable information for evaluating epistasis, we lack systematic efforts to investigate SNPs showing consistent associations with weak statistical significance across independent discovery and replication stages. The objectives of this study were i to select SNPs showing single-locus effects with weak statistical significance for breast cancer in a GWAS and/or candidate-gene studies; ii to replicate these SNPs in an independent set of breast cancer cases and controls; and iii to explore their potential SNP-SNP interactions contributing to breast cancer susceptibility. A total of 17 SNPs related to DNA repair, modification and metabolism pathway genes were selected since these pathways offer a priori knowledge for potential epistatic interactions and an overall role in breast carcinogenesis. The study design included predominantly Caucasian women (2,795 cases and 4,505 controls from Alberta, Canada. We observed two two-way SNP-SNP interactions (APEX1-rs1130409 and RPAP1-rs2297381; MLH1-rs1799977 and MDM2-rs769412 in logistic regression that conferred elevated risks for breast cancer (P(interaction<7.3 × 10(-3. Logic regression identified an interaction involving four SNPs (MBD2-rs4041245, MLH1-rs1799977, MDM2-rs769412, BRCA2-rs1799943 (P(permutation = 2.4 × 10(-3. SNPs involved in SNP-SNP interactions also showed single-locus effects with weak statistical significance, while BRCA2-rs1799943 showed stronger statistical significance (P

  11. Cancer Outlier Analysis Based on Mixture Modeling of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Keita Mori

    2013-01-01

    Full Text Available Molecular heterogeneity of cancer, partially caused by various chromosomal aberrations or gene mutations, can yield substantial heterogeneity in gene expression profile in cancer samples. To detect cancer-related genes which are active only in a subset of cancer samples or cancer outliers, several methods have been proposed in the context of multiple testing. Such cancer outlier analyses will generally suffer from a serious lack of power, compared with the standard multiple testing setting where common activation of genes across all cancer samples is supposed. In this paper, we consider information sharing across genes and cancer samples, via a parametric normal mixture modeling of gene expression levels of cancer samples across genes after a standardization using the reference, normal sample data. A gene-based statistic for gene selection is developed on the basis of a posterior probability of cancer outlier for each cancer sample. Some efficiency improvement by using our method was demonstrated, even under settings with misspecified, heavy-tailed t-distributions. An application to a real dataset from hematologic malignancies is provided.

  12. A hybrid computational method for the discovery of novel reproduction-related genes.

    Science.gov (United States)

    Chen, Lei; Chu, Chen; Kong, Xiangyin; Huang, Guohua; Huang, Tao; Cai, Yu-Dong

    2015-01-01

    Uncovering the molecular mechanisms underlying reproduction is of great importance to infertility treatment and to the generation of healthy offspring. In this study, we discovered novel reproduction-related genes with a hybrid computational method, integrating three different types of method, which offered new clues for further reproduction research. This method was first executed on a weighted graph, constructed based on known protein-protein interactions, to search the shortest paths connecting any two known reproduction-related genes. Genes occurring in these paths were deemed to have a special relationship with reproduction. These newly discovered genes were filtered with a randomization test. Then, the remaining genes were further selected according to their associations with known reproduction-related genes measured by protein-protein interaction score and alignment score obtained by BLAST. The in-depth analysis of the high confidence novel reproduction genes revealed hidden mechanisms of reproduction and provided guidelines for further experimental validations.

  13. Principles for the organization of gene-sets.

    Science.gov (United States)

    Li, Wentian; Freudenberg, Jan; Oswald, Michaela

    2015-12-01

    A gene-set, an important concept in microarray expression analysis and systems biology, is a collection of genes and/or their products (i.e. proteins) that have some features in common. There are many different ways to construct gene-sets, but a systematic organization of these ways is lacking. Gene-sets are mainly organized ad hoc in current public-domain databases, with group header names often determined by practical reasons (such as the types of technology in obtaining the gene-sets or a balanced number of gene-sets under a header). Here we aim at providing a gene-set organization principle according to the level at which genes are connected: homology, physical map proximity, chemical interaction, biological, and phenotypic-medical levels. We also distinguish two types of connections between genes: actual connection versus sharing of a label. Actual connections denote direct biological interactions, whereas shared label connection denotes shared membership in a group. Some extensions of the framework are also addressed such as overlapping of gene-sets, modules, and the incorporation of other non-protein-coding entities such as microRNAs. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Investigating Gene Function in Cereal Rust Fungi by Plant-Mediated Virus-Induced Gene Silencing.

    Science.gov (United States)

    Panwar, Vinay; Bakkeren, Guus

    2017-01-01

    Cereal rust fungi are destructive pathogens, threatening grain production worldwide. Targeted breeding for resistance utilizing host resistance genes has been effective. However, breakdown of resistance occurs frequently and continued efforts are needed to understand how these fungi overcome resistance and to expand the range of available resistance genes. Whole genome sequencing, transcriptomic and proteomic studies followed by genome-wide computational and comparative analyses have identified large repertoire of genes in rust fungi among which are candidates predicted to code for pathogenicity and virulence factors. Some of these genes represent defence triggering avirulence effectors. However, functions of most genes still needs to be assessed to understand the biology of these obligate biotrophic pathogens. Since genetic manipulations such as gene deletion and genetic transformation are not yet feasible in rust fungi, performing functional gene studies is challenging. Recently, Host-induced gene silencing (HIGS) has emerged as a useful tool to characterize gene function in rust fungi while infecting and growing in host plants. We utilized Barley stripe mosaic virus-mediated virus induced gene silencing (BSMV-VIGS) to induce HIGS of candidate rust fungal genes in the wheat host to determine their role in plant-fungal interactions. Here, we describe the methods for using BSMV-VIGS in wheat for functional genomics study in cereal rust fungi.

  15. Pharmacogenetics of drug-drug interaction and drug-drug-gene interaction: a systematic review on CYP2C9, CYP2C19 and CYP2D6.

    Science.gov (United States)

    Bahar, Muh Akbar; Setiawan, Didik; Hak, Eelko; Wilffert, Bob

    2017-05-01

    Currently, most guidelines on drug-drug interaction (DDI) neither consider the potential effect of genetic polymorphism in the strength of the interaction nor do they account for the complex interaction caused by the combination of DDI and drug-gene interaction (DGI) where there are multiple biotransformation pathways, which is referred to as drug-drug-gene interaction (DDGI). In this systematic review, we report the impact of pharmacogenetics on DDI and DDGI in which three major drug-metabolizing enzymes - CYP2C9, CYP2C19 and CYP2D6 - are central. We observed that several DDI and DDGI are highly gene-dependent, leading to a different magnitude of interaction. Precision drug therapy should take pharmacogenetics into account when drug interactions in clinical practice are expected.

  16. Mutational robustness of gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Aalt D J van Dijk

    Full Text Available Mutational robustness of gene regulatory networks refers to their ability to generate constant biological output upon mutations that change network structure. Such networks contain regulatory interactions (transcription factor-target gene interactions but often also protein-protein interactions between transcription factors. Using computational modeling, we study factors that influence robustness and we infer several network properties governing it. These include the type of mutation, i.e. whether a regulatory interaction or a protein-protein interaction is mutated, and in the case of mutation of a regulatory interaction, the sign of the interaction (activating vs. repressive. In addition, we analyze the effect of combinations of mutations and we compare networks containing monomeric with those containing dimeric transcription factors. Our results are consistent with available data on biological networks, for example based on evolutionary conservation of network features. As a novel and remarkable property, we predict that networks are more robust against mutations in monomer than in dimer transcription factors, a prediction for which analysis of conservation of DNA binding residues in monomeric vs. dimeric transcription factors provides indirect evidence.

  17. Mining Gene Regulatory Networks by Neural Modeling of Expression Time-Series.

    Science.gov (United States)

    Rubiolo, Mariano; Milone, Diego H; Stegmayer, Georgina

    2015-01-01

    Discovering gene regulatory networks from data is one of the most studied topics in recent years. Neural networks can be successfully used to infer an underlying gene network by modeling expression profiles as times series. This work proposes a novel method based on a pool of neural networks for obtaining a gene regulatory network from a gene expression dataset. They are used for modeling each possible interaction between pairs of genes in the dataset, and a set of mining rules is applied to accurately detect the subjacent relations among genes. The results obtained on artificial and real datasets confirm the method effectiveness for discovering regulatory networks from a proper modeling of the temporal dynamics of gene expression profiles.

  18. Beyond the single gene: How epistasis and gene-by-environment effects influence crop domestication.

    Science.gov (United States)

    Doust, Andrew N; Lukens, Lewis; Olsen, Kenneth M; Mauro-Herrera, Margarita; Meyer, Ann; Rogers, Kimberly

    2014-04-29

    Domestication is a multifaceted evolutionary process, involving changes in individual genes, genetic interactions, and emergent phenotypes. There has been extensive discussion of the phenotypic characteristics of plant domestication, and recent research has started to identify the specific genes and mutational mechanisms that control domestication traits. However, there is an apparent disconnect between the simple genetic architecture described for many crop domestication traits, which should facilitate rapid phenotypic change under selection, and the slow rate of change reported from the archeobotanical record. A possible explanation involves the middle ground between individual genetic changes and their expression during development, where gene-by-gene (epistatic) and gene-by-environment interactions can modify the expression of phenotypes and opportunities for selection. These aspects of genetic architecture have the potential to significantly slow the speed of phenotypic evolution during crop domestication and improvement. Here we examine whether epistatic and gene-by-environment interactions have shaped how domestication traits have evolved. We review available evidence from the literature, and we analyze two domestication-related traits, shattering and flowering time, in a mapping population derived from a cross between domesticated foxtail millet and its wild progenitor. We find that compared with wild progenitor alleles, those favored during domestication often have large phenotypic effects and are relatively insensitive to genetic background and environmental effects. Consistent selection should thus be able to rapidly change traits during domestication. We conclude that if phenotypic evolution was slow during crop domestication, this is more likely due to cultural or historical factors than epistatic or environmental constraints.

  19. Predictive networks: a flexible, open source, web application for integration and analysis of human gene networks.

    Science.gov (United States)

    Haibe-Kains, Benjamin; Olsen, Catharina; Djebbari, Amira; Bontempi, Gianluca; Correll, Mick; Bouton, Christopher; Quackenbush, John

    2012-01-01

    Genomics provided us with an unprecedented quantity of data on the genes that are activated or repressed in a wide range of phenotypes. We have increasingly come to recognize that defining the networks and pathways underlying these phenotypes requires both the integration of multiple data types and the development of advanced computational methods to infer relationships between the genes and to estimate the predictive power of the networks through which they interact. To address these issues we have developed Predictive Networks (PN), a flexible, open-source, web-based application and data services framework that enables the integration, navigation, visualization and analysis of gene interaction networks. The primary goal of PN is to allow biomedical researchers to evaluate experimentally derived gene lists in the context of large-scale gene interaction networks. The PN analytical pipeline involves two key steps. The first is the collection of a comprehensive set of known gene interactions derived from a variety of publicly available sources. The second is to use these 'known' interactions together with gene expression data to infer robust gene networks. The PN web application is accessible from http://predictivenetworks.org. The PN code base is freely available at https://sourceforge.net/projects/predictivenets/.

  20. TranscriptomeBrowser 3.0: introducing a new compendium of molecular interactions and a new visualization tool for the study of gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Lepoivre Cyrille

    2012-01-01

    Full Text Available Abstract Background Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. Results We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices, (ii potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii regulatory interactions curated from the literature, (iv predicted post-transcriptional regulation by micro-RNA, (v protein kinase-substrate interactions and (vi physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration

  1. Interactions between the FTO and GNB3 genes contribute to varied clinical phenotypes in hypertension.

    Directory of Open Access Journals (Sweden)

    Rahul Kumar

    Full Text Available The genes FTO and GNB3 are implicated in essential hypertension but their interaction remains to be explored. This study investigates the role of interaction between the two genes in the pathophysiology of essential hypertension.In a case-control study comprising 750 controls and 550 patients, interaction between the polymorphisms of FTO and GNB3 was examined using multifactor dimensionality reduction (MDR. The influence of interaction on clinical phenotypes like systolic and diastolic blood pressure, mean arterial pressure and body mass index was also investigated. The 3-locus MDR model comprising FTO rs8050136C/A and GNB3 rs1129649T/C and rs5443C/T emerged as the best disease conferring model. Moreover, the interacted-genotypes having either 1, 2, 3, 4 or 5 risk alleles correlated with linearly increasing odds ratios of 1.91 (P = 0.027; 3.93 (P = 2.08E-06; 4.51 (P = 7.63E-07; 7.44 (P = 3.66E-08 and 11.57 (P = 1.18E-05, respectively, when compared with interacted-genotypes devoid of risk alleles. Furthermore, interactions among haplotypes of FTO (H1-9 and GNB3 (Ha-d differed by >1.5-fold for protective-haplotypes, CTGGC+TC [H2+Ha] and CTGAC+TC [H4+Ha] (OR = 0.39, P = 0.003; OR = 0.22, P = 6.86E-05, respectively and risk-haplotypes, AAAGC+CT [H3+Hc] and AAAGC+TT [H3+Hd] (OR = 2.91, P = 9.98E-06; OR = 2.50, P = 0.004, respectively compared to individual haplotypes. Moreover, the effectiveness of gene-gene interaction was further corroborated with a 1.29-, 1.25- and 1.38-fold higher SBP, MAP and BMI, respectively, in patients having risk interacted-haplotype H3+Hc and 2.48-fold higher SBP having risk interacted-haplotype H3+Hd compared to individual haplotypes.Interactions between genetic variants of FTO and GNB3 influence clinical parameters to augment hypertension.

  2. Systematic study of association of four GABAergic genes: glutamic acid decarboxylase 1 gene, glutamic acid decarboxylase 2 gene, GABA(B) receptor 1 gene and GABA(A) receptor subunit beta2 gene, with schizophrenia using a universal DNA microarray.

    Science.gov (United States)

    Zhao, Xu; Qin, Shengying; Shi, Yongyong; Zhang, Aiping; Zhang, Jing; Bian, Li; Wan, Chunling; Feng, Guoyin; Gu, Niufan; Zhang, Guangqi; He, Guang; He, Lin

    2007-07-01

    Several studies have suggested the dysfunction of the GABAergic system as a risk factor in the pathogenesis of schizophrenia. In the present study, case-control association analysis was conducted in four GABAergic genes: two glutamic acid decarboxylase genes (GAD1 and GAD2), a GABA(A) receptor subunit beta2 gene (GABRB2) and a GABA(B) receptor 1 gene (GABBR1). Using a universal DNA microarray procedure we genotyped a total of 20 SNPs on the above four genes in a study involving 292 patients and 286 controls of Chinese descent. Statistically significant differences were observed in the allelic frequencies of the rs187269C/T polymorphism in the GABRB2 gene (P=0.0450, chi(2)=12.40, OR=1.65) and the -292A/C polymorphism in the GAD1 gene (P=0.0450, chi(2)=14.64 OR=1.77). In addition, using an electrophoretic mobility shift assay (EMSA), we discovered differences in the U251 nuclear protein binding to oligonucleotides representing the -292 SNP on the GAD1 gene, which suggests that the -292C allele has reduced transcription factor binding efficiency compared with the 292A allele. Using the multifactor-dimensionality reduction method (MDR), we found that the interactions among the rs187269C/T polymorphism in the GABRB2 gene, the -243A/G polymorphism in the GAD2 gene and the 27379C/T and 661C/T polymorphisms in the GAD1 gene revealed a significant association with schizophrenia (Pschizophrenia in the Chinese population.

  3. Tumor targeted gene therapy

    International Nuclear Information System (INIS)

    Kang, Joo Hyun

    2006-01-01

    Knowledge of molecular mechanisms governing malignant transformation brings new opportunities for therapeutic intervention against cancer using novel approaches. One of them is gene therapy based on the transfer of genetic material to an organism with the aim of correcting a disease. The application of gene therapy to the cancer treatment had led to the development of new experimental approaches such as suicidal gene therapy, inhibition of oncogenes and restoration of tumor-suppressor genes. Suicidal gene therapy is based on the expression in tumor cells of a gene encoding an enzyme that converts a prodrug into a toxic product. Representative suicidal genes are Herpes simplex virus type 1 thymidine kinase (HSV1-tk) and cytosine deaminase (CD). Especially, physicians and scientists of nuclear medicine field take an interest in suicidal gene therapy because they can monitor the location and magnitude, and duration of expression of HSV1-tk and CD by PET scanner

  4. Efficient strategy for detecting gene × gene joint action and its application in schizophrenia.

    Science.gov (United States)

    Won, Sungho; Kwon, Min-Seok; Mattheisen, Manuel; Park, Suyeon; Park, Changsoon; Kihara, Daisuke; Cichon, Sven; Ophoff, Roel; Nöthen, Markus M; Rietschel, Marcella; Baur, Max; Uitterlinden, Andre G; Hofmann, A; Lange, Christoph

    2014-01-01

    We propose a new approach to detect gene × gene joint action in genome-wide association studies (GWASs) for case-control designs. This approach offers an exhaustive search for all two-way joint action (including, as a special case, single gene action) that is computationally feasible at the genome-wide level and has reasonable statistical power under most genetic models. We found that the presence of any gene × gene joint action may imply differences in three types of genetic components: the minor allele frequencies and the amounts of Hardy-Weinberg disequilibrium may differ between cases and controls, and between the two genetic loci the degree of linkage disequilibrium may differ between cases and controls. Using Fisher's method, it is possible to combine the different sources of genetic information in an overall test for detecting gene × gene joint action. The proposed statistical analysis is efficient and its simplicity makes it applicable to GWASs. In the current study, we applied the proposed approach to a GWAS on schizophrenia and found several potential gene × gene interactions. Our application illustrates the practical advantage of the proposed method. © 2013 WILEY PERIODICALS, INC.

  5. [Correlation of angiotensin-converting enzyme 2 gene polymorphism with antihypertensive effects of benazepril].

    Science.gov (United States)

    Chen, Qing; Tang, Xun; Yu, Can-qing; Chen, Da-fang; Tian, Jun; Cao, Yang; Fan, Wen-yi; Cao, Wei-hua; Zhan, Si-yan; Lv, Jun; Guo, Xiao-xia; Li, Li-ming; Hu, Yong-hua

    2010-06-18

    To explore the correlation of rs2106809 from angiotensin-converting enzyme 2 gene with antihypertensive effects of benazepril, as well as its interactions with polymorphisms of angiotensinogen(AGT) and angiotensin II type 1 receptor(AGTR1) gene. Correlation between rs2106809 and blood pressure reduction was estimated based on a field trail with 1 831 hypertensive patients using benazepril for 2 weeks. Generalized multifactor dimensionality reduction (GMDR) was used to explore the interactions of rs2106809 and 8 single nucleotide polymorphisms (SNPs) of AGTR1 gene and 3 SNPs of AGT gene. rs2106809 was found to be associated with reduction in systolic blood pressure and pulse pressure in women, as well as pulse pressure reduction in men. T allele carriers presented more blood pressure reduction (1.4, 1.3 and 0.9 mmHg/T allele respectively). Gene-gene interactions involving rs2106809 were found in systolic blood pressure reduction of men, and the response to benazepril of non-sensitive genotypes carriers was 8.2 (95% confidence interval: 6.6-9.7) mmHg, lower than that of sensitive genotypes carriers. rs2106809 might act as an independent influencing factor or component of gene-gene interaction in blood pressure reducing effects of benazepril.

  6. Ortholog-based screening and identification of genes related to intracellular survival.

    Science.gov (United States)

    Yang, Xiaowen; Wang, Jiawei; Bing, Guoxia; Bie, Pengfei; De, Yanyan; Lyu, Yanli; Wu, Qingmin

    2018-04-20

    Bioinformatics and comparative genomics analysis methods were used to predict unknown pathogen genes based on homology with identified or functionally clustered genes. In this study, the genes of common pathogens were analyzed to screen and identify genes associated with intracellular survival through sequence similarity, phylogenetic tree analysis and the λ-Red recombination system test method. The total 38,952 protein-coding genes of common pathogens were divided into 19,775 clusters. As demonstrated through a COG analysis, information storage and processing genes might play an important role intracellular survival. Only 19 clusters were present in facultative intracellular pathogens, and not all were present in extracellular pathogens. Construction of a phylogenetic tree selected 18 of these 19 clusters. Comparisons with the DEG database and previous research revealed that seven other clusters are considered essential gene clusters and that seven other clusters are associated with intracellular survival. Moreover, this study confirmed that clusters screened by orthologs with similar function could be replaced with an approved uvrY gene and its orthologs, and the results revealed that the usg gene is associated with intracellular survival. The study improves the current understanding of intracellular pathogens characteristics and allows further exploration of the intracellular survival-related gene modules in these pathogens. Copyright © 2018. Published by Elsevier B.V.

  7. The dopamine D2 receptor gene, perceived parental support, and adolescent loneliness : longitudinal evidence for gene-environment interactions

    NARCIS (Netherlands)

    van Roekel, Eeske; Goossens, Luc; Scholte, Ron H. J.; Engels, Rutger C. M. E.; Verhagen, Maaike

    2011-01-01

    Background: Loneliness is a common problem in adolescence. Earlier research focused on genes within the serotonin and oxytocin systems, but no studies have examined the role of dopamine-related genes in loneliness. In the present study, we focused on the dopamine D2 receptor gene (DRD2). Methods:

  8. MyGeneFriends: A Social Network Linking Genes, Genetic Diseases, and Researchers.

    Science.gov (United States)

    Allot, Alexis; Chennen, Kirsley; Nevers, Yannis; Poidevin, Laetitia; Kress, Arnaud; Ripp, Raymond; Thompson, Julie Dawn; Poch, Olivier; Lecompte, Odile

    2017-06-16

    The constant and massive increase of biological data offers unprecedented opportunities to decipher the function and evolution of genes and their roles in human diseases. However, the multiplicity of sources and flow of data mean that efficient access to useful information and knowledge production has become a major challenge. This challenge can be addressed by taking inspiration from Web 2.0 and particularly social networks, which are at the forefront of big data exploration and human-data interaction. MyGeneFriends is a Web platform inspired by social networks, devoted to genetic disease analysis, and organized around three types of proactive agents: genes, humans, and genetic diseases. The aim of this study was to improve exploration and exploitation of biological, postgenomic era big data. MyGeneFriends leverages conventions popularized by top social networks (Facebook, LinkedIn, etc), such as networks of friends, profile pages, friendship recommendations, affinity scores, news feeds, content recommendation, and data visualization. MyGeneFriends provides simple and intuitive interactions with data through evaluation and visualization of connections (friendships) between genes, humans, and diseases. The platform suggests new friends and publications and allows agents to follow the activity of their friends. It dynamically personalizes information depending on the user's specific interests and provides an efficient way to share information with collaborators. Furthermore, the user's behavior itself generates new information that constitutes an added value integrated in the network, which can be used to discover new connections between biological agents. We have developed MyGeneFriends, a Web platform leveraging conventions from popular social networks to redefine the relationship between humans and biological big data and improve human processing of biomedical data. MyGeneFriends is available at lbgi.fr/mygenefriends. ©Alexis Allot, Kirsley Chennen, Yannis

  9. Measuring the genetic influence on human life span: gene-environment interaction and sex-specific genetic effects

    DEFF Research Database (Denmark)

    Tan, Qihua; De Benedictis, G; Yashin, Annatoli

    2001-01-01

    New approaches are needed to explore the different ways in which genes affect the human life span. One needs to assess the genetic effects themselves, as well as gene–environment interactions and sex dependency. In this paper, we present a new model that combines both genotypic and demographicinf......New approaches are needed to explore the different ways in which genes affect the human life span. One needs to assess the genetic effects themselves, as well as gene–environment interactions and sex dependency. In this paper, we present a new model that combines both genotypic...

  10. Overexpression of erg1 gene in Trichoderma harzianum CECT 2413: effect on the induction of tomato defence-related genes.

    Science.gov (United States)

    Cardoza, R E; Malmierca, M G; Gutiérrez, S

    2014-09-01

    To investigate the effect of the overexpression of erg1 gene of Trichoderma harzianum CECT 2413 (T34) on the Trichoderma-plant interactions and in the biocontrol ability of this fungus. Transformants of T34 strain overexpressing erg1 gene did not show effect on the ergosterol level, although a drastic decrease in the squalene level was observed in the transformants at 96 h of growth. During interaction with plants, the erg1 overexpression resulted in a reduction of the priming ability of several tomato defence-related genes belonging to the salicylate pathway, and also of the TomLoxA gene, which is related to the jasmonate pathway. Interestingly, other jasmonate-related genes, such as PINI and PINII, were slightly induced. The erg1 overexpressed transformants also showed a reduced ability to colonize tomato roots. The ergosterol biosynthetic pathway might play an important role in regulating Trichoderma-plant interactions, although this role does not seem to be restricted to the final product; instead, other intermediates such as squalene, whose role in the Trichoderma-plant interaction has not been characterized, would also play an important role. The functional analysis of genes involved in the synthesis of ergosterol could provide additional strategies to improve the ability of biocontrol of the Trichoderma strains and their interaction with plants. © 2014 The Society for Applied Microbiology.

  11. A methodology to establish a database to study gene environment interactions for childhood asthma

    Directory of Open Access Journals (Sweden)

    McCormick Jonathan

    2010-12-01

    Full Text Available Abstract Background Gene-environment interactions are likely to explain some of the heterogeneity in childhood asthma. Here, we describe the methodology and experiences in establishing a database for childhood asthma designed to study gene-environment interactions (PAGES - Paediatric Asthma Gene Environment Study. Methods Children with asthma and under the care of a respiratory paediatrician are being recruited from 15 hospitals between 2008 and 2011. An asthma questionnaire is completed and returned by post. At a routine clinic visit saliva is collected for DNA extraction. Detailed phenotyping in a proportion of children includes spirometry, bronchodilator response (BDR, skin prick reactivity, exhaled nitric oxide and salivary cotinine. Dietary and quality of life questionnaires are completed. Data are entered onto a purpose-built database. Results To date 1045 children have been invited to participate and data collected in 501 (48%. The mean age (SD of participants is 8.6 (3.9 years, 57% male. DNA has been collected in 436 children. Spirometry has been obtained in 172 children, mean % predicted (SD FEV1 97% (15 and median (IQR BDR is 5% (2, 9. There were differences in age, socioeconomic status, severity and %FEV1 between the different centres (p≤0.024. Reasons for non-participation included parents not having time to take part, children not attending clinics and, in a small proportion, refusal to take part. Conclusions It is feasible to establish a national database to study gene-environment interactions within an asthmatic paediatric population; there are barriers to participation and some different characteristics in individuals recruited from different centres. Recruitment to our study continues and is anticipated to extend current understanding of asthma heterogeneity.

  12. Newer Gene Editing Technologies toward HIV Gene Therapy

    Directory of Open Access Journals (Sweden)

    Premlata Shankar

    2013-11-01

    Full Text Available Despite the great success of highly active antiretroviral therapy (HAART in ameliorating the course of HIV infection, alternative therapeutic approaches are being pursued because of practical problems associated with life-long therapy. The eradication of HIV in the so-called “Berlin patient” who received a bone marrow transplant from a CCR5-negative donor has rekindled interest in genome engineering strategies to achieve the same effect. Precise gene editing within the cells is now a realistic possibility with recent advances in understanding the DNA repair mechanisms, DNA interaction with transcription factors and bacterial defense mechanisms. Within the past few years, four novel technologies have emerged that can be engineered for recognition of specific DNA target sequences to enable site-specific gene editing: Homing Endonuclease, ZFN, TALEN, and CRISPR/Cas9 system. The most recent CRISPR/Cas9 system uses a short stretch of complementary RNA bound to Cas9 nuclease to recognize and cleave target DNA, as opposed to the previous technologies that use DNA binding motifs of either zinc finger proteins or transcription activator-like effector molecules fused to an endonuclease to mediate sequence-specific DNA cleavage. Unlike RNA interference, which requires the continued presence of effector moieties to maintain gene silencing, the newer technologies allow permanent disruption of the targeted gene after a single treatment. Here, we review the applications, limitations and future prospects of novel gene-editing strategies for use as HIV therapy.

  13. Environment-Gene interaction in common complex diseases: New approaches

    Directory of Open Access Journals (Sweden)

    William A. Toscano, Jr.

    2014-10-01

    Full Text Available Approximately 100,000 different environmental chemicals that are in use as high production volume chemicals confront us in our daily lives. Many of the chemicals we encounter are persistent and have long half-lives in the environment and our bodies. These compounds are referred to as Persistent Organic Pollutants, or POPS. The total environment however is broader than just toxic pollutants. It includes social capital, social economic status, and other factors that are not commonly considered in traditional approaches to studying environment-human interactions. The mechanism of action of environmental agents in altering the human phenotype from health to disease is more complex than once thought. The focus in public health has shifted away from the study of single-gene rare diseases and has given way to the study of multifactorial complex diseases that are common in the population. To understand common complex diseases, we need teams of scientists from different fields working together with common aims. We review some approaches for studying the action of the environment by discussing use-inspired research, and transdisciplinary research approaches. The Genomic era has yielded new tools for study of gene-environment interactions, including genomics, epigenomics, and systems biology. We use environmentally-driven diabetes mellitus type two as an example of environmental epigenomics and disease. The aim of this review is to start the conversation of how the application of advances in biomedical science can be used to advance public health.

  14. Weighted functional linear regression models for gene-based association analysis.

    Science.gov (United States)

    Belonogova, Nadezhda M; Svishcheva, Gulnara R; Wilson, James F; Campbell, Harry; Axenovich, Tatiana I

    2018-01-01

    Functional linear regression models are effectively used in gene-based association analysis of complex traits. These models combine information about individual genetic variants, taking into account their positions and reducing the influence of noise and/or observation errors. To increase the power of methods, where several differently informative components are combined, weights are introduced to give the advantage to more informative components. Allele-specific weights have been introduced to collapsing and kernel-based approaches to gene-based association analysis. Here we have for the first time introduced weights to functional linear regression models adapted for both independent and family samples. Using data simulated on the basis of GAW17 genotypes and weights defined by allele frequencies via the beta distribution, we demonstrated that type I errors correspond to declared values and that increasing the weights of causal variants allows the power of functional linear models to be increased. We applied the new method to real data on blood pressure from the ORCADES sample. Five of the six known genes with P models. Moreover, we found an association between diastolic blood pressure and the VMP1 gene (P = 8.18×10-6), when we used a weighted functional model. For this gene, the unweighted functional and weighted kernel-based models had P = 0.004 and 0.006, respectively. The new method has been implemented in the program package FREGAT, which is freely available at https://cran.r-project.org/web/packages/FREGAT/index.html.

  15. Interactions Between Variation in Candidate Genes and Environmental Factors in the Etiology of Schizophrenia and Bipolar Disorder: a Systematic Review.

    Science.gov (United States)

    Misiak, Błażej; Stramecki, Filip; Gawęda, Łukasz; Prochwicz, Katarzyna; Sąsiadek, Maria M; Moustafa, Ahmed A; Frydecka, Dorota

    2017-08-18

    Schizophrenia and bipolar disorder (BD) are complex and multidimensional disorders with high heritability rates. The contribution of genetic factors to the etiology of these disorders is increasingly being recognized as the action of multiple risk variants with small effect sizes, which might explain only a minor part of susceptibility. On the other site, numerous environmental factors have been found to play an important role in their causality. Therefore, in recent years, several studies focused on gene × environment interactions that are believed to bridge the gap between genetic underpinnings and environmental insults. In this article, we performed a systematic review of studies investigating gene × environment interactions in BD and schizophrenia spectrum phenotypes. In the majority of studies from this field, interacting effects of variation in genes encoding catechol-O-methyltransferase (COMT), brain-derived neurotrophic factor (BDNF), and FK506-binding protein 5 (FKBP5) have been explored. Almost consistently, these studies revealed that polymorphisms in COMT, BDNF, and FKBP5 genes might interact with early life stress and cannabis abuse or dependence, influencing various outcomes of schizophrenia spectrum disorders and BD. Other interactions still require further replication in larger clinical and non-clinical samples. In addition, future studies should address the direction of causality and potential mechanisms of the relationship between gene × environment interactions and various categories of outcomes in schizophrenia and BD.

  16. Melanopsin gene variations interact with season to predict sleep onset and chronotype.

    Science.gov (United States)

    Roecklein, Kathryn A; Wong, Patricia M; Franzen, Peter L; Hasler, Brant P; Wood-Vasey, W Michael; Nimgaonkar, Vishwajit L; Miller, Megan A; Kepreos, Kyle M; Ferrell, Robert E; Manuck, Stephen B

    2012-10-01

    The human melanopsin gene has been reported to mediate risk for seasonal affective disorder (SAD), which is hypothesized to be caused by decreased photic input during winter when light levels fall below threshold, resulting in differences in circadian phase and/or sleep. However, it is unclear if melanopsin increases risk of SAD by causing differences in sleep or circadian phase, or if those differences are symptoms of the mood disorder. To determine if melanopsin sequence variations are associated with differences in sleep-wake behavior among those not suffering from a mood disorder, the authors tested associations between melanopsin gene polymorphisms and self-reported sleep timing (sleep onset and wake time) in a community sample (N = 234) of non-Hispanic Caucasian participants (age 30-54 yrs) with no history of psychological, neurological, or sleep disorders. The authors also tested the effect of melanopsin variations on differences in preferred sleep and activity timing (i.e., chronotype), which may reflect differences in circadian phase, sleep homeostasis, or both. Daylength on the day of assessment was measured and included in analyses. DNA samples were genotyped for melanopsin gene polymorphisms using fluorescence polarization. P10L genotype interacted with daylength to predict self-reported sleep onset (interaction p sleep onset among those with the TT genotype was later in the day when individuals were assessed on longer days and earlier in the day on shorter days, whereas individuals in the other genotype groups (i.e., CC and CT) did not show this interaction effect. P10L genotype also interacted in an analogous way with daylength to predict self-reported morningness (interaction p sleep onset and chronotype as a function of daylength, whereas other genotypes at P10L do not seem to have effects that vary by daylength. A better understanding of how melanopsin confers heightened responsivity to daylength may improve our understanding of a broad range of

  17. Actionable gene-based classification toward precision medicine in gastric cancer

    Directory of Open Access Journals (Sweden)

    Hiroshi Ichikawa

    2017-10-01

    Full Text Available Abstract Background Intertumoral heterogeneity represents a significant hurdle to identifying optimized targeted therapies in gastric cancer (GC. To realize precision medicine for GC patients, an actionable gene alteration-based molecular classification that directly associates GCs with targeted therapies is needed. Methods A total of 207 Japanese patients with GC were included in this study. Formalin-fixed, paraffin-embedded (FFPE tumor tissues were obtained from surgical or biopsy specimens and were subjected to DNA extraction. We generated comprehensive genomic profiling data using a 435-gene panel including 69 actionable genes paired with US Food and Drug Administration-approved targeted therapies, and the evaluation of Epstein-Barr virus (EBV infection and microsatellite instability (MSI status. Results Comprehensive genomic sequencing detected at least one alteration of 435 cancer-related genes in 194 GCs (93.7% and of 69 actionable genes in 141 GCs (68.1%. We classified the 207 GCs into four The Cancer Genome Atlas (TCGA subtypes using the genomic profiling data; EBV (N = 9, MSI (N = 17, chromosomal instability (N = 119, and genomically stable subtype (N = 62. Actionable gene alterations were not specific and were widely observed throughout all TCGA subtypes. To discover a novel classification which more precisely selects candidates for targeted therapies, 207 GCs were classified using hypermutated phenotype and the mutation profile of 69 actionable genes. We identified a hypermutated group (N = 32, while the others (N = 175 were sub-divided into six clusters including five with actionable gene alterations: ERBB2 (N = 25, CDKN2A, and CDKN2B (N = 10, KRAS (N = 10, BRCA2 (N = 9, and ATM cluster (N = 12. The clinical utility of this classification was demonstrated by a case of unresectable GC with a remarkable response to anti-HER2 therapy in the ERBB2 cluster. Conclusions This actionable gene-based

  18. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    Science.gov (United States)

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  19. Empirical study of supervised gene screening

    Directory of Open Access Journals (Sweden)

    Ma Shuangge

    2006-12-01

    Full Text Available Abstract Background Microarray studies provide a way of linking variations of phenotypes with their genetic causations. Constructing predictive models using high dimensional microarray measurements usually consists of three steps: (1 unsupervised gene screening; (2 supervised gene screening; and (3 statistical model building. Supervised gene screening based on marginal gene ranking is commonly used to reduce the number of genes in the model building. Various simple statistics, such as t-statistic or signal to noise ratio, have been used to rank genes in the supervised screening. Despite of its extensive usage, statistical study of supervised gene screening remains scarce. Our study is partly motivated by the differences in gene discovery results caused by using different supervised gene screening methods. Results We investigate concordance and reproducibility of supervised gene screening based on eight commonly used marginal statistics. Concordance is assessed by the relative fractions of overlaps between top ranked genes screened using different marginal statistics. We propose a Bootstrap Reproducibility Index, which measures reproducibility of individual genes under the supervised screening. Empirical studies are based on four public microarray data. We consider the cases where the top 20%, 40% and 60% genes are screened. Conclusion From a gene discovery point of view, the effect of supervised gene screening based on different marginal statistics cannot be ignored. Empirical studies show that (1 genes passed different supervised screenings may be considerably different; (2 concordance may vary, depending on the underlying data structure and percentage of selected genes; (3 evaluated with the Bootstrap Reproducibility Index, genes passed supervised screenings are only moderately reproducible; and (4 concordance cannot be improved by supervised screening based on reproducibility.

  20. Meta-analysis of Cancer Gene Profiling Data.

    Science.gov (United States)

    Roy, Janine; Winter, Christof; Schroeder, Michael

    2016-01-01

    The simultaneous measurement of thousands of genes gives the opportunity to personalize and improve cancer therapy. In addition, the integration of meta-data such as protein-protein interaction (PPI) information into the analyses helps in the identification and prioritization of genes from these screens. Here, we describe a computational approach that identifies genes prognostic for outcome by combining gene profiling data from any source with a network of known relationships between genes.

  1. Prioritizing genes associated with prostate cancer development

    International Nuclear Information System (INIS)

    Gorlov, Ivan P; Logothetis, Christopher J; Sircar, Kanishka; Zhao, Hongya; Maity, Sankar N; Navone, Nora M; Gorlova, Olga Y; Troncoso, Patricia; Pettaway, Curtis A; Byun, Jin Young

    2010-01-01

    The genetic control of prostate cancer development is poorly understood. Large numbers of gene-expression datasets on different aspects of prostate tumorigenesis are available. We used these data to identify and prioritize candidate genes associated with the development of prostate cancer and bone metastases. Our working hypothesis was that combining meta-analyses on different but overlapping steps of prostate tumorigenesis will improve identification of genes associated with prostate cancer development. A Z score-based meta-analysis of gene-expression data was used to identify candidate genes associated with prostate cancer development. To put together different datasets, we conducted a meta-analysis on 3 levels that follow the natural history of prostate cancer development. For experimental verification of candidates, we used in silico validation as well as in-house gene-expression data. Genes with experimental evidence of an association with prostate cancer development were overrepresented among our top candidates. The meta-analysis also identified a considerable number of novel candidate genes with no published evidence of a role in prostate cancer development. Functional annotation identified cytoskeleton, cell adhesion, extracellular matrix, and cell motility as the top functions associated with prostate cancer development. We identified 10 genes--CDC2, CCNA2, IGF1, EGR1, SRF, CTGF, CCL2, CAV1, SMAD4, and AURKA--that form hubs of the interaction network and therefore are likely to be primary drivers of prostate cancer development. By using this large 3-level meta-analysis of the gene-expression data to identify candidate genes associated with prostate cancer development, we have generated a list of candidate genes that may be a useful resource for researchers studying the molecular mechanisms underlying prostate cancer development

  2. Study of miRNA Based Gene Regulation, Involved in Solid Cancer, by the Assistance of Argonaute Protein

    Directory of Open Access Journals (Sweden)

    Surya Narayan Rath

    2016-09-01

    Full Text Available Solid tumor is generally observed in tissues of epithelial or endothelial cells of lung, breast, prostate, pancreases, colorectal, stomach, and bladder, where several genes transcription is regulated by the microRNAs (miRNAs. Argonaute (AGO protein is a family of protein which assists in miRNAs to bind with mRNAs of the target genes. Hence, study of the binding mechanism between AGO protein and miRNAs, and also with miRNAs-mRNAs duplex is crucial for understanding the RNA silencing mechanism. In the current work, 64 genes and 23 miRNAs have been selected from literatures, whose deregulation is well established in seven types of solid cancer like lung, breast, prostate, pancreases, colorectal, stomach, and bladder cancer. In silico study reveals, miRNAs namely, miR-106a, miR-21, and miR-29b-2 have a strong binding affinity towards PTEN, TGFBR2, and VEGFA genes, respectively, suggested as important factors in RNA silencing mechanism. Furthermore, interaction between AGO protein (PDB ID-3F73, chain A with selected miRNAs and with miRNAs-mRNAs duplex were studied computationally to understand their binding at molecular level. The residual interaction and hydrogen bonding are inspected in Discovery Studio 3.5 suites. The current investigation throws light on understanding miRNAs based gene silencing mechanism in solid cancer.

  3. Gene regulation is governed by a core network in hepatocellular carcinoma.

    Science.gov (United States)

    Gu, Zuguang; Zhang, Chenyu; Wang, Jin

    2012-05-01

    Hepatocellular carcinoma (HCC) is one of the most lethal cancers worldwide, and the mechanisms that lead to the disease are still relatively unclear. However, with the development of high-throughput technologies it is possible to gain a systematic view of biological systems to enhance the understanding of the roles of genes associated with HCC. Thus, analysis of the mechanism of molecule interactions in the context of gene regulatory networks can reveal specific sub-networks that lead to the development of HCC. In this study, we aimed to identify the most important gene regulations that are dysfunctional in HCC generation. Our method for constructing gene regulatory network is based on predicted target interactions, experimentally-supported interactions, and co-expression model. Regulators in the network included both transcription factors and microRNAs to provide a complete view of gene regulation. Analysis of gene regulatory network revealed that gene regulation in HCC is highly modular, in which different sets of regulators take charge of specific biological processes. We found that microRNAs mainly control biological functions related to mitochondria and oxidative reduction, while transcription factors control immune responses, extracellular activity and the cell cycle. On the higher level of gene regulation, there exists a core network that organizes regulations between different modules and maintains the robustness of the whole network. There is direct experimental evidence for most of the regulators in the core gene regulatory network relating to HCC. We infer it is the central controller of gene regulation. Finally, we explored the influence of the core gene regulatory network on biological pathways. Our analysis provides insights into the mechanism of transcriptional and post-transcriptional control in HCC. In particular, we highlight the importance of the core gene regulatory network; we propose that it is highly related to HCC and we believe further

  4. Systems-level analysis of risk genes reveals the modular nature of schizophrenia.

    Science.gov (United States)

    Liu, Jiewei; Li, Ming; Luo, Xiong-Jian; Su, Bing

    2018-05-19

    Schizophrenia (SCZ) is a complex mental disorder with high heritability. Genetic studies (especially recent genome-wide association studies) have identified many risk genes for schizophrenia. However, the physical interactions among the proteins encoded by schizophrenia risk genes remain elusive and it is not known whether the identified risk genes converge on common molecular networks or pathways. Here we systematically investigated the network characteristics of schizophrenia risk genes using the high-confidence protein-protein interactions (PPI) from the human interactome. We found that schizophrenia risk genes encode a densely interconnected PPI network (P = 4.15 × 10 -31 ). Compared with the background genes, the schizophrenia risk genes in the interactome have significantly higher degree (P = 5.39 × 10 -11 ), closeness centrality (P = 7.56 × 10 -11 ), betweeness centrality (P = 1.29 × 10 -11 ), clustering coefficient (P = 2.22 × 10 -2 ), and shorter average shortest path length (P = 7.56 × 10 -11 ). Based on the densely interconnected PPI network, we identified 48 hub genes and 4 modules formed by highly interconnected schizophrenia genes. We showed that the proteins encoded by schizophrenia hub genes have significantly more direct physical interactions. Gene ontology (GO) analysis revealed that cell adhesion, cell cycle, immune system response, and GABR-receptor complex categories were enriched in the modules formed by highly interconnected schizophrenia risk genes. Our study reveals that schizophrenia risk genes encode a densely interconnected molecular network and demonstrates the modular nature of schizophrenia. Copyright © 2018 Elsevier B.V. All rights reserved.

  5. Using imputed genotype data in the joint score tests for genetic association and gene-environment interactions in case-control studies.

    Science.gov (United States)

    Song, Minsun; Wheeler, William; Caporaso, Neil E; Landi, Maria Teresa; Chatterjee, Nilanjan

    2018-03-01

    Genome-wide association studies (GWAS) are now routinely imputed for untyped single nucleotide polymorphisms (SNPs) based on various powerful statistical algorithms for imputation trained on reference datasets. The use of predicted allele counts for imputed SNPs as the dosage variable is known to produce valid score test for genetic association. In this paper, we investigate how to best handle imputed SNPs in various modern complex tests for genetic associations incorporating gene-environment interactions. We focus on case-control association studies where inference for an underlying logistic regression model can be performed using alternative methods that rely on varying degree on an assumption of gene-environment independence in the underlying population. As increasingly large-scale GWAS are being performed through consortia effort where it is preferable to share only summary-level information across studies, we also describe simple mechanisms for implementing score tests based on standard meta-analysis of "one-step" maximum-likelihood estimates across studies. Applications of the methods in simulation studies and a dataset from GWAS of lung cancer illustrate ability of the proposed methods to maintain type-I error rates for the underlying testing procedures. For analysis of imputed SNPs, similar to typed SNPs, the retrospective methods can lead to considerable efficiency gain for modeling of gene-environment interactions under the assumption of gene-environment independence. Methods are made available for public use through CGEN R software package. © 2017 WILEY PERIODICALS, INC.

  6. Prediction of regulatory gene pairs using dynamic time warping and gene ontology.

    Science.gov (United States)

    Yang, Andy C; Hsu, Hui-Huang; Lu, Ming-Da; Tseng, Vincent S; Shih, Timothy K

    2014-01-01

    Selecting informative genes is the most important task for data analysis on microarray gene expression data. In this work, we aim at identifying regulatory gene pairs from microarray gene expression data. However, microarray data often contain multiple missing expression values. Missing value imputation is thus needed before further processing for regulatory gene pairs becomes possible. We develop a novel approach to first impute missing values in microarray time series data by combining k-Nearest Neighbour (KNN), Dynamic Time Warping (DTW) and Gene Ontology (GO). After missing values are imputed, we then perform gene regulation prediction based on our proposed DTW-GO distance measurement of gene pairs. Experimental results show that our approach is more accurate when compared with existing missing value imputation methods on real microarray data sets. Furthermore, our approach can also discover more regulatory gene pairs that are known in the literature than other methods.

  7. Banana ethylene response factors are involved in fruit ripening through their interactions with ethylene biosynthesis genes.

    Science.gov (United States)

    Xiao, Yun-yi; Chen, Jian-ye; Kuang, Jiang-fei; Shan, Wei; Xie, Hui; Jiang, Yue-ming; Lu, Wang-jin

    2013-05-01

    The involvement of ethylene response factor (ERF) transcription factor (TF) in the transcriptional regulation of ethylene biosynthesis genes during fruit ripening remains largely unclear. In this study, 15 ERF genes, designated as MaERF1-MaERF15, were isolated and characterized from banana fruit. These MaERFs were classified into seven of the 12 known ERF families. Subcellular localization showed that MaERF proteins of five different subfamilies preferentially localized to the nucleus. The 15 MaERF genes displayed differential expression patterns and levels in peel and pulp of banana fruit, in association with four different ripening treatments caused by natural, ethylene-induced, 1-methylcyclopropene (1-MCP)-delayed, and combined 1-MCP and ethylene treatments. MaERF9 was upregulated while MaERF11 was downregulated in peel and pulp of banana fruit during ripening or after treatment with ethylene. Furthermore, yeast-one hybrid (Y1H) and transient expression assays showed that the potential repressor MaERF11 bound to MaACS1 and MaACO1 promoters to suppress their activities and that MaERF9 activated MaACO1 promoter activity. Interestingly, protein-protein interaction analysis revealed that MaERF9 and -11 physically interacted with MaACO1. Taken together, these results suggest that MaERFs are involved in banana fruit ripening via transcriptional regulation of or interaction with ethylene biosynthesis genes.

  8. Identification of T1D susceptibility genes within the MHC region by combining protein interaction networks and SNP genotyping data

    DEFF Research Database (Denmark)

    Brorsson, C.; Hansen, Niclas Tue; Hansen, Kasper Lage

    2009-01-01

    genes. We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein-protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC......To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1...... region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein...

  9. Using rule-based machine learning for candidate disease gene prioritization and sample classification of cancer gene expression data.

    Science.gov (United States)

    Glaab, Enrico; Bacardit, Jaume; Garibaldi, Jonathan M; Krasnogor, Natalio

    2012-01-01

    Microarray data analysis has been shown to provide an effective tool for studying cancer and genetic diseases. Although classical machine learning techniques have successfully been applied to find informative genes and to predict class labels for new samples, common restrictions of microarray analysis such as small sample sizes, a large attribute space and high noise levels still limit its scientific and clinical applications. Increasing the interpretability of prediction models while retaining a high accuracy would help to exploit the information content in microarray data more effectively. For this purpose, we evaluate our rule-based evolutionary machine learning systems, BioHEL and GAssist, on three public microarray cancer datasets, obtaining simple rule-based models for sample classification. A comparison with other benchmark microarray sample classifiers based on three diverse feature selection algorithms suggests that these evolutionary learning techniques can compete with state-of-the-art methods like support vector machines. The obtained models reach accuracies above 90% in two-level external cross-validation, with the added value of facilitating interpretation by using only combinations of simple if-then-else rules. As a further benefit, a literature mining analysis reveals that prioritizations of informative genes extracted from BioHEL's classification rule sets can outperform gene rankings obtained from a conventional ensemble feature selection in terms of the pointwise mutual information between relevant disease terms and the standardized names of top-ranked genes.

  10. Dichotomy of major genes and polygenes

    International Nuclear Information System (INIS)

    Jain, S.

    1989-01-01

    In order to facilitate domestication and breeding of new or underexploited crop species, the genetic basis of many traits must be critically investigated, and both naturally occurring and induced mutations should be utilized. Classically, most breeding procedures have invoked the dichotomy of major genes versus polygenes (or discrete versus continuously varying traits) which is briefly reviewed here from several viewpoints. Clearly, the evidence for two distinct classes of genes (or gene effects on phenotype) and traits is largely a product of different forms of genetic analyses and their primary objectives as well as of researchers' expectations. Superimposed on the simplest Mendelian ratios and genome maps are numerous sources of molecular variation and gene expression at many levels of phenotypic description. Many attempts to delineate developmental pathways and to identify genes controlling discrete vs. quantitative phenotypic variation have resulted in emphasis on multigenic models with specific gene effects at mappable loci but nonetheless modified by small effects. Thus, quantitative genetic variation may arise from multi-genic and multi-allelic systems of both structural and regulatory gene action and gene interactions which, from an empirical breeding perspective, might be adequately described by the biometrical and evolutionary models. Polygenic analyses were conceptually based on genetic parameters in these models (as caricatures of reality) but efforts to modify or reject them by identifying and mapping sources of phenotypic variation through newer genetic methods are likely to enrich and not displace biometrical methods. Domestication programmes, in particular, should employ the entire array of genetic discoveries and methodologies. (author). 71 refs, 1 fig., 1 tab

  11. Association between NINJ2 gene polymorphisms and ischemic stroke: a family-based case-control study.

    Science.gov (United States)

    Zhu, Yanping; Liu, Kuo; Tang, Xun; Wang, Jinwei; Yu, Zhiping; Wu, Yiqun; Chen, Dafang; Wang, Xueyin; Fang, Kai; Li, Na; Huang, Shaoping; Hu, Yonghua

    2014-11-01

    Novel susceptibility genes related to ischemic stroke (IS) are proposed in recent literatures. Population-based replicate studies would cause false positive results due to population stratification. 229 recruit IS patients and their 229 non-IS siblings were used in this study to avoid population stratification. The family-based study was conducted in Beijing from June 2005 to June 2012. Association between SNPs and IS was found in the sibship discordant tests, and the conditional logistic regression was performed to identify effect size and explore gene-environment interactions. Significant allelic association was identified between NINJ2 gene rs11833579 (P = 0.008), protein kinase C η gene rs2230501 (P = 0.039) and IS. The AA genotype of rs11833579 increased 1.51-fold risk (95% CI 1.04-3.46; P = 0.043) of IS, and it conferred susceptibility to IS only in a dominant model (OR 2.69; 95% CI 1.06-6.78; P = 0.036]. Risk of IS was higher (HR 3.58; 95% CI 1.54-8.31; P = 0.003) especially when the carriers of rs11833579 AA genotype were smokers. The present study suggests A allele of rs11833579 may play a role in mediating susceptibility to IS and it may increase the risk of IS together with smoking.

  12. The Dopamine D2 Receptor Gene, Perceived Parental Support, and Adolescent Loneliness: Longitudinal Evidence for Gene-Environment Interactions

    Science.gov (United States)

    van Roekel, Eeske; Goossens, Luc; Scholte, Ron H. J.; Engels, Rutger C. M. E.; Verhagen, Maaike

    2011-01-01

    Background: Loneliness is a common problem in adolescence. Earlier research focused on genes within the serotonin and oxytocin systems, but no studies have examined the role of dopamine-related genes in loneliness. In the present study, we focused on the dopamine D2 receptor gene (DRD2). Methods: Associations among the DRD2, sex, parental support,…

  13. Bayesian median regression for temporal gene expression data

    Science.gov (United States)

    Yu, Keming; Vinciotti, Veronica; Liu, Xiaohui; 't Hoen, Peter A. C.

    2007-09-01

    Most of the existing methods for the identification of biologically interesting genes in a temporal expression profiling dataset do not fully exploit the temporal ordering in the dataset and are based on normality assumptions for the gene expression. In this paper, we introduce a Bayesian median regression model to detect genes whose temporal profile is significantly different across a number of biological conditions. The regression model is defined by a polynomial function where both time and condition effects as well as interactions between the two are included. MCMC-based inference returns the posterior distribution of the polynomial coefficients. From this a simple Bayes factor test is proposed to test for significance. The estimation of the median rather than the mean, and within a Bayesian framework, increases the robustness of the method compared to a Hotelling T2-test previously suggested. This is shown on simulated data and on muscular dystrophy gene expression data.

  14. RNAi-Based Identification of Gene-Specific Nuclear Cofactor Networks Regulating Interleukin-1 Target Genes

    Directory of Open Access Journals (Sweden)

    Johanna Meier-Soelch

    2018-04-01

    Full Text Available The potent proinflammatory cytokine interleukin (IL-1 triggers gene expression through the NF-κB signaling pathway. Here, we investigated the cofactor requirements of strongly regulated IL-1 target genes whose expression is impaired in p65 NF-κB-deficient murine embryonic fibroblasts. By two independent small-hairpin (shRNA screens, we examined 170 genes annotated to encode nuclear cofactors for their role in Cxcl2 mRNA expression and identified 22 factors that modulated basal or IL-1-inducible Cxcl2 levels. The functions of 16 of these factors were validated for Cxcl2 and further analyzed for their role in regulation of 10 additional IL-1 target genes by RT-qPCR. These data reveal that each inducible gene has its own (quantitative requirement of cofactors to maintain basal levels and to respond to IL-1. Twelve factors (Epc1, H2afz, Kdm2b, Kdm6a, Mbd3, Mta2, Phf21a, Ruvbl1, Sin3b, Suv420h1, Taf1, and Ube3a have not been previously implicated in inflammatory cytokine functions. Bioinformatics analysis indicates that they are components of complex nuclear protein networks that regulate chromatin functions and gene transcription. Collectively, these data suggest that downstream from the essential NF-κB signal each cytokine-inducible target gene has further subtle requirements for individual sets of nuclear cofactors that shape its transcriptional activation profile.

  15. Genome-Wide Gene-Environment Study Identifies Glutamate Receptor Gene GRIN2A as a Parkinson's Disease Modifier Gene via Interaction with Coffee

    Science.gov (United States)

    Hamza, Taye H.; Chen, Honglei; Hill-Burns, Erin M.; Rhodes, Shannon L.; Montimurro, Jennifer; Kay, Denise M.; Tenesa, Albert; Kusel, Victoria I.; Sheehan, Patricia; Eaaswarkhanth, Muthukrishnan; Yearout, Dora; Samii, Ali; Roberts, John W.; Agarwal, Pinky; Bordelon, Yvette; Park, Yikyung; Wang, Liyong; Gao, Jianjun; Vance, Jeffery M.; Kendler, Kenneth S.; Bacanu, Silviu-Alin; Scott, William K.; Ritz, Beate; Nutt, John; Factor, Stewart A.; Zabetian, Cyrus P.; Payami, Haydeh

    2011-01-01

    Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal components. We then stratified subjects as heavy or light coffee-drinkers and performed genome-wide association study (GWAS) in each group. We replicated the most significant SNP. Finally, we imputed the NGRC dataset, increasing genomic coverage to examine the region of interest in detail. The primary analyses (GWAIS, GWAS, Replication) were performed using genotyped data. In GWAIS, the most significant signal came from rs4998386 and the neighboring SNPs in GRIN2A. GRIN2A encodes an NMDA-glutamate-receptor subunit and regulates excitatory neurotransmission in the brain. Achieving P2df = 10−6, GRIN2A surpassed all known PD susceptibility genes in significance in the GWAIS. In stratified GWAS, the GRIN2A signal was present in heavy coffee-drinkers (OR = 0.43; P = 6×10−7) but not in light coffee-drinkers. The a priori Replication hypothesis that “Among heavy coffee-drinkers, rs4998386_T carriers have lower PD risk than rs4998386_CC carriers” was confirmed: ORReplication = 0.59, PReplication = 10−3; ORPooled = 0.51, PPooled = 7×10−8. Compared to light coffee-drinkers with rs4998386_CC genotype, heavy coffee-drinkers with rs4998386_CC genotype had 18% lower risk (P = 3×10−3), whereas heavy coffee-drinkers with rs4998386_TC genotype had 59% lower risk (P = 6×10−13). Imputation revealed a block of SNPs that achieved P2dfcoffee-drinkers. This study is proof of concept that inclusion of environmental factors can help identify genes that

  16. Genome-wide gene-environment study identifies glutamate receptor gene GRIN2A as a Parkinson's disease modifier gene via interaction with coffee.

    Science.gov (United States)

    Hamza, Taye H; Chen, Honglei; Hill-Burns, Erin M; Rhodes, Shannon L; Montimurro, Jennifer; Kay, Denise M; Tenesa, Albert; Kusel, Victoria I; Sheehan, Patricia; Eaaswarkhanth, Muthukrishnan; Yearout, Dora; Samii, Ali; Roberts, John W; Agarwal, Pinky; Bordelon, Yvette; Park, Yikyung; Wang, Liyong; Gao, Jianjun; Vance, Jeffery M; Kendler, Kenneth S; Bacanu, Silviu-Alin; Scott, William K; Ritz, Beate; Nutt, John; Factor, Stewart A; Zabetian, Cyrus P; Payami, Haydeh

    2011-08-01

    Our aim was to identify genes that influence the inverse association of coffee with the risk of developing Parkinson's disease (PD). We used genome-wide genotype data and lifetime caffeinated-coffee-consumption data on 1,458 persons with PD and 931 without PD from the NeuroGenetics Research Consortium (NGRC), and we performed a genome-wide association and interaction study (GWAIS), testing each SNP's main-effect plus its interaction with coffee, adjusting for sex, age, and two principal components. We then stratified subjects as heavy or light coffee-drinkers and performed genome-wide association study (GWAS) in each group. We replicated the most significant SNP. Finally, we imputed the NGRC dataset, increasing genomic coverage to examine the region of interest in detail. The primary analyses (GWAIS, GWAS, Replication) were performed using genotyped data. In GWAIS, the most significant signal came from rs4998386 and the neighboring SNPs in GRIN2A. GRIN2A encodes an NMDA-glutamate-receptor subunit and regulates excitatory neurotransmission in the brain. Achieving P(2df) = 10(-6), GRIN2A surpassed all known PD susceptibility genes in significance in the GWAIS. In stratified GWAS, the GRIN2A signal was present in heavy coffee-drinkers (OR = 0.43; P = 6×10(-7)) but not in light coffee-drinkers. The a priori Replication hypothesis that "Among heavy coffee-drinkers, rs4998386_T carriers have lower PD risk than rs4998386_CC carriers" was confirmed: OR(Replication) = 0.59, P(Replication) = 10(-3); OR(Pooled) = 0.51, P(Pooled) = 7×10(-8). Compared to light coffee-drinkers with rs4998386_CC genotype, heavy coffee-drinkers with rs4998386_CC genotype had 18% lower risk (P = 3×10(-3)), whereas heavy coffee-drinkers with rs4998386_TC genotype had 59% lower risk (P = 6×10(-13)). Imputation revealed a block of SNPs that achieved P(2df)coffee-drinkers. This study is proof of concept that inclusion of environmental factors can help identify

  17. Structuring osteosarcoma knowledge: an osteosarcoma-gene association database based on literature mining and manual annotation.

    Science.gov (United States)

    Poos, Kathrin; Smida, Jan; Nathrath, Michaela; Maugg, Doris; Baumhoer, Daniel; Neumann, Anna; Korsching, Eberhard

    2014-01-01

    Osteosarcoma (OS) is the most common primary bone cancer exhibiting high genomic instability. This genomic instability affects multiple genes and microRNAs to a varying extent depending on patient and tumor subtype. Massive research is ongoing to identify genes including their gene products and microRNAs that correlate with disease progression and might be used as biomarkers for OS. However, the genomic complexity hampers the identification of reliable biomarkers. Up to now, clinico-pathological factors are the key determinants to guide prognosis and therapeutic treatments. Each day, new studies about OS are published and complicate the acquisition of information to support biomarker discovery and therapeutic improvements. Thus, it is necessary to provide a structured and annotated view on the current OS knowledge that is quick and easily accessible to researchers of the field. Therefore, we developed a publicly available database and Web interface that serves as resource for OS-associated genes and microRNAs. Genes and microRNAs were collected using an automated dictionary-based gene recognition procedure followed by manual review and annotation by experts of the field. In total, 911 genes and 81 microRNAs related to 1331 PubMed abstracts were collected (last update: 29 October 2013). Users can evaluate genes and microRNAs according to their potential prognostic and therapeutic impact, the experimental procedures, the sample types, the biological contexts and microRNA target gene interactions. Additionally, a pathway enrichment analysis of the collected genes highlights different aspects of OS progression. OS requires pathways commonly deregulated in cancer but also features OS-specific alterations like deregulated osteoclast differentiation. To our knowledge, this is the first effort of an OS database containing manual reviewed and annotated up-to-date OS knowledge. It might be a useful resource especially for the bone tumor research community, as specific

  18. Genetic susceptibility loci, environmental exposures, and Parkinson's disease: a case-control study of gene-environment interactions.

    Science.gov (United States)

    Chung, Sun Ju; Armasu, Sebastian M; Anderson, Kari J; Biernacka, Joanna M; Lesnick, Timothy G; Rider, David N; Cunningham, Julie M; Ahlskog, J Eric; Frigerio, Roberta; Maraganore, Demetrius M

    2013-06-01

    Prior studies causally linked mutations in SNCA, MAPT, and LRRK2 genes with familial Parkinsonism. Genome-wide association studies have demonstrated association of single nucleotide polymorphisms (SNPs) in those three genes with sporadic Parkinson's disease (PD) susceptibility worldwide. Here we investigated the interactions between SNPs in those three susceptibility genes and environmental exposures (pesticides application, tobacco smoking, coffee drinking, and alcohol drinking) also associated with PD susceptibility. Pairwise interactions between environmental exposures and 18 variants (16 SNPs and two variable number tandem repeats, or "VNTRs") in SNCA, MAPT and LRRK2, were investigated using data from 1098 PD cases from the upper Midwest, USA and 1098 matched controls. Environmental exposures were assessed using a validated telephone interview script. Five pairwise interactions had uncorrected P-values coffee drinking × MAPT H1/H2 haplotype or MAPT rs16940806, and alcohol drinking × MAPT rs2435211. None of these interactions remained significant after Bonferroni correction. Secondary analyses in strata defined by type of control (sibling or unrelated), sex, or age at onset of the case also did not identify significant interactions after Bonferroni correction. This study documented limited pairwise interactions between established genetic and environmental risk factors for PD; however, the associations were not significant after correction for multiple testing. Copyright © 2013 Elsevier Ltd. All rights reserved.

  19. Sequence-based model of gap gene regulatory network.

    Science.gov (United States)

    Kozlov, Konstantin; Gursky, Vitaly; Kulakovskiy, Ivan; Samsonova, Maria

    2014-01-01

    The detailed analysis of transcriptional regulation is crucially important for understanding biological processes. The gap gene network in Drosophila attracts large interest among researches studying mechanisms of transcriptional regulation. It implements the most upstream regulatory layer of the segmentation gene network. The knowledge of molecular mechanisms involved in gap gene regulation is far less complete than that of genetics of the system. Mathematical modeling goes beyond insights gained by genetics and molecular approaches. It allows us to reconstruct wild-type gene expression patterns in silico, infer underlying regulatory mechanism and prove its sufficiency. We developed a new model that provides a dynamical description of gap gene regulatory systems, using detailed DNA-based information, as well as spatial transcription factor concentration data at varying time points. We showed that this model correctly reproduces gap gene expression patterns in wild type embryos and is able to predict gap expression patterns in Kr mutants and four reporter constructs. We used four-fold cross validation test and fitting to random dataset to validate the model and proof its sufficiency in data description. The identifiability analysis showed that most model parameters are well identifiable. We reconstructed the gap gene network topology and studied the impact of individual transcription factor binding sites on the model output. We measured this impact by calculating the site regulatory weight as a normalized difference between the residual sum of squares error for the set of all annotated sites and for the set with the site of interest excluded. The reconstructed topology of the gap gene network is in agreement with previous modeling results and data from literature. We showed that 1) the regulatory weights of transcription factor binding sites show very weak correlation with their PWM score; 2) sites with low regulatory weight are important for the model output; 3

  20. Genes with minimal phylogenetic information are problematic for coalescent analyses when gene tree estimation is biased.

    Science.gov (United States)

    Xi, Zhenxiang; Liu, Liang; Davis, Charles C

    2015-11-01

    The development and application of coalescent methods are undergoing rapid changes. One little explored area that bears on the application of gene-tree-based coalescent methods to species tree estimation is gene informativeness. Here, we investigate the accuracy of these coalescent methods when genes have minimal phylogenetic information, including the implementation of the multilocus bootstrap approach. Using simulated DNA sequences, we demonstrate that genes with minimal phylogenetic information can produce unreliable gene trees (i.e., high error in gene tree estimation), which may in turn reduce the accuracy of species tree estimation using gene-tree-based coalescent methods. We demonstrate that this problem can be alleviated by sampling more genes, as is commonly done in large-scale phylogenomic analyses. This applies even when these genes are minimally informative. If gene tree estimation is biased, however, gene-tree-based coalescent analyses will produce inconsistent results, which cannot be remedied by increasing the number of genes. In this case, it is not the gene-tree-based coalescent methods that are flawed, but rather the input data (i.e., estimated gene trees). Along these lines, the commonly used program PhyML has a tendency to infer one particular bifurcating topology even though it is best represented as a polytomy. We additionally corroborate these findings by analyzing the 183-locus mammal data set assembled by McCormack et al. (2012) using ultra-conserved elements (UCEs) and flanking DNA. Lastly, we demonstrate that when employing the multilocus bootstrap approach on this 183-locus data set, there is no strong conflict between species trees estimated from concatenation and gene-tree-based coalescent analyses, as has been previously suggested by Gatesy and Springer (2014). Copyright © 2015 Elsevier Inc. All rights reserved.