WorldWideScience

Sample records for regulatory gene network

  1. Current approaches to gene regulatory network modelling

    Directory of Open Access Journals (Sweden)

    Brazma Alvis

    2007-09-01

    Full Text Available Abstract Many different approaches have been developed to model and simulate gene regulatory networks. We proposed the following categories for gene regulatory network models: network parts lists, network topology models, network control logic models, and dynamic models. Here we will describe some examples for each of these categories. We will study the topology of gene regulatory networks in yeast in more detail, comparing a direct network derived from transcription factor binding data and an indirect network derived from genome-wide expression data in mutants. Regarding the network dynamics we briefly describe discrete and continuous approaches to network modelling, then describe a hybrid model called Finite State Linear Model and demonstrate that some simple network dynamics can be simulated in this model.

  2. Modeling of hysteresis in gene regulatory networks.

    Science.gov (United States)

    Hu, J; Qin, K R; Xiang, C; Lee, T H

    2012-08-01

    Hysteresis, observed in many gene regulatory networks, has a pivotal impact on biological systems, which enhances the robustness of cell functions. In this paper, a general model is proposed to describe the hysteretic gene regulatory network by combining the hysteresis component and the transient dynamics. The Bouc-Wen hysteresis model is modified to describe the hysteresis component in the mammalian gene regulatory networks. Rigorous mathematical analysis on the dynamical properties of the model is presented to ensure the bounded-input-bounded-output (BIBO) stability and demonstrates that the original Bouc-Wen model can only generate a clockwise hysteresis loop while the modified model can describe both clockwise and counter clockwise hysteresis loops. Simulation studies have shown that the hysteresis loops from our model are consistent with the experimental observations in three mammalian gene regulatory networks and two E.coli gene regulatory networks, which demonstrate the ability and accuracy of the mathematical model to emulate natural gene expression behavior with hysteresis. A comparison study has also been conducted to show that this model fits the experiment data significantly better than previous ones in the literature. The successful modeling of the hysteresis in all the five hysteretic gene regulatory networks suggests that the new model has the potential to be a unified framework for modeling hysteresis in gene regulatory networks and provide better understanding of the general mechanism that drives the hysteretic function.

  3. Inferring latent gene regulatory network kinetics

    NARCIS (Netherlands)

    González, Javier; Vujačić, Ivan; Wit, Ernst

    2013-01-01

    Regulatory networks consist of genes encoding transcription factors (TFs) and the genes they activate or repress. Various types of systems of ordinary differential equations (ODE) have been proposed to model these networks, ranging from linear to Michaelis-Menten approaches. In practice, a serious d

  4. Evolution of evolvability in gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Anton Crombach

    Full Text Available Gene regulatory networks are perhaps the most important organizational level in the cell where signals from the cell state and the outside environment are integrated in terms of activation and inhibition of genes. For the last decade, the study of such networks has been fueled by large-scale experiments and renewed attention from the theoretical field. Different models have been proposed to, for instance, investigate expression dynamics, explain the network topology we observe in bacteria and yeast, and for the analysis of evolvability and robustness of such networks. Yet how these gene regulatory networks evolve and become evolvable remains an open question. An individual-oriented evolutionary model is used to shed light on this matter. Each individual has a genome from which its gene regulatory network is derived. Mutations, such as gene duplications and deletions, alter the genome, while the resulting network determines the gene expression pattern and hence fitness. With this protocol we let a population of individuals evolve under Darwinian selection in an environment that changes through time. Our work demonstrates that long-term evolution of complex gene regulatory networks in a changing environment can lead to a striking increase in the efficiency of generating beneficial mutations. We show that the population evolves towards genotype-phenotype mappings that allow for an orchestrated network-wide change in the gene expression pattern, requiring only a few specific gene indels. The genes involved are hubs of the networks, or directly influencing the hubs. Moreover, throughout the evolutionary trajectory the networks maintain their mutational robustness. In other words, evolution in an alternating environment leads to a network that is sensitive to a small class of beneficial mutations, while the majority of mutations remain neutral: an example of evolution of evolvability.

  5. Mutational robustness of gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Aalt D J van Dijk

    Full Text Available Mutational robustness of gene regulatory networks refers to their ability to generate constant biological output upon mutations that change network structure. Such networks contain regulatory interactions (transcription factor-target gene interactions but often also protein-protein interactions between transcription factors. Using computational modeling, we study factors that influence robustness and we infer several network properties governing it. These include the type of mutation, i.e. whether a regulatory interaction or a protein-protein interaction is mutated, and in the case of mutation of a regulatory interaction, the sign of the interaction (activating vs. repressive. In addition, we analyze the effect of combinations of mutations and we compare networks containing monomeric with those containing dimeric transcription factors. Our results are consistent with available data on biological networks, for example based on evolutionary conservation of network features. As a novel and remarkable property, we predict that networks are more robust against mutations in monomer than in dimer transcription factors, a prediction for which analysis of conservation of DNA binding residues in monomeric vs. dimeric transcription factors provides indirect evidence.

  6. Gene regulatory networks governing pancreas development.

    Science.gov (United States)

    Arda, H Efsun; Benitez, Cecil M; Kim, Seung K

    2013-04-15

    Elucidation of cellular and gene regulatory networks (GRNs) governing organ development will accelerate progress toward tissue replacement. Here, we have compiled reference GRNs underlying pancreas development from data mining that integrates multiple approaches, including mutant analysis, lineage tracing, cell purification, gene expression and enhancer analysis, and biochemical studies of gene regulation. Using established computational tools, we integrated and represented these networks in frameworks that should enhance understanding of the surging output of genomic-scale genetic and epigenetic studies of pancreas development and diseases such as diabetes and pancreatic cancer. We envision similar approaches would be useful for understanding the development of other organs.

  7. Compressed Adjacency Matrices: Untangling Gene Regulatory Networks.

    Science.gov (United States)

    Dinkla, K; Westenberg, M A; van Wijk, J J

    2012-12-01

    We present a novel technique-Compressed Adjacency Matrices-for visualizing gene regulatory networks. These directed networks have strong structural characteristics: out-degrees with a scale-free distribution, in-degrees bound by a low maximum, and few and small cycles. Standard visualization techniques, such as node-link diagrams and adjacency matrices, are impeded by these network characteristics. The scale-free distribution of out-degrees causes a high number of intersecting edges in node-link diagrams. Adjacency matrices become space-inefficient due to the low in-degrees and the resulting sparse network. Compressed adjacency matrices, however, exploit these structural characteristics. By cutting open and rearranging an adjacency matrix, we achieve a compact and neatly-arranged visualization. Compressed adjacency matrices allow for easy detection of subnetworks with a specific structure, so-called motifs, which provide important knowledge about gene regulatory networks to domain experts. We summarize motifs commonly referred to in the literature, and relate them to network analysis tasks common to the visualization domain. We show that a user can easily find the important motifs in compressed adjacency matrices, and that this is hard in standard adjacency matrix and node-link diagrams. We also demonstrate that interaction techniques for standard adjacency matrices can be used for our compressed variant. These techniques include rearrangement clustering, highlighting, and filtering.

  8. Inference of Gene Regulatory Network Based on Local Bayesian Networks.

    Science.gov (United States)

    Liu, Fei; Zhang, Shao-Wu; Guo, Wei-Feng; Wei, Ze-Gang; Chen, Luonan

    2016-08-01

    The inference of gene regulatory networks (GRNs) from expression data can mine the direct regulations among genes and gain deep insights into biological processes at a network level. During past decades, numerous computational approaches have been introduced for inferring the GRNs. However, many of them still suffer from various problems, e.g., Bayesian network (BN) methods cannot handle large-scale networks due to their high computational complexity, while information theory-based methods cannot identify the directions of regulatory interactions and also suffer from false positive/negative problems. To overcome the limitations, in this work we present a novel algorithm, namely local Bayesian network (LBN), to infer GRNs from gene expression data by using the network decomposition strategy and false-positive edge elimination scheme. Specifically, LBN algorithm first uses conditional mutual information (CMI) to construct an initial network or GRN, which is decomposed into a number of local networks or GRNs. Then, BN method is employed to generate a series of local BNs by selecting the k-nearest neighbors of each gene as its candidate regulatory genes, which significantly reduces the exponential search space from all possible GRN structures. Integrating these local BNs forms a tentative network or GRN by performing CMI, which reduces redundant regulations in the GRN and thus alleviates the false positive problem. The final network or GRN can be obtained by iteratively performing CMI and local BN on the tentative network. In the iterative process, the false or redundant regulations are gradually removed. When tested on the benchmark GRN datasets from DREAM challenge as well as the SOS DNA repair network in E.coli, our results suggest that LBN outperforms other state-of-the-art methods (ARACNE, GENIE3 and NARROMI) significantly, with more accurate and robust performance. In particular, the decomposition strategy with local Bayesian networks not only effectively reduce

  9. Inferring slowly-changing dynamic gene-regulatory networks

    NARCIS (Netherlands)

    Wit, Ernst C.; Abbruzzo, Antonino

    2015-01-01

    Dynamic gene-regulatory networks are complex since the interaction patterns between their components mean that it is impossible to study parts of the network in separation. This holistic character of gene-regulatory networks poses a real challenge to any type of modelling. Graphical models are a cla

  10. Research of Gene Regulatory Network with Multi-Time Delay Based on Bayesian Network

    Institute of Scientific and Technical Information of China (English)

    LIU Bei; MENG Fanjiang; LI Yong; LIU Liyan

    2008-01-01

    The gene regulatory network was reconstructed according to time-series microarray data getting from hybridization at different time between gene chips to analyze coordination and restriction between genes. An algorithm for controlling the gene expression regulatory network of the whole cell was designed using Bayesian network which provides an effective aided analysis for gene regulatory network.

  11. Chaotic motifs in gene regulatory networks.

    Science.gov (United States)

    Zhang, Zhaoyang; Ye, Weiming; Qian, Yu; Zheng, Zhigang; Huang, Xuhui; Hu, Gang

    2012-01-01

    Chaos should occur often in gene regulatory networks (GRNs) which have been widely described by nonlinear coupled ordinary differential equations, if their dimensions are no less than 3. It is therefore puzzling that chaos has never been reported in GRNs in nature and is also extremely rare in models of GRNs. On the other hand, the topic of motifs has attracted great attention in studying biological networks, and network motifs are suggested to be elementary building blocks that carry out some key functions in the network. In this paper, chaotic motifs (subnetworks with chaos) in GRNs are systematically investigated. The conclusion is that: (i) chaos can only appear through competitions between different oscillatory modes with rivaling intensities. Conditions required for chaotic GRNs are found to be very strict, which make chaotic GRNs extremely rare. (ii) Chaotic motifs are explored as the simplest few-node structures capable of producing chaos, and serve as the intrinsic source of chaos of random few-node GRNs. Several optimal motifs causing chaos with atypically high probability are figured out. (iii) Moreover, we discovered that a number of special oscillators can never produce chaos. These structures bring some advantages on rhythmic functions and may help us understand the robustness of diverse biological rhythms. (iv) The methods of dominant phase-advanced driving (DPAD) and DPAD time fraction are proposed to quantitatively identify chaotic motifs and to explain the origin of chaotic behaviors in GRNs.

  12. Inference of Gene Regulatory Network Based on Local Bayesian Networks.

    Directory of Open Access Journals (Sweden)

    Fei Liu

    2016-08-01

    Full Text Available The inference of gene regulatory networks (GRNs from expression data can mine the direct regulations among genes and gain deep insights into biological processes at a network level. During past decades, numerous computational approaches have been introduced for inferring the GRNs. However, many of them still suffer from various problems, e.g., Bayesian network (BN methods cannot handle large-scale networks due to their high computational complexity, while information theory-based methods cannot identify the directions of regulatory interactions and also suffer from false positive/negative problems. To overcome the limitations, in this work we present a novel algorithm, namely local Bayesian network (LBN, to infer GRNs from gene expression data by using the network decomposition strategy and false-positive edge elimination scheme. Specifically, LBN algorithm first uses conditional mutual information (CMI to construct an initial network or GRN, which is decomposed into a number of local networks or GRNs. Then, BN method is employed to generate a series of local BNs by selecting the k-nearest neighbors of each gene as its candidate regulatory genes, which significantly reduces the exponential search space from all possible GRN structures. Integrating these local BNs forms a tentative network or GRN by performing CMI, which reduces redundant regulations in the GRN and thus alleviates the false positive problem. The final network or GRN can be obtained by iteratively performing CMI and local BN on the tentative network. In the iterative process, the false or redundant regulations are gradually removed. When tested on the benchmark GRN datasets from DREAM challenge as well as the SOS DNA repair network in E.coli, our results suggest that LBN outperforms other state-of-the-art methods (ARACNE, GENIE3 and NARROMI significantly, with more accurate and robust performance. In particular, the decomposition strategy with local Bayesian networks not only

  13. Modeling gene regulatory networks: A network simplification algorithm

    Science.gov (United States)

    Ferreira, Luiz Henrique O.; de Castro, Maria Clicia S.; da Silva, Fabricio A. B.

    2016-12-01

    Boolean networks have been used for some time to model Gene Regulatory Networks (GRNs), which describe cell functions. Those models can help biologists to make predictions, prognosis and even specialized treatment when some disturb on the GRN lead to a sick condition. However, the amount of information related to a GRN can be huge, making the task of inferring its boolean network representation quite a challenge. The method shown here takes into account information about the interactome to build a network, where each node represents a protein, and uses the entropy of each node as a key to reduce the size of the network, allowing the further inferring process to focus only on the main protein hubs, the ones with most potential to interfere in overall network behavior.

  14. Overview of methods of reverse engineering of gene regulatory networks: Boolean and Bayesian networks

    OpenAIRE

    Frolova A. O.

    2012-01-01

    Reverse engineering of gene regulatory networks is an intensively studied topic in Systems Biology as it reconstructs regulatory interactions between all genes in the genome in the most complete form. The extreme computational complexity of this problem and lack of thorough reviews on reconstruction methods of gene regulatory network is a significant obstacle to further development of this area. In this article the two most common methods for modeling gene regulatory networks are surveyed: Bo...

  15. Automated Identification of Core Regulatory Genes in Human Gene Regulatory Networks.

    Directory of Open Access Journals (Sweden)

    Vipin Narang

    Full Text Available Human gene regulatory networks (GRN can be difficult to interpret due to a tangle of edges interconnecting thousands of genes. We constructed a general human GRN from extensive transcription factor and microRNA target data obtained from public databases. In a subnetwork of this GRN that is active during estrogen stimulation of MCF-7 breast cancer cells, we benchmarked automated algorithms for identifying core regulatory genes (transcription factors and microRNAs. Among these algorithms, we identified K-core decomposition, pagerank and betweenness centrality algorithms as the most effective for discovering core regulatory genes in the network evaluated based on previously known roles of these genes in MCF-7 biology as well as in their ability to explain the up or down expression status of up to 70% of the remaining genes. Finally, we validated the use of K-core algorithm for organizing the GRN in an easier to interpret layered hierarchy where more influential regulatory genes percolate towards the inner layers. The integrated human gene and miRNA network and software used in this study are provided as supplementary materials (S1 Data accompanying this manuscript.

  16. Glucocorticoid receptor-dependent gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Phillip Phuc Le

    2005-08-01

    Full Text Available While the molecular mechanisms of glucocorticoid regulation of transcription have been studied in detail, the global networks regulated by the glucocorticoid receptor (GR remain unknown. To address this question, we performed an orthogonal analysis to identify direct targets of the GR. First, we analyzed the expression profile of mouse livers in the presence or absence of exogenous glucocorticoid, resulting in over 1,300 differentially expressed genes. We then executed genome-wide location analysis on chromatin from the same livers, identifying more than 300 promoters that are bound by the GR. Intersecting the two lists yielded 53 genes whose expression is functionally dependent upon the ligand-bound GR. Further network and sequence analysis of the functional targets enabled us to suggest interactions between the GR and other transcription factors at specific target genes. Together, our results further our understanding of the GR and its targets, and provide the basis for more targeted glucocorticoid therapies.

  17. Discovering Study-Specific Gene Regulatory Networks

    OpenAIRE

    2014-01-01

    This article has been made available through the Brunel Open Access Publishing Fund. This article has been made available through the Brunel Open Access Publishing Fund. Microarrays are commonly used in biology because of their ability to simultaneously measure thousands of genes under different conditions. Due to their structure, typically containing a high amount of variables but far fewer samples, scalable network analysis techniques are often employed. In particular, consensus appro...

  18. Improving gene regulatory network inference using network topology information.

    Science.gov (United States)

    Nair, Ajay; Chetty, Madhu; Wangikar, Pramod P

    2015-09-01

    Inferring the gene regulatory network (GRN) structure from data is an important problem in computational biology. However, it is a computationally complex problem and approximate methods such as heuristic search techniques, restriction of the maximum-number-of-parents (maxP) for a gene, or an optimal search under special conditions are required. The limitations of a heuristic search are well known but literature on the detailed analysis of the widely used maxP technique is lacking. The optimal search methods require large computational time. We report the theoretical analysis and experimental results of the strengths and limitations of the maxP technique. Further, using an optimal search method, we combine the strengths of the maxP technique and the known GRN topology to propose two novel algorithms. These algorithms are implemented in a Bayesian network framework and tested on biological, realistic, and in silico networks of different sizes and topologies. They overcome the limitations of the maxP technique and show superior computational speed when compared to the current optimal search algorithms.

  19. Gene regulatory networks elucidating huanglongbing disease mechanisms.

    Science.gov (United States)

    Martinelli, Federico; Reagan, Russell L; Uratsu, Sandra L; Phu, My L; Albrecht, Ute; Zhao, Weixiang; Davis, Cristina E; Bowman, Kim D; Dandekar, Abhaya M

    2013-01-01

    Next-generation sequencing was exploited to gain deeper insight into the response to infection by Candidatus liberibacter asiaticus (CaLas), especially the immune disregulation and metabolic dysfunction caused by source-sink disruption. Previous fruit transcriptome data were compared with additional RNA-Seq data in three tissues: immature fruit, and young and mature leaves. Four categories of orchard trees were studied: symptomatic, asymptomatic, apparently healthy, and healthy. Principal component analysis found distinct expression patterns between immature and mature fruits and leaf samples for all four categories of trees. A predicted protein - protein interaction network identified HLB-regulated genes for sugar transporters playing key roles in the overall plant responses. Gene set and pathway enrichment analyses highlight the role of sucrose and starch metabolism in disease symptom development in all tissues. HLB-regulated genes (glucose-phosphate-transporter, invertase, starch-related genes) would likely determine the source-sink relationship disruption. In infected leaves, transcriptomic changes were observed for light reactions genes (downregulation), sucrose metabolism (upregulation), and starch biosynthesis (upregulation). In parallel, symptomatic fruits over-expressed genes involved in photosynthesis, sucrose and raffinose metabolism, and downregulated starch biosynthesis. We visualized gene networks between tissues inducing a source-sink shift. CaLas alters the hormone crosstalk, resulting in weak and ineffective tissue-specific plant immune responses necessary for bacterial clearance. Accordingly, expression of WRKYs (including WRKY70) was higher in fruits than in leaves. Systemic acquired responses were inadequately activated in young leaves, generally considered the sites where most new infections occur.

  20. Inferring slowly-changing dynamic gene-regulatory networks.

    Science.gov (United States)

    Wit, Ernst C; Abbruzzo, Antonino

    2015-01-01

    Dynamic gene-regulatory networks are complex since the interaction patterns between their components mean that it is impossible to study parts of the network in separation. This holistic character of gene-regulatory networks poses a real challenge to any type of modelling. Graphical models are a class of models that connect the network with a conditional independence relationships between random variables. By interpreting these random variables as gene activities and the conditional independence relationships as functional non-relatedness, graphical models have been used to describe gene-regulatory networks. Whereas the literature has been focused on static networks, most time-course experiments are designed in order to tease out temporal changes in the underlying network. It is typically reasonable to assume that changes in genomic networks are few, because biological systems tend to be stable. We introduce a new model for estimating slow changes in dynamic gene-regulatory networks, which is suitable for high-dimensional data, e.g. time-course microarray data. Our aim is to estimate a dynamically changing genomic network based on temporal activity measurements of the genes in the network. Our method is based on the penalized likelihood with l1-norm, that penalizes conditional dependencies between genes as well as differences between conditional independence elements across time points. We also present a heuristic search strategy to find optimal tuning parameters. We re-write the penalized maximum likelihood problem into a standard convex optimization problem subject to linear equality constraints. We show that our method performs well in simulation studies. Finally, we apply the proposed model to a time-course T-cell dataset.

  1. Gene regulatory network inference using out of equilibrium statistical mechanics.

    Science.gov (United States)

    Benecke, Arndt

    2008-08-01

    Spatiotemporal control of gene expression is fundamental to multicellular life. Despite prodigious efforts, the encoding of gene expression regulation in eukaryotes is not understood. Gene expression analyses nourish the hope to reverse engineer effector-target gene networks using inference techniques. Inference from noisy and circumstantial data relies on using robust models with few parameters for the underlying mechanisms. However, a systematic path to gene regulatory network reverse engineering from functional genomics data is still impeded by fundamental problems. Recently, Johannes Berg from the Theoretical Physics Institute of Cologne University has made two remarkable contributions that significantly advance the gene regulatory network inference problem. Berg, who uses gene expression data from yeast, has demonstrated a nonequilibrium regime for mRNA concentration dynamics and was able to map the gene regulatory process upon simple stochastic systems driven out of equilibrium. The impact of his demonstration is twofold, affecting both the understanding of the operational constraints under which transcription occurs and the capacity to extract relevant information from highly time-resolved expression data. Berg has used his observation to predict target genes of selected transcription factors, and thereby, in principle, demonstrated applicability of his out of equilibrium statistical mechanics approach to the gene network inference problem.

  2. Toward an orofacial gene regulatory network.

    Science.gov (United States)

    Kousa, Youssef A; Schutte, Brian C

    2016-03-01

    Orofacial clefting is a common birth defect with significant morbidity. A panoply of candidate genes have been discovered through synergy of animal models and human genetics. Among these, variants in interferon regulatory factor 6 (IRF6) cause syndromic orofacial clefting and contribute risk toward isolated cleft lip and palate (1/700 live births). Rare variants in IRF6 can lead to Van der Woude syndrome (1/35,000 live births) and popliteal pterygium syndrome (1/300,000 live births). Furthermore, IRF6 regulates GRHL3 and rare variants in this downstream target can also lead to Van der Woude syndrome. In addition, a common variant (rs642961) in the IRF6 locus is found in 30% of the world's population and contributes risk for isolated orofacial clefting. Biochemical studies revealed that rs642961 abrogates one of four AP-2alpha binding sites. Like IRF6 and GRHL3, rare variants in TFAP2A can also lead to syndromic orofacial clefting with lip pits (branchio-oculo-facial syndrome). The literature suggests that AP-2alpha, IRF6 and GRHL3 are part of a pathway that is essential for lip and palate development. In addition to updating the pathways, players and pursuits, this review will highlight some of the current questions in the study of orofacial clefting.

  3. Gene Regulatory Network Reconstruction Using Conditional Mutual Information

    Directory of Open Access Journals (Sweden)

    Xiaodong Wang

    2008-06-01

    Full Text Available The inference of gene regulatory network from expression data is an important area of research that provides insight to the inner workings of a biological system. The relevance-network-based approaches provide a simple and easily-scalable solution to the understanding of interaction between genes. Up until now, most works based on relevance network focus on the discovery of direct regulation using correlation coefficient or mutual information. However, some of the more complicated interactions such as interactive regulation and coregulation are not easily detected. In this work, we propose a relevance network model for gene regulatory network inference which employs both mutual information and conditional mutual information to determine the interactions between genes. For this purpose, we propose a conditional mutual information estimator based on adaptive partitioning which allows us to condition on both discrete and continuous random variables. We provide experimental results that demonstrate that the proposed regulatory network inference algorithm can provide better performance when the target network contains coregulated and interactively regulated genes.

  4. Optimal finite horizon control in gene regulatory networks

    Science.gov (United States)

    Liu, Qiuli

    2013-06-01

    As a paradigm for modeling gene regulatory networks, probabilistic Boolean networks (PBNs) form a subclass of Markov genetic regulatory networks. To date, many different stochastic optimal control approaches have been developed to find therapeutic intervention strategies for PBNs. A PBN is essentially a collection of constituent Boolean networks via a probability structure. Most of the existing works assume that the probability structure for Boolean networks selection is known. Such an assumption cannot be satisfied in practice since the presence of noise prevents the probability structure from being accurately determined. In this paper, we treat a case in which we lack the governing probability structure for Boolean network selection. Specifically, in the framework of PBNs, the theory of finite horizon Markov decision process is employed to find optimal constituent Boolean networks with respect to the defined objective functions. In order to illustrate the validity of our proposed approach, an example is also displayed.

  5. The incorporation of epigenetics in artificial gene regulatory networks.

    Science.gov (United States)

    Turner, Alexander P; Lones, Michael A; Fuente, Luis A; Stepney, Susan; Caves, Leo S D; Tyrrell, Andy M

    2013-05-01

    Artificial gene regulatory networks are computational models that draw inspiration from biological networks of gene regulation. Since their inception they have been used to infer knowledge about gene regulation and as methods of computation. These computational models have been shown to possess properties typically found in the biological world, such as robustness and self organisation. Recently, it has become apparent that epigenetic mechanisms play an important role in gene regulation. This paper describes a new model, the Artificial Epigenetic Regulatory Network (AERN) which builds upon existing models by adding an epigenetic control layer. Our results demonstrate that AERNs are more adept at controlling multiple opposing trajectories when applied to a chaos control task within a conservative dynamical system, suggesting that AERNs are an interesting area for further investigation.

  6. Stable Gene Regulatory Network Modeling From Steady-State Data

    Directory of Open Access Journals (Sweden)

    Joy Edward Larvie

    2016-04-01

    Full Text Available Gene regulatory networks represent an abstract mapping of gene regulations in living cells. They aim to capture dependencies among molecular entities such as transcription factors, proteins and metabolites. In most applications, the regulatory network structure is unknown, and has to be reverse engineered from experimental data consisting of expression levels of the genes usually measured as messenger RNA concentrations in microarray experiments. Steady-state gene expression data are obtained from measurements of the variations in expression activity following the application of small perturbations to equilibrium states in genetic perturbation experiments. In this paper, the least absolute shrinkage and selection operator-vector autoregressive (LASSO-VAR originally proposed for the analysis of economic time series data is adapted to include a stability constraint for the recovery of a sparse and stable regulatory network that describes data obtained from noisy perturbation experiments. The approach is applied to real experimental data obtained for the SOS pathway in Escherichia coli and the cell cycle pathway for yeast Saccharomyces cerevisiae. Significant features of this method are the ability to recover networks without inputting prior knowledge of the network topology, and the ability to be efficiently applied to large scale networks due to the convex nature of the method.

  7. Noise Control in Gene Regulatory Networks with Negative Feedback.

    Science.gov (United States)

    Hinczewski, Michael; Thirumalai, D

    2016-07-01

    Genes and proteins regulate cellular functions through complex circuits of biochemical reactions. Fluctuations in the components of these regulatory networks result in noise that invariably corrupts the signal, possibly compromising function. Here, we create a practical formalism based on ideas introduced by Wiener and Kolmogorov (WK) for filtering noise in engineered communications systems to quantitatively assess the extent to which noise can be controlled in biological processes involving negative feedback. Application of the theory, which reproduces the previously proven scaling of the lower bound for noise suppression in terms of the number of signaling events, shows that a tetracycline repressor-based negative-regulatory gene circuit behaves as a WK filter. For the class of Hill-like nonlinear regulatory functions, this type of filter provides the optimal reduction in noise. Our theoretical approach can be readily combined with experimental measurements of response functions in a wide variety of genetic circuits, to elucidate the general principles by which biological networks minimize noise.

  8. A gene regulatory network armature for T-lymphocyte specification

    Energy Technology Data Exchange (ETDEWEB)

    Fung, Elizabeth-sharon [Los Alamos National Laboratory

    2008-01-01

    Choice of a T-lymphoid fate by hematopoietic progenitor cells depends on sustained Notch-Delta signaling combined with tightly-regulated activities of multiple transcription factors. To dissect the regulatory network connections that mediate this process, we have used high-resolution analysis of regulatory gene expression trajectories from the beginning to the end of specification; tests of the short-term Notchdependence of these gene expression changes; and perturbation analyses of the effects of overexpression of two essential transcription factors, namely PU.l and GATA-3. Quantitative expression measurements of >50 transcription factor and marker genes have been used to derive the principal components of regulatory change through which T-cell precursors progress from primitive multipotency to T-lineage commitment. Distinct parts of the path reveal separate contributions of Notch signaling, GATA-3 activity, and downregulation of PU.l. Using BioTapestry, the results have been assembled into a draft gene regulatory network for the specification of T-cell precursors and the choice of T as opposed to myeloid dendritic or mast-cell fates. This network also accommodates effects of E proteins and mutual repression circuits of Gfil against Egr-2 and of TCF-l against PU.l as proposed elsewhere, but requires additional functions that remain unidentified. Distinctive features of this network structure include the intense dose-dependence of GATA-3 effects; the gene-specific modulation of PU.l activity based on Notch activity; the lack of direct opposition between PU.l and GATA-3; and the need for a distinct, late-acting repressive function or functions to extinguish stem and progenitor-derived regulatory gene expression.

  9. Propagation of genetic variation in gene regulatory networks.

    Science.gov (United States)

    Plahte, Erik; Gjuvsland, Arne B; Omholt, Stig W

    2013-08-01

    A future quantitative genetics theory should link genetic variation to phenotypic variation in a causally cohesive way based on how genes actually work and interact. We provide a theoretical framework for predicting and understanding the manifestation of genetic variation in haploid and diploid regulatory networks with arbitrary feedback structures and intra-locus and inter-locus functional dependencies. Using results from network and graph theory, we define propagation functions describing how genetic variation in a locus is propagated through the network, and show how their derivatives are related to the network's feedback structure. Similarly, feedback functions describe the effect of genotypic variation of a locus on itself, either directly or mediated by the network. A simple sign rule relates the sign of the derivative of the feedback function of any locus to the feedback loops involving that particular locus. We show that the sign of the phenotypically manifested interaction between alleles at a diploid locus is equal to the sign of the dominant feedback loop involving that particular locus, in accordance with recent results for a single locus system. Our results provide tools by which one can use observable equilibrium concentrations of gene products to disclose structural properties of the network architecture. Our work is a step towards a theory capable of explaining the pleiotropy and epistasis features of genetic variation in complex regulatory networks as functions of regulatory anatomy and functional location of the genetic variation.

  10. Stability Depends on Positive Autoregulation in Boolean Gene Regulatory Networks

    Science.gov (United States)

    Pinho, Ricardo; Garcia, Victor; Irimia, Manuel; Feldman, Marcus W.

    2014-01-01

    Network motifs have been identified as building blocks of regulatory networks, including gene regulatory networks (GRNs). The most basic motif, autoregulation, has been associated with bistability (when positive) and with homeostasis and robustness to noise (when negative), but its general importance in network behavior is poorly understood. Moreover, how specific autoregulatory motifs are selected during evolution and how this relates to robustness is largely unknown. Here, we used a class of GRN models, Boolean networks, to investigate the relationship between autoregulation and network stability and robustness under various conditions. We ran evolutionary simulation experiments for different models of selection, including mutation and recombination. Each generation simulated the development of a population of organisms modeled by GRNs. We found that stability and robustness positively correlate with autoregulation; in all investigated scenarios, stable networks had mostly positive autoregulation. Assuming biological networks correspond to stable networks, these results suggest that biological networks should often be dominated by positive autoregulatory loops. This seems to be the case for most studied eukaryotic transcription factor networks, including those in yeast, flies and mammals. PMID:25375153

  11. Stability depends on positive autoregulation in Boolean gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Ricardo Pinho

    2014-11-01

    Full Text Available Network motifs have been identified as building blocks of regulatory networks, including gene regulatory networks (GRNs. The most basic motif, autoregulation, has been associated with bistability (when positive and with homeostasis and robustness to noise (when negative, but its general importance in network behavior is poorly understood. Moreover, how specific autoregulatory motifs are selected during evolution and how this relates to robustness is largely unknown. Here, we used a class of GRN models, Boolean networks, to investigate the relationship between autoregulation and network stability and robustness under various conditions. We ran evolutionary simulation experiments for different models of selection, including mutation and recombination. Each generation simulated the development of a population of organisms modeled by GRNs. We found that stability and robustness positively correlate with autoregulation; in all investigated scenarios, stable networks had mostly positive autoregulation. Assuming biological networks correspond to stable networks, these results suggest that biological networks should often be dominated by positive autoregulatory loops. This seems to be the case for most studied eukaryotic transcription factor networks, including those in yeast, flies and mammals.

  12. Identifying gene regulatory network rewiring using latent differential graphical models.

    Science.gov (United States)

    Tian, Dechao; Gu, Quanquan; Ma, Jian

    2016-09-30

    Gene regulatory networks (GRNs) are highly dynamic among different tissue types. Identifying tissue-specific gene regulation is critically important to understand gene function in a particular cellular context. Graphical models have been used to estimate GRN from gene expression data to distinguish direct interactions from indirect associations. However, most existing methods estimate GRN for a specific cell/tissue type or in a tissue-naive way, or do not specifically focus on network rewiring between different tissues. Here, we describe a new method called Latent Differential Graphical Model (LDGM). The motivation of our method is to estimate the differential network between two tissue types directly without inferring the network for individual tissues, which has the advantage of utilizing much smaller sample size to achieve reliable differential network estimation. Our simulation results demonstrated that LDGM consistently outperforms other Gaussian graphical model based methods. We further evaluated LDGM by applying to the brain and blood gene expression data from the GTEx consortium. We also applied LDGM to identify network rewiring between cancer subtypes using the TCGA breast cancer samples. Our results suggest that LDGM is an effective method to infer differential network using high-throughput gene expression data to identify GRN dynamics among different cellular conditions.

  13. Validation of Gene Regulatory Network Inference Based on Controllability

    Directory of Open Access Journals (Sweden)

    Edward eDougherty

    2013-12-01

    Full Text Available There are two distinct issues regarding network validation: (1 Does an inferred network provide good predictions relative to experimental data? (2 Does a network inference algorithm applied within a certain network model framework yield networks that are accurate relative to some criterion of goodness? The first issue concerns scientific validation and the second concerns algorithm validation. In this paper we consider inferential validation relative to controllability; that is, if an inference procedure is applied to synthetic data generated from a gene regulatory network and an intervention procedure is designed on the inferred network, how well does it perform on the true network? The reasoning behind such a criterion is that, if our purpose is to use gene regulatory networks to design therapeutic intervention strategies, then we are not concerned with network fidelity, per se, but only with our ability to design effective interventions based on the inferred network. We will consider the problem from the perspectives of stationary control, which involves designing a control policy to be applied over time based on the current state of the network, with the decision procedure itself being time independent. {The objective of a control policy is to optimally reduce the total steady-state probability mass of the undesirable states (phenotypes, which is equivalent to optimally increasing the total steady-state mass of the desirable states. Based on this criterion we compare several proposed network inference procedures. We will see that inference procedure psi may perform poorer than inference procedure xi relative to inferring the full network structure but perform better than xi relative to controllability. Hence, when one is aiming at a specific application, it may be wise to use an objective-based measure of inference validity.

  14. Ground rules of the pluripotency gene regulatory network.

    KAUST Repository

    Li, Mo

    2017-01-03

    Pluripotency is a state that exists transiently in the early embryo and, remarkably, can be recapitulated in vitro by deriving embryonic stem cells or by reprogramming somatic cells to become induced pluripotent stem cells. The state of pluripotency, which is stabilized by an interconnected network of pluripotency-associated genes, integrates external signals and exerts control over the decision between self-renewal and differentiation at the transcriptional, post-transcriptional and epigenetic levels. Recent evidence of alternative pluripotency states indicates the regulatory flexibility of this network. Insights into the underlying principles of the pluripotency network may provide unprecedented opportunities for studying development and for regenerative medicine.

  15. Overview of methods of reverse engineering of gene regulatory networks: Boolean and Bayesian networks

    Directory of Open Access Journals (Sweden)

    Frolova A. O.

    2012-06-01

    Full Text Available Reverse engineering of gene regulatory networks is an intensively studied topic in Systems Biology as it reconstructs regulatory interactions between all genes in the genome in the most complete form. The extreme computational complexity of this problem and lack of thorough reviews on reconstruction methods of gene regulatory network is a significant obstacle to further development of this area. In this article the two most common methods for modeling gene regulatory networks are surveyed: Boolean and Bayesian networks. The mathematical description of each method is given, as well as several algorithmic approaches to modeling gene networks using these methods; the complexity of algorithms and the problems that arise during its implementation are also noted.

  16. Topological effects of data incompleteness of gene regulatory networks

    CERN Document Server

    Sanz, J; Borge-Holthoefer, J; Moreno, Y

    2012-01-01

    The topological analysis of biological networks has been a prolific topic in network science during the last decade. A persistent problem with this approach is the inherent uncertainty and noisy nature of the data. One of the cases in which this situation is more marked is that of transcriptional regulatory networks (TRNs) in bacteria. The datasets are incomplete because regulatory pathways associated to a relevant fraction of bacterial genes remain unknown. Furthermore, direction, strengths and signs of the links are sometimes unknown or simply overlooked. Finally, the experimental approaches to infer the regulations are highly heterogeneous, in a way that induces the appearance of systematic experimental-topological correlations. And yet, the quality of the available data increases constantly. In this work we capitalize on these advances to point out the influence of data (in)completeness and quality on some classical results on topological analysis of TRNs, specially regarding modularity at different level...

  17. Dose response relationship in anti-stress gene regulatory networks.

    OpenAIRE

    Qiang Zhang; Andersen, Melvin E.

    2007-01-01

    To maintain a stable intracellular environment, cells utilize complex and specialized defense systems against a variety of external perturbations, such as electrophilic stress, heat shock, and hypoxia, etc. Irrespective of the type of stress, many adaptive mechanisms contributing to cellular homeostasis appear to operate through gene regulatory networks that are organized into negative feedback loops. In general, the degree of deviation of the controlled variables, such as electrophiles, misf...

  18. Phase transitions in the evolution of gene regulatory networks

    Science.gov (United States)

    Skanata, Antun; Kussell, Edo

    The role of gene regulatory networks is to respond to environmental conditions and optimize growth of the cell. A typical example is found in bacteria, where metabolic genes are activated in response to nutrient availability, and are subsequently turned off to conserve energy when their specific substrates are depleted. However, in fluctuating environmental conditions, regulatory networks could experience strong evolutionary pressures not only to turn the right genes on and off, but also to respond optimally under a wide spectrum of fluctuation timescales. The outcome of evolution is predicted by the long-term growth rate, which differentiates between optimal strategies. Here we present an analytic computation of the long-term growth rate in randomly fluctuating environments, by using mean-field and higher order expansion in the environmental history. We find that optimal strategies correspond to distinct regions in the phase space of fluctuations, separated by first and second order phase transitions. The statistics of environmental randomness are shown to dictate the possible evolutionary modes, which either change the structure of the regulatory network abruptly, or gradually modify and tune the interactions between its components.

  19. Dynamic Gene Regulatory Networks Drive Hematopoietic Specification and Differentiation

    Science.gov (United States)

    Goode, Debbie K.; Obier, Nadine; Vijayabaskar, M.S.; Lie-A-Ling, Michael; Lilly, Andrew J.; Hannah, Rebecca; Lichtinger, Monika; Batta, Kiran; Florkowska, Magdalena; Patel, Rahima; Challinor, Mairi; Wallace, Kirstie; Gilmour, Jane; Assi, Salam A.; Cauchy, Pierre; Hoogenkamp, Maarten; Westhead, David R.; Lacaud, Georges; Kouskoff, Valerie; Göttgens, Berthold; Bonifer, Constanze

    2016-01-01

    Summary Metazoan development involves the successive activation and silencing of specific gene expression programs and is driven by tissue-specific transcription factors programming the chromatin landscape. To understand how this process executes an entire developmental pathway, we generated global gene expression, chromatin accessibility, histone modification, and transcription factor binding data from purified embryonic stem cell-derived cells representing six sequential stages of hematopoietic specification and differentiation. Our data reveal the nature of regulatory elements driving differential gene expression and inform how transcription factor binding impacts on promoter activity. We present a dynamic core regulatory network model for hematopoietic specification and demonstrate its utility for the design of reprogramming experiments. Functional studies motivated by our genome-wide data uncovered a stage-specific role for TEAD/YAP factors in mammalian hematopoietic specification. Our study presents a powerful resource for studying hematopoiesis and demonstrates how such data advance our understanding of mammalian development. PMID:26923725

  20. How difficult is inference of mammalian causal gene regulatory networks?

    Directory of Open Access Journals (Sweden)

    Djordje Djordjevic

    Full Text Available Gene regulatory networks (GRNs play a central role in systems biology, especially in the study of mammalian organ development. One key question remains largely unanswered: Is it possible to infer mammalian causal GRNs using observable gene co-expression patterns alone? We assembled two mouse GRN datasets (embryonic tooth and heart and matching microarray gene expression profiles to systematically investigate the difficulties of mammalian causal GRN inference. The GRNs were assembled based on > 2,000 pieces of experimental genetic perturbation evidence from manually reading > 150 primary research articles. Each piece of perturbation evidence records the qualitative change of the expression of one gene following knock-down or over-expression of another gene. Our data have thorough annotation of tissue types and embryonic stages, as well as the type of regulation (activation, inhibition and no effect, which uniquely allows us to estimate both sensitivity and specificity of the inference of tissue specific causal GRN edges. Using these unprecedented datasets, we found that gene co-expression does not reliably distinguish true positive from false positive interactions, making inference of GRN in mammalian development very difficult. Nonetheless, if we have expression profiling data from genetic or molecular perturbation experiments, such as gene knock-out or signalling stimulation, it is possible to use the set of differentially expressed genes to recover causal regulatory relationships with good sensitivity and specificity. Our result supports the importance of using perturbation experimental data in causal network reconstruction. Furthermore, we showed that causal gene regulatory relationship can be highly cell type or developmental stage specific, suggesting the importance of employing expression profiles from homogeneous cell populations. This study provides essential datasets and empirical evidence to guide the development of new GRN inference

  1. How difficult is inference of mammalian causal gene regulatory networks?

    Science.gov (United States)

    Djordjevic, Djordje; Yang, Andrian; Zadoorian, Armella; Rungrugeecharoen, Kevin; Ho, Joshua W K

    2014-01-01

    Gene regulatory networks (GRNs) play a central role in systems biology, especially in the study of mammalian organ development. One key question remains largely unanswered: Is it possible to infer mammalian causal GRNs using observable gene co-expression patterns alone? We assembled two mouse GRN datasets (embryonic tooth and heart) and matching microarray gene expression profiles to systematically investigate the difficulties of mammalian causal GRN inference. The GRNs were assembled based on > 2,000 pieces of experimental genetic perturbation evidence from manually reading > 150 primary research articles. Each piece of perturbation evidence records the qualitative change of the expression of one gene following knock-down or over-expression of another gene. Our data have thorough annotation of tissue types and embryonic stages, as well as the type of regulation (activation, inhibition and no effect), which uniquely allows us to estimate both sensitivity and specificity of the inference of tissue specific causal GRN edges. Using these unprecedented datasets, we found that gene co-expression does not reliably distinguish true positive from false positive interactions, making inference of GRN in mammalian development very difficult. Nonetheless, if we have expression profiling data from genetic or molecular perturbation experiments, such as gene knock-out or signalling stimulation, it is possible to use the set of differentially expressed genes to recover causal regulatory relationships with good sensitivity and specificity. Our result supports the importance of using perturbation experimental data in causal network reconstruction. Furthermore, we showed that causal gene regulatory relationship can be highly cell type or developmental stage specific, suggesting the importance of employing expression profiles from homogeneous cell populations. This study provides essential datasets and empirical evidence to guide the development of new GRN inference methods for

  2. Evolution of the mammalian embryonic pluripotency gene regulatory network

    Science.gov (United States)

    Fernandez-Tresguerres, Beatriz; Cañon, Susana; Rayon, Teresa; Pernaute, Barbara; Crespo, Miguel; Torroja, Carlos; Manzanares, Miguel

    2010-01-01

    Embryonic pluripotency in the mouse is established and maintained by a gene-regulatory network under the control of a core set of transcription factors that include octamer-binding protein 4 (Oct4; official name POU domain, class 5, transcription factor 1, Pou5f1), sex-determining region Y (SRY)-box containing gene 2 (Sox2), and homeobox protein Nanog. Although this network is largely conserved in eutherian mammals, very little information is available regarding its evolutionary conservation in other vertebrates. We have compared the embryonic pluripotency networks in mouse and chick by means of expression analysis in the pregastrulation chicken embryo, genomic comparisons, and functional assays of pluripotency-related regulatory elements in ES cells and blastocysts. We find that multiple components of the network are either novel to mammals or have acquired novel expression domains in early developmental stages of the mouse. We also find that the downstream action of the mouse core pluripotency factors is mediated largely by genomic sequence elements nonconserved with chick. In the case of Sox2 and Fgf4, we find that elements driving expression in embryonic pluripotent cells have evolved by a small number of nucleotide changes that create novel binding sites for core factors. Our results show that the network in charge of embryonic pluripotency is an evolutionary novelty of mammals that is related to the comparatively extended period during which mammalian embryonic cells need to be maintained in an undetermined state before engaging in early differentiation events. PMID:21048080

  3. Autonomous Boolean modelling of developmental gene regulatory networks

    Science.gov (United States)

    Cheng, Xianrui; Sun, Mengyang; Socolar, Joshua E. S.

    2013-01-01

    During early embryonic development, a network of regulatory interactions among genes dynamically determines a pattern of differentiated tissues. We show that important timing information associated with the interactions can be faithfully represented in autonomous Boolean models in which binary variables representing expression levels are updated in continuous time, and that such models can provide a direct insight into features that are difficult to extract from ordinary differential equation (ODE) models. As an application, we model the experimentally well-studied network controlling fly body segmentation. The Boolean model successfully generates the patterns formed in normal and genetically perturbed fly embryos, permits the derivation of constraints on the time delay parameters, clarifies the logic associated with different ODE parameter sets and provides a platform for studying connectivity and robustness in parameter space. By elucidating the role of regulatory time delays in pattern formation, the results suggest new types of experimental measurements in early embryonic development. PMID:23034351

  4. Comparison of evolutionary algorithms in gene regulatory network model inference.

    LENUS (Irish Health Repository)

    2010-01-01

    ABSTRACT: BACKGROUND: The evolution of high throughput technologies that measure gene expression levels has created a data base for inferring GRNs (a process also known as reverse engineering of GRNs). However, the nature of these data has made this process very difficult. At the moment, several methods of discovering qualitative causal relationships between genes with high accuracy from microarray data exist, but large scale quantitative analysis on real biological datasets cannot be performed, to date, as existing approaches are not suitable for real microarray data which are noisy and insufficient. RESULTS: This paper performs an analysis of several existing evolutionary algorithms for quantitative gene regulatory network modelling. The aim is to present the techniques used and offer a comprehensive comparison of approaches, under a common framework. Algorithms are applied to both synthetic and real gene expression data from DNA microarrays, and ability to reproduce biological behaviour, scalability and robustness to noise are assessed and compared. CONCLUSIONS: Presented is a comparison framework for assessment of evolutionary algorithms, used to infer gene regulatory networks. Promising methods are identified and a platform for development of appropriate model formalisms is established.

  5. An algebra-based method for inferring gene regulatory networks.

    Science.gov (United States)

    Vera-Licona, Paola; Jarrah, Abdul; Garcia-Puente, Luis David; McGee, John; Laubenbacher, Reinhard

    2014-03-26

    The inference of gene regulatory networks (GRNs) from experimental observations is at the heart of systems biology. This includes the inference of both the network topology and its dynamics. While there are many algorithms available to infer the network topology from experimental data, less emphasis has been placed on methods that infer network dynamics. Furthermore, since the network inference problem is typically underdetermined, it is essential to have the option of incorporating into the inference process, prior knowledge about the network, along with an effective description of the search space of dynamic models. Finally, it is also important to have an understanding of how a given inference method is affected by experimental and other noise in the data used. This paper contains a novel inference algorithm using the algebraic framework of Boolean polynomial dynamical systems (BPDS), meeting all these requirements. The algorithm takes as input time series data, including those from network perturbations, such as knock-out mutant strains and RNAi experiments. It allows for the incorporation of prior biological knowledge while being robust to significant levels of noise in the data used for inference. It uses an evolutionary algorithm for local optimization with an encoding of the mathematical models as BPDS. The BPDS framework allows an effective representation of the search space for algebraic dynamic models that improves computational performance. The algorithm is validated with both simulated and experimental microarray expression profile data. Robustness to noise is tested using a published mathematical model of the segment polarity gene network in Drosophila melanogaster. Benchmarking of the algorithm is done by comparison with a spectrum of state-of-the-art network inference methods on data from the synthetic IRMA network to demonstrate that our method has good precision and recall for the network reconstruction task, while also predicting several of the

  6. Optimal Constrained Stationary Intervention in Gene Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Golnaz Vahedi

    2008-05-01

    Full Text Available A key objective of gene network modeling is to develop intervention strategies to alter regulatory dynamics in such a way as to reduce the likelihood of undesirable phenotypes. Optimal stationary intervention policies have been developed for gene regulation in the framework of probabilistic Boolean networks in a number of settings. To mitigate the possibility of detrimental side effects, for instance, in the treatment of cancer, it may be desirable to limit the expected number of treatments beneath some bound. This paper formulates a general constraint approach for optimal therapeutic intervention by suitably adapting the reward function and then applies this formulation to bound the expected number of treatments. A mutated mammalian cell cycle is considered as a case study.

  7. Optimal Constrained Stationary Intervention in Gene Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Faryabi Babak

    2008-01-01

    Full Text Available A key objective of gene network modeling is to develop intervention strategies to alter regulatory dynamics in such a way as to reduce the likelihood of undesirable phenotypes. Optimal stationary intervention policies have been developed for gene regulation in the framework of probabilistic Boolean networks in a number of settings. To mitigate the possibility of detrimental side effects, for instance, in the treatment of cancer, it may be desirable to limit the expected number of treatments beneath some bound. This paper formulates a general constraint approach for optimal therapeutic intervention by suitably adapting the reward function and then applies this formulation to bound the expected number of treatments. A mutated mammalian cell cycle is considered as a case study.

  8. Graphlet Based Metrics for the Comparison of Gene Regulatory Networks

    Science.gov (United States)

    Martin, Alberto J. M.; Dominguez, Calixto; Contreras-Riquelme, Sebastián; Holmes, David S.; Perez-Acle, Tomas

    2016-01-01

    Understanding the control of gene expression remains one of the main challenges in the post-genomic era. Accordingly, a plethora of methods exists to identify variations in gene expression levels. These variations underlay almost all relevant biological phenomena, including disease and adaptation to environmental conditions. However, computational tools to identify how regulation changes are scarce. Regulation of gene expression is usually depicted in the form of a gene regulatory network (GRN). Structural changes in a GRN over time and conditions represent variations in the regulation of gene expression. Like other biological networks, GRNs are composed of basic building blocks called graphlets. As a consequence, two new metrics based on graphlets are proposed in this work: REConstruction Rate (REC) and REC Graphlet Degree (RGD). REC determines the rate of graphlet similarity between different states of a network and RGD identifies the subset of nodes with the highest topological variation. In other words, RGD discerns how th GRN was rewired. REC and RGD were used to compare the local structure of nodes in condition-specific GRNs obtained from gene expression data of Escherichia coli, forming biofilms and cultured in suspension. According to our results, most of the network local structure remains unaltered in the two compared conditions. Nevertheless, changes reported by RGD necessarily imply that a different cohort of regulators (i.e. transcription factors (TFs)) appear on the scene, shedding light on how the regulation of gene expression occurs when E. coli transits from suspension to biofilm. Consequently, we propose that both metrics REC and RGD should be adopted as a quantitative approach to conduct differential analyses of GRNs. A tool that implements both metrics is available as an on-line web server (http://dlab.cl/loto). PMID:27695050

  9. Reconstruction of Gene Regulatory Networks Based on Two-Stage Bayesian Network Structure Learning Algorithm

    Institute of Scientific and Technical Information of China (English)

    Gui-xia Liu; Wei Feng; Han Wang; Lei Liu; Chun-guang Zhou

    2009-01-01

    In the post-genomic biology era, the reconstruction of gene regulatory networks from microarray gene expression data is very important to understand the underlying biological system, and it has been a challenging task in bioinformatics. The Bayesian network model has been used in reconstructing the gene regulatory network for its advantages, but how to determine the network structure and parameters is still important to be explored. This paper proposes a two-stage structure learning algorithm which integrates immune evolution algorithm to build a Bayesian network .The new algorithm is evaluated with the use of both simulated and yeast cell cycle data. The experimental results indicate that the proposed algorithm can find many of the known real regulatory relationships from literature and predict the others unknown with high validity and accuracy.

  10. Combination of Neuro-Fuzzy Network Models with Biological Knowledge for Reconstructing Gene Regulatory Networks

    Institute of Scientific and Technical Information of China (English)

    Guixia Liu; Lei Liu; Chunyu Liu; Ming Zheng; Lanying Su; Chunguang Zhou

    2011-01-01

    Inferring gene regulatory networks from large-scale expression data is an important topic in both cellular systems and computational biology. The inference of regulators might be the core factor for understanding actual regulatory conditions in gene regulatory networks, especially when strong regulators do work significantly, in this paper, we propose a novel approach based on combining neuro-fuzzy network models with biological knowledge to infer strong regulators and interrelated fuzzy rules. The hybrid neuro-fuzzy architecture can not only infer the fuzzy rules, which are suitable for describing the regulatory conditions in regulatory networks, but also explain the meaning of nodes and weight value in the neural network. It can get useful rules automatically without factitious judgments. At the same time, it does not add recursive layers to the model, and the model can also strengthen the relationships among genes and reduce calculation. We use the proposed approach to reconstruct a partial gene regulatory network of yeast. The results show that this approach can work effectively.

  11. Using gene expression programming to infer gene regulatory networks from time-series data.

    Science.gov (United States)

    Zhang, Yongqing; Pu, Yifei; Zhang, Haisen; Su, Yabo; Zhang, Lifang; Zhou, Jiliu

    2013-12-01

    Gene regulatory networks inference is currently a topic under heavy research in the systems biology field. In this paper, gene regulatory networks are inferred via evolutionary model based on time-series microarray data. A non-linear differential equation model is adopted. Gene expression programming (GEP) is applied to identify the structure of the model and least mean square (LMS) is used to optimize the parameters in ordinary differential equations (ODEs). The proposed work has been first verified by synthetic data with noise-free and noisy time-series data, respectively, and then its effectiveness is confirmed by three real time-series expression datasets. Finally, a gene regulatory network was constructed with 12 Yeast genes. Experimental results demonstrate that our model can improve the prediction accuracy of microarray time-series data effectively. Copyright © 2013 Elsevier Ltd. All rights reserved.

  12. The role of master regulators in gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Enrique Hernández Lemus

    2015-05-01

    Full Text Available Gene regulatory networks present a wide variety of dynamical responses to intrinsic and extrinsic perturbations. Arguably, one of the most important of such coordinated responses is the one of amplification cascades, in which activation of a few key-responsive transcription factors (termed master regulators, MRs lead to a large series of transcriptional activation events. This is so since master regulators are transcription factors controlling the expression of other transcription factor molecules and so on. MRs hold a central position related to transcriptional dynamics and control of gene regulatory networks and are often involved in complex feedback and feedforward loops inducing non-trivial dynamics. Recent studies have pointed out to the myocyte enhancing factor 2C (MEF2C, also known as MADS box transcription enhancer factor 2, polypeptide C as being one of such master regulators involved in the pathogenesis of primary breast cancer. In this work, we perform an integrative genomic analysis of the transcriptional regulation activity of MEF2C and its target genes to evaluate to what extent are these molecules inducing collective responses leading to gene expression deregulation and carcinogenesis. We also analyzed a number of induced dynamic responses, in particular those associated with transcriptional bursts, and nonlinear cascading to evaluate the influence they may have in malignant phenotypes and cancer. Received: 20 Novembre 2014, Accepted: 24 June 2015; Edited by: C. A. Condat, G. J. Sibona; DOI: http://dx.doi.org/10.4279/PIP.070011 Cite as: E Hernández-Lemus, K Baca-López, R Lemus, R García-Herrera, Papers in Physics 7, 070011 (2015

  13. Indeterminacy of reverse engineering of Gene Regulatory Networks: the curse of gene elasticity.

    Directory of Open Access Journals (Sweden)

    Arun Krishnan

    Full Text Available BACKGROUND: Gene Regulatory Networks (GRNs have become a major focus of interest in recent years. A number of reverse engineering approaches have been developed to help uncover the regulatory networks giving rise to the observed gene expression profiles. However, this is an overspecified problem due to the fact that more than one genotype (network wiring can give rise to the same phenotype. We refer to this phenomenon as "gene elasticity." In this work, we study the effect of this particular problem on the pure, data-driven inference of gene regulatory networks. METHODOLOGY: We simulated a four-gene network in order to produce "data" (protein levels that we use in lieu of real experimental data. We then optimized the network connections between the four genes with a view to obtain the original network that gave rise to the data. We did this for two different cases: one in which only the network connections were optimized and the other in which both the network connections as well as the kinetic parameters (given as reaction probabilities in our case were estimated. We observed that multiple genotypes gave rise to very similar protein levels. Statistical experimentation indicates that it is impossible to differentiate between the different networks on the basis of both equilibrium as well as dynamic data. CONCLUSIONS: We show explicitly that reverse engineering of GRNs from pure expression data is an indeterminate problem. Our results suggest the unsuitability of an inferential, purely data-driven approach for the reverse engineering transcriptional networks in the case of gene regulatory networks displaying a certain level of complexity.

  14. Data Integration for Microarrays: Enhanced Inference for Gene Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Alina Sîrbu

    2015-05-01

    Full Text Available Microarray technologies have been the basis of numerous important findings regarding gene expression in the few last decades. Studies have generated large amounts of data describing various processes, which, due to the existence of public databases, are widely available for further analysis. Given their lower cost and higher maturity compared to newer sequencing technologies, these data continue to be produced, even though data quality has been the subject of some debate. However, given the large volume of data generated, integration can help overcome some issues related, e.g., to noise or reduced time resolution, while providing additional insight on features not directly addressed by sequencing methods. Here, we present an integration test case based on public Drosophila melanogaster datasets (gene expression, binding site affinities, known interactions. Using an evolutionary computation framework, we show how integration can enhance the ability to recover transcriptional gene regulatory networks from these data, as well as indicating which data types are more important for quantitative and qualitative network inference. Our results show a clear improvement in performance when multiple datasets are integrated, indicating that microarray data will remain a valuable and viable resource for some time to come.

  15. Gene regulatory network interactions in sea urchin endomesoderm induction.

    Directory of Open Access Journals (Sweden)

    Aditya J Sethi

    2009-02-01

    Full Text Available A major goal of contemporary studies of embryonic development is to understand large sets of regulatory changes that accompany the phenomenon of embryonic induction. The highly resolved sea urchin pregastrular endomesoderm-gene regulatory network (EM-GRN provides a unique framework to study the global regulatory interactions underlying endomesoderm induction. Vegetal micromeres of the sea urchin embryo constitute a classic endomesoderm signaling center, whose potential to induce archenteron formation from presumptive ectoderm was demonstrated almost a century ago. In this work, we ectopically activate the primary mesenchyme cell-GRN (PMC-GRN that operates in micromere progeny by misexpressing the micromere determinant Pmar1 and identify the responding EM-GRN that is induced in animal blastomeres. Using localized loss-of -function analyses in conjunction with expression of endo16, the molecular definition of micromere-dependent endomesoderm specification, we show that the TGFbeta cytokine, ActivinB, is an essential component of this induction in blastomeres that emit this signal, as well as in cells that respond to it. We report that normal pregastrular endomesoderm specification requires activation of the Pmar1-inducible subset of the EM-GRN by the same cytokine, strongly suggesting that early micromere-mediated endomesoderm specification, which regulates timely gastrulation in the sea urchin embryo, is also ActivinB dependent. This study unexpectedly uncovers the existence of an additional uncharacterized micromere signal to endomesoderm progenitors, significantly revising existing models. In one of the first network-level characterizations of an intercellular inductive phenomenon, we describe an important in vivo model of the requirement of ActivinB signaling in the earliest steps of embryonic endomesoderm progenitor specification.

  16. Analysis of deterministic cyclic gene regulatory network models with delays

    CERN Document Server

    Ahsen, Mehmet Eren; Niculescu, Silviu-Iulian

    2015-01-01

    This brief examines a deterministic, ODE-based model for gene regulatory networks (GRN) that incorporates nonlinearities and time-delayed feedback. An introductory chapter provides some insights into molecular biology and GRNs. The mathematical tools necessary for studying the GRN model are then reviewed, in particular Hill functions and Schwarzian derivatives. One chapter is devoted to the analysis of GRNs under negative feedback with time delays and a special case of a homogenous GRN is considered. Asymptotic stability analysis of GRNs under positive feedback is then considered in a separate chapter, in which conditions leading to bi-stability are derived. Graduate and advanced undergraduate students and researchers in control engineering, applied mathematics, systems biology and synthetic biology will find this brief to be a clear and concise introduction to the modeling and analysis of GRNs.

  17. Identifying time-delayed gene regulatory networks via an evolvable hierarchical recurrent neural network.

    Science.gov (United States)

    Kordmahalleh, Mina Moradi; Sefidmazgi, Mohammad Gorji; Harrison, Scott H; Homaifar, Abdollah

    2017-01-01

    The modeling of genetic interactions within a cell is crucial for a basic understanding of physiology and for applied areas such as drug design. Interactions in gene regulatory networks (GRNs) include effects of transcription factors, repressors, small metabolites, and microRNA species. In addition, the effects of regulatory interactions are not always simultaneous, but can occur after a finite time delay, or as a combined outcome of simultaneous and time delayed interactions. Powerful biotechnologies have been rapidly and successfully measuring levels of genetic expression to illuminate different states of biological systems. This has led to an ensuing challenge to improve the identification of specific regulatory mechanisms through regulatory network reconstructions. Solutions to this challenge will ultimately help to spur forward efforts based on the usage of regulatory network reconstructions in systems biology applications. We have developed a hierarchical recurrent neural network (HRNN) that identifies time-delayed gene interactions using time-course data. A customized genetic algorithm (GA) was used to optimize hierarchical connectivity of regulatory genes and a target gene. The proposed design provides a non-fully connected network with the flexibility of using recurrent connections inside the network. These features and the non-linearity of the HRNN facilitate the process of identifying temporal patterns of a GRN. Our HRNN method was implemented with the Python language. It was first evaluated on simulated data representing linear and nonlinear time-delayed gene-gene interaction models across a range of network sizes and variances of noise. We then further demonstrated the capability of our method in reconstructing GRNs of the Saccharomyces cerevisiae synthetic network for in vivo benchmarking of reverse-engineering and modeling approaches (IRMA). We compared the performance of our method to TD-ARACNE, HCC-CLINDE, TSNI and ebdbNet across different network

  18. Automated large-scale control of gene regulatory networks.

    Science.gov (United States)

    Tan, Mehmet; Alhajj, Reda; Polat, Faruk

    2010-04-01

    Controlling gene regulatory networks (GRNs) is an important and hard problem. As it is the case in all control problems, the curse of dimensionality is the main issue in real applications. It is possible that hundreds of genes may regulate one biological activity in an organism; this implies a huge state space, even in the case of Boolean models. This is also evident in the literature that shows that only models of small portions of the genome could be used in control applications. In this paper, we empower our framework for controlling GRNs by eliminating the need for expert knowledge to specify some crucial threshold that is necessary for producing effective results. Our framework is characterized by applying the factored Markov decision problem (FMDP) method to the control problem of GRNs. The FMDP is a suitable framework for large state spaces as it represents the probability distribution of state transitions using compact models so that more space and time efficient algorithms could be devised for solving control problems. We successfully mapped the GRN control problem to an FMDP and propose a model reduction algorithm that helps find approximate solutions for large networks by using existing FMDP solvers. The test results reported in this paper demonstrate the efficiency and effectiveness of the proposed approach.

  19. Learning Gene Regulatory Networks Computationally from Gene Expression Data Using Weighted Consensus

    KAUST Repository

    Fujii, Chisato

    2015-04-16

    Gene regulatory networks analyze the relationships between genes allowing us to un- derstand the gene regulatory interactions in systems biology. Gene expression data from the microarray experiments is used to obtain the gene regulatory networks. How- ever, the microarray data is discrete, noisy and non-linear which makes learning the networks a challenging problem and existing gene network inference methods do not give consistent results. Current state-of-the-art study uses the average-ranking-based consensus method to combine and average the ranked predictions from individual methods. However each individual method has an equal contribution to the consen- sus prediction. We have developed a linear programming-based consensus approach which uses learned weights from linear programming among individual methods such that the methods have di↵erent weights depending on their performance. Our result reveals that assigning di↵erent weights to individual methods rather than giving them equal weights improves the performance of the consensus. The linear programming- based consensus method is evaluated and it had the best performance on in silico and Saccharomyces cerevisiae networks, and the second best on the Escherichia coli network outperformed by Inferelator Pipeline method which gives inconsistent results across a wide range of microarray data sets.

  20. Gene Regulatory Networks from Multifactorial Perturbations Using Graphical Lasso: Application to the DREAM4 Challenge

    NARCIS (Netherlands)

    Menéndez, P.; Kourmpetis, Y.I.A.; Braak, ter C.J.F.; Eeuwijk, van F.A.

    2010-01-01

    A major challenge in the field of systems biology consists of predicting gene regulatory networks based on different training data. Within the DREAM4 initiative, we took part in the multifactorial sub-challenge that aimed to predict gene regulatory networks of size 100 from training data consisting

  1. Learning gene regulatory networks from gene expression data using weighted consensus

    KAUST Repository

    Fujii, Chisato

    2016-08-25

    An accurate determination of the network structure of gene regulatory systems from high-throughput gene expression data is an essential yet challenging step in studying how the expression of endogenous genes is controlled through a complex interaction of gene products and DNA. While numerous methods have been proposed to infer the structure of gene regulatory networks, none of them seem to work consistently over different data sets with high accuracy. A recent study to compare gene network inference methods showed that an average-ranking-based consensus method consistently performs well under various settings. Here, we propose a linear programming-based consensus method for the inference of gene regulatory networks. Unlike the average-ranking-based one, which treats the contribution of each individual method equally, our new consensus method assigns a weight to each method based on its credibility. As a case study, we applied the proposed consensus method on synthetic and real microarray data sets, and compared its performance to that of the average-ranking-based consensus and individual inference methods. Our results show that our weighted consensus method achieves superior performance over the unweighted one, suggesting that assigning weights to different individual methods rather than giving them equal weights improves the accuracy. © 2016 Elsevier B.V.

  2. Modular reorganization of the global network of gene regulatory interactions during perinatal human brain development

    OpenAIRE

    Monzón-Sandoval, Jimena; Castillo-Morales, Atahualpa; Urrutia, Araxi O.; Gutierrez, Humberto

    2016-01-01

    Background During early development of the nervous system, gene expression patterns are known to vary widely depending on the specific developmental trajectories of different structures. Observable changes in gene expression profiles throughout development are determined by an underlying network of precise regulatory interactions between individual genes. Elucidating the organizing principles that shape this gene regulatory network is one of the central goals of developmental biology. Whether...

  3. Boolean networks using the chi-square test for inferring large-scale gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Lee Jae K

    2007-02-01

    Full Text Available Abstract Background Boolean network (BN modeling is a commonly used method for constructing gene regulatory networks from time series microarray data. However, its major drawback is that its computation time is very high or often impractical to construct large-scale gene networks. We propose a variable selection method that are not only reduces BN computation times significantly but also obtains optimal network constructions by using chi-square statistics for testing the independence in contingency tables. Results Both the computation time and accuracy of the network structures estimated by the proposed method are compared with those of the original BN methods on simulated and real yeast cell cycle microarray gene expression data sets. Our results reveal that the proposed chi-square testing (CST-based BN method significantly improves the computation time, while its ability to identify all the true network mechanisms was effectively the same as that of full-search BN methods. The proposed BN algorithm is approximately 70.8 and 7.6 times faster than the original BN algorithm when the error sizes of the Best-Fit Extension problem are 0 and 1, respectively. Further, the false positive error rate of the proposed CST-based BN algorithm tends to be less than that of the original BN. Conclusion The CST-based BN method dramatically improves the computation time of the original BN algorithm. Therefore, it can efficiently infer large-scale gene regulatory network mechanisms.

  4. The underlying molecular and network level mechanisms in the evolution of robustness in gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Mario Pujato

    Full Text Available Gene regulatory networks show robustness to perturbations. Previous works identified robustness as an emergent property of gene network evolution but the underlying molecular mechanisms are poorly understood. We used a multi-tier modeling approach that integrates molecular sequence and structure information with network architecture and population dynamics. Structural models of transcription factor-DNA complexes are used to estimate relative binding specificities. In this model, mutations in the DNA cause changes on two levels: (a at the sequence level in individual binding sites (modulating binding specificity, and (b at the network level (creating and destroying binding sites. We used this model to dissect the underlying mechanisms responsible for the evolution of robustness in gene regulatory networks. Results suggest that in sparse architectures (represented by short promoters, a mixture of local-sequence and network-architecture level changes are exploited. At the local-sequence level, robustness evolves by decreasing the probabilities of both the destruction of existent and generation of new binding sites. Meanwhile, in highly interconnected architectures (represented by long promoters, robustness evolves almost entirely via network level changes, deleting and creating binding sites that modify the network architecture.

  5. An efficient approach of attractor calculation for large-scale Boolean gene regulatory networks.

    Science.gov (United States)

    He, Qinbin; Xia, Zhile; Lin, Bin

    2016-11-07

    Boolean network models provide an efficient way for studying gene regulatory networks. The main dynamics of a Boolean network is determined by its attractors. Attractor calculation plays a key role for analyzing Boolean gene regulatory networks. An approach of attractor calculation was proposed in this study, which improved the predecessor-based approach. Furthermore, the proposed approach combined with the identification of constant nodes and simplified Boolean networks to accelerate attractor calculation. The proposed algorithm is effective to calculate all attractors for large-scale Boolean gene regulatory networks. If the average degree of the network is not too large, the algorithm can get all attractors of a Boolean network with dozens or even hundreds of nodes.

  6. A provisional regulatory gene network for specification of endomesoderm in the sea urchin embryo

    Science.gov (United States)

    Davidson, Eric H.; Rast, Jonathan P.; Oliveri, Paola; Ransick, Andrew; Calestani, Cristina; Yuh, Chiou-Hwa; Minokawa, Takuya; Amore, Gabriele; Hinman, Veronica; Arenas-Mena, Cesar; Otim, Ochan; Brown, C. Titus; Livi, Carolina B.; Lee, Pei Yun; Revilla, Roger; Schilstra, Maria J.; Clarke, Peter J C.; Rust, Alistair G.; Pan, Zhengjun; Arnone, Maria I.; Rowen, Lee; Cameron, R. Andrew; McClay, David R.; Hood, Leroy; Bolouri, Hamid

    2002-01-01

    We present the current form of a provisional DNA sequence-based regulatory gene network that explains in outline how endomesodermal specification in the sea urchin embryo is controlled. The model of the network is in a continuous process of revision and growth as new genes are added and new experimental results become available; see http://www.its.caltech.edu/mirsky/endomeso.htm (End-mes Gene Network Update) for the latest version. The network contains over 40 genes at present, many newly uncovered in the course of this work, and most encoding DNA-binding transcriptional regulatory factors. The architecture of the network was approached initially by construction of a logic model that integrated the extensive experimental evidence now available on endomesoderm specification. The internal linkages between genes in the network have been determined functionally, by measurement of the effects of regulatory perturbations on the expression of all relevant genes in the network. Five kinds of perturbation have been applied: (1) use of morpholino antisense oligonucleotides targeted to many of the key regulatory genes in the network; (2) transformation of other regulatory factors into dominant repressors by construction of Engrailed repressor domain fusions; (3) ectopic expression of given regulatory factors, from genetic expression constructs and from injected mRNAs; (4) blockade of the beta-catenin/Tcf pathway by introduction of mRNA encoding the intracellular domain of cadherin; and (5) blockade of the Notch signaling pathway by introduction of mRNA encoding the extracellular domain of the Notch receptor. The network model predicts the cis-regulatory inputs that link each gene into the network. Therefore, its architecture is testable by cis-regulatory analysis. Strongylocentrotus purpuratus and Lytechinus variegatus genomic BAC recombinants that include a large number of the genes in the network have been sequenced and annotated. Tests of the cis-regulatory predictions of

  7. Inference of Cancer-specific Gene Regulatory Networks Using Soft Computing Rules

    Directory of Open Access Journals (Sweden)

    Xiaosheng Wang

    2010-03-01

    Full Text Available Perturbations of gene regulatory networks are essentially responsible for oncogenesis. Therefore, inferring the gene regulatory networks is a key step to overcoming cancer. In this work, we propose a method for inferring directed gene regulatory networks based on soft computing rules, which can identify important cause-effect regulatory relations of gene expression. First, we identify important genes associated with a specific cancer (colon cancer using a supervised learning approach. Next, we reconstruct the gene regulatory networks by inferring the regulatory relations among the identified genes, and their regulated relations by other genes within the genome. We obtain two meaningful findings. One is that upregulated genes are regulated by more genes than downregulated ones, while downregulated genes regulate more genes than upregulated ones. The other one is that tumor suppressors suppress tumor activators and activate other tumor suppressors strongly, while tumor activators activate other tumor activators and suppress tumor suppressors weakly, indicating the robustness of biological systems. These findings provide valuable insights into the pathogenesis of cancer.

  8. Optimal Control of Gene Regulatory Networks with Effectiveness of Multiple Drugs: A Boolean Network Approach

    Science.gov (United States)

    Kobayashi, Koichi; Hiraishi, Kunihiko

    2013-01-01

    Developing control theory of gene regulatory networks is one of the significant topics in the field of systems biology, and it is expected to apply the obtained results to gene therapy technologies in the future. In this paper, a control method using a Boolean network (BN) is studied. A BN is widely used as a model of gene regulatory networks, and gene expression is expressed by a binary value (0 or 1). In the control problem, we assume that the concentration level of a part of genes is arbitrarily determined as the control input. However, there are cases that no gene satisfying this assumption exists, and it is important to consider structural control via external stimuli. Furthermore, these controls are realized by multiple drugs, and it is also important to consider multiple effects such as duration of effect and side effects. In this paper, we propose a BN model with two types of the control inputs and an optimal control method with duration of drug effectiveness. First, a BN model and duration of drug effectiveness are discussed. Next, the optimal control problem is formulated and is reduced to an integer linear programming problem. Finally, numerical simulations are shown. PMID:24058904

  9. Experimental design for parameter estimation of gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Bernhard Steiert

    Full Text Available Systems biology aims for building quantitative models to address unresolved issues in molecular biology. In order to describe the behavior of biological cells adequately, gene regulatory networks (GRNs are intensively investigated. As the validity of models built for GRNs depends crucially on the kinetic rates, various methods have been developed to estimate these parameters from experimental data. For this purpose, it is favorable to choose the experimental conditions yielding maximal information. However, existing experimental design principles often rely on unfulfilled mathematical assumptions or become computationally demanding with growing model complexity. To solve this problem, we combined advanced methods for parameter and uncertainty estimation with experimental design considerations. As a showcase, we optimized three simulated GRNs in one of the challenges from the Dialogue for Reverse Engineering Assessment and Methods (DREAM. This article presents our approach, which was awarded the best performing procedure at the DREAM6 Estimation of Model Parameters challenge. For fast and reliable parameter estimation, local deterministic optimization of the likelihood was applied. We analyzed identifiability and precision of the estimates by calculating the profile likelihood. Furthermore, the profiles provided a way to uncover a selection of most informative experiments, from which the optimal one was chosen using additional criteria at every step of the design process. In conclusion, we provide a strategy for optimal experimental design and show its successful application on three highly nonlinear dynamic models. Although presented in the context of the GRNs to be inferred for the DREAM6 challenge, the approach is generic and applicable to most types of quantitative models in systems biology and other disciplines.

  10. Learning a Markov Logic network for supervised gene regulatory network inference.

    Science.gov (United States)

    Brouard, Céline; Vrain, Christel; Dubois, Julie; Castel, David; Debily, Marie-Anne; d'Alché-Buc, Florence

    2013-09-12

    Gene regulatory network inference remains a challenging problem in systems biology despite the numerous approaches that have been proposed. When substantial knowledge on a gene regulatory network is already available, supervised network inference is appropriate. Such a method builds a binary classifier able to assign a class (Regulation/No regulation) to an ordered pair of genes. Once learnt, the pairwise classifier can be used to predict new regulations. In this work, we explore the framework of Markov Logic Networks (MLN) that combine features of probabilistic graphical models with the expressivity of first-order logic rules. We propose to learn a Markov Logic network, e.g. a set of weighted rules that conclude on the predicate "regulates", starting from a known gene regulatory network involved in the switch proliferation/differentiation of keratinocyte cells, a set of experimental transcriptomic data and various descriptions of genes all encoded into first-order logic. As training data are unbalanced, we use asymmetric bagging to learn a set of MLNs. The prediction of a new regulation can then be obtained by averaging predictions of individual MLNs. As a side contribution, we propose three in silico tests to assess the performance of any pairwise classifier in various network inference tasks on real datasets. A first test consists of measuring the average performance on balanced edge prediction problem; a second one deals with the ability of the classifier, once enhanced by asymmetric bagging, to update a given network. Finally our main result concerns a third test that measures the ability of the method to predict regulations with a new set of genes. As expected, MLN, when provided with only numerical discretized gene expression data, does not perform as well as a pairwise SVM in terms of AUPR. However, when a more complete description of gene properties is provided by heterogeneous sources, MLN achieves the same performance as a black-box model such as a

  11. The impact of gene expression variation on the robustness and evolvability of a developmental gene regulatory network.

    Directory of Open Access Journals (Sweden)

    David A Garfield

    2013-10-01

    Full Text Available Regulatory interactions buffer development against genetic and environmental perturbations, but adaptation requires phenotypes to change. We investigated the relationship between robustness and evolvability within the gene regulatory network underlying development of the larval skeleton in the sea urchin Strongylocentrotus purpuratus. We find extensive variation in gene expression in this network throughout development in a natural population, some of which has a heritable genetic basis. Switch-like regulatory interactions predominate during early development, buffer expression variation, and may promote the accumulation of cryptic genetic variation affecting early stages. Regulatory interactions during later development are typically more sensitive (linear, allowing variation in expression to affect downstream target genes. Variation in skeletal morphology is associated primarily with expression variation of a few, primarily structural, genes at terminal positions within the network. These results indicate that the position and properties of gene interactions within a network can have important evolutionary consequences independent of their immediate regulatory role.

  12. Gene regulatory network modeling via global optimization of high-order dynamic Bayesian network

    Directory of Open Access Journals (Sweden)

    Xuan Nguyen

    2012-06-01

    Full Text Available Abstract Background Dynamic Bayesian network (DBN is among the mainstream approaches for modeling various biological networks, including the gene regulatory network (GRN. Most current methods for learning DBN employ either local search such as hill-climbing, or a meta stochastic global optimization framework such as genetic algorithm or simulated annealing, which are only able to locate sub-optimal solutions. Further, current DBN applications have essentially been limited to small sized networks. Results To overcome the above difficulties, we introduce here a deterministic global optimization based DBN approach for reverse engineering genetic networks from time course gene expression data. For such DBN models that consist only of inter time slice arcs, we show that there exists a polynomial time algorithm for learning the globally optimal network structure. The proposed approach, named GlobalMIT+, employs the recently proposed information theoretic scoring metric named mutual information test (MIT. GlobalMIT+ is able to learn high-order time delayed genetic interactions, which are common to most biological systems. Evaluation of the approach using both synthetic and real data sets, including a 733 cyanobacterial gene expression data set, shows significantly improved performance over other techniques. Conclusions Our studies demonstrate that deterministic global optimization approaches can infer large scale genetic networks.

  13. CMIP: a software package capable of reconstructing genome-wide regulatory networks using gene expression data.

    Science.gov (United States)

    Zheng, Guangyong; Xu, Yaochen; Zhang, Xiujun; Liu, Zhi-Ping; Wang, Zhuo; Chen, Luonan; Zhu, Xin-Guang

    2016-12-23

    A gene regulatory network (GRN) represents interactions of genes inside a cell or tissue, in which vertexes and edges stand for genes and their regulatory interactions respectively. Reconstruction of gene regulatory networks, in particular, genome-scale networks, is essential for comparative exploration of different species and mechanistic investigation of biological processes. Currently, most of network inference methods are computationally intensive, which are usually effective for small-scale tasks (e.g., networks with a few hundred genes), but are difficult to construct GRNs at genome-scale. Here, we present a software package for gene regulatory network reconstruction at a genomic level, in which gene interaction is measured by the conditional mutual information measurement using a parallel computing framework (so the package is named CMIP). The package is a greatly improved implementation of our previous PCA-CMI algorithm. In CMIP, we provide not only an automatic threshold determination method but also an effective parallel computing framework for network inference. Performance tests on benchmark datasets show that the accuracy of CMIP is comparable to most current network inference methods. Moreover, running tests on synthetic datasets demonstrate that CMIP can handle large datasets especially genome-wide datasets within an acceptable time period. In addition, successful application on a real genomic dataset confirms its practical applicability of the package. This new software package provides a powerful tool for genomic network reconstruction to biological community. The software can be accessed at http://www.picb.ac.cn/CMIP/ .

  14. Inferring gene regulatory networks via nonlinear state-space models and exploiting sparsity.

    Science.gov (United States)

    Noor, Amina; Serpedin, Erchin; Nounou, Mohamed; Nounou, Hazem N

    2012-01-01

    This paper considers the problem of learning the structure of gene regulatory networks from gene expression time series data. A more realistic scenario when the state space model representing a gene network evolves nonlinearly is considered while a linear model is assumed for the microarray data. To capture the nonlinearity, a particle filter-based state estimation algorithm is considered instead of the contemporary linear approximation-based approaches. The parameters characterizing the regulatory relations among various genes are estimated online using a Kalman filter. Since a particular gene interacts with a few other genes only, the parameter vector is expected to be sparse. The state estimates delivered by the particle filter and the observed microarray data are then subjected to a LASSO-based least squares regression operation which yields a parsimonious and efficient description of the regulatory network by setting the irrelevant coefficients to zero. The performance of the aforementioned algorithm is compared with the extended Kalman filter (EKF) and Unscented Kalman Filter (UKF) employing the Mean Square Error (MSE) as the fidelity criterion in recovering the parameters of gene regulatory networks from synthetic data and real biological data. Extensive computer simulations illustrate that the proposed particle filter-based network inference algorithm outperforms EKF and UKF, and therefore, it can serve as a natural framework for modeling gene regulatory networks with nonlinear and sparse structure.

  15. Stochastic Boolean networks: An efficient approach to modeling gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Liang Jinghang

    2012-08-01

    Full Text Available Abstract Background Various computational models have been of interest due to their use in the modelling of gene regulatory networks (GRNs. As a logical model, probabilistic Boolean networks (PBNs consider molecular and genetic noise, so the study of PBNs provides significant insights into the understanding of the dynamics of GRNs. This will ultimately lead to advances in developing therapeutic methods that intervene in the process of disease development and progression. The applications of PBNs, however, are hindered by the complexities involved in the computation of the state transition matrix and the steady-state distribution of a PBN. For a PBN with n genes and N Boolean networks, the complexity to compute the state transition matrix is O(nN22n or O(nN2n for a sparse matrix. Results This paper presents a novel implementation of PBNs based on the notions of stochastic logic and stochastic computation. This stochastic implementation of a PBN is referred to as a stochastic Boolean network (SBN. An SBN provides an accurate and efficient simulation of a PBN without and with random gene perturbation. The state transition matrix is computed in an SBN with a complexity of O(nL2n, where L is a factor related to the stochastic sequence length. Since the minimum sequence length required for obtaining an evaluation accuracy approximately increases in a polynomial order with the number of genes, n, and the number of Boolean networks, N, usually increases exponentially with n, L is typically smaller than N, especially in a network with a large number of genes. Hence, the computational efficiency of an SBN is primarily limited by the number of genes, but not directly by the total possible number of Boolean networks. Furthermore, a time-frame expanded SBN enables an efficient analysis of the steady-state distribution of a PBN. These findings are supported by the simulation results of a simplified p53 network, several randomly generated networks and a

  16. Gene regulatory network inference using fused LASSO on multiple data sets.

    Science.gov (United States)

    Omranian, Nooshin; Eloundou-Mbebi, Jeanne M O; Mueller-Roeber, Bernd; Nikoloski, Zoran

    2016-02-11

    Devising computational methods to accurately reconstruct gene regulatory networks given gene expression data is key to systems biology applications. Here we propose a method for reconstructing gene regulatory networks by simultaneous consideration of data sets from different perturbation experiments and corresponding controls. The method imposes three biologically meaningful constraints: (1) expression levels of each gene should be explained by the expression levels of a small number of transcription factor coding genes, (2) networks inferred from different data sets should be similar with respect to the type and number of regulatory interactions, and (3) relationships between genes which exhibit similar differential behavior over the considered perturbations should be favored. We demonstrate that these constraints can be transformed in a fused LASSO formulation for the proposed method. The comparative analysis on transcriptomics time-series data from prokaryotic species, Escherichia coli and Mycobacterium tuberculosis, as well as a eukaryotic species, mouse, demonstrated that the proposed method has the advantages of the most recent approaches for regulatory network inference, while obtaining better performance and assigning higher scores to the true regulatory links. The study indicates that the combination of sparse regression techniques with other biologically meaningful constraints is a promising framework for gene regulatory network reconstructions.

  17. A method for developing regulatory gene set networks to characterize complex biological systems.

    Science.gov (United States)

    Suphavilai, Chayaporn; Zhu, Liugen; Chen, Jake Y

    2015-01-01

    Traditional approaches to studying molecular networks are based on linking genes or proteins. Higher-level networks linking gene sets or pathways have been proposed recently. Several types of gene set networks have been used to study complex molecular networks such as co-membership gene set networks (M-GSNs) and co-enrichment gene set networks (E-GSNs). Gene set networks are useful for studying biological mechanism of diseases and drug perturbations. In this study, we proposed a new approach for constructing directed, regulatory gene set networks (R-GSNs) to reveal novel relationships among gene sets or pathways. We collected several gene set collections and high-quality gene regulation data in order to construct R-GSNs in a comparative study with co-membership gene set networks (M-GSNs). We described a method for constructing both global and disease-specific R-GSNs and determining their significance. To demonstrate the potential applications to disease biology studies, we constructed and analysed an R-GSN specifically built for Alzheimer's disease. R-GSNs can provide new biological insights complementary to those derived at the protein regulatory network level or M-GSNs. When integrated properly to functional genomics data, R-GSNs can help enable future research on systems biology and translational bioinformatics.

  18. Reconstruction of the Regulatory Network for Bacillus subtilis and Reconciliation with Gene Expression Data

    Science.gov (United States)

    Faria, José P.; Overbeek, Ross; Taylor, Ronald C.; Conrad, Neal; Vonstein, Veronika; Goelzer, Anne; Fromion, Vincent; Rocha, Miguel; Rocha, Isabel; Henry, Christopher S.

    2016-01-01

    We introduce a manually constructed and curated regulatory network model that describes the current state of knowledge of transcriptional regulation of Bacillus subtilis. The model corresponds to an updated and enlarged version of the regulatory model of central metabolism originally proposed in 2008. We extended the original network to the whole genome by integration of information from DBTBS, a compendium of regulatory data that includes promoters, transcription factors (TFs), binding sites, motifs, and regulated operons. Additionally, we consolidated our network with all the information on regulation included in the SporeWeb and Subtiwiki community-curated resources on B. subtilis. Finally, we reconciled our network with data from RegPrecise, which recently released their own less comprehensive reconstruction of the regulatory network for B. subtilis. Our model describes 275 regulators and their target genes, representing 30 different mechanisms of regulation such as TFs, RNA switches, Riboswitches, and small regulatory RNAs. Overall, regulatory information is included in the model for ∼2500 of the ∼4200 genes in B. subtilis 168. In an effort to further expand our knowledge of B. subtilis regulation, we reconciled our model with expression data. For this process, we reconstructed the Atomic Regulons (ARs) for B. subtilis, which are the sets of genes that share the same “ON” and “OFF” gene expression profiles across multiple samples of experimental data. We show how ARs for B. subtilis are able to capture many sets of genes corresponding to regulated operons in our manually curated network. Additionally, we demonstrate how ARs can be used to help expand or validate the knowledge of the regulatory networks by looking at highly correlated genes in the ARs for which regulatory information is lacking. During this process, we were also able to infer novel stimuli for hypothetical genes by exploring the genome expression metadata relating to experimental

  19. Reconstruction of the Regulatory Network for Bacillus subtilis and Reconciliation with Gene Expression Data

    Energy Technology Data Exchange (ETDEWEB)

    Faria, José P.; Overbeek, Ross; Taylor, Ronald C.; Conrad, Neal; Vonstein, Veronika; Goelzer, Anne; Fromion, Vincent; Rocha, Miguel; Rocha, Isabel; Henry, Christopher S.

    2016-03-18

    We introduce a manually constructed and curated regulatory network model that describes the current state of knowledge of transcriptional regulation of B. subtilis. The model corresponds to an updated and enlarged version of the regulatory model of central metabolism originally proposed in 2008. We extended the original network to the whole genome by integration of information from DBTBS, a compendium of regulatory data that includes promoters, transcription factors (TFs), binding sites, motifs and regulated operons. Additionally, we consolidated our network with all the information on regulation included in the SporeWeb and Subtiwiki community-curated resources on B. subtilis. Finally, we reconciled our network with data from RegPrecise, which recently released their own less comprehensive reconstruction of the regulatory network for B. subtilis. Our model describes 275 regulators and their target genes, representing 30 different mechanisms of regulation such as TFs, RNA switches, Riboswitches and small regulatory RNAs. Overall, regulatory information is included in the model for approximately 2500 of the ~4200 genes in B. subtilis 168. In an effort to further expand our knowledge of B. subtilis regulation, we reconciled our model with expression data. For this process, we reconstructed the Atomic Regulons (ARs) for B. subtilis, which are the sets of genes that share the same “ON” and “OFF” gene expression profiles across multiple samples of experimental data. We show how atomic regulons for B. subtilis are able to capture many sets of genes corresponding to regulated operons in our manually curated network. Additionally, we demonstrate how atomic regulons can be used to help expand or validate the knowledge of the regulatory networks by looking at highly correlated genes in the ARs for which regulatory information is lacking. During this process, we were also able to infer novel stimuli for hypothetical genes by exploring the genome expression metadata

  20. On the Interplay between Entropy and Robustness of Gene Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Bor-Sen Chen

    2010-05-01

    Full Text Available The interplay between entropy and robustness of gene network is a core mechanism of systems biology. The entropy is a measure of randomness or disorder of a physical system due to random parameter fluctuation and environmental noises in gene regulatory networks. The robustness of a gene regulatory network, which can be measured as the ability to tolerate the random parameter fluctuation and to attenuate the effect of environmental noise, will be discussed from the robust H∞ stabilization and filtering perspective. In this review, we will also discuss their balancing roles in evolution and potential applications in systems and synthetic biology.

  1. CoryneRegNet 4.0 – A reference database for corynebacterial gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Baumbach Jan

    2007-11-01

    Full Text Available Abstract Background Detailed information on DNA-binding transcription factors (the key players in the regulation of gene expression and on transcriptional regulatory interactions of microorganisms deduced from literature-derived knowledge, computer predictions and global DNA microarray hybridization experiments, has opened the way for the genome-wide analysis of transcriptional regulatory networks. The large-scale reconstruction of these networks allows the in silico analysis of cell behavior in response to changing environmental conditions. We previously published CoryneRegNet, an ontology-based data warehouse of corynebacterial transcription factors and regulatory networks. Initially, it was designed to provide methods for the analysis and visualization of the gene regulatory network of Corynebacterium glutamicum. Results Now we introduce CoryneRegNet release 4.0, which integrates data on the gene regulatory networks of 4 corynebacteria, 2 mycobacteria and the model organism Escherichia coli K12. As the previous versions, CoryneRegNet provides a web-based user interface to access the database content, to allow various queries, and to support the reconstruction, analysis and visualization of regulatory networks at different hierarchical levels. In this article, we present the further improved database content of CoryneRegNet along with novel analysis features. The network visualization feature GraphVis now allows the inter-species comparisons of reconstructed gene regulatory networks and the projection of gene expression levels onto that networks. Therefore, we added stimulon data directly into the database, but also provide Web Service access to the DNA microarray analysis platform EMMA. Additionally, CoryneRegNet now provides a SOAP based Web Service server, which can easily be consumed by other bioinformatics software systems. Stimulons (imported from the database, or uploaded by the user can be analyzed in the context of known

  2. Large-scale modeling of condition-specific gene regulatory networks by information integration and inference.

    Science.gov (United States)

    Ellwanger, Daniel Christian; Leonhardt, Jörn Florian; Mewes, Hans-Werner

    2014-12-01

    Understanding how regulatory networks globally coordinate the response of a cell to changing conditions, such as perturbations by shifting environments, is an elementary challenge in systems biology which has yet to be met. Genome-wide gene expression measurements are high dimensional as these are reflecting the condition-specific interplay of thousands of cellular components. The integration of prior biological knowledge into the modeling process of systems-wide gene regulation enables the large-scale interpretation of gene expression signals in the context of known regulatory relations. We developed COGERE (http://mips.helmholtz-muenchen.de/cogere), a method for the inference of condition-specific gene regulatory networks in human and mouse. We integrated existing knowledge of regulatory interactions from multiple sources to a comprehensive model of prior information. COGERE infers condition-specific regulation by evaluating the mutual dependency between regulator (transcription factor or miRNA) and target gene expression using prior information. This dependency is scored by the non-parametric, nonlinear correlation coefficient η(2) (eta squared) that is derived by a two-way analysis of variance. We show that COGERE significantly outperforms alternative methods in predicting condition-specific gene regulatory networks on simulated data sets. Furthermore, by inferring the cancer-specific gene regulatory network from the NCI-60 expression study, we demonstrate the utility of COGERE to promote hypothesis-driven clinical research.

  3. iSLIM: a comprehensive approach to mapping and characterizing gene regulatory networks.

    Science.gov (United States)

    Rockel, Sylvie; Geertz, Marcel; Hens, Korneel; Deplancke, Bart; Maerkl, Sebastian J

    2013-02-01

    Mapping gene regulatory networks is a significant challenge in systems biology, yet only a few methods are currently capable of systems-level identification of transcription factors (TFs) that bind a specific regulatory element. We developed a microfluidic method for integrated systems-level interaction mapping of TF-DNA interactions, generating and interrogating an array of 423 full-length Drosophila TFs. With integrated systems-level interaction mapping, it is now possible to rapidly and quantitatively map gene regulatory networks of higher eukaryotes.

  4. A boolean model of the cardiac gene regulatory network determining first and second heart field identity.

    Directory of Open Access Journals (Sweden)

    Franziska Herrmann

    Full Text Available Two types of distinct cardiac progenitor cell populations can be identified during early heart development: the first heart field (FHF and second heart field (SHF lineage that later form the mature heart. They can be characterized by differential expression of transcription and signaling factors. These regulatory factors influence each other forming a gene regulatory network. Here, we present a core gene regulatory network for early cardiac development based on published temporal and spatial expression data of genes and their interactions. This gene regulatory network was implemented in a Boolean computational model. Simulations reveal stable states within the network model, which correspond to the regulatory states of the FHF and the SHF lineages. Furthermore, we are able to reproduce the expected temporal expression patterns of early cardiac factors mimicking developmental progression. Additionally, simulations of knock-down experiments within our model resemble published phenotypes of mutant mice. Consequently, this gene regulatory network retraces the early steps and requirements of cardiogenic mesoderm determination in a way appropriate to enhance the understanding of heart development.

  5. A Boolean Model of the Cardiac Gene Regulatory Network Determining First and Second Heart Field Identity

    Science.gov (United States)

    Zhou, Dao; Kestler, Hans A.; Kühl, Michael

    2012-01-01

    Two types of distinct cardiac progenitor cell populations can be identified during early heart development: the first heart field (FHF) and second heart field (SHF) lineage that later form the mature heart. They can be characterized by differential expression of transcription and signaling factors. These regulatory factors influence each other forming a gene regulatory network. Here, we present a core gene regulatory network for early cardiac development based on published temporal and spatial expression data of genes and their interactions. This gene regulatory network was implemented in a Boolean computational model. Simulations reveal stable states within the network model, which correspond to the regulatory states of the FHF and the SHF lineages. Furthermore, we are able to reproduce the expected temporal expression patterns of early cardiac factors mimicking developmental progression. Additionally, simulations of knock-down experiments within our model resemble published phenotypes of mutant mice. Consequently, this gene regulatory network retraces the early steps and requirements of cardiogenic mesoderm determination in a way appropriate to enhance the understanding of heart development. PMID:23056457

  6. Structures and Boolean Dynamics in Gene Regulatory Networks

    Science.gov (United States)

    Szedlak, Anthony

    This dissertation discusses the topological and dynamical properties of GRNs in cancer, and is divided into four main chapters. First, the basic tools of modern complex network theory are introduced. These traditional tools as well as those developed by myself (set efficiency, interset efficiency, and nested communities) are crucial for understanding the intricate topological properties of GRNs, and later chapters recall these concepts. Second, the biology of gene regulation is discussed, and a method for disease-specific GRN reconstruction developed by our collaboration is presented. This complements the traditional exhaustive experimental approach of building GRNs edge-by-edge by quickly inferring the existence of as of yet undiscovered edges using correlations across sets of gene expression data. This method also provides insight into the distribution of common mutations across GRNs. Third, I demonstrate that the structures present in these reconstructed networks are strongly related to the evolutionary histories of their constituent genes. Investigation of how the forces of evolution shaped the topology of GRNs in multicellular organisms by growing outward from a core of ancient, conserved genes can shed light upon the ''reverse evolution'' of normal cells into unicellular-like cancer states. Next, I simulate the dynamics of the GRNs of cancer cells using the Hopfield model, an infinite range spin-glass model designed with the ability to encode Boolean data as attractor states. This attractor-driven approach facilitates the integration of gene expression data into predictive mathematical models. Perturbations representing therapeutic interventions are applied to sets of genes, and the resulting deviations from their attractor states are recorded, suggesting new potential drug targets for experimentation. Finally, I extend the Hopfield model to modular networks, cyclic attractors, and complex attractors, and apply these concepts to simulations of the cell cycle

  7. Reconstruction of large-scale gene regulatory networks using Bayesian model averaging.

    Science.gov (United States)

    Kim, Haseong; Gelenbe, Erol

    2012-09-01

    Gene regulatory networks provide the systematic view of molecular interactions in a complex living system. However, constructing large-scale gene regulatory networks is one of the most challenging problems in systems biology. Also large burst sets of biological data require a proper integration technique for reliable gene regulatory network construction. Here we present a new reverse engineering approach based on Bayesian model averaging which attempts to combine all the appropriate models describing interactions among genes. This Bayesian approach with a prior based on the Gibbs distribution provides an efficient means to integrate multiple sources of biological data. In a simulation study with maximum of 2000 genes, our method shows better sensitivity than previous elastic-net and Gaussian graphical models, with a fixed specificity of 0.99. The study also shows that the proposed method outperforms the other standard methods for a DREAM dataset generated by nonlinear stochastic models. In brain tumor data analysis, three large-scale networks consisting of 4422 genes were built using the gene expression of non-tumor, low and high grade tumor mRNA expression samples, along with DNA-protein binding affinity information. We found that genes having a large variation of degree distribution among the three tumor networks are the ones that see most involved in regulatory and developmental processes, which possibly gives a novel insight concerning conventional differentially expressed gene analysis.

  8. Discovery of time-delayed gene regulatory networks based on temporal gene expression profiling

    Directory of Open Access Journals (Sweden)

    Guo Zheng

    2006-01-01

    Full Text Available Abstract Background It is one of the ultimate goals for modern biological research to fully elucidate the intricate interplays and the regulations of the molecular determinants that propel and characterize the progression of versatile life phenomena, to name a few, cell cycling, developmental biology, aging, and the progressive and recurrent pathogenesis of complex diseases. The vast amount of large-scale and genome-wide time-resolved data is becoming increasing available, which provides the golden opportunity to unravel the challenging reverse-engineering problem of time-delayed gene regulatory networks. Results In particular, this methodological paper aims to reconstruct regulatory networks from temporal gene expression data by using delayed correlations between genes, i.e., pairwise overlaps of expression levels shifted in time relative each other. We have thus developed a novel model-free computational toolbox termed TdGRN (Time-delayed Gene Regulatory Network to address the underlying regulations of genes that can span any unit(s of time intervals. This bioinformatics toolbox has provided a unified approach to uncovering time trends of gene regulations through decision analysis of the newly designed time-delayed gene expression matrix. We have applied the proposed method to yeast cell cycling and human HeLa cell cycling and have discovered most of the underlying time-delayed regulations that are supported by multiple lines of experimental evidence and that are remarkably consistent with the current knowledge on phase characteristics for the cell cyclings. Conclusion We established a usable and powerful model-free approach to dissecting high-order dynamic trends of gene-gene interactions. We have carefully validated the proposed algorithm by applying it to two publicly available cell cycling datasets. In addition to uncovering the time trends of gene regulations for cell cycling, this unified approach can also be used to study the complex

  9. Construction of Gene Regulatory Networks Using Recurrent Neural Networks and Swarm Intelligence.

    Science.gov (United States)

    Khan, Abhinandan; Mandal, Sudip; Pal, Rajat Kumar; Saha, Goutam

    2016-01-01

    We have proposed a methodology for the reverse engineering of biologically plausible gene regulatory networks from temporal genetic expression data. We have used established information and the fundamental mathematical theory for this purpose. We have employed the Recurrent Neural Network formalism to extract the underlying dynamics present in the time series expression data accurately. We have introduced a new hybrid swarm intelligence framework for the accurate training of the model parameters. The proposed methodology has been first applied to a small artificial network, and the results obtained suggest that it can produce the best results available in the contemporary literature, to the best of our knowledge. Subsequently, we have implemented our proposed framework on experimental (in vivo) datasets. Finally, we have investigated two medium sized genetic networks (in silico) extracted from GeneNetWeaver, to understand how the proposed algorithm scales up with network size. Additionally, we have implemented our proposed algorithm with half the number of time points. The results indicate that a reduction of 50% in the number of time points does not have an effect on the accuracy of the proposed methodology significantly, with a maximum of just over 15% deterioration in the worst case.

  10. Drivers of structural features in gene regulatory networks: From biophysical constraints to biological function

    Science.gov (United States)

    Martin, O. C.; Krzywicki, A.; Zagorski, M.

    2016-07-01

    Living cells can maintain their internal states, react to changing environments, grow, differentiate, divide, etc. All these processes are tightly controlled by what can be called a regulatory program. The logic of the underlying control can sometimes be guessed at by examining the network of influences amongst genetic components. Some associated gene regulatory networks have been studied in prokaryotes and eukaryotes, unveiling various structural features ranging from broad distributions of out-degrees to recurrent "motifs", that is small subgraphs having a specific pattern of interactions. To understand what factors may be driving such structuring, a number of groups have introduced frameworks to model the dynamics of gene regulatory networks. In that context, we review here such in silico approaches and show how selection for phenotypes, i.e., network function, can shape network structure.

  11. Refining Dynamics of Gene Regulatory Networks in a Stochastic π-Calculus Framework

    OpenAIRE

    Paulevé, Loïc; Magnin, Morgan; Roux, Olivier

    2011-01-01

    International audience; In this paper, we introduce a framework allowing to model and analyse efficiently Gene Regulatory Networks in their temporal and stochastic aspects. The analysis of stable states and inference of René Thomas' discrete parameters derives from this logical formalism. We offer a compositional approach which comes with a natural translation to the Stochastic π-Calculus. The method we propose consists in successive refinements of generalized dynamics of Gene Regulatory Netw...

  12. Inference of gene regulatory networks with sparse structural equation models exploiting genetic perturbations.

    Directory of Open Access Journals (Sweden)

    Xiaodong Cai

    Full Text Available Integrating genetic perturbations with gene expression data not only improves accuracy of regulatory network topology inference, but also enables learning of causal regulatory relations between genes. Although a number of methods have been developed to integrate both types of data, the desiderata of efficient and powerful algorithms still remains. In this paper, sparse structural equation models (SEMs are employed to integrate both gene expression data and cis-expression quantitative trait loci (cis-eQTL, for modeling gene regulatory networks in accordance with biological evidence about genes regulating or being regulated by a small number of genes. A systematic inference method named sparsity-aware maximum likelihood (SML is developed for SEM estimation. Using simulated directed acyclic or cyclic networks, the SML performance is compared with that of two state-of-the-art algorithms: the adaptive Lasso (AL based scheme, and the QTL-directed dependency graph (QDG method. Computer simulations demonstrate that the novel SML algorithm offers significantly better performance than the AL-based and QDG algorithms across all sample sizes from 100 to 1,000, in terms of detection power and false discovery rate, in all the cases tested that include acyclic or cyclic networks of 10, 30 and 300 genes. The SML method is further applied to infer a network of 39 human genes that are related to the immune function and are chosen to have a reliable eQTL per gene. The resulting network consists of 9 genes and 13 edges. Most of the edges represent interactions reasonably expected from experimental evidence, while the remaining may just indicate the emergence of new interactions. The sparse SEM and efficient SML algorithm provide an effective means of exploiting both gene expression and perturbation data to infer gene regulatory networks. An open-source computer program implementing the SML algorithm is freely available upon request.

  13. Harnessing diversity towards the reconstructing of large scale gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Takeshi Hase

    Full Text Available Elucidating gene regulatory network (GRN from large scale experimental data remains a central challenge in systems biology. Recently, numerous techniques, particularly consensus driven approaches combining different algorithms, have become a potentially promising strategy to infer accurate GRNs. Here, we develop a novel consensus inference algorithm, TopkNet that can integrate multiple algorithms to infer GRNs. Comprehensive performance benchmarking on a cloud computing framework demonstrated that (i a simple strategy to combine many algorithms does not always lead to performance improvement compared to the cost of consensus and (ii TopkNet integrating only high-performance algorithms provide significant performance improvement compared to the best individual algorithms and community prediction. These results suggest that a priori determination of high-performance algorithms is a key to reconstruct an unknown regulatory network. Similarity among gene-expression datasets can be useful to determine potential optimal algorithms for reconstruction of unknown regulatory networks, i.e., if expression-data associated with known regulatory network is similar to that with unknown regulatory network, optimal algorithms determined for the known regulatory network can be repurposed to infer the unknown regulatory network. Based on this observation, we developed a quantitative measure of similarity among gene-expression datasets and demonstrated that, if similarity between the two expression datasets is high, TopkNet integrating algorithms that are optimal for known dataset perform well on the unknown dataset. The consensus framework, TopkNet, together with the similarity measure proposed in this study provides a powerful strategy towards harnessing the wisdom of the crowds in reconstruction of unknown regulatory networks.

  14. From Gene Regulation to Gene Function: Regulatory Networks in Bacillus Subtilis

    Directory of Open Access Journals (Sweden)

    Ivan Moszer

    2006-04-01

    Full Text Available Bacillus subtilis is a sporulating Gram-positive bacterium that lives primarily in the soil and associated water sources. The publication of the B. subtilis genome sequence and subsequent systematic functional analysis and gene regulation programmes, together with an extensive understanding of its biochemistry and physiology, makes this micro-organism a prime candidate in which to model regulatory networks in silico. In this paper we discuss combined molecular biological and bioinformatical approaches that are being developed to model this organism’s responses to changes in its environment.

  15. Computational studies of gene regulatory networks: in numero molecular biology.

    Science.gov (United States)

    Hasty, J; McMillen, D; Isaacs, F; Collins, J J

    2001-04-01

    Remarkable progress in genomic research is leading to a complete map of the building blocks of biology. Knowledge of this map is, in turn, setting the stage for a fundamental description of cellular function at the DNA level. Such a description will entail an understanding of gene regulation, in which proteins often regulate their own production or that of other proteins in a complex web of interactions. The implications of the underlying logic of genetic networks are difficult to deduce through experimental techniques alone, and successful approaches will probably involve the union of new experiments and computational modelling techniques.

  16. A consensus network of gene regulatory factors in the human frontal lobe

    Directory of Open Access Journals (Sweden)

    Stefano eBerto

    2016-03-01

    Full Text Available Cognitive abilities, such as memory, learning, language, problem solving, and planning, involve the frontal lobe and other brain areas. Not much is known yet about the molecular basis of cognitive abilities, but it seems clear that cognitive abilities are determined by the interplay of many genes. One approach for analyzing the genetic networks involved in cognitive functions is to study the coexpression networks of genes with known importance for proper cognitive functions, such as genes that have been associated with cognitive disorders like intellectual disability (ID or autism spectrum disorders (ASD. Because many of these genes are gene regulatory factors (GRFs we aimed to provide insights into the gene regulatory networks active in the human frontal lobe. Using genome wide human frontal lobe expression data from 10 independent data sets, we first derived 10 individual coexpression networks for all GRFs including their potential target genes. We observed a high level of variability among these 10 independently derived networks, pointing out that relying on results from a single study can only provide limited biological insights. To instead focus on the most confident information from these 10 networks we developed a method for integrating such independently derived networks into a consensus network. This consensus network revealed robust GRF interactions that are conserved across the frontal lobes of different healthy human individuals. Within this network, we detected a strong central module that is enriched for 166 GRFs known to be involved in brain development and/or cognitive disorders. Interestingly, several hubs of the consensus network encode for GRFs that have not yet been associated with brain functions. Their central role in the network suggests them as excellent new candidates for playing an essential role in the regulatory network of the human frontal lobe, which should be investigated in future studies.

  17. A Consensus Network of Gene Regulatory Factors in the Human Frontal Lobe.

    Science.gov (United States)

    Berto, Stefano; Perdomo-Sabogal, Alvaro; Gerighausen, Daniel; Qin, Jing; Nowick, Katja

    2016-01-01

    Cognitive abilities, such as memory, learning, language, problem solving, and planning, involve the frontal lobe and other brain areas. Not much is known yet about the molecular basis of cognitive abilities, but it seems clear that cognitive abilities are determined by the interplay of many genes. One approach for analyzing the genetic networks involved in cognitive functions is to study the coexpression networks of genes with known importance for proper cognitive functions, such as genes that have been associated with cognitive disorders like intellectual disability (ID) or autism spectrum disorders (ASD). Because many of these genes are gene regulatory factors (GRFs) we aimed to provide insights into the gene regulatory networks active in the human frontal lobe. Using genome wide human frontal lobe expression data from 10 independent data sets, we first derived 10 individual coexpression networks for all GRFs including their potential target genes. We observed a high level of variability among these 10 independently derived networks, pointing out that relying on results from a single study can only provide limited biological insights. To instead focus on the most confident information from these 10 networks we developed a method for integrating such independently derived networks into a consensus network. This consensus network revealed robust GRF interactions that are conserved across the frontal lobes of different healthy human individuals. Within this network, we detected a strong central module that is enriched for 166 GRFs known to be involved in brain development and/or cognitive disorders. Interestingly, several hubs of the consensus network encode for GRFs that have not yet been associated with brain functions. Their central role in the network suggests them as excellent new candidates for playing an essential role in the regulatory network of the human frontal lobe, which should be investigated in future studies.

  18. Statistical inference and reverse engineering of gene regulatory networks from observational expression data.

    Science.gov (United States)

    Emmert-Streib, Frank; Glazko, Galina V; Altay, Gökmen; de Matos Simoes, Ricardo

    2012-01-01

    In this paper, we present a systematic and conceptual overview of methods for inferring gene regulatory networks from observational gene expression data. Further, we discuss two classic approaches to infer causal structures and compare them with contemporary methods by providing a conceptual categorization thereof. We complement the above by surveying global and local evaluation measures for assessing the performance of inference algorithms.

  19. Predicting gene regulatory networks of soybean nodulation from RNA-Seq transcriptome data

    Science.gov (United States)

    2013-01-01

    Background High-throughput RNA sequencing (RNA-Seq) is a revolutionary technique to study the transcriptome of a cell under various conditions at a systems level. Despite the wide application of RNA-Seq techniques to generate experimental data in the last few years, few computational methods are available to analyze this huge amount of transcription data. The computational methods for constructing gene regulatory networks from RNA-Seq expression data of hundreds or even thousands of genes are particularly lacking and urgently needed. Results We developed an automated bioinformatics method to predict gene regulatory networks from the quantitative expression values of differentially expressed genes based on RNA-Seq transcriptome data of a cell in different stages and conditions, integrating transcriptional, genomic and gene function data. We applied the method to the RNA-Seq transcriptome data generated for soybean root hair cells in three different development stages of nodulation after rhizobium infection. The method predicted a soybean nodulation-related gene regulatory network consisting of 10 regulatory modules common for all three stages, and 24, 49 and 70 modules separately for the first, second and third stage, each containing both a group of co-expressed genes and several transcription factors collaboratively controlling their expression under different conditions. 8 of 10 common regulatory modules were validated by at least two kinds of validations, such as independent DNA binding motif analysis, gene function enrichment test, and previous experimental data in the literature. Conclusions We developed a computational method to reliably reconstruct gene regulatory networks from RNA-Seq transcriptome data. The method can generate valuable hypotheses for interpreting biological data and designing biological experiments such as ChIP-Seq, RNA interference, and yeast two hybrid experiments. PMID:24053776

  20. PyPanda: a Python package for gene regulatory network reconstruction

    OpenAIRE

    van IJzendoorn, David G. P.; Glass, Kimberly; Quackenbush, John; Kuijjer, Marieke L

    2016-01-01

    Summary: PANDA (Passing Attributes between Networks for Data Assimilation) is a gene regulatory network inference method that uses message-passing to integrate multiple sources of ‘omics data. PANDA was originally coded in C ++. In this application note we describe PyPanda, the Python version of PANDA. PyPanda runs considerably faster than the C ++ version and includes additional features for network analysis. Availability and implementation: The open source PyPanda Python package is freely a...

  1. Fractal gene regulatory networks for robust locomotion control of modular robots

    DEFF Research Database (Denmark)

    Zahadat, Payam; Christensen, David Johan; Schultz, Ulrik Pagh;

    2010-01-01

    Designing controllers for modular robots is difficult due to the distributed and dynamic nature of the robots. In this paper fractal gene regulatory networks are evolved to control modular robots in a distributed way. Experiments with different morphologies of modular robot are performed and the ......Designing controllers for modular robots is difficult due to the distributed and dynamic nature of the robots. In this paper fractal gene regulatory networks are evolved to control modular robots in a distributed way. Experiments with different morphologies of modular robot are performed...

  2. Sensor-coupled fractal gene regulatory networks for locomotion control of a modular snake robot

    DEFF Research Database (Denmark)

    Zahadat, Payam; Christensen, David Johan; Katebi, Serajeddin

    2013-01-01

    In this paper we study fractal gene regulatory network (FGRN) controllers based on sensory information. The FGRN controllers are evolved to control a snake robot consisting of seven simulated ATRON modules. Each module contains three tilt sensors which represent the direction of gravity in the co......In this paper we study fractal gene regulatory network (FGRN) controllers based on sensory information. The FGRN controllers are evolved to control a snake robot consisting of seven simulated ATRON modules. Each module contains three tilt sensors which represent the direction of gravity...

  3. Identifying Gene Regulatory Networks in Arabidopsis by In Silico Prediction, Yeast-1-Hybrid, and Inducible Gene Profiling Assays.

    Science.gov (United States)

    Sparks, Erin E; Benfey, Philip N

    2016-01-01

    A system-wide understanding of gene regulation will provide deep insights into plant development and physiology. In this chapter we describe a threefold approach to identify the gene regulatory networks in Arabidopsis thaliana that function in a specific tissue or biological process. Since no single method is sufficient to establish comprehensive and high-confidence gene regulatory networks, we focus on the integration of three approaches. First, we describe an in silico prediction method of transcription factor-DNA binding, then an in vivo assay of transcription factor-DNA binding by yeast-1-hybrid and lastly the identification of co-expression clusters by transcription factor induction in planta. Each of these methods provides a unique tool to advance our understanding of gene regulation, and together provide a robust model for the generation of gene regulatory networks.

  4. Refining ensembles of predicted gene regulatory networks based on characteristic interaction sets.

    Directory of Open Access Journals (Sweden)

    Lukas Windhager

    Full Text Available Different ensemble voting approaches have been successfully applied for reverse-engineering of gene regulatory networks. They are based on the assumption that a good approximation of true network structure can be derived by considering the frequencies of individual interactions in a large number of predicted networks. Such approximations are typically superior in terms of prediction quality and robustness as compared to considering a single best scoring network only. Nevertheless, ensemble approaches only work well if the predicted gene regulatory networks are sufficiently similar to each other. If the topologies of predicted networks are considerably different, an ensemble of all networks obscures interesting individual characteristics. Instead, networks should be grouped according to local topological similarities and ensemble voting performed for each group separately. We argue that the presence of sets of co-occurring interactions is a suitable indicator for grouping predicted networks. A stepwise bottom-up procedure is proposed, where first mutual dependencies between pairs of interactions are derived from predicted networks. Pairs of co-occurring interactions are subsequently extended to derive characteristic interaction sets that distinguish groups of networks. Finally, ensemble voting is applied separately to the resulting topologically similar groups of networks to create distinct group-ensembles. Ensembles of topologically similar networks constitute distinct hypotheses about the reference network structure. Such group-ensembles are easier to interpret as their characteristic topology becomes clear and dependencies between interactions are known. The availability of distinct hypotheses facilitates the design of further experiments to distinguish between plausible network structures. The proposed procedure is a reasonable refinement step for non-deterministic reverse-engineering applications that produce a large number of candidate

  5. Inference of gene regulatory networks with the strong-inhibition Boolean model

    Energy Technology Data Exchange (ETDEWEB)

    Xia Qinzhi; Liu Lulu; Ye Weiming; Hu Gang, E-mail: ganghu@bnu.edu.cn [Department of Physics, Beijing Normal University, Beijing 100875 (China)

    2011-08-15

    The inference of gene regulatory networks (GRNs) is an important topic in biology. In this paper, a logic-based algorithm that infers the strong-inhibition Boolean genetic regulatory networks (where regulation by any single repressor can definitely suppress the expression of the gene regulated) from time series is discussed. By properly ordering various computation steps, we derive for the first time explicit formulae for the probabilities at which different interactions can be inferred given a certain number of data. With the formulae, we can predict the precision of reconstructions of regulation networks when the data are insufficient. Numerical simulations coincide well with the analytical results. The method and results are expected to be applicable to a wide range of general dynamic networks, where logic algorithms play essential roles in the network dynamics and the probabilities of various logics can be estimated well.

  6. Improved reconstruction of in silico gene regulatory networks by integrating knockout and perturbation data.

    Directory of Open Access Journals (Sweden)

    Kevin Y Yip

    Full Text Available We performed computational reconstruction of the in silico gene regulatory networks in the DREAM3 Challenges. Our task was to learn the networks from two types of data, namely gene expression profiles in deletion strains (the 'deletion data' and time series trajectories of gene expression after some initial perturbation (the 'perturbation data'. In the course of developing the prediction method, we observed that the two types of data contained different and complementary information about the underlying network. In particular, deletion data allow for the detection of direct regulatory activities with strong responses upon the deletion of the regulator while perturbation data provide richer information for the identification of weaker and more complex types of regulation. We applied different techniques to learn the regulation from the two types of data. For deletion data, we learned a noise model to distinguish real signals from random fluctuations using an iterative method. For perturbation data, we used differential equations to model the change of expression levels of a gene along the trajectories due to the regulation of other genes. We tried different models, and combined their predictions. The final predictions were obtained by merging the results from the two types of data. A comparison with the actual regulatory networks suggests that our approach is effective for networks with a range of different sizes. The success of the approach demonstrates the importance of integrating heterogeneous data in network reconstruction.

  7. An integer optimization algorithm for robust identification of non-linear gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Chemmangattuvalappil Nishanth

    2012-09-01

    Full Text Available Abstract Background Reverse engineering gene networks and identifying regulatory interactions are integral to understanding cellular decision making processes. Advancement in high throughput experimental techniques has initiated innovative data driven analysis of gene regulatory networks. However, inherent noise associated with biological systems requires numerous experimental replicates for reliable conclusions. Furthermore, evidence of robust algorithms directly exploiting basic biological traits are few. Such algorithms are expected to be efficient in their performance and robust in their prediction. Results We have developed a network identification algorithm to accurately infer both the topology and strength of regulatory interactions from time series gene expression data in the presence of significant experimental noise and non-linear behavior. In this novel formulism, we have addressed data variability in biological systems by integrating network identification with the bootstrap resampling technique, hence predicting robust interactions from limited experimental replicates subjected to noise. Furthermore, we have incorporated non-linearity in gene dynamics using the S-system formulation. The basic network identification formulation exploits the trait of sparsity of biological interactions. Towards that, the identification algorithm is formulated as an integer-programming problem by introducing binary variables for each network component. The objective function is targeted to minimize the network connections subjected to the constraint of maximal agreement between the experimental and predicted gene dynamics. The developed algorithm is validated using both in silico and experimental data-sets. These studies show that the algorithm can accurately predict the topology and connection strength of the in silico networks, as quantified by high precision and recall, and small discrepancy between the actual and predicted kinetic parameters

  8. Analysis of regulatory networks constructed based on gene coexpression in pituitary adenoma.

    Science.gov (United States)

    Gong, Jie; Diao, Bo; Yao, Guo Jie; Liu, Ying; Xu, Guo Zheng

    2013-12-01

    Gene coexpression patterns can reveal gene collections with functional consistency. This study systematically constructs regulatory networks for pituitary tumours by integrating gene coexpression, transcriptional and posttranscriptional regulation. Through network analysis, we elaborate the incidence mechanism of pituitary adenoma. The Pearson's correlation coefficient was utilized to calculate the level of gene coexpression. By comparing pituitary adenoma samples with normal samples, pituitary adenoma-specific gene coexpression patterns were identified. For pituitary adenoma-specific coexpressed genes, we integrated transcription factor (TF) and microRNA (miRNA) regulation to construct a complex regulatory network from the transcriptional and posttranscriptional perspectives. Network module analysis identified the synergistic regulation of genes by miRNAs and TFs in pituitary adenoma. We identified 142 pituitary adenoma-specific active genes, including 43 TFs and 99 target genes of TFs. Functional enrichment of these 142 genes revealed that the occurrence of pituitary adenoma induced abnormalities in intracellular metabolism and angiogenesis process. These 142 genes were also significantly enriched in adenoma pathway. Module analysis of the systematic regulatory network found that three modules contained elements that were closely related to pituitary adenoma, such as FGF2 and SP1, as well as transcription factors and miRNAs involved in the tumourigenesis. These results show that in the occurrence of pituitary adenoma, miRNA, TF and genes interact with each other. Based on gene expression, the proposed method integrates interaction information from different levels and systematically explains the occurrence of pituitary tumours. It facilitates the tracing of the origin of the disease and can provide basis for early diagnosis of complex diseases or cancer without obvious symptoms.

  9. Analysis of regulatory networks constructed based on gene coexpression in pituitary adenoma

    Indian Academy of Sciences (India)

    Jie Gong; Bo Diao; Guo Jie Yao; Ying Liu; Guo Zheng Xu

    2013-12-01

    Gene coexpression patterns can reveal gene collections with functional consistency. This study systematically constructs regulatory networks for pituitary tumours by integrating gene coexpression, transcriptional and posttranscriptional regulation. Through network analysis, we elaborate the incidence mechanism of pituitary adenoma. The Pearson’s correlation coefficient was utilized to calculate the level of gene coexpression. By comparing pituitary adenoma samples with normal samples, pituitary adenoma-specific gene coexpression patterns were identified. For pituitary adenoma-specific coexpressed genes, we integrated transcription factor (TF) and microRNA (miRNA) regulation to construct a complex regulatory network from the transcriptional and posttranscriptional perspectives. Network module analysis identified the synergistic regulation of genes by miRNAs and TFs in pituitary adenoma. We identified 142 pituitary adenoma-specific active genes, including 43 TFs and 99 target genes of TFs. Functional enrichment of these 142 genes revealed that the occurrence of pituitary adenoma induced abnormalities in intracellular metabolism and angiogenesis process. These 142 genes were also significantly enriched in adenoma pathway. Module analysis of the systematic regulatory network found that three modules contained elements that were closely related to pituitary adenoma, such as FGF2 and SP1, as well as transcription factors and miRNAs involved in the tumourigenesis. These results show that in the occurrence of pituitary adenoma, miRNA, TF and genes interact with each other. Based on gene expression, the proposed method integrates interaction information from different levels and systematically explains the occurrence of pituitary tumours. It facilitates the tracing of the origin of the disease and can provide basis for early diagnosis of complex diseases or cancer without obvious symptoms.

  10. Regulatory network analysis of microRNAs and genes in imatinib-resistant chronic myeloid leukemia.

    Science.gov (United States)

    Soltani, Ismael; Gharbi, Hanen; Hassine, Islem Ben; Bouguerra, Ghada; Douzi, Kais; Teber, Mouheb; Abbes, Salem; Menif, Samia

    2016-09-16

    Targeted therapy in the form of selective breakpoint cluster region-abelson (BCR/ABL) tyrosine kinase inhibitor (imatinib mesylate) has successfully been introduced in the treatment of the chronic myeloid leukemia (CML). However, acquired resistance against imatinib mesylate (IM) has been reported in nearly half of patients and has been recognized as major issue in clinical practice. Multiple resistance genes and microRNAs (miRNAs) are thought to be involved in the IM resistance process. These resistance genes and miRNAs tend to interact with each other through a regulatory network. Therefore, it is crucial to study the impact of these interactions in the IM resistance process. The present study focused on miRNA and gene network analysis in order to elucidate the role of interacting elements and to understand their functional contribution in therapeutic failure. Unlike previous studies which were centered only on genes or miRNAs, the prime focus of the present study was on relationships. To this end, three regulatory networks including differentially expressed, related, and global networks were constructed and analyzed in search of similarities and differences. Regulatory associations between miRNAs and their target genes, transcription factors and miRNAs, as well as miRNAs and their host genes were also macroscopically investigated. Certain key pathways in the three networks, especially in the differentially expressed network, were featured. The differentially expressed network emerged as a fault map of IM-resistant CML. Theoretically, the IM resistance process could be prevented by correcting the included errors. The present network-based approach to study resistance miRNAs and genes might help in understanding the molecular mechanisms of IM resistance in CML as well as in the improvement of CML therapy.

  11. Bottom-up GGM algorithm for constructing multiple layered hierarchical gene regulatory networks

    Science.gov (United States)

    Multilayered hierarchical gene regulatory networks (ML-hGRNs) are very important for understanding genetics regulation of biological pathways. However, there are currently no computational algorithms available for directly building ML-hGRNs that regulate biological pathways. A bottom-up graphic Gaus...

  12. Regulatory Network Construction in Arabidopsis using genome-wide gene expression QTLs

    NARCIS (Netherlands)

    Keurentjes, J.J.B.; Fu, J.J.; Terpstra, I.R.; Garcia, J.M.; van den Ackerveken, G.; Snoek, L.B.; Peeters, A.J.M.; Vreugdenhil, D.; Koornreef, M.; Jansen, R.C.

    2007-01-01

    Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci.Keurentjes JJ, Fu J, Terpstra IR, Garcia JM, van den Ackerveken G, Snoek LB, Peeters AJ, Vreugdenhil D, Koornneef M, Jansen RC. Laboratory of Genetics, Wageningen University, Arboretumlaan 4,

  13. NetDiff - Bayesian model selection for differential gene regulatory network inference.

    Science.gov (United States)

    Thorne, Thomas

    2016-12-16

    Differential networks allow us to better understand the changes in cellular processes that are exhibited in conditions of interest, identifying variations in gene regulation or protein interaction between, for example, cases and controls, or in response to external stimuli. Here we present a novel methodology for the inference of differential gene regulatory networks from gene expression microarray data. Specifically we apply a Bayesian model selection approach to compare models of conserved and varying network structure, and use Gaussian graphical models to represent the network structures. We apply a variational inference approach to the learning of Gaussian graphical models of gene regulatory networks, that enables us to perform Bayesian model selection that is significantly more computationally efficient than Markov Chain Monte Carlo approaches. Our method is demonstrated to be more robust than independent analysis of data from multiple conditions when applied to synthetic network data, generating fewer false positive predictions of differential edges. We demonstrate the utility of our approach on real world gene expression microarray data by applying it to existing data from amyotrophic lateral sclerosis cases with and without mutations in C9orf72, and controls, where we are able to identify differential network interactions for further investigation.

  14. A gene regulatory network for root epidermis cell differentiation in Arabidopsis.

    Directory of Open Access Journals (Sweden)

    Angela Bruex

    2012-01-01

    Full Text Available The root epidermis of Arabidopsis provides an exceptional model for studying the molecular basis of cell fate and differentiation. To obtain a systems-level view of root epidermal cell differentiation, we used a genome-wide transcriptome approach to define and organize a large set of genes into a transcriptional regulatory network. Using cell fate mutants that produce only one of the two epidermal cell types, together with fluorescence-activated cell-sorting to preferentially analyze the root epidermis transcriptome, we identified 1,582 genes differentially expressed in the root-hair or non-hair cell types, including a set of 208 "core" root epidermal genes. The organization of the core genes into a network was accomplished by using 17 distinct root epidermis mutants and 2 hormone treatments to perturb the system and assess the effects on each gene's transcript accumulation. In addition, temporal gene expression information from a developmental time series dataset and predicted gene associations derived from a Bayesian modeling approach were used to aid the positioning of genes within the network. Further, a detailed functional analysis of likely bHLH regulatory genes within the network, including MYC1, bHLH54, bHLH66, and bHLH82, showed that three distinct subfamilies of bHLH proteins participate in root epidermis development in a stage-specific manner. The integration of genetic, genomic, and computational analyses provides a new view of the composition, architecture, and logic of the root epidermal transcriptional network, and it demonstrates the utility of a comprehensive systems approach for dissecting a complex regulatory network.

  15. Understanding the Role of Housekeeping and Stress-Related Genes in Transcription-Regulatory Networks

    Science.gov (United States)

    Heath, Allison; Kavraki, Lydia; Balázsi, Gábor

    2008-03-01

    Despite the increasing number of completely sequenced genomes, much remains to be learned about how living cells process environmental information and respond to changes in their surroundings. Accumulating evidence indicates that eukaryotic and prokaryotic genes can be classified in two distinct categories that we will call class I and class II. Class I genes are housekeeping genes, often characterized by stable, noise resistant expression levels. In contrast, class II genes are stress-related genes and often have noisy, unstable expression levels. In this work we analyze the large scale transcription-regulatory networks (TRN) of E. coli and S. cerevisiae and preliminary data on H. sapien. We find that stable, housekeeping genes (class I) are preferentially utilized as transcriptional inputs while stress related, unstable genes (class II) are utilized as transcriptional integrators. This might be the result of convergent evolution that placed the appropriate genes in the appropriate locations within transcriptional networks according to some fundamental principles that govern cellular information processing.

  16. Influence of the experimental design of gene expression studies on the inference of gene regulatory networks: environmental factors

    Directory of Open Access Journals (Sweden)

    Frank Emmert-Streib

    2013-02-01

    Full Text Available The inference of gene regulatory networks gained within recent years a considerable interest in the biology and biomedical community. The purpose of this paper is to investigate the influence that environmental conditions can exhibit on the inference performance of network inference algorithms. Specifically, we study five network inference methods, Aracne, BC3NET, CLR, C3NET and MRNET, and compare the results for three different conditions: (I observational gene expression data: normal environmental condition, (II interventional gene expression data: growth in rich media, (III interventional gene expression data: normal environmental condition interrupted by a positive spike-in stimulation. Overall, we find that different statistical inference methods lead to comparable, but condition-specific results. Further, our results suggest that non-steady-state data enhance the inferability of regulatory networks.

  17. Medusa structure of the gene regulatory network: dominance of transcription factors in cancer subtype classification.

    Science.gov (United States)

    Guo, Yuchun; Feng, Ying; Trivedi, Niraj S; Huang, Sui

    2011-05-01

    Gene expression profiles consisting of ten thousands of transcripts are used for clustering of tissue, such as tumors, into subtypes, often without considering the underlying reason that the distinct patterns of expression arise because of constraints in the realization of gene expression profiles imposed by the gene regulatory network. The topology of this network has been suggested to consist of a regulatory core of genes represented most prominently by transcription factors (TFs) and microRNAs, that influence the expression of other genes, and of a periphery of 'enslaved' effector genes that are regulated but not regulating. This 'medusa' architecture implies that the core genes are much stronger determinants of the realized gene expression profiles. To test this hypothesis, we examined the clustering of gene expression profiles into known tumor types to quantitatively demonstrate that TFs, and even more pronounced, microRNAs, are much stronger discriminators of tumor type specific gene expression patterns than a same number of randomly selected or metabolic genes. These findings lend support to the hypothesis of a medusa architecture and of the canalizing nature of regulation by microRNAs. They also reveal the degree of freedom for the expression of peripheral genes that are less stringently associated with a tissue type specific global gene expression profile.

  18. A study on the regulatory network with promoter analysis for Arabidopsis DREB-genes

    Science.gov (United States)

    Sazegari, Sima; Niazi, Ali; Ahmadi, Farajolah Shahriary

    2015-01-01

    Dehydration response element binding factors (DREBs) are one of the principal plant transcription factor subfamilies that regulate the expression of many abiotic stress-inducible genes. This sub-family belongs to AP2 transcription factor family and plays a considerable role in improving abiotic stresses tolerance in plants. Therefore, it is of interest to identify critical cis-acting elements involved in abiotic stress responses. In this study, we survey promoter cis-elements for ATDREBs genes (Arabidopsis thaliana DREBs). Regulatory networks based on ATDREB candidate genes were also generated to find other genes that are functionally similar to DREBs. The study was conducted on all 20 Arabidopsis thaliana non redundant DREB genes stored in RefSeq database. Promoter analysis and regulatory network prediction was accomplished by use of Plant CARE program and GeneMANIA web tool, respectively. The results indicated that among all genes, DREB1A, DREB1C, DREB2C, DREB2G and DEAR3 have the most type of diverse motifs involved in abiotic stress responses. It is implied that co-operation of abscisic acid, ethylene, salicylic acid and methyl jasmonate signaling is crucial for the regulation of the expression of drought and cold responses through DREB transcription factors. Gene network analysis showed different co-expressed but functionally similar genes that had physical and functional interactions with candidate DREB genes. PMID:25848171

  19. PRODIGEN: visualizing the probability landscape of stochastic gene regulatory networks in state and time space.

    Science.gov (United States)

    Ma, Chihua; Luciani, Timothy; Terebus, Anna; Liang, Jie; Marai, G Elisabeta

    2017-02-15

    Visualizing the complex probability landscape of stochastic gene regulatory networks can further biologists' understanding of phenotypic behavior associated with specific genes. We present PRODIGEN (PRObability DIstribution of GEne Networks), a web-based visual analysis tool for the systematic exploration of probability distributions over simulation time and state space in such networks. PRODIGEN was designed in collaboration with bioinformaticians who research stochastic gene networks. The analysis tool combines in a novel way existing, expanded, and new visual encodings to capture the time-varying characteristics of probability distributions: spaghetti plots over one dimensional projection, heatmaps of distributions over 2D projections, enhanced with overlaid time curves to display temporal changes, and novel individual glyphs of state information corresponding to particular peaks. We demonstrate the effectiveness of the tool through two case studies on the computed probabilistic landscape of a gene regulatory network and of a toggle-switch network. Domain expert feedback indicates that our visual approach can help biologists: 1) visualize probabilities of stable states, 2) explore the temporal probability distributions, and 3) discover small peaks in the probability landscape that have potential relation to specific diseases.

  20. Comparative analysis of the transcription-factor gene regulatory networks of E. coli and S. cerevisiae

    Directory of Open Access Journals (Sweden)

    Santillán Moisés

    2008-01-01

    Full Text Available Abstract Background The regulatory interactions between transcription factors (TF and regulated genes (RG in a species genome can be lumped together in a single directed graph. The TF's and RG's conform the nodes of this graph, while links are drawn whenever a transcription factor regulates a gene's expression. Projections onto TF nodes can be constructed by linking every two nodes regulating a common gene. Similarly, projections onto RG nodes can be made by linking every two regulated genes sharing at least one common regulator. Recent studies of the connectivity pattern in the transcription-factor regulatory network of many organisms have revealed some interesting properties. However, the differences between TF and RG nodes have not been widely explored. Results After analysing the RG and TF projections of the transcription-factor gene regulatory networks of Escherichia coli and Saccharomyces cerevisiae, we found several common characteristic as well as some noticeable differences. To better understand these differences, we compared the properties of the E. coli and S. cerevisiae RG- and TF-projected networks with those of the corresponding projections built from randomized versions of the original bipartite networks. These last results indicate that the observed differences are mostly due to the very different ratios of TF to RG counts of the E. coli and S. cerevisiae bipartite networks, rather than to their having different connectivity patterns. Conclusion Since E. coli is a prokaryotic organism while S. cerevisiae is eukaryotic, there are important differences between them concerning processing of mRNA before translation, DNA packing, amount of junk DNA, and gene regulation. From the results in this paper we conclude that the most important effect such differences have had on the development of the corresponding transcription-factor gene regulatory networks is their very different ratios of TF to RG numbers. This ratio is more than three

  1. Hidden Markov induced Dynamic Bayesian Network for recovering time evolving gene regulatory networks

    Science.gov (United States)

    Zhu, Shijia; Wang, Yadong

    2015-12-01

    Dynamic Bayesian Networks (DBN) have been widely used to recover gene regulatory relationships from time-series data in computational systems biology. Its standard assumption is ‘stationarity’, and therefore, several research efforts have been recently proposed to relax this restriction. However, those methods suffer from three challenges: long running time, low accuracy and reliance on parameter settings. To address these problems, we propose a novel non-stationary DBN model by extending each hidden node of Hidden Markov Model into a DBN (called HMDBN), which properly handles the underlying time-evolving networks. Correspondingly, an improved structural EM algorithm is proposed to learn the HMDBN. It dramatically reduces searching space, thereby substantially improving computational efficiency. Additionally, we derived a novel generalized Bayesian Information Criterion under the non-stationary assumption (called BWBIC), which can help significantly improve the reconstruction accuracy and largely reduce over-fitting. Moreover, the re-estimation formulas for all parameters of our model are derived, enabling us to avoid reliance on parameter settings. Compared to the state-of-the-art methods, the experimental evaluation of our proposed method on both synthetic and real biological data demonstrates more stably high prediction accuracy and significantly improved computation efficiency, even with no prior knowledge and parameter settings.

  2. Hidden Markov induced Dynamic Bayesian Network for recovering time evolving gene regulatory networks.

    Science.gov (United States)

    Zhu, Shijia; Wang, Yadong

    2015-12-18

    Dynamic Bayesian Networks (DBN) have been widely used to recover gene regulatory relationships from time-series data in computational systems biology. Its standard assumption is 'stationarity', and therefore, several research efforts have been recently proposed to relax this restriction. However, those methods suffer from three challenges: long running time, low accuracy and reliance on parameter settings. To address these problems, we propose a novel non-stationary DBN model by extending each hidden node of Hidden Markov Model into a DBN (called HMDBN), which properly handles the underlying time-evolving networks. Correspondingly, an improved structural EM algorithm is proposed to learn the HMDBN. It dramatically reduces searching space, thereby substantially improving computational efficiency. Additionally, we derived a novel generalized Bayesian Information Criterion under the non-stationary assumption (called BWBIC), which can help significantly improve the reconstruction accuracy and largely reduce over-fitting. Moreover, the re-estimation formulas for all parameters of our model are derived, enabling us to avoid reliance on parameter settings. Compared to the state-of-the-art methods, the experimental evaluation of our proposed method on both synthetic and real biological data demonstrates more stably high prediction accuracy and significantly improved computation efficiency, even with no prior knowledge and parameter settings.

  3. A scalable algorithm for structure identification of complex gene regulatory network from temporal expression data.

    Science.gov (United States)

    Gui, Shupeng; Rice, Andrew P; Chen, Rui; Wu, Liang; Liu, Ji; Miao, Hongyu

    2017-01-31

    Gene regulatory interactions are of fundamental importance to various biological functions and processes. However, only a few previous computational studies have claimed success in revealing genome-wide regulatory landscapes from temporal gene expression data, especially for complex eukaryotes like human. Moreover, recent work suggests that these methods still suffer from the curse of dimensionality if a network size increases to 100 or higher. Here we present a novel scalable algorithm for identifying genome-wide gene regulatory network (GRN) structures, and we have verified the algorithm performances by extensive simulation studies based on the DREAM challenge benchmark data. The highlight of our method is that its superior performance does not degenerate even for a network size on the order of 10(4), and is thus readily applicable to large-scale complex networks. Such a breakthrough is achieved by considering both prior biological knowledge and multiple topological properties (i.e., sparsity and hub gene structure) of complex networks in the regularized formulation. We also validate and illustrate the application of our algorithm in practice using the time-course gene expression data from a study on human respiratory epithelial cells in response to influenza A virus (IAV) infection, as well as the CHIP-seq data from ENCODE on transcription factor (TF) and target gene interactions. An interesting finding, owing to the proposed algorithm, is that the biggest hub structures (e.g., top ten) in the GRN all center at some transcription factors in the context of epithelial cell infection by IAV. The proposed algorithm is the first scalable method for large complex network structure identification. The GRN structure identified by our algorithm could reveal possible biological links and help researchers to choose which gene functions to investigate in a biological event. The algorithm described in this article is implemented in MATLAB (Ⓡ) , and the source code is

  4. Reliable transfer of transcriptional gene regulatory networks between taxonomically related organisms

    Directory of Open Access Journals (Sweden)

    Tauch Andreas

    2009-01-01

    Full Text Available Abstract Background Transcriptional regulation of gene activity is essential for any living organism. Transcription factors therefore recognize specific binding sites within the DNA to regulate the expression of particular target genes. The genome-scale reconstruction of the emerging regulatory networks is important for biotechnology and human medicine but cost-intensive, time-consuming, and impossible to perform for any species separately. By using bioinformatics methods one can partially transfer networks from well-studied model organisms to closely related species. However, the prediction quality is limited by the low level of evolutionary conservation of the transcription factor binding sites, even within organisms of the same genus. Results Here we present an integrated bioinformatics workflow that assures the reliability of transferred gene regulatory networks. Our approach combines three methods that can be applied on a large-scale: re-assessment of annotated binding sites, subsequent binding site prediction, and homology detection. A gene regulatory interaction is considered to be conserved if (1 the transcription factor, (2 the adjusted binding site, and (3 the target gene are conserved. The power of the approach is demonstrated by transferring gene regulations from the model organism Corynebacterium glutamicum to the human pathogens C. diphtheriae, C. jeikeium, and the biotechnologically relevant C. efficiens. For these three organisms we identified reliable transcriptional regulations for ~40% of the common transcription factors, compared to ~5% for which knowledge was available before. Conclusion Our results suggest that trustworthy genome-scale transfer of gene regulatory networks between organisms is feasible in general but still limited by the level of evolutionary conservation.

  5. Information theory in systems biology. Part I: Gene regulatory and metabolic networks.

    Science.gov (United States)

    Mousavian, Zaynab; Kavousi, Kaveh; Masoudi-Nejad, Ali

    2016-03-01

    "A Mathematical Theory of Communication", was published in 1948 by Claude Shannon to establish a framework that is now known as information theory. In recent decades, information theory has gained much attention in the area of systems biology. The aim of this paper is to provide a systematic review of those contributions that have applied information theory in inferring or understanding of biological systems. Based on the type of system components and the interactions between them, we classify the biological systems into 4 main classes: gene regulatory, metabolic, protein-protein interaction and signaling networks. In the first part of this review, we attempt to introduce most of the existing studies on two types of biological networks, including gene regulatory and metabolic networks, which are founded on the concepts of information theory. Copyright © 2015 Elsevier Ltd. All rights reserved.

  6. Regulatory gene networks that shape the development of adaptive phenotypic plasticity in a cichlid fish.

    Science.gov (United States)

    Schneider, Ralf F; Li, Yuanhao; Meyer, Axel; Gunter, Helen M

    2014-09-01

    Phenotypic plasticity is the ability of organisms with a given genotype to develop different phenotypes according to environmental stimuli, resulting in individuals that are better adapted to local conditions. In spite of their ecological importance, the developmental regulatory networks underlying plastic phenotypes often remain uncharacterized. We examined the regulatory basis of diet-induced plasticity in the lower pharyngeal jaw (LPJ) of the cichlid fish Astatoreochromis alluaudi, a model species in the study of adaptive plasticity. Through raising juvenile A. alluaudi on either a hard or soft diet (hard-shelled or pulverized snails) for between 1 and 8 months, we gained insight into the temporal regulation of 19 previously identified candidate genes during the early stages of plasticity development. Plasticity in LPJ morphology was first detected between 3 and 5 months of diet treatment. The candidate genes, belonging to various functional categories, displayed dynamic expression patterns that consistently preceded the onset of morphological divergence and putatively contribute to the initiation of the plastic phenotypes. Within functional categories, we observed striking co-expression, and transcription factor binding site analysis was used to examine the prospective basis of their coregulation. We propose a regulatory network of LPJ plasticity in cichlids, presenting evidence for regulatory crosstalk between bone and muscle tissues, which putatively facilitates the development of this highly integrated trait. Through incorporating a developmental time-course into a phenotypic plasticity study, we have identified an interconnected, environmentally responsive regulatory network that shapes the development of plasticity in a key innovation of East African cichlids.

  7. NIMEFI: gene regulatory network inference using multiple ensemble feature importance algorithms.

    Science.gov (United States)

    Ruyssinck, Joeri; Huynh-Thu, Vân Anh; Geurts, Pierre; Dhaene, Tom; Demeester, Piet; Saeys, Yvan

    2014-01-01

    One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available.

  8. NIMEFI: gene regulatory network inference using multiple ensemble feature importance algorithms.

    Directory of Open Access Journals (Sweden)

    Joeri Ruyssinck

    Full Text Available One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made

  9. Comparison of gene regulatory networks of benign and malignant breast cancer samples with normal samples.

    Science.gov (United States)

    Chen, D B; Yang, H J

    2014-11-11

    The aim of this study was to explain the pathogenesis and deterioration process of breast cancer. Breast cancer expression profile data GSE27567 was downloaded from the Gene Expression Omnibus (GEO) database, and breast cancer-related genes were extracted from databases, including Cancer-Resource and Online Mendelian Inheritance In Man (OMIM). Next, h17 transcription factor data were obtained from the University of California, Santa Cruz. Database for Annotation, Visualization, and Integrated Discovery (DAVID)-enrichment analysis was applied and gene-regulatory networks were constructed by double-two-way t-tests in 3 states, including normal, benign, and malignant. Furthermore, network topological properties were compared between 2 states, and breast cancer-related bub genes were ranked according to their different degrees between each of the two states. A total of 2380 breast cancer-related genes and 215 transcription factors were screened by exploring databases; the genes were mainly enriched in their functions, such as cell apoptosis and proliferation, and pathways, such as p53 signaling and apoptosis, which were related with carcinogenesis. In addition, gene-regulatory networks in the 3 conditions were constructed. By comparing their network topological properties, we found that there is a larger transition of differences between malignant and benign breast cancer. Moreover, 8 hub genes (YBX1, ZFP36, YY1, XRCC5, XRCC4, ZFHX3, ZMAT3, and XPC) were identified in the top 10 genes ranked by different degrees. Through comparative analysis of gene-regulation networks, we identified the link between related genes and the pathogenesis of breast cancer. However, further experiments are needed to confirm our results.

  10. Statistical identification of gene association by CID in application of constructing ER regulatory network

    Directory of Open Access Journals (Sweden)

    Lien Huang-Chun

    2009-03-01

    Full Text Available Abstract Background A variety of high-throughput techniques are now available for constructing comprehensive gene regulatory networks in systems biology. In this study, we report a new statistical approach for facilitating in silico inference of regulatory network structure. The new measure of association, coefficient of intrinsic dependence (CID, is model-free and can be applied to both continuous and categorical distributions. When given two variables X and Y, CID answers whether Y is dependent on X by examining the conditional distribution of Y given X. In this paper, we apply CID to analyze the regulatory relationships between transcription factors (TFs (X and their downstream genes (Y based on clinical data. More specifically, we use estrogen receptor α (ERα as the variable X, and the analyses are based on 48 clinical breast cancer gene expression arrays (48A. Results The analytical utility of CID was evaluated in comparison with four commonly used statistical methods, Galton-Pearson's correlation coefficient (GPCC, Student's t-test (STT, coefficient of determination (CoD, and mutual information (MI. When being compared to GPCC, CoD, and MI, CID reveals its preferential ability to discover the regulatory association where distribution of the mRNA expression levels on X and Y does not fit linear models. On the other hand, when CID is used to measure the association of a continuous variable (Y against a discrete variable (X, it shows similar performance as compared to STT, and appears to outperform CoD and MI. In addition, this study established a two-layer transcriptional regulatory network to exemplify the usage of CID, in combination with GPCC, in deciphering gene networks based on gene expression profiles from patient arrays. Conclusion CID is shown to provide useful information for identifying associations between genes and transcription factors of interest in patient arrays. When coupled with the relationships detected by GPCC, the

  11. Statistical identification of gene association by CID in application of constructing ER regulatory network.

    Science.gov (United States)

    Liu, Li-Yu D; Chen, Chien-Yu; Chen, Mei-Ju M; Tsai, Ming-Shian; Lee, Cho-Han S; Phang, Tzu L; Chang, Li-Yun; Kuo, Wen-Hung; Hwa, Hsiao-Lin; Lien, Huang-Chun; Jung, Shih-Ming; Lin, Yi-Shing; Chang, King-Jen; Hsieh, Fon-Jou

    2009-03-17

    A variety of high-throughput techniques are now available for constructing comprehensive gene regulatory networks in systems biology. In this study, we report a new statistical approach for facilitating in silico inference of regulatory network structure. The new measure of association, coefficient of intrinsic dependence (CID), is model-free and can be applied to both continuous and categorical distributions. When given two variables X and Y, CID answers whether Y is dependent on X by examining the conditional distribution of Y given X. In this paper, we apply CID to analyze the regulatory relationships between transcription factors (TFs) (X) and their downstream genes (Y) based on clinical data. More specifically, we use estrogen receptor alpha (ERalpha) as the variable X, and the analyses are based on 48 clinical breast cancer gene expression arrays (48A). The analytical utility of CID was evaluated in comparison with four commonly used statistical methods, Galton-Pearson's correlation coefficient (GPCC), Student's t-test (STT), coefficient of determination (CoD), and mutual information (MI). When being compared to GPCC, CoD, and MI, CID reveals its preferential ability to discover the regulatory association where distribution of the mRNA expression levels on X and Y does not fit linear models. On the other hand, when CID is used to measure the association of a continuous variable (Y) against a discrete variable (X), it shows similar performance as compared to STT, and appears to outperform CoD and MI. In addition, this study established a two-layer transcriptional regulatory network to exemplify the usage of CID, in combination with GPCC, in deciphering gene networks based on gene expression profiles from patient arrays. CID is shown to provide useful information for identifying associations between genes and transcription factors of interest in patient arrays. When coupled with the relationships detected by GPCC, the association predicted by CID are applicable

  12. Reverse engineering gene regulatory networks related to quorum sensing in the plant pathogen Pectobacterium atrosepticum.

    Science.gov (United States)

    Lin, Kuang; Husmeier, Dirk; Dondelinger, Frank; Mayer, Claus D; Liu, Hui; Prichard, Leighton; Salmond, George P C; Toth, Ian K; Birch, Paul R J

    2010-01-01

    The objective of the project reported in the present chapter was the reverse engineering of gene regulatory networks related to quorum sensing in the plant pathogen Pectobacterium atrosepticum from micorarray gene expression profiles, obtained from the wild-type and eight knockout strains. To this end, we have applied various recent methods from multivariate statistics and machine learning: graphical Gaussian models, sparse Bayesian regression, LASSO (least absolute shrinkage and selection operator), Bayesian networks, and nested effects models. We have investigated the degree of similarity between the predictions obtained with the different approaches, and we have assessed the consistency of the reconstructed networks in terms of global topological network properties, based on the node degree distribution. The chapter concludes with a biological evaluation of the predicted network structures.

  13. PyPanda: a Python package for gene regulatory network reconstruction

    Science.gov (United States)

    van IJzendoorn, David G.P.; Glass, Kimberly; Quackenbush, John; Kuijjer, Marieke L.

    2016-01-01

    Summary: PANDA (Passing Attributes between Networks for Data Assimilation) is a gene regulatory network inference method that uses message-passing to integrate multiple sources of ‘omics data. PANDA was originally coded in C ++. In this application note we describe PyPanda, the Python version of PANDA. PyPanda runs considerably faster than the C ++ version and includes additional features for network analysis. Availability and implementation: The open source PyPanda Python package is freely available at http://github.com/davidvi/pypanda. Contact: mkuijjer@jimmy.harvard.edu or d.g.p.van_ijzendoorn@lumc.nl PMID:27402905

  14. Conservation and diversification of an ancestral chordate gene regulatory network for dorsoventral patterning.

    Directory of Open Access Journals (Sweden)

    Iryna Kozmikova

    Full Text Available Formation of a dorsoventral axis is a key event in the early development of most animal embryos. It is well established that bone morphogenetic proteins (Bmps and Wnts are key mediators of dorsoventral patterning in vertebrates. In the cephalochordate amphioxus, genes encoding Bmps and transcription factors downstream of Bmp signaling such as Vent are expressed in patterns reminiscent of those of their vertebrate orthologues. However, the key question is whether the conservation of expression patterns of network constituents implies conservation of functional network interactions, and if so, how an increased functional complexity can evolve. Using heterologous systems, namely by reporter gene assays in mammalian cell lines and by transgenesis in medaka fish, we have compared the gene regulatory network implicated in dorsoventral patterning of the basal chordate amphioxus and vertebrates. We found that Bmp but not canonical Wnt signaling regulates promoters of genes encoding homeodomain proteins AmphiVent1 and AmphiVent2. Furthermore, AmphiVent1 and AmphiVent2 promoters appear to be correctly regulated in the context of a vertebrate embryo. Finally, we show that AmphiVent1 is able to directly repress promoters of AmphiGoosecoid and AmphiChordin genes. Repression of genes encoding dorsal-specific signaling molecule Chordin and transcription factor Goosecoid by Xenopus and zebrafish Vent genes represents a key regulatory interaction during vertebrate axis formation. Our data indicate high evolutionary conservation of a core Bmp-triggered gene regulatory network for dorsoventral patterning in chordates and suggest that co-option of the canonical Wnt signaling pathway for dorsoventral patterning in vertebrates represents one of the innovations through which an increased morphological complexity of vertebrate embryo is achieved.

  15. Mosaic gene network modelling identified new regulatory mechanisms in HCV infection.

    Science.gov (United States)

    Popik, Olga V; Petrovskiy, Evgeny D; Mishchenko, Elena L; Lavrik, Inna N; Ivanisenko, Vladimir A

    2016-06-15

    Modelling of gene networks is widely used in systems biology to study the functioning of complex biological systems. Most of the existing mathematical modelling techniques are useful for analysis of well-studied biological processes, for which information on rates of reactions is available. However, complex biological processes such as those determining the phenotypic traits of organisms or pathological disease processes, including pathogen-host interactions, involve complicated cross-talk between interacting networks. Furthermore, the intrinsic details of the interactions between these networks are often missing. In this study, we developed an approach, which we call mosaic network modelling, that allows the combination of independent mathematical models of gene regulatory networks and, thereby, description of complex biological systems. The advantage of this approach is that it allows us to generate the integrated model despite the fact that information on molecular interactions between parts of the model (so-called mosaic fragments) might be missing. To generate a mosaic mathematical model, we used control theory and mathematical models, written in the form of a system of ordinary differential equations (ODEs). In the present study, we investigated the efficiency of this method in modelling the dynamics of more than 10,000 simulated mosaic regulatory networks consisting of two pieces. Analysis revealed that this approach was highly efficient, as the mean deviation of the dynamics of mosaic network elements from the behaviour of the initial parts of the model was less than 10%. It turned out that for construction of the control functional, data on perturbation of one or two vertices of the mosaic piece are sufficient. Further, we used the developed method to construct a mosaic gene regulatory network including hepatitis C virus (HCV) as the first piece and the tumour necrosis factor (TNF)-induced apoptosis and NF-κB induction pathways as the second piece. Thus

  16. Computational design and designability of gene regulatory networks

    OpenAIRE

    Rodrigo Tarrega, Guillermo

    2012-01-01

    Nuestro conocimiento de las interacciones moleculares nos ha conducido hoy hacia una perspectiva ingenieril, donde diseños e implementaciones de sistemas artificiales de regulación intentan proporcionar instrucciones fundamentales para la reprogramación celular. Nosotros aquí abordamos el diseño de redes de genes como una forma de profundizar en la comprensión de las regulaciones naturales. También abordamos el problema de la diseñabilidad dada una genoteca de elementos compatibles. Con este ...

  17. Large-scale genetic perturbations reveal regulatory networks and an abundance of gene-specific repressors.

    Science.gov (United States)

    Kemmeren, Patrick; Sameith, Katrin; van de Pasch, Loes A L; Benschop, Joris J; Lenstra, Tineke L; Margaritis, Thanasis; O'Duibhir, Eoghan; Apweiler, Eva; van Wageningen, Sake; Ko, Cheuk W; van Heesch, Sebastiaan; Kashani, Mehdi M; Ampatziadis-Michailidis, Giannis; Brok, Mariel O; Brabers, Nathalie A C H; Miles, Anthony J; Bouwmeester, Diane; van Hooff, Sander R; van Bakel, Harm; Sluiters, Erik; Bakker, Linda V; Snel, Berend; Lijnzaad, Philip; van Leenen, Dik; Groot Koerkamp, Marian J A; Holstege, Frank C P

    2014-04-24

    To understand regulatory systems, it would be useful to uniformly determine how different components contribute to the expression of all other genes. We therefore monitored mRNA expression genome-wide, for individual deletions of one-quarter of yeast genes, focusing on (putative) regulators. The resulting genetic perturbation signatures reflect many different properties. These include the architecture of protein complexes and pathways, identification of expression changes compatible with viability, and the varying responsiveness to genetic perturbation. The data are assembled into a genetic perturbation network that shows different connectivities for different classes of regulators. Four feed-forward loop (FFL) types are overrepresented, including incoherent type 2 FFLs that likely represent feedback. Systematic transcription factor classification shows a surprisingly high abundance of gene-specific repressors, suggesting that yeast chromatin is not as generally restrictive to transcription as is often assumed. The data set is useful for studying individual genes and for discovering properties of an entire regulatory system.

  18. Intrinsic noise and deviations from criticality in Boolean gene-regulatory networks

    Science.gov (United States)

    Villegas, Pablo; Ruiz-Franco, José; Hidalgo, Jorge; Muñoz, Miguel A.

    2016-10-01

    Gene regulatory networks can be successfully modeled as Boolean networks. A much discussed hypothesis says that such model networks reproduce empirical findings the best if they are tuned to operate at criticality, i.e. at the borderline between their ordered and disordered phases. Critical networks have been argued to lead to a number of functional advantages such as maximal dynamical range, maximal sensitivity to environmental changes, as well as to an excellent tradeoff between stability and flexibility. Here, we study the effect of noise within the context of Boolean networks trained to learn complex tasks under supervision. We verify that quasi-critical networks are the ones learning in the fastest possible way –even for asynchronous updating rules– and that the larger the task complexity the smaller the distance to criticality. On the other hand, when additional sources of intrinsic noise in the network states and/or in its wiring pattern are introduced, the optimally performing networks become clearly subcritical. These results suggest that in order to compensate for inherent stochasticity, regulatory and other type of biological networks might become subcritical rather than being critical, all the most if the task to be performed has limited complexity.

  19. Intrinsic noise and deviations from criticality in Boolean gene-regulatory networks

    Science.gov (United States)

    Villegas, Pablo; Ruiz-Franco, José; Hidalgo, Jorge; Muñoz, Miguel A.

    2016-01-01

    Gene regulatory networks can be successfully modeled as Boolean networks. A much discussed hypothesis says that such model networks reproduce empirical findings the best if they are tuned to operate at criticality, i.e. at the borderline between their ordered and disordered phases. Critical networks have been argued to lead to a number of functional advantages such as maximal dynamical range, maximal sensitivity to environmental changes, as well as to an excellent tradeoff between stability and flexibility. Here, we study the effect of noise within the context of Boolean networks trained to learn complex tasks under supervision. We verify that quasi-critical networks are the ones learning in the fastest possible way –even for asynchronous updating rules– and that the larger the task complexity the smaller the distance to criticality. On the other hand, when additional sources of intrinsic noise in the network states and/or in its wiring pattern are introduced, the optimally performing networks become clearly subcritical. These results suggest that in order to compensate for inherent stochasticity, regulatory and other type of biological networks might become subcritical rather than being critical, all the most if the task to be performed has limited complexity. PMID:27713479

  20. Deconvoluting lung evolution: from phenotypes to gene regulatory networks

    DEFF Research Database (Denmark)

    Torday, J.S.; Rehan, V.K.; Hicks, J.W.

    2007-01-01

    Speakers in this symposium presented examples of respiratory regulation that broadly illustrate principles of evolution from whole organ to genes. The swim bladder and lungs of aquatic and terrestrial organisms arose independently from a common primordial "respiratory pharynx" but not from each...... other. Pathways of lung evolution are similar between crocodiles and birds but a low compliance of mammalian lung may have driven the development of the diaphragm to permit lung inflation during inspiration. To meet the high oxygen demands of flight, bird lungs have evolved separate gas exchange...... diffusing capacities than required by their oxygen consumption. The "primitive" central admixture of oxygenated and deoxygenated blood in the incompletely divided reptilian heart is actually co-regulated with other autonomic cardiopulmonary responses to provide flexible control of arterial oxygen tension...

  1. Reverse Engineering Sparse Gene Regulatory Networks Using Cubature Kalman Filter and Compressed Sensing

    Directory of Open Access Journals (Sweden)

    Amina Noor

    2013-01-01

    Full Text Available This paper proposes a novel algorithm for inferring gene regulatory networks which makes use of cubature Kalman filter (CKF and Kalman filter (KF techniques in conjunction with compressed sensing methods. The gene network is described using a state-space model. A nonlinear model for the evolution of gene expression is considered, while the gene expression data is assumed to follow a linear Gaussian model. The hidden states are estimated using CKF. The system parameters are modeled as a Gauss-Markov process and are estimated using compressed sensing-based KF. These parameters provide insight into the regulatory relations among the genes. The Cramér-Rao lower bound of the parameter estimates is calculated for the system model and used as a benchmark to assess the estimation accuracy. The proposed algorithm is evaluated rigorously using synthetic data in different scenarios which include different number of genes and varying number of sample points. In addition, the algorithm is tested on the DREAM4 in silico data sets as well as the in vivo data sets from IRMA network. The proposed algorithm shows superior performance in terms of accuracy, robustness, and scalability.

  2. Genome-wide analyses for dissecting gene regulatory networks in the shoot apical meristem.

    Science.gov (United States)

    Bustamante, Mariana; Matus, José Tomás; Riechmann, José Luis

    2016-03-01

    Shoot apical meristem activity is controlled by complex regulatory networks in which components such as transcription factors, miRNAs, small peptides, hormones, enzymes and epigenetic marks all participate. Many key genes that determine the inherent characteristics of the shoot apical meristem have been identified through genetic approaches. Recent advances in genome-wide studies generating extensive transcriptomic and DNA-binding datasets have increased our understanding of the interactions within the regulatory networks that control the activity of the meristem, identifying new regulators and uncovering connections between previously unlinked network components. In this review, we focus on recent studies that illustrate the contribution of whole genome analyses to understand meristem function. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  3. Developmental gene regulatory network architecture across 500 million years of echinoderm evolution

    Science.gov (United States)

    Hinman, Veronica F.; Nguyen, Albert T.; Cameron, R. Andrew; Davidson, Eric H.

    2003-01-01

    Evolutionary change in morphological features must depend on architectural reorganization of developmental gene regulatory networks (GRNs), just as true conservation of morphological features must imply retention of ancestral developmental GRN features. Key elements of the provisional GRN for embryonic endomesoderm development in the sea urchin are here compared with those operating in embryos of a distantly related echinoderm, a starfish. These animals diverged from their common ancestor 520-480 million years ago. Their endomesodermal fate maps are similar, except that sea urchins generate a skeletogenic cell lineage that produces a prominent skeleton lacking entirely in starfish larvae. A relevant set of regulatory genes was isolated from the starfish Asterina miniata, their expression patterns determined, and effects on the other genes of perturbing the expression of each were demonstrated. A three-gene feedback loop that is a fundamental feature of the sea urchin GRN for endoderm specification is found in almost identical form in the starfish: a detailed element of GRN architecture has been retained since the Cambrian Period in both echinoderm lineages. The significance of this retention is highlighted by the observation of numerous specific differences in the GRN connections as well. A regulatory gene used to drive skeletogenesis in the sea urchin is used entirely differently in the starfish, where it responds to endomesodermal inputs that do not affect it in the sea urchin embryo. Evolutionary changes in the GRNs since divergence are limited sharply to certain cis-regulatory elements, whereas others have persisted unaltered.

  4. Construction of an integrated gene regulatory network link to stress-related immune system in cattle.

    Science.gov (United States)

    Behdani, Elham; Bakhtiarizadeh, Mohammad Reza

    2017-08-20

    The immune system is an important biological system that is negatively impacted by stress. This study constructed an integrated regulatory network to enhance our understanding of the regulatory gene network used in the stress-related immune system. Module inference was used to construct modules of co-expressed genes with bovine leukocyte RNA-Seq data. Transcription factors (TFs) were then assigned to these modules using Lemon-Tree algorithms. In addition, the TFs assigned to each module were confirmed using the promoter analysis and protein-protein interactions data. Therefore, our integrated method identified three TFs which include one TF that is previously known to be involved in immune response (MYBL2) and two TFs (E2F8 and FOXS1) that had not been recognized previously and were identified for the first time in this study as novel regulatory candidates in immune response. This study provides valuable insights on the regulatory programs of genes involved in the stress-related immune system.

  5. Developmental evolution in social insects: regulatory networks from genes to societies.

    Science.gov (United States)

    Linksvayer, Timothy A; Fewell, Jennifer H; Gadau, Jürgen; Laubichler, Manfred D

    2012-05-01

    The evolution and development of complex phenotypes in social insect colonies, such as queen-worker dimorphism or division of labor, can, in our opinion, only be fully understood within an expanded mechanistic framework of Developmental Evolution. Conversely, social insects offer a fertile research area in which fundamental questions of Developmental Evolution can be addressed empirically. We review the concept of gene regulatory networks (GRNs) that aims to fully describe the battery of interacting genomic modules that are differentially expressed during the development of individual organisms. We discuss how distinct types of network models have been used to study different levels of biological organization in social insects, from GRNs to social networks. We propose that these hierarchical networks spanning different organizational levels from genes to societies should be integrated and incorporated into full GRN models to elucidate the evolutionary and developmental mechanisms underlying social insect phenotypes. Finally, we discuss prospects and approaches to achieve such an integration.

  6. Increasing galactose consumption by Saccharomyces cerevisiae through metabolic engineering of the GAL gene regulatory network

    DEFF Research Database (Denmark)

    Østergaard, Simon; Olsson, Lisbeth; Johnston, M.

    2000-01-01

    in the pathway, and ultimately, increasing metabolic flux through the pathway of interest, By manipulating the GAL gene regulatory network of Saccharomyces cerevisiae, which is a tightly regulated system, we produced prototroph mutant strains, which increased the flux through the galactose utilization pathway...... media. The improved galactose consumption of the gal mutants did not favor biomass formation, but rather caused excessive respiro-fermentative metabolism, with the ethanol production rate increasing linearly with glycolytic flux....

  7. Methods for Characterizing the Epigenetic Attractors Landscape Associated with Boolean Gene Regulatory Networks

    OpenAIRE

    Davila-Velderrain, Jose; Juarez-Ramiro, Luis; Martinez-Garcia, Juan C.; Alvarez-Buylla, Elena R

    2015-01-01

    Gene regulatory network (GRN) modeling is a well-established theoretical framework for the study of cell-fate specification during developmental processes. Recently, dynamical models of GRNs have been taken as a basis for formalizing the metaphorical model of Waddington's epigenetic landscape, providing a natural extension for the general protocol of GRN modeling. In this contribution we present in a coherent framework a novel implementation of two previously proposed general frameworks for m...

  8. A Systems’ Biology Approach to Study MicroRNA-Mediated Gene Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Xin Lai

    2013-01-01

    Full Text Available MicroRNAs (miRNAs are potent effectors in gene regulatory networks where aberrant miRNA expression can contribute to human diseases such as cancer. For a better understanding of the regulatory role of miRNAs in coordinating gene expression, we here present a systems biology approach combining data-driven modeling and model-driven experiments. Such an approach is characterized by an iterative process, including biological data acquisition and integration, network construction, mathematical modeling and experimental validation. To demonstrate the application of this approach, we adopt it to investigate mechanisms of collective repression on p21 by multiple miRNAs. We first construct a p21 regulatory network based on data from the literature and further expand it using algorithms that predict molecular interactions. Based on the network structure, a detailed mechanistic model is established and its parameter values are determined using data. Finally, the calibrated model is used to study the effect of different miRNA expression profiles and cooperative target regulation on p21 expression levels in different biological contexts.

  9. Multi-target trapping in constrained environments using gene regulatory network-based pattern formation

    Directory of Open Access Journals (Sweden)

    Xingguang Peng

    2016-10-01

    Full Text Available Inspired by the morphogenesis of biological organisms, gene regulatory network-based methods have been used in complex pattern formation of swarm robotic systems. In this article, obstacle information was embedded into the gene regulatory network model to make the robots trap targets with an expected pattern while avoiding obstacles in a distributed manner. Based on the modified gene regulatory network model, an implicit function method was adopted to represent the expected pattern which is easily adjusted by adding extra feature points. Considering environmental constraints (e.g. tunnels or gaps in which robots must adjust their pattern to conduct trapping task, a pattern adaptation strategy was proposed for the pattern modeler to adaptively adjust the expected pattern. Also to trap multiple targets, a splitting pattern adaptation strategy was proposed for diffusively moving targets so that the robots can trap each target separately with split sub-patterns. The proposed model and strategies were verified through a set of simulation with complex environmental constraints and non-consensus movements of targets.

  10. Inference of gene regulatory networks from genetic perturbations with linear regression model.

    Directory of Open Access Journals (Sweden)

    Zijian Dong

    Full Text Available It is an effective strategy to use both genetic perturbation data and gene expression data to infer regulatory networks that aims to improve the detection accuracy of the regulatory relationships among genes. Based on both types of data, the genetic regulatory networks can be accurately modeled by Structural Equation Modeling (SEM. In this paper, a linear regression (LR model is formulated based on the SEM, and a novel iterative scheme using Bayesian inference is proposed to estimate the parameters of the LR model (LRBI. Comparative evaluations of LRBI with other two algorithms, the Adaptive Lasso (AL-Based and the Sparsity-aware Maximum Likelihood (SML, are also presented. Simulations show that LRBI has significantly better performance than AL-Based, and overperforms SML in terms of power of detection. Applying the LRBI algorithm to experimental data, we inferred the interactions in a network of 35 yeast genes. An open-source program of the LRBI algorithm is freely available upon request.

  11. A parallel implementation of the network identification by multiple regression (NIR algorithm to reverse-engineer regulatory gene networks.

    Directory of Open Access Journals (Sweden)

    Francesco Gregoretti

    Full Text Available The reverse engineering of gene regulatory networks using gene expression profile data has become crucial to gain novel biological knowledge. Large amounts of data that need to be analyzed are currently being produced due to advances in microarray technologies. Using current reverse engineering algorithms to analyze large data sets can be very computational-intensive. These emerging computational requirements can be met using parallel computing techniques. It has been shown that the Network Identification by multiple Regression (NIR algorithm performs better than the other ready-to-use reverse engineering software. However it cannot be used with large networks with thousands of nodes--as is the case in biological networks--due to the high time and space complexity. In this work we overcome this limitation by designing and developing a parallel version of the NIR algorithm. The new implementation of the algorithm reaches a very good accuracy even for large gene networks, improving our understanding of the gene regulatory networks that is crucial for a wide range of biomedical applications.

  12. iRegulon: from a gene list to a gene regulatory network using large motif and track collections.

    Directory of Open Access Journals (Sweden)

    Rekin's Janky

    2014-07-01

    Full Text Available Identifying master regulators of biological processes and mapping their downstream gene networks are key challenges in systems biology. We developed a computational method, called iRegulon, to reverse-engineer the transcriptional regulatory network underlying a co-expressed gene set using cis-regulatory sequence analysis. iRegulon implements a genome-wide ranking-and-recovery approach to detect enriched transcription factor motifs and their optimal sets of direct targets. We increase the accuracy of network inference by using very large motif collections of up to ten thousand position weight matrices collected from various species, and linking these to candidate human TFs via a motif2TF procedure. We validate iRegulon on gene sets derived from ENCODE ChIP-seq data with increasing levels of noise, and we compare iRegulon with existing motif discovery methods. Next, we use iRegulon on more challenging types of gene lists, including microRNA target sets, protein-protein interaction networks, and genetic perturbation data. In particular, we over-activate p53 in breast cancer cells, followed by RNA-seq and ChIP-seq, and could identify an extensive up-regulated network controlled directly by p53. Similarly we map a repressive network with no indication of direct p53 regulation but rather an indirect effect via E2F and NFY. Finally, we generalize our computational framework to include regulatory tracks such as ChIP-seq data and show how motif and track discovery can be combined to map functional regulatory interactions among co-expressed genes. iRegulon is available as a Cytoscape plugin from http://iregulon.aertslab.org.

  13. iRegulon: from a gene list to a gene regulatory network using large motif and track collections.

    Directory of Open Access Journals (Sweden)

    Rekin's Janky

    2014-07-01

    Full Text Available Identifying master regulators of biological processes and mapping their downstream gene networks are key challenges in systems biology. We developed a computational method, called iRegulon, to reverse-engineer the transcriptional regulatory network underlying a co-expressed gene set using cis-regulatory sequence analysis. iRegulon implements a genome-wide ranking-and-recovery approach to detect enriched transcription factor motifs and their optimal sets of direct targets. We increase the accuracy of network inference by using very large motif collections of up to ten thousand position weight matrices collected from various species, and linking these to candidate human TFs via a motif2TF procedure. We validate iRegulon on gene sets derived from ENCODE ChIP-seq data with increasing levels of noise, and we compare iRegulon with existing motif discovery methods. Next, we use iRegulon on more challenging types of gene lists, including microRNA target sets, protein-protein interaction networks, and genetic perturbation data. In particular, we over-activate p53 in breast cancer cells, followed by RNA-seq and ChIP-seq, and could identify an extensive up-regulated network controlled directly by p53. Similarly we map a repressive network with no indication of direct p53 regulation but rather an indirect effect via E2F and NFY. Finally, we generalize our computational framework to include regulatory tracks such as ChIP-seq data and show how motif and track discovery can be combined to map functional regulatory interactions among co-expressed genes. iRegulon is available as a Cytoscape plugin from http://iregulon.aertslab.org.

  14. iRegulon: from a gene list to a gene regulatory network using large motif and track collections.

    Science.gov (United States)

    Janky, Rekin's; Verfaillie, Annelien; Imrichová, Hana; Van de Sande, Bram; Standaert, Laura; Christiaens, Valerie; Hulselmans, Gert; Herten, Koen; Naval Sanchez, Marina; Potier, Delphine; Svetlichnyy, Dmitry; Kalender Atak, Zeynep; Fiers, Mark; Marine, Jean-Christophe; Aerts, Stein

    2014-07-01

    Identifying master regulators of biological processes and mapping their downstream gene networks are key challenges in systems biology. We developed a computational method, called iRegulon, to reverse-engineer the transcriptional regulatory network underlying a co-expressed gene set using cis-regulatory sequence analysis. iRegulon implements a genome-wide ranking-and-recovery approach to detect enriched transcription factor motifs and their optimal sets of direct targets. We increase the accuracy of network inference by using very large motif collections of up to ten thousand position weight matrices collected from various species, and linking these to candidate human TFs via a motif2TF procedure. We validate iRegulon on gene sets derived from ENCODE ChIP-seq data with increasing levels of noise, and we compare iRegulon with existing motif discovery methods. Next, we use iRegulon on more challenging types of gene lists, including microRNA target sets, protein-protein interaction networks, and genetic perturbation data. In particular, we over-activate p53 in breast cancer cells, followed by RNA-seq and ChIP-seq, and could identify an extensive up-regulated network controlled directly by p53. Similarly we map a repressive network with no indication of direct p53 regulation but rather an indirect effect via E2F and NFY. Finally, we generalize our computational framework to include regulatory tracks such as ChIP-seq data and show how motif and track discovery can be combined to map functional regulatory interactions among co-expressed genes. iRegulon is available as a Cytoscape plugin from http://iregulon.aertslab.org.

  15. In vitro gene regulatory networks predict in vivo function of liver

    Directory of Open Access Journals (Sweden)

    Ang Choo Y

    2010-11-01

    Full Text Available Abstract Background Evolution of toxicity testing is predicated upon using in vitro cell based systems to rapidly screen and predict how a chemical might cause toxicity to an organ in vivo. However, the degree to which we can extend in vitro results to in vivo activity and possible mechanisms of action remains to be fully addressed. Results Here we use the nitroaromatic 2,4,6-trinitrotoluene (TNT as a model chemical to compare and determine how we might extrapolate from in vitro data to in vivo effects. We found 341 transcripts differentially expressed in common among in vitro and in vivo assays in response to TNT. The major functional term corresponding to these transcripts was cell cycle. Similarly modulated common pathways were identified between in vitro and in vivo. Furthermore, we uncovered the conserved common transcriptional gene regulatory networks between in vitro and in vivo cellular liver systems that responded to TNT exposure, which mainly contain 2 subnetwork modules: PTTG1 and PIR centered networks. Interestingly, all 7 genes in the PTTG1 module were involved in cell cycle and downregulated by TNT both in vitro and in vivo. Conclusions The results of our investigation of TNT effects on gene expression in liver suggest that gene regulatory networks obtained from an in vitro system can predict in vivo function and mechanisms. Inhibiting PTTG1 and its targeted cell cyle related genes could be key machanism for TNT induced liver toxicity.

  16. In vitro gene regulatory networks predict in vivo function of liver

    Science.gov (United States)

    2010-01-01

    Background Evolution of toxicity testing is predicated upon using in vitro cell based systems to rapidly screen and predict how a chemical might cause toxicity to an organ in vivo. However, the degree to which we can extend in vitro results to in vivo activity and possible mechanisms of action remains to be fully addressed. Results Here we use the nitroaromatic 2,4,6-trinitrotoluene (TNT) as a model chemical to compare and determine how we might extrapolate from in vitro data to in vivo effects. We found 341 transcripts differentially expressed in common among in vitro and in vivo assays in response to TNT. The major functional term corresponding to these transcripts was cell cycle. Similarly modulated common pathways were identified between in vitro and in vivo. Furthermore, we uncovered the conserved common transcriptional gene regulatory networks between in vitro and in vivo cellular liver systems that responded to TNT exposure, which mainly contain 2 subnetwork modules: PTTG1 and PIR centered networks. Interestingly, all 7 genes in the PTTG1 module were involved in cell cycle and downregulated by TNT both in vitro and in vivo. Conclusions The results of our investigation of TNT effects on gene expression in liver suggest that gene regulatory networks obtained from an in vitro system can predict in vivo function and mechanisms. Inhibiting PTTG1 and its targeted cell cyle related genes could be key machanism for TNT induced liver toxicity. PMID:21073692

  17. Adaptive modelling of gene regulatory network using Bayesian information criterion-guided sparse regression approach.

    Science.gov (United States)

    Shi, Ming; Shen, Weiming; Wang, Hong-Qiang; Chong, Yanwen

    2016-12-01

    Inferring gene regulatory networks (GRNs) from microarray expression data are an important but challenging issue in systems biology. In this study, the authors propose a Bayesian information criterion (BIC)-guided sparse regression approach for GRN reconstruction. This approach can adaptively model GRNs by optimising the l1-norm regularisation of sparse regression based on a modified version of BIC. The use of the regularisation strategy ensures the inferred GRNs to be as sparse as natural, while the modified BIC allows incorporating prior knowledge on expression regulation and thus avoids the overestimation of expression regulators as usual. Especially, the proposed method provides a clear interpretation of combinatorial regulations of gene expression by optimally extracting regulation coordination for a given target gene. Experimental results on both simulation data and real-world microarray data demonstrate the competent performance of discovering regulatory relationships in GRN reconstruction.

  18. Localizing potentially active post-transcriptional regulations in the Ewing's sarcoma gene regulatory network

    Directory of Open Access Journals (Sweden)

    Delyon Bernard

    2010-11-01

    Full Text Available Abstract Background A wide range of techniques is now available for analyzing regulatory networks. Nonetheless, most of these techniques fail to interpret large-scale transcriptional data at the post-translational level. Results We address the question of using large-scale transcriptomic observation of a system perturbation to analyze a regulatory network which contained several types of interactions - transcriptional and post-translational. Our method consisted of post-processing the outputs of an open-source tool named BioQuali - an automatic constraint-based analysis mimicking biologist's local reasoning on a large scale. The post-processing relied on differences in the behavior of the transcriptional and post-translational levels in the network. As a case study, we analyzed a network representation of the genes and proteins controlled by an oncogene in the context of Ewing's sarcoma. The analysis allowed us to pinpoint active interactions specific to this cancer. We also identified the parts of the network which were incomplete and should be submitted for further investigation. Conclusions The proposed approach is effective for the qualitative analysis of cancer networks. It allows the integrative use of experimental data of various types in order to identify the specific information that should be considered a priority in the initial - and possibly very large - experimental dataset. Iteratively, new dataset can be introduced into the analysis to improve the network representation and make it more specific.

  19. A recursive network approach can identify constitutive regulatory circuits in gene expression data

    Science.gov (United States)

    Blasi, Monica Francesca; Casorelli, Ida; Colosimo, Alfredo; Blasi, Francesco Simone; Bignami, Margherita; Giuliani, Alessandro

    2005-03-01

    The activity of the cell is often coordinated by the organisation of proteins into regulatory circuits that share a common function. Genome-wide expression profiles might contain important information on these circuits. Current approaches for the analysis of gene expression data include clustering the individual expression measurements and relating them to biological functions as well as modelling and simulation of gene regulation processes by additional computer tools. The identification of the regulative programmes from microarray experiments is limited, however, by the intrinsic difficulty of linear methods to detect low-variance signals and by the sensitivity of the different approaches. Here we face the problem of recognising invariant patterns of correlations among gene expression reminiscent of regulation circuits. We demonstrate that a recursive neural network approach can identify genetic regulation circuits from expression data for ribosomal and genome stability genes. The proposed method, by greatly enhancing the sensitivity of microarray studies, allows the identification of important aspects of genetic regulation networks and might be useful for the discrimination of the different players involved in regulation circuits. Our results suggest that the constitutive regulatory networks involved in the generic organisation of the cell display a high degree of clustering depending on a modular architecture.

  20. Modularity of gene-regulatory networks revealed in sea-star development

    Directory of Open Access Journals (Sweden)

    Degnan Bernard M

    2011-01-01

    Full Text Available Abstract Evidence that conserved developmental gene-regulatory networks can change as a unit during deutersostome evolution emerges from a study published in BMC Biology. This shows that genes consistently expressed in anterior brain patterning in hemichordates and chordates are expressed in a similar spatial pattern in another deuterostome, an asteroid echinoderm (sea star, but in a completely different developmental context (the animal-vegetal axis. This observation has implications for hypotheses on the type of development present in the deuterostome common ancestor. See research article: http://www.biomedcentral.com/1741-7007/8/143/abstract

  1. Dissecting early regulatory relationships in the lamprey neural crest gene network.

    Science.gov (United States)

    Nikitina, Natalya; Sauka-Spengler, Tatjana; Bronner-Fraser, Marianne

    2008-12-23

    The neural crest, a multipotent embryonic cell type, originates at the border between neural and nonneural ectoderm. After neural tube closure, these cells undergo an epithelial-mesenchymal transition, migrate to precise, often distant locations, and differentiate into diverse derivatives. Analyses of expression and function of signaling and transcription factors in higher vertebrates has led to the proposal that a neural crest gene regulatory network (NC-GRN) orchestrates neural crest formation. Here, we interrogate the NC-GRN in the lamprey, taking advantage of its slow development and basal phylogenetic position to resolve early inductive events, 1 regulatory step at the time. To establish regulatory relationships at the neural plate border, we assess relative expression of 6 neural crest network genes and effects of individually perturbing each on the remaining 5. The results refine an upstream portion of the NC-GRN and reveal unexpected order and linkages therein; e.g., lamprey AP-2 appears to function early as a neural plate border rather than a neural crest specifier and in a pathway linked to MsxA but independent of ZicA. These findings provide an ancestral framework for performing comparative tests in higher vertebrates in which network linkages may be more difficult to resolve because of their rapid development.

  2. The Max-Min High-Order Dynamic Bayesian Network for Learning Gene Regulatory Networks with Time-Delayed Regulations.

    Science.gov (United States)

    Li, Yifeng; Chen, Haifen; Zheng, Jie; Ngom, Alioune

    2016-01-01

    Accurately reconstructing gene regulatory network (GRN) from gene expression data is a challenging task in systems biology. Although some progresses have been made, the performance of GRN reconstruction still has much room for improvement. Because many regulatory events are asynchronous, learning gene interactions with multiple time delays is an effective way to improve the accuracy of GRN reconstruction. Here, we propose a new approach, called Max-Min high-order dynamic Bayesian network (MMHO-DBN) by extending the Max-Min hill-climbing Bayesian network technique originally devised for learning a Bayesian network's structure from static data. Our MMHO-DBN can explicitly model the time lags between regulators and targets in an efficient manner. It first uses constraint-based ideas to limit the space of potential structures, and then applies search-and-score ideas to search for an optimal HO-DBN structure. The performance of MMHO-DBN to GRN reconstruction was evaluated using both synthetic and real gene expression time-series data. Results show that MMHO-DBN is more accurate than current time-delayed GRN learning methods, and has an intermediate computing performance. Furthermore, it is able to learn long time-delayed relationships between genes. We applied sensitivity analysis on our model to study the performance variation along different parameter settings. The result provides hints on the setting of parameters of MMHO-DBN.

  3. Similarity in gene-regulatory networks suggests that cancer cells share characteristics of embryonic neural cells.

    Science.gov (United States)

    Zhang, Zan; Lei, Anhua; Xu, Liyang; Chen, Lu; Chen, Yonglong; Zhang, Xuena; Gao, Yan; Yang, Xiaoli; Zhang, Min; Cao, Ying

    2017-08-04

    Cancer cells are immature cells resulting from cellular reprogramming by gene misregulation, and redifferentiation is expected to reduce malignancy. It is unclear, however, whether cancer cells can undergo terminal differentiation. Here, we show that inhibition of the epigenetic modification enzyme enhancer of zeste homolog 2 (EZH2), histone deacetylases 1 and 3 (HDAC1 and -3), lysine demethylase 1A (LSD1), or DNA methyltransferase 1 (DNMT1), which all promote cancer development and progression, leads to postmitotic neuron-like differentiation with loss of malignant features in distinct solid cancer cell lines. The regulatory effect of these enzymes in neuronal differentiation resided in their intrinsic activity in embryonic neural precursor/progenitor cells. We further found that a major part of pan-cancer-promoting genes and the signal transducers of the pan-cancer-promoting signaling pathways, including the epithelial-to-mesenchymal transition (EMT) mesenchymal marker genes, display neural specific expression during embryonic neurulation. In contrast, many tumor suppressor genes, including the EMT epithelial marker gene that encodes cadherin 1 (CDH1), exhibited non-neural or no expression. This correlation indicated that cancer cells and embryonic neural cells share a regulatory network, mediating both tumorigenesis and neural development. This observed similarity in regulatory mechanisms suggests that cancer cells might share characteristics of embryonic neural cells. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  4. The Caenorhabditis elegans vulva: a post-embryonic gene regulatory network controlling organogenesis.

    Science.gov (United States)

    Ririe, Ted O; Fernandes, Jolene S; Sternberg, Paul W

    2008-12-23

    The Caenorhabditis elegans vulva is an elegant model for dissecting a gene regulatory network (GRN) that directs postembryonic organogenesis. The mature vulva comprises seven cell types (vulA, vulB1, vulB2, vulC, vulD, vulE, and vulF), each with its own unique pattern of spatial and temporal gene expression. The mechanisms that specify these cell types in a precise spatial pattern are not well understood. Using reverse genetic screens, we identified novel components of the vulval GRN, including nhr-113 in vulA. Several transcription factors (lin-11, lin-29, cog-1, egl-38, and nhr-67) interact with each other and act in concert to regulate target gene expression in the diverse vulval cell types. For example, egl-38 (Pax2/5/8) stabilizes the vulF fate by positively regulating vulF characteristics and by inhibiting characteristics associated with the neighboring vulE cells. nhr-67 and egl-38 regulate cog-1, helping restrict its expression to vulE. Computational approaches have been successfully used to identify functional cis-regulatory motifs in the zmp-1 (zinc metalloproteinase) promoter. These results provide an overview of the regulatory network architecture for each vulval cell type.

  5. High Dimensional ODEs Coupled with Mixed-Effects Modeling Techniques for Dynamic Gene Regulatory Network Identification.

    Science.gov (United States)

    Lu, Tao; Liang, Hua; Li, Hongzhe; Wu, Hulin

    2011-01-01

    Gene regulation is a complicated process. The interaction of many genes and their products forms an intricate biological network. Identification of this dynamic network will help us understand the biological process in a systematic way. However, the construction of such a dynamic network is very challenging for a high-dimensional system. In this article we propose to use a set of ordinary differential equations (ODE), coupled with dimensional reduction by clustering and mixed-effects modeling techniques, to model the dynamic gene regulatory network (GRN). The ODE models allow us to quantify both positive and negative gene regulations as well as feedback effects of one set of genes in a functional module on the dynamic expression changes of the genes in another functional module, which results in a directed graph network. A five-step procedure, Clustering, Smoothing, regulation Identification, parameter Estimates refining and Function enrichment analysis (CSIEF) is developed to identify the ODE-based dynamic GRN. In the proposed CSIEF procedure, a series of cutting-edge statistical methods and techniques are employed, that include non-parametric mixed-effects models with a mixture distribution for clustering, nonparametric mixed-effects smoothing-based methods for ODE models, the smoothly clipped absolute deviation (SCAD)-based variable selection, and stochastic approximation EM (SAEM) approach for mixed-effects ODE model parameter estimation. The key step, the SCAD-based variable selection of the proposed procedure is justified by investigating its asymptotic properties and validated by Monte Carlo simulations. We apply the proposed method to identify the dynamic GRN for yeast cell cycle progression data. We are able to annotate the identified modules through function enrichment analyses. Some interesting biological findings are discussed. The proposed procedure is a promising tool for constructing a general dynamic GRN and more complicated dynamic networks.

  6. State of the Art of Fuzzy Methods for Gene Regulatory Networks Inference

    Science.gov (United States)

    Al Qazlan, Tuqyah Abdullah; Kara-Mohamed, Chafia

    2015-01-01

    To address one of the most challenging issues at the cellular level, this paper surveys the fuzzy methods used in gene regulatory networks (GRNs) inference. GRNs represent causal relationships between genes that have a direct influence, trough protein production, on the life and the development of living organisms and provide a useful contribution to the understanding of the cellular functions as well as the mechanisms of diseases. Fuzzy systems are based on handling imprecise knowledge, such as biological information. They provide viable computational tools for inferring GRNs from gene expression data, thus contributing to the discovery of gene interactions responsible for specific diseases and/or ad hoc correcting therapies. Increasing computational power and high throughput technologies have provided powerful means to manage these challenging digital ecosystems at different levels from cell to society globally. The main aim of this paper is to report, present, and discuss the main contributions of this multidisciplinary field in a coherent and structured framework. PMID:25879048

  7. State of the Art of Fuzzy Methods for Gene Regulatory Networks Inference

    Directory of Open Access Journals (Sweden)

    Tuqyah Abdullah Al Qazlan

    2015-01-01

    Full Text Available To address one of the most challenging issues at the cellular level, this paper surveys the fuzzy methods used in gene regulatory networks (GRNs inference. GRNs represent causal relationships between genes that have a direct influence, trough protein production, on the life and the development of living organisms and provide a useful contribution to the understanding of the cellular functions as well as the mechanisms of diseases. Fuzzy systems are based on handling imprecise knowledge, such as biological information. They provide viable computational tools for inferring GRNs from gene expression data, thus contributing to the discovery of gene interactions responsible for specific diseases and/or ad hoc correcting therapies. Increasing computational power and high throughput technologies have provided powerful means to manage these challenging digital ecosystems at different levels from cell to society globally. The main aim of this paper is to report, present, and discuss the main contributions of this multidisciplinary field in a coherent and structured framework.

  8. Two different modes of oscillation in a gene transcription regulatory network with interlinked positive and negative feedback loops

    Science.gov (United States)

    Karmakar, Rajesh

    2016-12-01

    We study the oscillatory behavior of a gene regulatory network with interlinked positive and negative feedback loop. The frequency and amplitude are two important properties of oscillation. The studied network produces two different modes of oscillation. In one mode (mode-I), frequency of oscillation remains constant over a wide range of amplitude and in the other mode (mode-II) the amplitude of oscillation remains constant over a wide range of frequency. Our study reproduces both features of oscillations in a single gene regulatory network and shows that the negative plus positive feedback loops in gene regulatory network offer additional advantage. We identified the key parameters/variables responsible for different modes of oscillation. The network is flexible in switching between different modes by choosing appropriately the required parameters/variables.

  9. Neural model of gene regulatory network: a survey on supportive meta-heuristics.

    Science.gov (United States)

    Biswas, Surama; Acharyya, Sriyankar

    2016-06-01

    Gene regulatory network (GRN) is produced as a result of regulatory interactions between different genes through their coded proteins in cellular context. Having immense importance in disease detection and drug finding, GRN has been modelled through various mathematical and computational schemes and reported in survey articles. Neural and neuro-fuzzy models have been the focus of attraction in bioinformatics. Predominant use of meta-heuristic algorithms in training neural models has proved its excellence. Considering these facts, this paper is organized to survey neural modelling schemes of GRN and the efficacy of meta-heuristic algorithms towards parameter learning (i.e. weighting connections) within the model. This survey paper renders two different structure-related approaches to infer GRN which are global structure approach and substructure approach. It also describes two neural modelling schemes, such as artificial neural network/recurrent neural network based modelling and neuro-fuzzy modelling. The meta-heuristic algorithms applied so far to learn the structure and parameters of neutrally modelled GRN have been reviewed here.

  10. A model of gene expression based on random dynamical systems reveals modularity properties of gene regulatory networks.

    Science.gov (United States)

    Antoneli, Fernando; Ferreira, Renata C; Briones, Marcelo R S

    2016-06-01

    Here we propose a new approach to modeling gene expression based on the theory of random dynamical systems (RDS) that provides a general coupling prescription between the nodes of any given regulatory network given the dynamics of each node is modeled by a RDS. The main virtues of this approach are the following: (i) it provides a natural way to obtain arbitrarily large networks by coupling together simple basic pieces, thus revealing the modularity of regulatory networks; (ii) the assumptions about the stochastic processes used in the modeling are fairly general, in the sense that the only requirement is stationarity; (iii) there is a well developed mathematical theory, which is a blend of smooth dynamical systems theory, ergodic theory and stochastic analysis that allows one to extract relevant dynamical and statistical information without solving the system; (iv) one may obtain the classical rate equations form the corresponding stochastic version by averaging the dynamic random variables (small noise limit). It is important to emphasize that unlike the deterministic case, where coupling two equations is a trivial matter, coupling two RDS is non-trivial, specially in our case, where the coupling is performed between a state variable of one gene and the switching stochastic process of another gene and, hence, it is not a priori true that the resulting coupled system will satisfy the definition of a random dynamical system. We shall provide the necessary arguments that ensure that our coupling prescription does indeed furnish a coupled regulatory network of random dynamical systems. Finally, the fact that classical rate equations are the small noise limit of our stochastic model ensures that any validation or prediction made on the basis of the classical theory is also a validation or prediction of our model. We illustrate our framework with some simple examples of single-gene system and network motifs.

  11. Systematic Approach to Computational Design of Gene Regulatory Networks with Information Processing Capabilities.

    Science.gov (United States)

    Moskon, Miha; Mraz, Miha

    2014-01-01

    We present several measures that can be used in de novo computational design of biological systems with information processing capabilities. Their main purpose is to objectively evaluate the behavior and identify the biological information processing structures with the best dynamical properties. They can be used to define constraints that allow one to simplify the design of more complex biological systems. These measures can be applied to existent computational design approaches in synthetic biology, i.e., rational and automatic design approaches. We demonstrate their use on a) the computational models of several basic information processing structures implemented with gene regulatory networks and b) on a modular design of a synchronous toggle switch.

  12. MicroRNA Gene Regulatory Networks in Peripheral Nerve Sheath Tumors

    Science.gov (United States)

    2012-09-01

    NUMBER (include area code) Standard Form 298 (Rev. 8-98) Prescribed by ANSI Std . Z39.18 MicroRNA Gene Regulatory Networks in Peripheral Nerve...Department of Surgery, University of Minnesota, 11-212 Moos Tower (Mail Code: MMC 195), 515 Delaware St, S.E, Minneapolis, MN 55455, USA e-mail...malignant bone tumor with an incidence of 4–5 cases per million. It arises from the metaphysis of the long bones of adolescents and young adults. Two

  13. Redeployment of a conserved gene regulatory network during Aedes aegypti development.

    Science.gov (United States)

    Suryamohan, Kushal; Hanson, Casey; Andrews, Emily; Sinha, Saurabh; Scheel, Molly Duman; Halfon, Marc S

    2016-08-15

    Changes in gene regulatory networks (GRNs) underlie the evolution of morphological novelty and developmental system drift. The fruitfly Drosophila melanogaster and the dengue and Zika vector mosquito Aedes aegypti have substantially similar nervous system morphology. Nevertheless, they show significant divergence in a set of genes co-expressed in the midline of the Drosophila central nervous system, including the master regulator single minded and downstream genes including short gastrulation, Star, and NetrinA. In contrast to Drosophila, we find that midline expression of these genes is either absent or severely diminished in A. aegypti. Instead, they are co-expressed in the lateral nervous system. This suggests that in A. aegypti this "midline GRN" has been redeployed to a new location while lost from its previous site of activity. In order to characterize the relevant GRNs, we employed the SCRMshaw method we previously developed to identify transcriptional cis-regulatory modules in both species. Analysis of these regulatory sequences in transgenic Drosophila suggests that the altered gene expression observed in A. aegypti is the result of trans-dependent redeployment of the GRN, potentially stemming from cis-mediated changes in the expression of sim and other as-yet unidentified regulators. Our results illustrate a novel "repeal, replace, and redeploy" mode of evolution in which a conserved GRN acquires a different function at a new site while its original function is co-opted by a different GRN. This represents a striking example of developmental system drift in which the dramatic shift in gene expression does not result in gross morphological changes, but in more subtle differences in development and function of the late embryonic nervous system. Copyright © 2016 Elsevier Inc. All rights reserved.

  14. Inference of Gene Regulatory Networks Using Bayesian Nonparametric Regression and Topology Information

    Science.gov (United States)

    2017-01-01

    Gene regulatory networks (GRNs) play an important role in cellular systems and are important for understanding biological processes. Many algorithms have been developed to infer the GRNs. However, most algorithms only pay attention to the gene expression data but do not consider the topology information in their inference process, while incorporating this information can partially compensate for the lack of reliable expression data. Here we develop a Bayesian group lasso with spike and slab priors to perform gene selection and estimation for nonparametric models. B-spline basis functions are used to capture the nonlinear relationships flexibly and penalties are used to avoid overfitting. Further, we incorporate the topology information into the Bayesian method as a prior. We present the application of our method on DREAM3 and DREAM4 datasets and two real biological datasets. The results show that our method performs better than existing methods and the topology information prior can improve the result. PMID:28133490

  15. Inference of Gene Regulatory Networks Using Bayesian Nonparametric Regression and Topology Information

    Directory of Open Access Journals (Sweden)

    Yue Fan

    2017-01-01

    Full Text Available Gene regulatory networks (GRNs play an important role in cellular systems and are important for understanding biological processes. Many algorithms have been developed to infer the GRNs. However, most algorithms only pay attention to the gene expression data but do not consider the topology information in their inference process, while incorporating this information can partially compensate for the lack of reliable expression data. Here we develop a Bayesian group lasso with spike and slab priors to perform gene selection and estimation for nonparametric models. B-spline basis functions are used to capture the nonlinear relationships flexibly and penalties are used to avoid overfitting. Further, we incorporate the topology information into the Bayesian method as a prior. We present the application of our method on DREAM3 and DREAM4 datasets and two real biological datasets. The results show that our method performs better than existing methods and the topology information prior can improve the result.

  16. Evolution of gene regulatory network architectures: examples of subcircuit conservation and plasticity between classes of echinoderms.

    Science.gov (United States)

    Hinman, Veronica F; Yankura, Kristen A; McCauley, Brenna S

    2009-04-01

    Developmental gene regulatory networks (GRNs) explain how regulatory states are established in particular cells during development and how these states then determine the final form of the embryo. Evolutionary changes to the sequence of the genome will direct reorganization of GRN architectures, which in turn will lead to the alteration of developmental programs. A comparison of GRN architectures must consequently reveal the molecular basis for the evolution of developmental programs among different organisms. This review highlights some of the important findings that have emerged from the most extensive direct comparison of GRN architectures to date. Comparison of the orthologous GRNs for endomesodermal specification in the sea urchin and sea star, provides examples of several discrete, functional GRN subcircuits and shows that they are subject to diverse selective pressures. This demonstrates that different regulatory linkages may be more or less amenable to evolutionary change. One of the more surprising findings from this comparison is that GRN-level functions may be maintained while the factors performing the functions have changed, suggesting that GRNs have a high capacity for compensatory changes involving transcription factor binding to cis regulatory modules.

  17. Inference of Gene Regulatory Networks Based on a Universal Minimum Description Length

    Directory of Open Access Journals (Sweden)

    Dougherty John

    2008-01-01

    Full Text Available The Boolean network paradigm is a simple and effective way to interpret genomic systems, but discovering the structure of these networks remains a difficult task. The minimum description length (MDL principle has already been used for inferring genetic regulatory networks from time-series expression data and has proven useful for recovering the directed connections in Boolean networks. However, the existing method uses an ad hoc measure of description length that necessitates a tuning parameter for artificially balancing the model and error costs and, as a result, directly conflicts with the MDL principle's implied universality. In order to surpass this difficulty, we propose a novel MDL-based method in which the description length is a theoretical measure derived from a universal normalized maximum likelihood model. The search space is reduced by applying an implementable analogue of Kolmogorov's structure function. The performance of the proposed method is demonstrated on random synthetic networks, for which it is shown to improve upon previously published network inference algorithms with respect to both speed and accuracy. Finally, it is applied to time-series Drosophila gene expression measurements.

  18. Inference of Gene Regulatory Networks Based on a Universal Minimum Description Length

    Directory of Open Access Journals (Sweden)

    Jaakko Astola

    2008-02-01

    Full Text Available The Boolean network paradigm is a simple and effective way to interpret genomic systems, but discovering the structure of these networks remains a difficult task. The minimum description length (MDL principle has already been used for inferring genetic regulatory networks from time-series expression data and has proven useful for recovering the directed connections in Boolean networks. However, the existing method uses an ad hoc measure of description length that necessitates a tuning parameter for artificially balancing the model and error costs and, as a result, directly conflicts with the MDL principle's implied universality. In order to surpass this difficulty, we propose a novel MDL-based method in which the description length is a theoretical measure derived from a universal normalized maximum likelihood model. The search space is reduced by applying an implementable analogue of Kolmogorov's structure function. The performance of the proposed method is demonstrated on random synthetic networks, for which it is shown to improve upon previously published network inference algorithms with respect to both speed and accuracy. Finally, it is applied to time-series Drosophila gene expression measurements.

  19. Majority Rules with Random Tie-Breaking in Boolean Gene Regulatory Networks

    Science.gov (United States)

    Chaouiya, Claudine; Ourrad, Ouerdia; Lima, Ricardo

    2013-01-01

    We consider threshold Boolean gene regulatory networks, where the update function of each gene is described as a majority rule evaluated among the regulators of that gene: it is turned ON when the sum of its regulator contributions is positive (activators contribute positively whereas repressors contribute negatively) and turned OFF when this sum is negative. In case of a tie (when contributions cancel each other out), it is often assumed that the gene keeps it current state. This framework has been successfully used to model cell cycle control in yeast. Moreover, several studies consider stochastic extensions to assess the robustness of such a model. Here, we introduce a novel, natural stochastic extension of the majority rule. It consists in randomly choosing the next value of a gene only in case of a tie. Hence, the resulting model includes deterministic and probabilistic updates. We present variants of the majority rule, including alternate treatments of the tie situation. Impact of these variants on the corresponding dynamical behaviours is discussed. After a thorough study of a class of two-node networks, we illustrate the interest of our stochastic extension using a published cell cycle model. In particular, we demonstrate that steady state analysis can be rigorously performed and can lead to effective predictions; these relate for example to the identification of interactions whose addition would ensure that a specific state is absorbing. PMID:23922761

  20. Majority rules with random tie-breaking in Boolean gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Claudine Chaouiya

    Full Text Available We consider threshold boolean gene regulatory networks, where the update function of each gene is described as a majority rule evaluated among the regulators of that gene: it is turned ON when the sum of its regulator contributions is positive (activators contribute positively whereas repressors contribute negatively and turned OFF when this sum is negative. In case of a tie (when contributions cancel each other out, it is often assumed that the gene keeps it current state. This framework has been successfully used to model cell cycle control in yeast. Moreover, several studies consider stochastic extensions to assess the robustness of such a model. Here, we introduce a novel, natural stochastic extension of the majority rule. It consists in randomly choosing the next value of a gene only in case of a tie. Hence, the resulting model includes deterministic and probabilistic updates. We present variants of the majority rule, including alternate treatments of the tie situation. Impact of these variants on the corresponding dynamical behaviours is discussed. After a thorough study of a class of two-node networks, we illustrate the interest of our stochastic extension using a published cell cycle model. In particular, we demonstrate that steady state analysis can be rigorously performed and can lead to effective predictions; these relate for example to the identification of interactions whose addition would ensure that a specific state is absorbing.

  1. Developmental gene regulatory network evolution: insights from comparative studies in echinoderms.

    Science.gov (United States)

    Hinman, Veronica F; Cheatle Jarvela, Alys M

    2014-03-01

    One of the central concerns of Evolutionary Developmental biology is to understand how the specification of cell types can change during evolution. In the last decade, developmental biology has progressed toward a systems level understanding of cell specification processes. In particular, the focus has been on determining the regulatory interactions of the repertoire of genes that make up gene regulatory networks (GRNs). Echinoderms provide an extraordinary model system for determining how GRNs evolve. This review highlights the comparative GRN analyses arising from the echinoderm system. This work shows that certain types of GRN subcircuits or motifs, i.e., those involving positive feedback, tend to be conserved and may provide a constraint on development. This conservation may be due to a required arrangement of transcription factor binding sites in cis regulatory modules. The review will also discuss ways in which novelty may arise, in particular through the co-option of regulatory genes and subcircuits. The development of the sea urchin larval skeleton, a novel feature that arose in echinoderms, has provided a model for study of co-option mechanisms. Finally, the types of GRNs that can permit the great diversity in the patterns of ciliary bands and their associated neurons found among these taxa are discussed. The availability of genomic resources is rapidly expanding for echinoderms, including genome sequences not only for multiple species of sea urchins but also a species of sea star, sea cucumber, and brittle star. This will enable echinoderms to become a particularly powerful system for understanding how developmental GRNs evolve.

  2. A Semiquantitative Framework for Gene Regulatory Networks: Increasing the Time and Quantitative Resolution of Boolean Networks

    Science.gov (United States)

    Kerkhofs, Johan; Geris, Liesbet

    2015-01-01

    Boolean models have been instrumental in predicting general features of gene networks and more recently also as explorative tools in specific biological applications. In this study we introduce a basic quantitative and a limited time resolution to a discrete (Boolean) framework. Quantitative resolution is improved through the employ of normalized variables in unison with an additive approach. Increased time resolution stems from the introduction of two distinct priority classes. Through the implementation of a previously published chondrocyte network and T helper cell network, we show that this addition of quantitative and time resolution broadens the scope of biological behaviour that can be captured by the models. Specifically, the quantitative resolution readily allows models to discern qualitative differences in dosage response to growth factors. The limited time resolution, in turn, can influence the reachability of attractors, delineating the likely long term system behaviour. Importantly, the information required for implementation of these features, such as the nature of an interaction, is typically obtainable from the literature. Nonetheless, a trade-off is always present between additional computational cost of this approach and the likelihood of extending the model’s scope. Indeed, in some cases the inclusion of these features does not yield additional insight. This framework, incorporating increased and readily available time and semi-quantitative resolution, can help in substantiating the litmus test of dynamics for gene networks, firstly by excluding unlikely dynamics and secondly by refining falsifiable predictions on qualitative behaviour. PMID:26067297

  3. A bayesian framework that integrates heterogeneous data for inferring gene regulatory networks.

    Science.gov (United States)

    Santra, Tapesh

    2014-01-01

    Reconstruction of gene regulatory networks (GRNs) from experimental data is a fundamental challenge in systems biology. A number of computational approaches have been developed to infer GRNs from mRNA expression profiles. However, expression profiles alone are proving to be insufficient for inferring GRN topologies with reasonable accuracy. Recently, it has been shown that integration of external data sources (such as gene and protein sequence information, gene ontology data, protein-protein interactions) with mRNA expression profiles may increase the reliability of the inference process. Here, I propose a new approach that incorporates transcription factor binding sites (TFBS) and physical protein interactions (PPI) among transcription factors (TFs) in a Bayesian variable selection (BVS) algorithm which can infer GRNs from mRNA expression profiles subjected to genetic perturbations. Using real experimental data, I show that the integration of TFBS and PPI data with mRNA expression profiles leads to significantly more accurate networks than those inferred from expression profiles alone. Additionally, the performance of the proposed algorithm is compared with a series of least absolute shrinkage and selection operator (LASSO) regression-based network inference methods that can also incorporate prior knowledge in the inference framework. The results of this comparison suggest that BVS can outperform LASSO regression-based method in some circumstances.

  4. A Bayesian Framework that integrates heterogeneous data for inferring gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Tapesh eSantra

    2014-05-01

    Full Text Available Reconstruction of gene regulatory networks (GRNs from experimental data is a fundamental challenge in systems biology. A number of computational approaches have been developed to infer GRNs from mRNA expression profiles. However, expression profiles alone are proving to be insufficient for inferring GRN topologies with reasonable accuracy. Recently, it has been shown that integration of external data sources (such as gene and protein sequence information, gene ontology data, protein protein interactions with mRNA expression profiles may increase the reliability of the inference process. Here, I propose a new approach that incorporates transcription factor binding sites (TFBS and physical protein interactions (PPI among transcription factors (TFs in a Bayesian Variable Selection (BVS algorithm which can infer GRNs from mRNA expression profiles subjected to genetic perturbations. Using real experimental data, I show that the integration of TFBS and PPI data with mRNA expression profiles leads to significantly more accurate networks than those inferred from expression profiles alone. Additionally, the performance of the proposed algorithm is compared with a series of LASSO regression based network inference methods that can also incorporate prior knowledge in the inference framework. The results of this comparison suggest that BVS can outperform LASSO regression based method in some circumstances.

  5. Gene regulatory network inference and validation using relative change ratio analysis and time-delayed dynamic Bayesian network.

    Science.gov (United States)

    Li, Peng; Gong, Ping; Li, Haoni; Perkins, Edward J; Wang, Nan; Zhang, Chaoyang

    2014-12-01

    The Dialogue for Reverse Engineering Assessments and Methods (DREAM) project was initiated in 2006 as a community-wide effort for the development of network inference challenges for rigorous assessment of reverse engineering methods for biological networks. We participated in the in silico network inference challenge of DREAM3 in 2008. Here we report the details of our approach and its performance on the synthetic challenge datasets. In our methodology, we first developed a model called relative change ratio (RCR), which took advantage of the heterozygous knockdown data and null-mutant knockout data provided by the challenge, in order to identify the potential regulators for the genes. With this information, a time-delayed dynamic Bayesian network (TDBN) approach was then used to infer gene regulatory networks from time series trajectory datasets. Our approach considerably reduced the searching space of TDBN; hence, it gained a much higher efficiency and accuracy. The networks predicted using our approach were evaluated comparatively along with 29 other submissions by two metrics (area under the ROC curve and area under the precision-recall curve). The overall performance of our approach ranked the second among all participating teams.

  6. Enriching regulatory networks by bootstrap learning using optimised GO-based gene similarity and gene links mined from PubMed abstracts

    Energy Technology Data Exchange (ETDEWEB)

    Taylor, Ronald C.; Sanfilippo, Antonio P.; McDermott, Jason E.; Baddeley, Robert L.; Riensche, Roderick M.; Jensen, Russell S.; Verhagen, Marc; Pustejovsky, James

    2011-02-18

    Transcriptional regulatory networks are being determined using “reverse engineering” methods that infer connections based on correlations in gene state. Corroboration of such networks through independent means such as evidence from the biomedical literature is desirable. Here, we explore a novel approach, a bootstrapping version of our previous Cross-Ontological Analytic method (XOA) that can be used for semi-automated annotation and verification of inferred regulatory connections, as well as for discovery of additional functional relationships between the genes. First, we use our annotation and network expansion method on a biological network learned entirely from the literature. We show how new relevant links between genes can be iteratively derived using a gene similarity measure based on the Gene Ontology that is optimized on the input network at each iteration. Second, we apply our method to annotation, verification, and expansion of a set of regulatory connections found by the Context Likelihood of Relatedness algorithm.

  7. Inference of gene regulatory networks from time series by Tsallis entropy

    Directory of Open Access Journals (Sweden)

    de Oliveira Evaldo A

    2011-05-01

    Full Text Available Abstract Background The inference of gene regulatory networks (GRNs from large-scale expression profiles is one of the most challenging problems of Systems Biology nowadays. Many techniques and models have been proposed for this task. However, it is not generally possible to recover the original topology with great accuracy, mainly due to the short time series data in face of the high complexity of the networks and the intrinsic noise of the expression measurements. In order to improve the accuracy of GRNs inference methods based on entropy (mutual information, a new criterion function is here proposed. Results In this paper we introduce the use of generalized entropy proposed by Tsallis, for the inference of GRNs from time series expression profiles. The inference process is based on a feature selection approach and the conditional entropy is applied as criterion function. In order to assess the proposed methodology, the algorithm is applied to recover the network topology from temporal expressions generated by an artificial gene network (AGN model as well as from the DREAM challenge. The adopted AGN is based on theoretical models of complex networks and its gene transference function is obtained from random drawing on the set of possible Boolean functions, thus creating its dynamics. On the other hand, DREAM time series data presents variation of network size and its topologies are based on real networks. The dynamics are generated by continuous differential equations with noise and perturbation. By adopting both data sources, it is possible to estimate the average quality of the inference with respect to different network topologies, transfer functions and network sizes. Conclusions A remarkable improvement of accuracy was observed in the experimental results by reducing the number of false connections in the inferred topology by the non-Shannon entropy. The obtained best free parameter of the Tsallis entropy was on average in the range 2.5

  8. Mean field analysis of a spatial stochastic model of a gene regulatory network.

    Science.gov (United States)

    Sturrock, M; Murray, P J; Matzavinos, A; Chaplain, M A J

    2015-10-01

    A gene regulatory network may be defined as a collection of DNA segments which interact with each other indirectly through their RNA and protein products. Such a network is said to contain a negative feedback loop if its products inhibit gene transcription, and a positive feedback loop if a gene product promotes its own production. Negative feedback loops can create oscillations in mRNA and protein levels while positive feedback loops are primarily responsible for signal amplification. It is often the case in real biological systems that both negative and positive feedback loops operate in parameter regimes that result in low copy numbers of gene products. In this paper we investigate the spatio-temporal dynamics of a single feedback loop in a eukaryotic cell. We first develop a simplified spatial stochastic model of a canonical feedback system (either positive or negative). Using a Gillespie's algorithm, we compute sample trajectories and analyse their corresponding statistics. We then derive a system of equations that describe the spatio-temporal evolution of the stochastic means. Subsequently, we examine the spatially homogeneous case and compare the results of numerical simulations with the spatially explicit case. Finally, using a combination of steady-state analysis and data clustering techniques, we explore model behaviour across a subregion of the parameter space that is difficult to access experimentally and compare the parameter landscape of our spatio-temporal and spatially-homogeneous models.

  9. A systems biology approach to construct the gene regulatory network of systemic inflammation via microarray and databases mining

    Directory of Open Access Journals (Sweden)

    Lan Chung-Yu

    2008-09-01

    Full Text Available Abstract Background Inflammation is a hallmark of many human diseases. Elucidating the mechanisms underlying systemic inflammation has long been an important topic in basic and clinical research. When primary pathogenetic events remains unclear due to its immense complexity, construction and analysis of the gene regulatory network of inflammation at times becomes the best way to understand the detrimental effects of disease. However, it is difficult to recognize and evaluate relevant biological processes from the huge quantities of experimental data. It is hence appealing to find an algorithm which can generate a gene regulatory network of systemic inflammation from high-throughput genomic studies of human diseases. Such network will be essential for us to extract valuable information from the complex and chaotic network under diseased conditions. Results In this study, we construct a gene regulatory network of inflammation using data extracted from the Ensembl and JASPAR databases. We also integrate and apply a number of systematic algorithms like cross correlation threshold, maximum likelihood estimation method and Akaike Information Criterion (AIC on time-lapsed microarray data to refine the genome-wide transcriptional regulatory network in response to bacterial endotoxins in the context of dynamic activated genes, which are regulated by transcription factors (TFs such as NF-κB. This systematic approach is used to investigate the stochastic interaction represented by the dynamic leukocyte gene expression profiles of human subject exposed to an inflammatory stimulus (bacterial endotoxin. Based on the kinetic parameters of the dynamic gene regulatory network, we identify important properties (such as susceptibility to infection of the immune system, which may be useful for translational research. Finally, robustness of the inflammatory gene network is also inferred by analyzing the hubs and "weak ties" structures of the gene network

  10. Robust dynamic balance of AP-1 transcription factors in a neuronal gene regulatory network

    Directory of Open Access Journals (Sweden)

    Schwaber James S

    2010-12-01

    Full Text Available Abstract Background The octapeptide Angiotensin II is a key hormone that acts via its receptor AT1R in the brainstem to modulate the blood pressure control circuits and thus plays a central role in the cardiac and respiratory homeostasis. This modulation occurs via activation of a complex network of signaling proteins and transcription factors, leading to changes in levels of key genes and proteins. AT1R initiated activity in the nucleus tractus solitarius (NTS, which regulates blood pressure, has been the subject of extensive molecular analysis. But the adaptive network interactions in the NTS response to AT1R, plausibly related to the development of hypertension, are not understood. Results We developed and analyzed a mathematical model of AT1R-activated signaling kinases and a downstream gene regulatory network, with structural basis in our transcriptomic data analysis and literature. To our knowledge, our report presents the first computational model of this key regulatory network. Our simulations and analysis reveal a dynamic balance among distinct dimers of the AP-1 family of transcription factors. We investigated the robustness of this behavior to simultaneous perturbations in the network parameters using a novel multivariate approach that integrates global sensitivity analysis with decision-tree methods. Our analysis implicates a subset of Fos and Jun dependent mechanisms, with dynamic sensitivities shifting from Fos-regulating kinase (FRK-mediated processes to those downstream of c-Jun N-terminal kinase (JNK. Decision-tree analysis indicated that while there may be a large combinatorial functional space feasible for neuronal states and parameters, the network behavior is constrained to a small set of AP-1 response profiles. Many of the paths through the combinatorial parameter space lead to a dynamic balance of AP-1 dimer forms, yielding a robust AP-1 response counteracting the biological variability. Conclusions Based on the simulation

  11. Recursive random forest algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways

    Science.gov (United States)

    Zhang, Kui; Busov, Victor; Wei, Hairong

    2017-01-01

    Background Present knowledge indicates a multilayered hierarchical gene regulatory network (ML-hGRN) often operates above a biological pathway. Although the ML-hGRN is very important for understanding how a pathway is regulated, there is almost no computational algorithm for directly constructing ML-hGRNs. Results A backward elimination random forest (BWERF) algorithm was developed for constructing the ML-hGRN operating above a biological pathway. For each pathway gene, the BWERF used a random forest model to calculate the importance values of all transcription factors (TFs) to this pathway gene recursively with a portion (e.g. 1/10) of least important TFs being excluded in each round of modeling, during which, the importance values of all TFs to the pathway gene were updated and ranked until only one TF was remained in the list. The above procedure, termed BWERF. After that, the importance values of a TF to all pathway genes were aggregated and fitted to a Gaussian mixture model to determine the TF retention for the regulatory layer immediately above the pathway layer. The acquired TFs at the secondary layer were then set to be the new bottom layer to infer the next upper layer, and this process was repeated until a ML-hGRN with the expected layers was obtained. Conclusions BWERF improved the accuracy for constructing ML-hGRNs because it used backward elimination to exclude the noise genes, and aggregated the individual importance values for determining the TFs retention. We validated the BWERF by using it for constructing ML-hGRNs operating above mouse pluripotency maintenance pathway and Arabidopsis lignocellulosic pathway. Compared to GENIE3, BWERF showed an improvement in recognizing authentic TFs regulating a pathway. Compared to the bottom-up Gaussian graphical model algorithm we developed for constructing ML-hGRNs, the BWERF can construct ML-hGRNs with significantly reduced edges that enable biologists to choose the implicit edges for experimental

  12. A relative variation-based method to unraveling gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Yali Wang

    Full Text Available Gene regulatory network (GRN reconstruction is essential in understanding the functioning and pathology of a biological system. Extensive models and algorithms have been developed to unravel a GRN. The DREAM project aims to clarify both advantages and disadvantages of these methods from an application viewpoint. An interesting yet surprising observation is that compared with complicated methods like those based on nonlinear differential equations, etc., methods based on a simple statistics, such as the so-called Z-score, usually perform better. A fundamental problem with the Z-score, however, is that direct and indirect regulations can not be easily distinguished. To overcome this drawback, a relative expression level variation (RELV based GRN inference algorithm is suggested in this paper, which consists of three major steps. Firstly, on the basis of wild type and single gene knockout/knockdown experimental data, the magnitude of RELV of a gene is estimated. Secondly, probability for the existence of a direct regulation from a perturbed gene to a measured gene is estimated, which is further utilized to estimate whether a gene can be regulated by other genes. Finally, the normalized RELVs are modified to make genes with an estimated zero in-degree have smaller RELVs in magnitude than the other genes, which is used afterwards in queuing possibilities of the existence of direct regulations among genes and therefore leads to an estimate on the GRN topology. This method can in principle avoid the so-called cascade errors under certain situations. Computational results with the Size 100 sub-challenges of DREAM3 and DREAM4 show that, compared with the Z-score based method, prediction performances can be substantially improved, especially the AUPR specification. Moreover, it can even outperform the best team of both DREAM3 and DREAM4. Furthermore, the high precision of the obtained most reliable predictions shows that the suggested algorithm may be

  13. Signaling and Gene Regulatory Networks Governing Definitive Endoderm Derivation From Pluripotent Stem Cells.

    Science.gov (United States)

    Mohammadnia, Abdulshakour; Yaqubi, Moein; Pourasgari, Farzaneh; Neely, Eric; Fallahi, Hossein; Massumi, Mohammad

    2016-09-01

    The generation of definitive endoderm (DE) from pluripotent stem cells (PSCs) is a fundamental stage in the formation of highly organized visceral organs, such as the liver and pancreas. Currently, there is a need for a comprehensive study that illustrates the involvement of different signaling pathways and their interactions in the derivation of DE cells from PSCs. This study aimed to identify signaling pathways that have the greatest influence on DE formation using analyses of transcriptional profiles, protein-protein interactions, protein-DNA interactions, and protein localization data. Using this approach, signaling networks involved in DE formation were constructed using systems biology and data mining tools, and the validity of the predicted networks was confirmed experimentally by measuring the mRNA levels of hub genes in several PSCs-derived DE cell lines. Based on our analyses, seven signaling pathways, including the BMP, ERK1-ERK2, FGF, TGF-beta, MAPK, Wnt, and PIP signaling pathways and their interactions, were found to play a role in the derivation of DE cells from PSCs. Lastly, the core gene regulatory network governing this differentiation process was constructed. The results of this study could improve our understanding surrounding the efficient generation of DE cells for the regeneration of visceral organs. J. Cell. Physiol. 231: 1994-2006, 2016. © 2016 Wiley Periodicals, Inc.

  14. Transcriptional regulatory network refinement and quantification through kinetic modeling, gene expression microarray data and information theory

    Science.gov (United States)

    Sayyed-Ahmad, Abdallah; Tuncay, Kagan; Ortoleva, Peter J

    2007-01-01

    Background Gene expression microarray and other multiplex data hold promise for addressing the challenges of cellular complexity, refined diagnoses and the discovery of well-targeted treatments. A new approach to the construction and quantification of transcriptional regulatory networks (TRNs) is presented that integrates gene expression microarray data and cell modeling through information theory. Given a partial TRN and time series data, a probability density is constructed that is a functional of the time course of transcription factor (TF) thermodynamic activities at the site of gene control, and is a function of mRNA degradation and transcription rate coefficients, and equilibrium constants for TF/gene binding. Results Our approach yields more physicochemical information that compliments the results of network structure delineation methods, and thereby can serve as an element of a comprehensive TRN discovery/quantification system. The most probable TF time courses and values of the aforementioned parameters are obtained by maximizing the probability obtained through entropy maximization. Observed time delays between mRNA expression and activity are accounted for implicitly since the time course of the activity of a TF is coupled by probability functional maximization, and is not assumed to be proportional to expression level of the mRNA type that translates into the TF. This allows one to investigate post-translational and TF activation mechanisms of gene regulation. Accuracy and robustness of the method are evaluated. A kinetic formulation is used to facilitate the analysis of phenomena with a strongly dynamical character while a physically-motivated regularization of the TF time course is found to overcome difficulties due to omnipresent noise and data sparsity that plague other methods of gene expression data analysis. An application to Escherichia coli is presented. Conclusion Multiplex time series data can be used for the construction of the network of

  15. Transcriptional regulatory network refinement and quantification through kinetic modeling, gene expression microarray data and information theory

    Directory of Open Access Journals (Sweden)

    Tuncay Kagan

    2007-01-01

    Full Text Available Abstract Background Gene expression microarray and other multiplex data hold promise for addressing the challenges of cellular complexity, refined diagnoses and the discovery of well-targeted treatments. A new approach to the construction and quantification of transcriptional regulatory networks (TRNs is presented that integrates gene expression microarray data and cell modeling through information theory. Given a partial TRN and time series data, a probability density is constructed that is a functional of the time course of transcription factor (TF thermodynamic activities at the site of gene control, and is a function of mRNA degradation and transcription rate coefficients, and equilibrium constants for TF/gene binding. Results Our approach yields more physicochemical information that compliments the results of network structure delineation methods, and thereby can serve as an element of a comprehensive TRN discovery/quantification system. The most probable TF time courses and values of the aforementioned parameters are obtained by maximizing the probability obtained through entropy maximization. Observed time delays between mRNA expression and activity are accounted for implicitly since the time course of the activity of a TF is coupled by probability functional maximization, and is not assumed to be proportional to expression level of the mRNA type that translates into the TF. This allows one to investigate post-translational and TF activation mechanisms of gene regulation. Accuracy and robustness of the method are evaluated. A kinetic formulation is used to facilitate the analysis of phenomena with a strongly dynamical character while a physically-motivated regularization of the TF time course is found to overcome difficulties due to omnipresent noise and data sparsity that plague other methods of gene expression data analysis. An application to Escherichia coli is presented. Conclusion Multiplex time series data can be used for the

  16. Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems

    Directory of Open Access Journals (Sweden)

    Faridah Hani Mohamed Salleh

    2017-01-01

    Full Text Available Gene regulatory network (GRN reconstruction is the process of identifying regulatory gene interactions from experimental data through computational analysis. One of the main reasons for the reduced performance of previous GRN methods had been inaccurate prediction of cascade motifs. Cascade error is defined as the wrong prediction of cascade motifs, where an indirect interaction is misinterpreted as a direct interaction. Despite the active research on various GRN prediction methods, the discussion on specific methods to solve problems related to cascade errors is still lacking. In fact, the experiments conducted by the past studies were not specifically geared towards proving the ability of GRN prediction methods in avoiding the occurrences of cascade errors. Hence, this research aims to propose Multiple Linear Regression (MLR to infer GRN from gene expression data and to avoid wrongly inferring of an indirect interaction (A → B → C as a direct interaction (A → C. Since the number of observations of the real experiment datasets was far less than the number of predictors, some predictors were eliminated by extracting the random subnetworks from global interaction networks via an established extraction method. In addition, the experiment was extended to assess the effectiveness of MLR in dealing with cascade error by using a novel experimental procedure that had been proposed in this work. The experiment revealed that the number of cascade errors had been very minimal. Apart from that, the Belsley collinearity test proved that multicollinearity did affect the datasets used in this experiment greatly. All the tested subnetworks obtained satisfactory results, with AUROC values above 0.5.

  17. Reconstructing gene regulatory networks from knock-out data using Gaussian Noise Model and Pearson Correlation Coefficient.

    Science.gov (United States)

    Mohamed Salleh, Faridah Hani; Arif, Shereena Mohd; Zainudin, Suhaila; Firdaus-Raih, Mohd

    2015-12-01

    A gene regulatory network (GRN) is a large and complex network consisting of interacting elements that, over time, affect each other's state. The dynamics of complex gene regulatory processes are difficult to understand using intuitive approaches alone. To overcome this problem, we propose an algorithm for inferring the regulatory interactions from knock-out data using a Gaussian model combines with Pearson Correlation Coefficient (PCC). There are several problems relating to GRN construction that have been outlined in this paper. We demonstrated the ability of our proposed method to (1) predict the presence of regulatory interactions between genes, (2) their directionality and (3) their states (activation or suppression). The algorithm was applied to network sizes of 10 and 50 genes from DREAM3 datasets and network sizes of 10 from DREAM4 datasets. The predicted networks were evaluated based on AUROC and AUPR. We discovered that high false positive values were generated by our GRN prediction methods because the indirect regulations have been wrongly predicted as true relationships. We achieved satisfactory results as the majority of sub-networks achieved AUROC values above 0.5.

  18. The core regulatory network in human cells.

    Science.gov (United States)

    Kim, Man-Sun; Kim, Dongsan; Kang, Nam Sook; Kim, Jeong-Rae

    2017-03-04

    In order to discover the common characteristics of various cell types in the human body, many researches have been conducted to find the set of genes commonly expressed in various cell types and tissues. However, the functional characteristics of a cell is determined by the complex regulatory relationships among the genes rather than by expressed genes themselves. Therefore, it is more important to identify and analyze a core regulatory network where all regulatory relationship between genes are active across all cell types to uncover the common features of various cell types. Here, based on hundreds of tissue-specific gene regulatory networks constructed by recent genome-wide experimental data, we constructed the core regulatory network. Interestingly, we found that the core regulatory network is organized by simple cascade and has few complex regulations such as feedback or feed-forward loops. Moreover, we discovered that the regulatory links from genes in the core regulatory network to genes in the peripheral regulatory network are much more abundant than the reverse direction links. These results suggest that the core regulatory network locates at the top of regulatory network and plays a role as a 'hub' in terms of information flow, and the information that is common to all cells can be modified to achieve the tissue-specific characteristics through various types of feedback and feed-forward loops in the peripheral regulatory networks. We also found that the genes in the core regulatory network are evolutionary conserved, essential and non-disease, non-druggable genes compared to the peripheral genes. Overall, our study provides an insight into how all human cells share a common function and generate tissue-specific functional traits by transmitting and processing information through regulatory network.

  19. A regulatory network modeled from wild-type gene expression data guides functional predictions in Caenorhabditis elegans development

    Directory of Open Access Journals (Sweden)

    Stigler Brandilyn

    2012-06-01

    Full Text Available Abstract Background Complex gene regulatory networks underlie many cellular and developmental processes. While a variety of experimental approaches can be used to discover how genes interact, few biological systems have been systematically evaluated to the extent required for an experimental definition of the underlying network. Therefore, the development of computational methods that can use limited experimental data to define and model a gene regulatory network would provide a useful tool to evaluate many important but incompletely understood biological processes. Such methods can assist in extracting all relevant information from data that are available, identify unexpected regulatory relationships and prioritize future experiments. Results To facilitate the analysis of gene regulatory networks, we have developed a computational modeling pipeline method that complements traditional evaluation of experimental data. For a proof-of-concept example, we have focused on the gene regulatory network in the nematode C. elegans that mediates the developmental choice between mesodermal (muscle and ectodermal (skin cell fates in the embryonic C lineage. We have used gene expression data to build two models: a knowledge-driven model based on gene expression changes following gene perturbation experiments, and a data-driven mathematical model derived from time-course gene expression data recovered from wild-type animals. We show that both models can identify a rich set of network gene interactions. Importantly, the mathematical model built only from wild-type data can predict interactions demonstrated by the perturbation experiments better than chance, and better than an existing knowledge-driven model built from the same data set. The mathematical model also provides new biological insight, including a dissection of zygotic from maternal functions of a key transcriptional regulator, PAL-1, and identification of non-redundant activities of the T-box genes

  20. A regulatory network modeled from wild-type gene expression data guides functional predictions in Caenorhabditis elegans development.

    Science.gov (United States)

    Stigler, Brandilyn; Chamberlin, Helen M

    2012-06-26

    Complex gene regulatory networks underlie many cellular and developmental processes. While a variety of experimental approaches can be used to discover how genes interact, few biological systems have been systematically evaluated to the extent required for an experimental definition of the underlying network. Therefore, the development of computational methods that can use limited experimental data to define and model a gene regulatory network would provide a useful tool to evaluate many important but incompletely understood biological processes. Such methods can assist in extracting all relevant information from data that are available, identify unexpected regulatory relationships and prioritize future experiments. To facilitate the analysis of gene regulatory networks, we have developed a computational modeling pipeline method that complements traditional evaluation of experimental data. For a proof-of-concept example, we have focused on the gene regulatory network in the nematode C. elegans that mediates the developmental choice between mesodermal (muscle) and ectodermal (skin) cell fates in the embryonic C lineage. We have used gene expression data to build two models: a knowledge-driven model based on gene expression changes following gene perturbation experiments, and a data-driven mathematical model derived from time-course gene expression data recovered from wild-type animals. We show that both models can identify a rich set of network gene interactions. Importantly, the mathematical model built only from wild-type data can predict interactions demonstrated by the perturbation experiments better than chance, and better than an existing knowledge-driven model built from the same data set. The mathematical model also provides new biological insight, including a dissection of zygotic from maternal functions of a key transcriptional regulator, PAL-1, and identification of non-redundant activities of the T-box genes tbx-8 and tbx-9. This work provides a strong

  1. A Dynamic Gene Regulatory Network Model That Recovers the Cyclic Behavior of Arabidopsis thaliana Cell Cycle

    Science.gov (United States)

    Ortiz-Gutiérrez, Elizabeth; García-Cruz, Karla; Azpeitia, Eugenio; Castillo, Aaron; Sánchez, María de la Paz; Álvarez-Buylla, Elena R.

    2015-01-01

    Cell cycle control is fundamental in eukaryotic development. Several modeling efforts have been used to integrate the complex network of interacting molecular components involved in cell cycle dynamics. In this paper, we aimed at recovering the regulatory logic upstream of previously known components of cell cycle control, with the aim of understanding the mechanisms underlying the emergence of the cyclic behavior of such components. We focus on Arabidopsis thaliana, but given that many components of cell cycle regulation are conserved among eukaryotes, when experimental data for this system was not available, we considered experimental results from yeast and animal systems. We are proposing a Boolean gene regulatory network (GRN) that converges into only one robust limit cycle attractor that closely resembles the cyclic behavior of the key cell-cycle molecular components and other regulators considered here. We validate the model by comparing our in silico configurations with data from loss- and gain-of-function mutants, where the endocyclic behavior also was recovered. Additionally, we approximate a continuous model and recovered the temporal periodic expression profiles of the cell-cycle molecular components involved, thus suggesting that the single limit cycle attractor recovered with the Boolean model is not an artifact of its discrete and synchronous nature, but rather an emergent consequence of the inherent characteristics of the regulatory logic proposed here. This dynamical model, hence provides a novel theoretical framework to address cell cycle regulation in plants, and it can also be used to propose novel predictions regarding cell cycle regulation in other eukaryotes. PMID:26340681

  2. Gene Regulatory Network Analysis Reveals Differences in Site-specific Cell Fate Determination in Mammalian Brain

    Directory of Open Access Journals (Sweden)

    Gokhan eErtaylan

    2014-12-01

    Full Text Available Neurogenesis - the generation of new neurons - is an ongoing process that persists in the adult mammalian brain of several species, including humans. In this work we analyze two discrete brain regions: the subventricular zone (SVZ lining the walls of the lateral ventricles; and the subgranular zone (SGZ of the dentate gyrus of the hippocampus in mice and shed light on the SVZ and SGZ specific neurogenesis. We propose a computational model that relies on the construction and analysis of region specific gene regulatory networks from the publicly available data on these two regions. Using this model a number of putative factors involved in neuronal stem cell (NSC identity and maintenance were identified. We also demonstrate potential gender and niche-derived differences based on cell surface and nuclear receptors via Ar, Hif1a and Nr3c1.We have also conducted cell fate determinant analysis for SVZ NSC populations to Olfactory Bulb interneurons and SGZ NSC populations to the granule cells of the Granular Cell Layer. We report thirty-one candidate cell fate determinant gene pairs, ready to be validated. We focus on Ar - Pax6 in SVZ and Sox2 - Ncor1 in SGZ. Both pairs are expressed and localized in the suggested anatomical structures as shown by in situ hybridization and found to physically interact.Finally, we conclude that there are fundamental differences between SGZ and SVZ neurogenesis. We argue that these regulatory mechanisms are linked to the observed differential neurogenic potential of these regions. The presence of nuclear and cell surface receptors in the region specific regulatory circuits indicate the significance of niche derived extracellular factors, hormones and region specific factors such as the oxygen sensitivity, dictating SGZ and SVZ specific neurogenesis.

  3. A review of integration strategies to support gene regulatory network construction.

    Science.gov (United States)

    Chen, Hailin; VanBuren, Vincent

    2012-01-01

    Gene regulatory network (GRN) construction is a central task of systems biology. Integration of different data sources to infer and construct GRNs is an important consideration for the success of this effort. In this paper, we will discuss distinctive strategies of data integration for GRN construction. Basically, the process of integration of different data sources is divided into two phases: the first phase is collection of the required data and the second phase is data processing with advanced algorithms to infer the GRNs. In this paper these two phases are called "structural integration" and "analytic integration," respectively. Compared with the nonintegration strategies, the integration strategies perform quite well and have better agreement with the experimental evidence.

  4. A Review of Integration Strategies to Support Gene Regulatory Network Construction

    Directory of Open Access Journals (Sweden)

    Hailin Chen

    2012-01-01

    Full Text Available Gene regulatory network (GRN construction is a central task of systems biology. Integration of different data sources to infer and construct GRNs is an important consideration for the success of this effort. In this paper, we will discuss distinctive strategies of data integration for GRN construction. Basically, the process of integration of different data sources is divided into two phases: the first phase is collection of the required data and the second phase is data processing with advanced algorithms to infer the GRNs. In this paper these two phases are called “structural integration” and “analytic integration,” respectively. Compared with the nonintegration strategies, the integration strategies perform quite well and have better agreement with the experimental evidence.

  5. The vertebrate Hox gene regulatory network for hindbrain segmentation: Evolution and diversification: Coupling of a Hox gene regulatory network to hindbrain segmentation is an ancient trait originating at the base of vertebrates.

    Science.gov (United States)

    Parker, Hugo J; Bronner, Marianne E; Krumlauf, Robb

    2016-06-01

    Hindbrain development is orchestrated by a vertebrate gene regulatory network that generates segmental patterning along the anterior-posterior axis via Hox genes. Here, we review analyses of vertebrate and invertebrate chordate models that inform upon the evolutionary origin and diversification of this network. Evidence from the sea lamprey reveals that the hindbrain regulatory network generates rhombomeric compartments with segmental Hox expression and an underlying Hox code. We infer that this basal feature was present in ancestral vertebrates and, as an evolutionarily constrained developmental state, is fundamentally important for patterning of the vertebrate hindbrain across diverse lineages. Despite the common ground plan, vertebrates exhibit neuroanatomical diversity in lineage-specific patterns, with different vertebrates revealing variations of Hox expression in the hindbrain that could underlie this diversification. Invertebrate chordates lack hindbrain segmentation but exhibit some conserved aspects of this network, with retinoic acid signaling playing a role in establishing nested domains of Hox expression. © 2016 WILEY Periodicals, Inc.

  6. Semi-supervised prediction of gene regulatory networks using machine learning algorithms

    Indian Academy of Sciences (India)

    Nihir Patel; T L Wang

    2015-10-01

    Use of computational methods to predict gene regulatory networks (GRNs) from gene expression data is a challenging task. Many studies have been conducted using unsupervised methods to fulfill the task; however, such methods usually yield low prediction accuracies due to the lack of training data. In this article, we propose semi-supervised methods for GRN prediction by utilizing two machine learning algorithms, namely, support vector machines (SVM) and random forests (RF). The semi-supervised methods make use of unlabelled data for training. We investigated inductive and transductive learning approaches, both of which adopt an iterative procedure to obtain reliable negative training data from the unlabelled data. We then applied our semi-supervised methods to gene expression data of Escherichia coli and Saccharomyces cerevisiae, and evaluated the performance of our methods using the expression data. Our analysis indicated that the transductive learning approach outperformed the inductive learning approach for both organisms. However, there was no conclusive difference identified in the performance of SVM and RF. Experimental results also showed that the proposed semi-supervised methods performed better than existing supervised methods for both organisms.

  7. Modelling non-stationary gene regulatory processes with a non-homogeneous Bayesian network and the allocation sampler

    NARCIS (Netherlands)

    Grzegorczyk, Marco; Husmeier, Dirk; Edwards, Kieron D.; Ghazal, Peter; Millar, Andrew J.

    2008-01-01

    Method: The objective of the present article is to propose and evaluate a probabilistic approach based on Bayesian networks for modelling non-homogeneous and non-linear gene regulatory processes. The method is based on a mixture model, using latent variables to assign individual measurements to diff

  8. Modelling non-stationary gene regulatory processes with a non-homogeneous Bayesian network and the allocation sampler

    NARCIS (Netherlands)

    Grzegorczyk, Marco; Husmeier, Dirk; Edwards, Kieron D.; Ghazal, Peter; Millar, Andrew J.

    2008-01-01

    Method: The objective of the present article is to propose and evaluate a probabilistic approach based on Bayesian networks for modelling non-homogeneous and non-linear gene regulatory processes. The method is based on a mixture model, using latent variables to assign individual measurements to

  9. Sub-circuits of a gene regulatory network control a developmental epithelial-mesenchymal transition.

    Science.gov (United States)

    Saunders, Lindsay R; McClay, David R

    2014-04-01

    Epithelial-mesenchymal transition (EMT) is a fundamental cell state change that transforms epithelial to mesenchymal cells during embryonic development, adult tissue repair and cancer metastasis. EMT includes a complex series of intermediate cell state changes including remodeling of the basement membrane, apical constriction, epithelial de-adhesion, directed motility, loss of apical-basal polarity, and acquisition of mesenchymal adhesion and polarity. Transcriptional regulatory state changes must ultimately coordinate the timing and execution of these cell biological processes. A well-characterized gene regulatory network (GRN) in the sea urchin embryo was used to identify the transcription factors that control five distinct cell changes during EMT. Single transcription factors were perturbed and the consequences followed with in vivo time-lapse imaging or immunostaining assays. The data show that five different sub-circuits of the GRN control five distinct cell biological activities, each part of the complex EMT process. Thirteen transcription factors (TFs) expressed specifically in pre-EMT cells were required for EMT. Three TFs highest in the GRN specified and activated EMT (alx1, ets1, tbr) and the 10 TFs downstream of those (tel, erg, hex, tgif, snail, twist, foxn2/3, dri, foxb, foxo) were also required for EMT. No single TF functioned in all five sub-circuits, indicating that there is no EMT master regulator. Instead, the resulting sub-circuit topologies suggest EMT requires multiple simultaneous regulatory mechanisms: forward cascades, parallel inputs and positive-feedback lock downs. The interconnected and overlapping nature of the sub-circuits provides one explanation for the seamless orchestration by the embryo of cell state changes leading to successful EMT.

  10. Urothelial cancer gene regulatory networks inferred from large-scale RNAseq, Bead and Oligo gene expression data.

    Science.gov (United States)

    de Matos Simoes, Ricardo; Dalleau, Sabine; Williamson, Kate E; Emmert-Streib, Frank

    2015-05-14

    Urothelial pathogenesis is a complex process driven by an underlying network of interconnected genes. The identification of novel genomic target regions and gene targets that drive urothelial carcinogenesis is crucial in order to improve our current limited understanding of urothelial cancer (UC) on the molecular level. The inference of genome-wide gene regulatory networks (GRN) from large-scale gene expression data provides a promising approach for a detailed investigation of the underlying network structure associated to urothelial carcinogenesis. In our study we inferred and compared three GRNs by the application of the BC3Net inference algorithm to large-scale transitional cell carcinoma gene expression data sets from Illumina RNAseq (179 samples), Illumina Bead arrays (165 samples) and Affymetrix Oligo microarrays (188 samples). We investigated the structural and functional properties of GRNs for the identification of molecular targets associated to urothelial cancer. We found that the urothelial cancer (UC) GRNs show a significant enrichment of subnetworks that are associated with known cancer hallmarks including cell cycle, immune response, signaling, differentiation and translation. Interestingly, the most prominent subnetworks of co-located genes were found on chromosome regions 5q31.3 (RNAseq), 8q24.3 (Oligo) and 1q23.3 (Bead), which all represent known genomic regions frequently deregulated or aberated in urothelial cancer and other cancer types. Furthermore, the identified hub genes of the individual GRNs, e.g., HID1/DMC1 (tumor development), RNF17/TDRD4 (cancer antigen) and CYP4A11 (angiogenesis/ metastasis) are known cancer associated markers. The GRNs were highly dataset specific on the interaction level between individual genes, but showed large similarities on the biological function level represented by subnetworks. Remarkably, the RNAseq UC GRN showed twice the proportion of significant functional subnetworks. Based on our analysis of inferential

  11. A new asynchronous parallel algorithm for inferring large-scale gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Xiangyun Xiao

    Full Text Available The reconstruction of gene regulatory networks (GRNs from high-throughput experimental data has been considered one of the most important issues in systems biology research. With the development of high-throughput technology and the complexity of biological problems, we need to reconstruct GRNs that contain thousands of genes. However, when many existing algorithms are used to handle these large-scale problems, they will encounter two important issues: low accuracy and high computational cost. To overcome these difficulties, the main goal of this study is to design an effective parallel algorithm to infer large-scale GRNs based on high-performance parallel computing environments. In this study, we proposed a novel asynchronous parallel framework to improve the accuracy and lower the time complexity of large-scale GRN inference by combining splitting technology and ordinary differential equation (ODE-based optimization. The presented algorithm uses the sparsity and modularity of GRNs to split whole large-scale GRNs into many small-scale modular subnetworks. Through the ODE-based optimization of all subnetworks in parallel and their asynchronous communications, we can easily obtain the parameters of the whole network. To test the performance of the proposed approach, we used well-known benchmark datasets from Dialogue for Reverse Engineering Assessments and Methods challenge (DREAM, experimentally determined GRN of Escherichia coli and one published dataset that contains more than 10 thousand genes to compare the proposed approach with several popular algorithms on the same high-performance computing environments in terms of both accuracy and time complexity. The numerical results demonstrate that our parallel algorithm exhibits obvious superiority in inferring large-scale GRNs.

  12. The MYB98 subcircuit of the synergid gene regulatory network includes genes directly and indirectly regulated by MYB98.

    Science.gov (United States)

    Punwani, Jayson A; Rabiger, David S; Lloyd, Alan; Drews, Gary N

    2008-08-01

    The female gametophyte contains two synergid cells that play a role in many steps of the angiosperm reproductive process, including pollen tube guidance. At their micropylar poles, the synergid cells have a thickened and elaborated cell wall: the filiform apparatus that is thought to play a role in the secretion of the pollen tube attractant(s). MYB98 regulates an important subcircuit of the synergid gene regulatory network (GRN) that functions to activate the expression of genes required for pollen tube guidance and filiform apparatus formation. The MYB98 subcircuit comprises at least 83 downstream genes, including 48 genes within four gene families (CRP810, CRP3700, CRP3730 and CRP3740) that encode Cys-rich proteins. We show that the 11 CRP3700 genes, which include DD11 and DD18, are regulated by a common cis-element, GTAACNT, and that a multimer of this sequence confers MYB98-dependent synergid expression. The GTAACNT element contains the MYB98-binding site identified in vitro, suggesting that the 11 CRP3700 genes are direct targets of MYB98. We also show that five of the CRP810 genes, which include DD2, lack a functional GTAACNT element, suggesting that they are not directly regulated by MYB98. In addition, we show that the five CRP810 genes are regulated by the cis-element AACGT, and that a multimer of this sequence confers synergid expression. Together, these results suggest that the MYB98 branch of the synergid GRN is multi-tiered and, therefore, contains at least one additional downstream transcription factor.

  13. Gene regulatory networks reused to build novel traits: co-option of an eye-related gene regulatory network in eye-like organs and red wing patches on insect wings is suggested by optix expression.

    Science.gov (United States)

    Monteiro, Antónia

    2012-03-01

    Co-option of the eye developmental gene regulatory network may have led to the appearance of novel functional traits on the wings of flies and butterflies. The first trait is a recently described wing organ in a species of extinct midge resembling the outer layers of the midge's own compound eye. The second trait is red pigment patches on Heliconius butterfly wings connected to the expression of an eye selector gene, optix. These examples, as well as others, are discussed regarding the type of empirical evidence and burden of proof that have been used to infer gene network co-option underlying the origin of novel traits. A conceptual framework describing increasing confidence in inference of network co-option is proposed. Novel research directions to facilitate inference of network co-option are also highlighted, especially in cases where the pre-existent and novel traits do not resemble each other.

  14. Large-scale genetic perturbations reveal regulatory networks and an abundance of gene-specific repressors

    NARCIS (Netherlands)

    Kemmeren, P.P.C.W.; Sameith, K.; van de Pasch, L.A.L.; Benschop, J.J.; Lenstra, T.L.; Margaritis, A.; O'Duibhir, E.; Apweiler, E.; van Wageningen, S.; Ko, C.W.; van Heesch, S.A.A.C.; Kashani, M.M.; Ampatziadis-Michailidis, G.; Brok, M.O.; Brabers, N.A.C.H.; Miles, A.J.; Bouwmeester, D.; van Hooff, S.R.; van Bakel, H.H.M.J.; Sluiters, E.C.; Bakker, L.V.; Snel, B.; Lijnzaad, P.; van Leenen, D.; Groot Koerkamp, M.J.A.; Holstege, F.C.P.

    2014-01-01

    To understand regulatory systems, it would be useful to uniformly determine how different components contribute to the expression of all other genes. We therefore monitored mRNA expression genomewide, for individual deletions of one-quarter of yeast genes, focusing on (putative) regulators. The resu

  15. Developmental gene regulatory networks in sea urchins and what we can learn from them [version 1; referees: 3 approved

    Directory of Open Access Journals (Sweden)

    Megan L. Martik

    2016-02-01

    Full Text Available Sea urchin embryos begin zygotic transcription shortly after the egg is fertilized.  Throughout the cleavage stages a series of transcription factors are activated and, along with signaling through a number of pathways, at least 15 different cell types are specified by the beginning of gastrulation.  Experimentally, perturbation of contributing transcription factors, signals and receptors and their molecular consequences enabled the assembly of an extensive gene regulatory network model.  That effort, pioneered and led by Eric Davidson and his laboratory, with many additional insights provided by other laboratories, provided the sea urchin community with a valuable resource.  Here we describe the approaches used to enable the assembly of an advanced gene regulatory network model describing molecular diversification during early development.  We then provide examples to show how a relatively advanced authenticated network can be used as a tool for discovery of how diverse developmental mechanisms are controlled and work.

  16. Escherichia coli transcriptional regulatory network

    Directory of Open Access Journals (Sweden)

    Agustino Martinez-Antonio

    2011-06-01

    Full Text Available Escherichia coli is the most well-know bacterial model about the function of its molecular components. In this review are presented several structural and functional aspects of their transcriptional regulatory network constituted by transcription factors and target genes. The network discussed here represent to 1531 genes and 3421 regulatory interactions. This network shows a power-law distribution with a few global regulators and most of genes poorly connected. 176 of genes in the network correspond to transcription factors, which form a sub-network of seven hierarchical layers where global regulators tend to be set in superior layers while local regulators are located in the lower ones. There is a small set of proteins know as nucleoid-associated proteins, which are in a high cellular concentrations and reshape the nucleoid structure to influence the running of global transcriptional programs, to this mode of regulation is named analog regulation. Specific signal effectors assist the activity of most of transcription factors in E. coli. These effectors switch and tune the activity of transcription factors. To this type of regulation, depending of environmental signals is named the digital-precise-regulation. The integration of regulatory programs have place in the promoter region of transcription units where it is common to observe co-regulation among global and local TFs as well as of TFs sensing exogenous and endogenous conditions. The mechanistic logic to understand the harmonious operation of regulatory programs in the network should consider the globalism of TFs, their signal perceived, coregulation, genome position, and cellular concentration. Finally, duplicated TFs and their horizontal transfer influence the evolvability of members of the network. The most duplicated and transferred TFs are located in the network periphery.

  17. Deciphering ascorbic acid regulatory pathways in ripening tomato fruit using a weighted gene correlation network analysis approach.

    Science.gov (United States)

    Gao, Chao; Ju, Zheng; Li, Shan; Zuo, Jinhua; Fu, Daqi; Tian, Huiqin; Luo, Yunbo; Zhu, Benzhong

    2013-11-01

    Genotype is generally determined by the co-expression of diverse genes and multiple regulatory pathways in plants. Gene co-expression analysis combining with physiological trait data provides very important information about the gene function and regulatory mechanism. L-Ascorbic acid (AsA), which is an essential nutrient component for human health and plant metabolism, plays key roles in diverse biological processes such as cell cycle, cell expansion, stress resistance, hormone synthesis, and signaling. Here, we applied a weighted gene correlation network analysis approach based on gene expression values and AsA content data in ripening tomato (Solanum lycopersicum L.) fruit with different AsA content levels, which leads to identification of AsA relevant modules and vital genes in AsA regulatory pathways. Twenty-four modules were compartmentalized according to gene expression profiling. Among these modules, one negatively related module containing genes involved in redox processes and one positively related module enriched with genes involved in AsA biosynthetic and recycling pathways were further analyzed. The present work herein indicates that redox pathways as well as hormone-signal pathways are closely correlated with AsA accumulation in ripening tomato fruit, and allowed us to prioritize candidate genes for follow-up studies to dissect this interplay at the biochemical and molecular level.

  18. Deciphering Ascorbic Acid Regulatory Pathways in Ripening Tomato Fruit Using a Weighted Gene Correlation Network Analysis Approach

    Institute of Scientific and Technical Information of China (English)

    Chao Gao; Zheng Ju; Shan Li; Jinhua Zuo; Daqi Fu; Huiqin Tian; Yunbo Luo; Benzhong Zhu

    2013-01-01

    Genotype is generally determined by the co-expression of diverse genes and multiple regulatory pathways in plants. Gene co-expression analysis combining with physiological trait data provides very important information about the gene function and regulatory mechanism. L-Ascorbic acid (AsA), which is an essential nutrient component for human health and plant metabolism, plays key roles in diverse biological processes such as cell cycle, cell expansion, stress resistance, hormone synthesis, and signaling. Here, we applied a weighted gene correlation network analysis approach based on gene expression values and AsA content data in ripening tomato (Solanum lycopersicum L.) fruit with different AsA content levels, which leads to identification of AsA relevant modules and vital genes in AsA regulatory pathways. Twenty-four modules were compartmentalized according to gene expression profiling. Among these modules, one negatively related module containing genes involved in redox processes and one positively related module enriched with genes involved in AsA biosynthetic and recycling pathways were further analyzed. The present work herein indicates that redox pathways as well as hormone-signal pathways are closely correlated with AsA accumulation in ripening tomato fruit, and allowed us to prioritize candidate genes for follow-up studies to dissect this interplay at the biochemical and molecular level.

  19. Gene and metabolite regulatory network analysis of early developing fruit tissues highlights new candidate genes for the control of tomato fruit composition and development.

    Science.gov (United States)

    Mounet, Fabien; Moing, Annick; Garcia, Virginie; Petit, Johann; Maucourt, Michael; Deborde, Catherine; Bernillon, Stéphane; Le Gall, Gwénaëlle; Colquhoun, Ian; Defernez, Marianne; Giraudel, Jean-Luc; Rolin, Dominique; Rothan, Christophe; Lemaire-Chamley, Martine

    2009-03-01

    Variations in early fruit development and composition may have major impacts on the taste and the overall quality of ripe tomato (Solanum lycopersicum) fruit. To get insights into the networks involved in these coordinated processes and to identify key regulatory genes, we explored the transcriptional and metabolic changes in expanding tomato fruit tissues using multivariate analysis and gene-metabolite correlation networks. To this end, we demonstrated and took advantage of the existence of clear structural and compositional differences between expanding mesocarp and locular tissue during fruit development (12-35 d postanthesis). Transcriptome and metabolome analyses were carried out with tomato microarrays and analytical methods including proton nuclear magnetic resonance and liquid chromatography-mass spectrometry, respectively. Pairwise comparisons of metabolite contents and gene expression profiles detected up to 37 direct gene-metabolite correlations involving regulatory genes (e.g. the correlations between glutamine, bZIP, and MYB transcription factors). Correlation network analyses revealed the existence of major hub genes correlated with 10 or more regulatory transcripts and embedded in a large regulatory network. This approach proved to be a valuable strategy for identifying specific subsets of genes implicated in key processes of fruit development and metabolism, which are therefore potential targets for genetic improvement of tomato fruit quality.

  20. Genome-wide identification of regulatory elements and reconstruction of gene regulatory networks of the green alga Chlamydomonas reinhardtii under carbon deprivation.

    Directory of Open Access Journals (Sweden)

    Flavia Vischi Winck

    Full Text Available The unicellular green alga Chlamydomonas reinhardtii is a long-established model organism for studies on photosynthesis and carbon metabolism-related physiology. Under conditions of air-level carbon dioxide concentration [CO2], a carbon concentrating mechanism (CCM is induced to facilitate cellular carbon uptake. CCM increases the availability of carbon dioxide at the site of cellular carbon fixation. To improve our understanding of the transcriptional control of the CCM, we employed FAIRE-seq (formaldehyde-assisted Isolation of Regulatory Elements, followed by deep sequencing to determine nucleosome-depleted chromatin regions of algal cells subjected to carbon deprivation. Our FAIRE data recapitulated the positions of known regulatory elements in the promoter of the periplasmic carbonic anhydrase (Cah1 gene, which is upregulated during CCM induction, and revealed new candidate regulatory elements at a genome-wide scale. In addition, time series expression patterns of 130 transcription factor (TF and transcription regulator (TR genes were obtained for cells cultured under photoautotrophic condition and subjected to a shift from high to low [CO2]. Groups of co-expressed genes were identified and a putative directed gene-regulatory network underlying the CCM was reconstructed from the gene expression data using the recently developed IOTA (inner composition alignment method. Among the candidate regulatory genes, two members of the MYB-related TF family, Lcr1 (Low-CO 2 response regulator 1 and Lcr2 (Low-CO2 response regulator 2, may play an important role in down-regulating the expression of a particular set of TF and TR genes in response to low [CO2]. The results obtained provide new insights into the transcriptional control of the CCM and revealed more than 60 new candidate regulatory genes. Deep sequencing of nucleosome-depleted genomic regions indicated the presence of new, previously unknown regulatory elements in the C. reinhardtii genome

  1. Bach2 represses plasma cell gene regulatory network in B cells to promote antibody class switch.

    Science.gov (United States)

    Muto, Akihiko; Ochiai, Kyoko; Kimura, Yoshitaka; Itoh-Nakadai, Ari; Calame, Kathryn L; Ikebe, Dai; Tashiro, Satoshi; Igarashi, Kazuhiko

    2010-12-01

    Two transcription factors, Pax5 and Blimp-1, form a gene regulatory network (GRN) with a double-negative loop, which defines either B-cell (Pax5 high) or plasma cell (Blimp-1 high) status as a binary switch. However, it is unclear how this B-cell GRN registers class switch DNA recombination (CSR), an event that takes place before the terminal differentiation to plasma cells. In the absence of Bach2 encoding a transcription factor required for CSR, mouse splenic B cells more frequently and rapidly expressed Blimp-1 and differentiated to IgM plasma cells as compared with wild-type cells. Genetic loss of Blimp-1 in Bach2(-/-) B cells was sufficient to restore CSR. These data with mathematical modelling of the GRN indicate that Bach2 achieves a time delay in Blimp-1 induction, which inhibits plasma cell differentiation and promotes CSR (Delay-Driven Diversity model for CSR). Reduction in mature B-cell numbers in Bach2(-/-) mice was not rescued by Blimp-1 ablation, indicating that Bach2 regulates B-cell differentiation and function through Blimp-1-dependent and -independent GRNs.

  2. Aggregation for Computing Multi-Modal Stationary Distributions in 1-D Gene Regulatory Networks.

    Science.gov (United States)

    Avcu, Neslihan; Pekergin, Nihal; Pekergin, Ferhan; Guzelis, Cuneyt

    2017-04-27

    This paper proposes aggregation-based, three-stage algorithms to overcome the numerical problems encountered in computing stationary distributions and mean first passage times for multi-modal birth-death processes of large state space sizes. The considered birth-death processes which are defined by Chemical Master Equations are used in modeling stochastic behavior of gene regulatory networks. Computing stationary probabilities for a multi-modal distribution from Chemical Master Equations is subject to have numerical problems due to the probability values running out of the representation range of the standard programming languages with the increasing size of the state space. The aggregation is shown to provide a solution to this problem by analyzing first reduced size subsystems in isolation and then considering the transitions between these subsystems. The proposed algorithms are applied to study the bimodal behavior of the lac operon of E. coli described with a one-dimensional birth-death model. Thus the determination of the entire parameter range of bimodality for the stochastic model of lac operon is achieved.

  3. Parallel mutual information estimation for inferring gene regulatory networks on GPUs

    Directory of Open Access Journals (Sweden)

    Liu Weiguo

    2011-06-01

    Full Text Available Abstract Background Mutual information is a measure of similarity between two variables. It has been widely used in various application domains including computational biology, machine learning, statistics, image processing, and financial computing. Previously used simple histogram based mutual information estimators lack the precision in quality compared to kernel based methods. The recently introduced B-spline function based mutual information estimation method is competitive to the kernel based methods in terms of quality but at a lower computational complexity. Results We present a new approach to accelerate the B-spline function based mutual information estimation algorithm with commodity graphics hardware. To derive an efficient mapping onto this type of architecture, we have used the Compute Unified Device Architecture (CUDA programming model to design and implement a new parallel algorithm. Our implementation, called CUDA-MI, can achieve speedups of up to 82 using double precision on a single GPU compared to a multi-threaded implementation on a quad-core CPU for large microarray datasets. We have used the results obtained by CUDA-MI to infer gene regulatory networks (GRNs from microarray data. The comparisons to existing methods including ARACNE and TINGe show that CUDA-MI produces GRNs of higher quality in less time. Conclusions CUDA-MI is publicly available open-source software, written in CUDA and C++ programming languages. It obtains significant speedup over sequential multi-threaded implementation by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs.

  4. Mapping Gene Regulatory Networks in Drosophila Eye Development by Large-Scale Transcriptome Perturbations and Motif Inference

    Directory of Open Access Journals (Sweden)

    Delphine Potier

    2014-12-01

    Full Text Available Genome control is operated by transcription factors (TFs controlling their target genes by binding to promoters and enhancers. Conceptually, the interactions between TFs, their binding sites, and their functional targets are represented by gene regulatory networks (GRNs. Deciphering in vivo GRNs underlying organ development in an unbiased genome-wide setting involves identifying both functional TF-gene interactions and physical TF-DNA interactions. To reverse engineer the GRNs of eye development in Drosophila, we performed RNA-seq across 72 genetic perturbations and sorted cell types and inferred a coexpression network. Next, we derived direct TF-DNA interactions using computational motif inference, ultimately connecting 241 TFs to 5,632 direct target genes through 24,926 enhancers. Using this network, we found network motifs, cis-regulatory codes, and regulators of eye development. We validate the predicted target regions of Grainyhead by ChIP-seq and identify this factor as a general cofactor in the eye network, being bound to thousands of nucleosome-free regions.

  5. Small RNA-Controlled Gene Regulatory Networks in Pseudomonas putida

    DEFF Research Database (Denmark)

    Bojanovic, Klara

    evolved numerous mechanisms to controlgene expression in response to specific environmental signals. In addition to two-component systems, small regulatory RNAs (sRNAs) have emerged as major regulators of gene expression. The majority of sRNAs bind to mRNA and regulate their expression. They often have...

  6. LoTo: a graphlet based method for the comparison of local topology between gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Alberto J. Martin

    2017-02-01

    Full Text Available One of the main challenges of the post-genomic era is the understanding of how gene expression is controlled. Changes in gene expression lay behind diverse biological phenomena such as development, disease and the adaptation to different environmental conditions. Despite the availability of well-established methods to identify these changes, tools to discern how gene regulation is orchestrated are still required. The regulation of gene expression is usually depicted as a Gene Regulatory Network (GRN where changes in the network structure (i.e., network topology represent adjustments of gene regulation. Like other networks, GRNs are composed of basic building blocks; small induced subgraphs called graphlets. Here we present LoTo, a novel method that using Graphlet Based Metrics (GBMs identifies topological variations between different states of a GRN. Under our approach, different states of a GRN are analyzed to determine the types of graphlet formed by all triplets of nodes in the network. Subsequently, graphlets occurring in a state of the network are compared to those formed by the same three nodes in another version of the network. Once the comparisons are performed, LoTo applies metrics from binary classification problems calculated on the existence and absence of graphlets to assess the topological similarity between both network states. Experiments performed on randomized networks demonstrate that GBMs are more sensitive to topological variation than the same metrics calculated on single edges. Additional comparisons with other common metrics demonstrate that our GBMs are capable to identify nodes whose local topology changes between different states of the network. Notably, due to the explicit use of graphlets, LoTo captures topological variations that are disregarded by other approaches. LoTo is freely available as an online web server at http://dlab.cl/loto.

  7. Evolutionary approaches for the reverse-engineering of gene regulatory networks: A study on a biologically realistic dataset

    Directory of Open Access Journals (Sweden)

    Gidrol Xavier

    2008-02-01

    Full Text Available Abstract Background Inferring gene regulatory networks from data requires the development of algorithms devoted to structure extraction. When only static data are available, gene interactions may be modelled by a Bayesian Network (BN that represents the presence of direct interactions from regulators to regulees by conditional probability distributions. We used enhanced evolutionary algorithms to stochastically evolve a set of candidate BN structures and found the model that best fits data without prior knowledge. Results We proposed various evolutionary strategies suitable for the task and tested our choices using simulated data drawn from a given bio-realistic network of 35 nodes, the so-called insulin network, which has been used in the literature for benchmarking. We assessed the inferred models against this reference to obtain statistical performance results. We then compared performances of evolutionary algorithms using two kinds of recombination operators that operate at different scales in the graphs. We introduced a niching strategy that reinforces diversity through the population and avoided trapping of the algorithm in one local minimum in the early steps of learning. We show the limited effect of the mutation operator when niching is applied. Finally, we compared our best evolutionary approach with various well known learning algorithms (MCMC, K2, greedy search, TPDA, MMHC devoted to BN structure learning. Conclusion We studied the behaviour of an evolutionary approach enhanced by niching for the learning of gene regulatory networks with BN. We show that this approach outperforms classical structure learning methods in elucidating the original model. These results were obtained for the learning of a bio-realistic network and, more importantly, on various small datasets. This is a suitable approach for learning transcriptional regulatory networks from real datasets without prior knowledge.

  8. Inferring the time-invariant topology of a nonlinear sparse gene regulatory network using fully Bayesian spline autoregression.

    Science.gov (United States)

    Morrissey, Edward R; Juárez, Miguel A; Denby, Katherine J; Burroughs, Nigel J

    2011-10-01

    We propose a semiparametric Bayesian model, based on penalized splines, for the recovery of the time-invariant topology of a causal interaction network from longitudinal data. Our motivation is inference of gene regulatory networks from low-resolution microarray time series, where existence of nonlinear interactions is well known. Parenthood relations are mapped by augmenting the model with kinship indicators and providing these with either an overall or gene-wise hierarchical structure. Appropriate specification of the prior is crucial to control the flexibility of the splines, especially under circumstances of scarce data; thus, we provide an informative, proper prior. Substantive improvement in network inference over a linear model is demonstrated using synthetic data drawn from ordinary differential equation models and gene expression from an experimental data set of the Arabidopsis thaliana circadian rhythm.

  9. Influence of statistical estimators of mutual information and data heterogeneity on the inference of gene regulatory networks.

    Directory of Open Access Journals (Sweden)

    Ricardo de Matos Simoes

    Full Text Available The inference of gene regulatory networks from gene expression data is a difficult problem because the performance of the inference algorithms depends on a multitude of different factors. In this paper we study two of these. First, we investigate the influence of discrete mutual information (MI estimators on the global and local network inference performance of the C3NET algorithm. More precisely, we study 4 different MI estimators (Empirical, Miller-Madow, Shrink and Schürmann-Grassberger in combination with 3 discretization methods (equal frequency, equal width and global equal width discretization. We observe the best global and local inference performance of C3NET for the Miller-Madow estimator with an equal width discretization. Second, our numerical analysis can be considered as a systems approach because we simulate gene expression data from an underlying gene regulatory network, instead of making a distributional assumption to sample thereof. We demonstrate that despite the popularity of the latter approach, which is the traditional way of studying MI estimators, this is in fact not supported by simulated and biological expression data because of their heterogeneity. Hence, our study provides guidance for an efficient design of a simulation study in the context of network inference, supporting a systems approach.

  10. Influence of statistical estimators of mutual information and data heterogeneity on the inference of gene regulatory networks.

    Science.gov (United States)

    de Matos Simoes, Ricardo; Emmert-Streib, Frank

    2011-01-01

    The inference of gene regulatory networks from gene expression data is a difficult problem because the performance of the inference algorithms depends on a multitude of different factors. In this paper we study two of these. First, we investigate the influence of discrete mutual information (MI) estimators on the global and local network inference performance of the C3NET algorithm. More precisely, we study 4 different MI estimators (Empirical, Miller-Madow, Shrink and Schürmann-Grassberger) in combination with 3 discretization methods (equal frequency, equal width and global equal width discretization). We observe the best global and local inference performance of C3NET for the Miller-Madow estimator with an equal width discretization. Second, our numerical analysis can be considered as a systems approach because we simulate gene expression data from an underlying gene regulatory network, instead of making a distributional assumption to sample thereof. We demonstrate that despite the popularity of the latter approach, which is the traditional way of studying MI estimators, this is in fact not supported by simulated and biological expression data because of their heterogeneity. Hence, our study provides guidance for an efficient design of a simulation study in the context of network inference, supporting a systems approach.

  11. An efficient data assimilation schema for restoration and extension of gene regulatory networks using time-course observation data.

    Science.gov (United States)

    Hasegawa, Takanori; Mori, Tomoya; Yamaguchi, Rui; Imoto, Seiya; Miyano, Satoru; Akutsu, Tatsuya

    2014-11-01

    Gene regulatory networks (GRNs) play a central role in sustaining complex biological systems in cells. Although we can construct GRNs by integrating biological interactions that have been recorded in literature, they can include suspicious data and a lack of information. Therefore, there has been an urgent need for an approach by which the validity of constructed networks can be evaluated; simulation-based methods have been applied in which biological observational data are assimilated. However, these methods apply nonlinear models that require high computational power to evaluate even one network consisting of only several genes. Therefore, to explore candidate networks whose simulation models can better predict the data by modifying and extending literature-based GRNs, an efficient and versatile method is urgently required. We applied a combinatorial transcription model, which can represent combinatorial regulatory effects of genes, as a biological simulation model, to reproduce the dynamic behavior of gene expressions within a state space model. Under the model, we applied the unscented Kalman filter to obtain the approximate posterior probability distribution of the hidden state to efficiently estimate parameter values maximizing prediction ability for observational data by the EM-algorithm. Utilizing the method, we propose a novel algorithm to modify GRNs reported in the literature so that their simulation models become consistent with observed data. The effectiveness of our approach was validated through comparison analysis to the previous methods using synthetic networks. Finally, as an application example, a Kyoto Encyclopedia of Genes and Genomes (KEGG)-based yeast cell cycle network was extended with additional candidate genes to better predict the real mRNA expressions data using the proposed method.

  12. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model

    DEFF Research Database (Denmark)

    Kogelman, Lisette; Cirera Salicio, Susanna; Zhernakova, Daria V.

    2014-01-01

    interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model...... in a porcine model. Methods We selected 36 animals for RNA Sequencing from a previously created F2 pig population representing three extreme groups based on their predicted genetic risks for obesity. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to detect clusters of highly co-expressed genes...... in humans and rodents, e.g. CSF1R and MARC2. Conclusions To our knowledge, this is the first study to apply systems biology approaches using porcine adipose tissue RNA-Sequencing data in a genetically characterized porcine model for obesity. We revealed complex networks, pathways, candidate and regulatory...

  13. Ancestral regulatory circuits governing ectoderm patterning downstream of Nodal and BMP2/4 revealed by gene regulatory network analysis in an echinoderm.

    Directory of Open Access Journals (Sweden)

    Alexandra Saudemont

    Full Text Available Echinoderms, which are phylogenetically related to vertebrates and produce large numbers of transparent embryos that can be experimentally manipulated, offer many advantages for the analysis of the gene regulatory networks (GRN regulating germ layer formation. During development of the sea urchin embryo, the ectoderm is the source of signals that pattern all three germ layers along the dorsal-ventral axis. How this signaling center controls patterning and morphogenesis of the embryo is not understood. Here, we report a large-scale analysis of the GRN deployed in response to the activity of this signaling center in the embryos of the Mediterranean sea urchin Paracentrotus lividus, in which studies with high spatial resolution are possible. By using a combination of in situ hybridization screening, overexpression of mRNA, recombinant ligand treatments, and morpholino-based loss-of-function studies, we identified a cohort of transcription factors and signaling molecules expressed in the ventral ectoderm, dorsal ectoderm, and interposed neurogenic ("ciliary band" region in response to the known key signaling molecules Nodal and BMP2/4 and defined the epistatic relationships between the most important genes. The resultant GRN showed a number of striking features. First, Nodal was found to be essential for the expression of all ventral and dorsal marker genes, and BMP2/4 for all dorsal genes. Second, goosecoid was identified as a central player in a regulatory sub-circuit controlling mouth formation, while tbx2/3 emerged as a critical factor for differentiation of the dorsal ectoderm. Finally, and unexpectedly, a neurogenic ectoderm regulatory circuit characterized by expression of "ciliary band" genes was triggered in the absence of TGF beta signaling. We propose a novel model for ectoderm regionalization, in which neural ectoderm is the default fate in the absence of TGF beta signaling, and suggest that the stomodeal and neural subcircuits that we

  14. On the trail of EHEC/EAEC--unraveling the gene regulatory networks of human pathogenic Escherichia coli bacteria.

    Science.gov (United States)

    Pauling, Josch; Röttger, Richard; Neuner, Andreas; Salgado, Heladia; Collado-Vides, Julio; Kalaghatgi, Prabhav; Azevedo, Vasco; Tauch, Andreas; Pühler, Alfred; Baumbach, Jan

    2012-07-01

    Pathogenic Escherichia coli, such as Enterohemorrhagic E. coli (EHEC) and Enteroaggregative E. coli (EAEC), are globally widespread bacteria. Some may cause the hemolytic uremic syndrome (HUS). Varying strains cause epidemics all over the world. Recently, we observed an epidemic outbreak of a multi-resistant EHEC strain in Western Europe, mainly in Germany. The Robert Koch Institute reports >4300 infections and >50 deaths (July, 2011). Farmers lost several million EUR since the origin of infection was unclear. Here, we contribute to the currently ongoing research with a computer-aided study of EHEC transcriptional regulatory interactions, a network of genetic switches that control, for instance, pathogenicity, survival and reproduction of bacterial cells. Our strategy is to utilize knowledge of gene regulatory networks from the evolutionary relative E. coli K-12, a harmless strain mainly used for wet lab studies. In order to provide high-potential candidates for human pathogenic E. coli bacteria, such as EHEC, we developed the integrated online database and an analysis platform EhecRegNet. We utilize 3489 known regulations from E. coli K-12 for predictions of yet unknown gene regulatory interactions in 16 human pathogens. For these strains we predict 40,913 regulatory interactions. EhecRegNet is based on the identification of evolutionarily conserved regulatory sites within the DNA of the harmless E. coli K-12 and the pathogens. Identifying and characterizing EHEC's genetic control mechanism network on a large scale will allow for a better understanding of its survival and infection strategies. This will support the development of urgently needed new treatments. EhecRegNet is online via http://www.ehecregnet.de.

  15. LmSmdB: an integrated database for metabolic and gene regulatory network in Leishmania major and Schistosoma mansoni.

    Science.gov (United States)

    Patel, Priyanka; Mandlik, Vineetha; Singh, Shailza

    2016-03-01

    A database that integrates all the information required for biological processing is essential to be stored in one platform. We have attempted to create one such integrated database that can be a one stop shop for the essential features required to fetch valuable result. LmSmdB (L. major and S. mansoni database) is an integrated database that accounts for the biological networks and regulatory pathways computationally determined by integrating the knowledge of the genome sequences of the mentioned organisms. It is the first database of its kind that has together with the network designing showed the simulation pattern of the product. This database intends to create a comprehensive canopy for the regulation of lipid metabolism reaction in the parasite by integrating the transcription factors, regulatory genes and the protein products controlled by the transcription factors and hence operating the metabolism at genetic level.

  16. Evolutionary dynamics of prokaryotic transcriptional regulatory networks.

    Science.gov (United States)

    Madan Babu, M; Teichmann, Sarah A; Aravind, L

    2006-04-28

    The structure of complex transcriptional regulatory networks has been studied extensively in certain model organisms. However, the evolutionary dynamics of these networks across organisms, which would reveal important principles of adaptive regulatory changes, are poorly understood. We use the known transcriptional regulatory network of Escherichia coli to analyse the conservation patterns of this network across 175 prokaryotic genomes, and predict components of the regulatory networks for these organisms. We observe that transcription factors are typically less conserved than their target genes and evolve independently of them, with different organisms evolving distinct repertoires of transcription factors responding to specific signals. We show that prokaryotic transcriptional regulatory networks have evolved principally through widespread tinkering of transcriptional interactions at the local level by embedding orthologous genes in different types of regulatory motifs. Different transcription factors have emerged independently as dominant regulatory hubs in various organisms, suggesting that they have convergently acquired similar network structures approximating a scale-free topology. We note that organisms with similar lifestyles across a wide phylogenetic range tend to conserve equivalent interactions and network motifs. Thus, organism-specific optimal network designs appear to have evolved due to selection for specific transcription factors and transcriptional interactions, allowing responses to prevalent environmental stimuli. The methods for biological network analysis introduced here can be applied generally to study other networks, and these predictions can be used to guide specific experiments.

  17. An overview of the gene regulatory network controlling trichome development in the model plant, Arabidopsis

    Directory of Open Access Journals (Sweden)

    Sitakanta ePattanaik

    2014-06-01

    Full Text Available Trichomes are specialized epidermal cells located on aerial parts of plants and are associated with a wide array of biological processes. Trichomes protect plants from adverse conditions including UV light and herbivore attack and are also an important source of a number of phytochemicals. The simple unicellular trichomes of Arabidopsis serve as an excellent model to study molecular mechanism of cell differentiation and pattern formation in plants. The emerging picture suggests that the developmental process is controlled by a transcriptional network involving three major groups of transcription factors: the R2R3 MYB, basic helix-loop-helix (bHLH and WD40 repeat (WDR protein. These regulatory proteins form a trimeric activator complex that positively regulates trichome development. The single repeat R3 MYBs act as negative regulators of trichome development. They compete with the R2R3 MYBs to bind the bHLH factor and form a repressor complex. In addition to activator-repressor mechanism, a depletion mechanism may operate in parallel during trichome development. In this mechanism, the bHLH factor traps the WDR protein which results in depletion of WDR protein in neighboring cells. Consequently, the cells with high levels of bHLH and WDR proteins are developed into trichomes. A group of C2H2 zinc finger TFs has also been implicated in trichome development. Phytohormones, including gibberellins and jasmonic acid, play significant roles in this developmental process. Recently, microRNAs have been shown to be involved in trichome development. Furthermore, it has been demonstrated that the activities of the key regulatory proteins involved in trichome development are controlled by the 26S/ubiquitin proteasome system (UPS, highlighting the complexity of the regulatory network controlling this developmental process. To complement several excellent recent relevant reviews, this review focuses on the transcriptional network and hormonal interplay

  18. Integrative analysis of the zinc finger transcription factor Lame duck in the Drosophila myogenic gene regulatory network.

    Science.gov (United States)

    Busser, Brian W; Huang, Di; Rogacki, Kevin R; Lane, Elizabeth A; Shokri, Leila; Ni, Ting; Gamble, Caitlin E; Gisselbrecht, Stephen S; Zhu, Jun; Bulyk, Martha L; Ovcharenko, Ivan; Michelson, Alan M

    2012-12-11

    Contemporary high-throughput technologies permit the rapid identification of transcription factor (TF) target genes on a genome-wide scale, yet the functional significance of TFs requires knowledge of target gene expression patterns, cooperating TFs, and cis-regulatory element (CRE) structures. Here we investigated the myogenic regulatory network downstream of the Drosophila zinc finger TF Lame duck (Lmd) by combining both previously published and newly performed genomic data sets, including ChIP sequencing (ChIP-seq), genome-wide mRNA profiling, cell-specific expression patterns of putative transcriptional targets, analysis of histone mark signatures, studies of TF cooccupancy by additional mesodermal regulators, TF binding site determination using protein binding microarrays (PBMs), and machine learning of candidate CRE motif compositions. Our findings suggest that Lmd orchestrates an extensive myogenic regulatory network, a conclusion supported by the identification of Lmd-dependent genes, histone signatures of Lmd-bound genomic regions, and the relationship of these features to cell-specific gene expression patterns. The heterogeneous cooccupancy of Lmd-bound regions with additional mesodermal regulators revealed that different transcriptional inputs are used to mediate similar myogenic gene expression patterns. Machine learning further demonstrated diverse combinatorial motif patterns within tissue-specific Lmd-bound regions. PBM analysis established the complete spectrum of Lmd DNA binding specificities, and site-directed mutagenesis of Lmd and additional newly discovered motifs in known enhancers demonstrated the critical role of these TF binding sites in supporting full enhancer activity. Collectively, these findings provide insights into the transcriptional codes regulating muscle gene expression and offer a generalizable approach for similar studies in other systems.

  19. DREISS: Using State-Space Models to Infer the Dynamics of Gene Expression Driven by External and Internal Regulatory Networks.

    Science.gov (United States)

    Wang, Daifeng; He, Fei; Maslov, Sergei; Gerstein, Mark

    2016-10-01

    Gene expression is controlled by the combinatorial effects of regulatory factors from different biological subsystems such as general transcription factors (TFs), cellular growth factors and microRNAs. A subsystem's gene expression may be controlled by its internal regulatory factors, exclusively, or by external subsystems, or by both. It is thus useful to distinguish the degree to which a subsystem is regulated internally or externally-e.g., how non-conserved, species-specific TFs affect the expression of conserved, cross-species genes during evolution. We developed a computational method (DREISS, dreiss.gerteinlab.org) for analyzing the Dynamics of gene expression driven by Regulatory networks, both External and Internal based on State Space models. Given a subsystem, the "state" and "control" in the model refer to its own (internal) and another subsystem's (external) gene expression levels. The state at a given time is determined by the state and control at a previous time. Because typical time-series data do not have enough samples to fully estimate the model's parameters, DREISS uses dimensionality reduction, and identifies canonical temporal expression trajectories (e.g., degradation, growth and oscillation) representing the regulatory effects emanating from various subsystems. To demonstrate capabilities of DREISS, we study the regulatory effects of evolutionarily conserved vs. divergent TFs across distant species. In particular, we applied DREISS to the time-series gene expression datasets of C. elegans and D. melanogaster during their embryonic development. We analyzed the expression dynamics of the conserved, orthologous genes (orthologs), seeing the degree to which these can be accounted for by orthologous (internal) versus species-specific (external) TFs. We found that between two species, the orthologs have matched, internally driven expression patterns but very different externally driven ones. This is particularly true for genes with evolutionarily

  20. DREISS: Using State-Space Models to Infer the Dynamics of Gene Expression Driven by External and Internal Regulatory Networks

    Science.gov (United States)

    Gerstein, Mark

    2016-01-01

    Gene expression is controlled by the combinatorial effects of regulatory factors from different biological subsystems such as general transcription factors (TFs), cellular growth factors and microRNAs. A subsystem’s gene expression may be controlled by its internal regulatory factors, exclusively, or by external subsystems, or by both. It is thus useful to distinguish the degree to which a subsystem is regulated internally or externally–e.g., how non-conserved, species-specific TFs affect the expression of conserved, cross-species genes during evolution. We developed a computational method (DREISS, dreiss.gerteinlab.org) for analyzing the Dynamics of gene expression driven by Regulatory networks, both External and Internal based on State Space models. Given a subsystem, the “state” and “control” in the model refer to its own (internal) and another subsystem’s (external) gene expression levels. The state at a given time is determined by the state and control at a previous time. Because typical time-series data do not have enough samples to fully estimate the model’s parameters, DREISS uses dimensionality reduction, and identifies canonical temporal expression trajectories (e.g., degradation, growth and oscillation) representing the regulatory effects emanating from various subsystems. To demonstrate capabilities of DREISS, we study the regulatory effects of evolutionarily conserved vs. divergent TFs across distant species. In particular, we applied DREISS to the time-series gene expression datasets of C. elegans and D. melanogaster during their embryonic development. We analyzed the expression dynamics of the conserved, orthologous genes (orthologs), seeing the degree to which these can be accounted for by orthologous (internal) versus species-specific (external) TFs. We found that between two species, the orthologs have matched, internally driven expression patterns but very different externally driven ones. This is particularly true for genes with

  1. Identifying Tmem59 related gene regulatory network of mouse neural stem cell from a compendium of expression profiles

    Directory of Open Access Journals (Sweden)

    Guo Xiuyun

    2011-09-01

    Full Text Available Abstract Background Neural stem cells offer potential treatment for neurodegenerative disorders, such like Alzheimer's disease (AD. While much progress has been made in understanding neural stem cell function, a precise description of the molecular mechanisms regulating neural stem cells is not yet established. This lack of knowledge is a major barrier holding back the discovery of therapeutic uses of neural stem cells. In this paper, the regulatory mechanism of mouse neural stem cell (NSC differentiation by tmem59 is explored on the genome-level. Results We identified regulators of tmem59 during the differentiation of mouse NSCs from a compendium of expression profiles. Based on the microarray experiment, we developed the parallelized SWNI algorithm to reconstruct gene regulatory networks of mouse neural stem cells. From the inferred tmem59 related gene network including 36 genes, pou6f1 was identified to regulate tmem59 significantly and might play an important role in the differentiation of NSCs in mouse brain. There are four pathways shown in the gene network, indicating that tmem59 locates in the downstream of the signalling pathway. The real-time RT-PCR results shown that the over-expression of pou6f1 could significantly up-regulate tmem59 expression in C17.2 NSC line. 16 out of 36 predicted genes in our constructed network have been reported to be AD-related, including Ace, aqp1, arrdc3, cd14, cd59a, cds1, cldn1, cox8b, defb11, folr1, gdi2, mmp3, mgp, myrip, Ripk4, rnd3, and sncg. The localization of tmem59 related genes and functional-related gene groups based on the Gene Ontology (GO annotation was also identified. Conclusions Our findings suggest that the expression of tmem59 is an important factor contributing to AD. The parallelized SWNI algorithm increased the efficiency of network reconstruction significantly. This study enables us to highlight novel genes that may be involved in NSC differentiation and provides a shortcut to

  2. Proteomic shifts in embryonic stem cells with gene dose modifications suggest the presence of balancer proteins in protein regulatory networks.

    Directory of Open Access Journals (Sweden)

    Lei Mao

    Full Text Available Large numbers of protein expression changes are usually observed in mouse models for neurodegenerative diseases, even when only a single gene was mutated in each case. To study the effect of gene dose alterations on the cellular proteome, we carried out a proteomic investigation on murine embryonic stem cells that either overexpressed individual genes or displayed aneuploidy over a genomic region encompassing 14 genes. The number of variant proteins detected per cell line ranged between 70 and 110, and did not correlate with the number of modified genes. In cell lines with single gene mutations, up and down-regulated proteins were always in balance in comparison to parental cell lines regarding number as well as concentration of differentially expressed proteins. In contrast, dose alteration of 14 genes resulted in an unequal number of up and down-regulated proteins, though the balance was kept at the level of protein concentration. We propose that the observed protein changes might partially be explained by a proteomic network response. Hence, we hypothesize the existence of a class of "balancer" proteins within the proteomic network, defined as proteins that buffer or cushion a system, and thus oppose multiple system disturbances. Through database queries and resilience analysis of the protein interaction network, we found that potential balancer proteins are of high cellular abundance, possess a low number of direct interaction partners, and show great allelic variation. Moreover, balancer proteins contribute more heavily to the network entropy, and thus are of high importance in terms of system resilience. We propose that the "elasticity" of the proteomic regulatory network mediated by balancer proteins may compensate for changes that occur under diseased conditions.

  3. Application of R to investigate common gene regulatory network pathway among bipolar disorder and associate diseases

    Directory of Open Access Journals (Sweden)

    Nahida Habib

    2016-12-01

    Full Text Available Depression, Major Depression or mental disorder creates severe diseases. Mental illness such as Unipolar Major Depression, Bipolar Disorder, Dysthymia, Schizophrenia, Cardiovascular Diseases (Hypertension, Coronary Heart Disease, Stroke etc., are known as Major Depression. Several studies have revealed the possibilities about the association among Bipolar Disorder, Schizophrenia, Coronary Heart Diseases and Stroke with each other. The current study aimed to investigate the relationships between genetic variants in the above four diseases and to create a common pathway or PPI network. The associated genes of each disease are collected from different gene database with verification using R. After performing some preprocessing, mining and operations using R on collected genes, seven (7 common associated genes are discovered on selected four diseases (SZ, BD, CHD and Stroke. In each of the iteration, the numbers of collected genes are reduced up to 51%, 36%, 10%, 2% and finally less than 1% respectively. Moreover, common pathway on selected diseases has been investigated in this research.

  4. Uncovering early response of gene regulatory networks in ES cells by systematic induction of transcription factors

    Science.gov (United States)

    Nishiyama, Akira; Xin, Li; Sharov, Alexei A.; Thomas, Marshall; Mowrer, Gregory; Meyers, Emily; Piao, Yulan; Mehta, Samir; Yee, Sarah; Nakatake, Yuhki; Stagg, Carole; Sharova, Lioudmila; Correa-Cerro, Lina S.; Bassey, Uwem; Hoang, Hien; Kim, Eugene; Tapnio, Richard; Qian, Yong; Dudekula, Dawood; Zalzman, Michal; Li, Manxiang; Falco, Geppino; Yang, Hsih-Te; Lee, Sung-Lim; Monti, Manuela; Stanghellini, Ilaria; Islam, Md. Nurul; Nagaraja, Ramaiah; Goldberg, Ilya; Wang, Weidong; Longo, Dan L.; Schlessinger, David; Ko, Minoru S. H.

    2009-01-01

    SUMMARY To examine transcription factor (TF) network(s), we created mouse ES cell lines, in each of which one of 50 TFs tagged with a FLAG moiety is inserted into a ubiquitously controllable tetracycline-repressible locus. Of the 50 TFs, Cdx2 provoked the most extensive transcriptome perturbation in ES cells, followed by Esx1, Sox9, Tcf3, Klf4, and Gata3. ChIP-Seq revealed that CDX2 binds to promoters of up-regulated target genes. By contrast, genes down-regulated by CDX2 did not show CDX2 binding, but were enriched with binding sites for POU5F1, SOX2, and NANOG. Genes with binding sites for these core TFs were also down-regulated by the induction of at least 15 other TFs, suggesting a common initial step for ES cell differentiation mediated by interference with the binding of core TFs to their target genes. These ES cell lines provide a fundamental resource to study biological networks in ES cells and mice. PMID:19796622

  5. Construction and analysis of regulatory genetic networks in cervical cancer based on involved microRNAs, target genes, transcription factors and host genes.

    Science.gov (United States)

    Wang, Ning; Xu, Zhiwen; Wang, Kunhao; Zhu, Minghui; Li, Yang

    2014-04-01

    Over recent years, genes and microRNA (miRNA/miR) have been considered as key biological factors in human carcinogenesis. During cancer development, genes may act as multiple identities, including target genes of miRNA, transcription factors and host genes. The present study concentrated on the regulatory networks consisting of the biological factors involved in cervical cancer in order to investigate their features and affect on this specific pathology. Numerous raw data was collected and organized into purposeful structures, and adaptive procedures were defined for application to the prepared data. The networks were therefore built with the factors as basic components according to their interacting associations. The networks were constructed at three levels of interdependency, including a differentially-expressed network, a related network and a global network. Comparisons and analyses were made at a systematic level rather than from an isolated gene or miRNA. Critical hubs were extracted in the core networks and notable features were discussed, including self-adaption feedback regulation. The present study expounds the pathogenesis from a novel point of view and is proposed to provide inspiration for further investigation and therapy.

  6. GRNsight: a web application and service for visualizing models of small- to medium-scale gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Kam D. Dahlquist

    2016-09-01

    Full Text Available GRNsight is a web application and service for visualizing models of gene regulatory networks (GRNs. A gene regulatory network (GRN consists of genes, transcription factors, and the regulatory connections between them which govern the level of expression of mRNA and protein from genes. The original motivation came from our efforts to perform parameter estimation and forward simulation of the dynamics of a differential equations model of a small GRN with 21 nodes and 31 edges. We wanted a quick and easy way to visualize the weight parameters from the model which represent the direction and magnitude of the influence of a transcription factor on its target gene, so we created GRNsight. GRNsight automatically lays out either an unweighted or weighted network graph based on an Excel spreadsheet containing an adjacency matrix where regulators are named in the columns and target genes in the rows, a Simple Interaction Format (SIF text file, or a GraphML XML file. When a user uploads an input file specifying an unweighted network, GRNsight automatically lays out the graph using black lines and pointed arrowheads. For a weighted network, GRNsight uses pointed and blunt arrowheads, and colors the edges and adjusts their thicknesses based on the sign (positive for activation or negative for repression and magnitude of the weight parameter. GRNsight is written in JavaScript, with diagrams facilitated by D3.js, a data visualization library. Node.js and the Express framework handle server-side functions. GRNsight’s diagrams are based on D3.js’s force graph layout algorithm, which was then extensively customized to support the specific needs of GRNs. Nodes are rectangular and support gene labels of up to 12 characters. The edges are arcs, which become straight lines when the nodes are close together. Self-regulatory edges are indicated by a loop. When a user mouses over an edge, the numerical value of the weight parameter is displayed. Visualizations can

  7. MapReduce Algorithms for Inferring Gene Regulatory Networks from Time-Series Microarray Data Using an Information-Theoretic Approach

    Directory of Open Access Journals (Sweden)

    Yasser Abduallah

    2017-01-01

    Full Text Available Gene regulation is a series of processes that control gene expression and its extent. The connections among genes and their regulatory molecules, usually transcription factors, and a descriptive model of such connections are known as gene regulatory networks (GRNs. Elucidating GRNs is crucial to understand the inner workings of the cell and the complexity of gene interactions. To date, numerous algorithms have been developed to infer gene regulatory networks. However, as the number of identified genes increases and the complexity of their interactions is uncovered, networks and their regulatory mechanisms become cumbersome to test. Furthermore, prodding through experimental results requires an enormous amount of computation, resulting in slow data processing. Therefore, new approaches are needed to expeditiously analyze copious amounts of experimental data resulting from cellular GRNs. To meet this need, cloud computing is promising as reported in the literature. Here, we propose new MapReduce algorithms for inferring gene regulatory networks on a Hadoop cluster in a cloud environment. These algorithms employ an information-theoretic approach to infer GRNs using time-series microarray data. Experimental results show that our MapReduce program is much faster than an existing tool while achieving slightly better prediction accuracy than the existing tool.

  8. MapReduce Algorithms for Inferring Gene Regulatory Networks from Time-Series Microarray Data Using an Information-Theoretic Approach.

    Science.gov (United States)

    Abduallah, Yasser; Turki, Turki; Byron, Kevin; Du, Zongxuan; Cervantes-Cervantes, Miguel; Wang, Jason T L

    2017-01-01

    Gene regulation is a series of processes that control gene expression and its extent. The connections among genes and their regulatory molecules, usually transcription factors, and a descriptive model of such connections are known as gene regulatory networks (GRNs). Elucidating GRNs is crucial to understand the inner workings of the cell and the complexity of gene interactions. To date, numerous algorithms have been developed to infer gene regulatory networks. However, as the number of identified genes increases and the complexity of their interactions is uncovered, networks and their regulatory mechanisms become cumbersome to test. Furthermore, prodding through experimental results requires an enormous amount of computation, resulting in slow data processing. Therefore, new approaches are needed to expeditiously analyze copious amounts of experimental data resulting from cellular GRNs. To meet this need, cloud computing is promising as reported in the literature. Here, we propose new MapReduce algorithms for inferring gene regulatory networks on a Hadoop cluster in a cloud environment. These algorithms employ an information-theoretic approach to infer GRNs using time-series microarray data. Experimental results show that our MapReduce program is much faster than an existing tool while achieving slightly better prediction accuracy than the existing tool.

  9. Genome-wide analysis of the p53 gene regulatory network in the developing mouse kidney.

    Science.gov (United States)

    Li, Yuwen; Liu, Jiao; McLaughlin, Nathan; Bachvarov, Dimcho; Saifudeen, Zubaida; El-Dahr, Samir S

    2013-10-16

    Despite mounting evidence that p53 senses and responds to physiological cues in vivo, existing knowledge regarding p53 function and target genes is largely derived from studies in cancer or stressed cells. Herein we utilize p53 transcriptome and ChIP-Seq (chromatin immunoprecipitation-high throughput sequencing) analyses to identify p53 regulated pathways in the embryonic kidney, an organ that develops via mesenchymal-epithelial interactions. This integrated approach allowed identification of novel genes that are possible direct p53 targets during kidney development. We find the p53-regulated transcriptome in the embryonic kidney is largely composed of genes regulating developmental, morphogenesis, and metabolic pathways. Surprisingly, genes in cell cycle and apoptosis pathways account for kidney lie within proximal promoters of annotated genes compared with 7% in a representative cancer cell line; 25% of the differentially expressed p53-bound genes are present in nephron progenitors and nascent nephrons, including key transcriptional regulators, components of Fgf, Wnt, Bmp, and Notch pathways, and ciliogenesis genes. The results indicate widespread p53 binding to the genome in vivo and context-dependent differences in the p53 regulon between cancer, stress, and development. To our knowledge, this is the first comprehensive analysis of the p53 transcriptome and cistrome in a developing mammalian organ, substantiating the role of p53 as a bona fide developmental regulator. We conclude p53 targets transcriptional networks regulating nephrogenesis and cellular metabolism during kidney development.

  10. Pitx2 modulates a Tbx5-dependent gene regulatory network to maintain atrial rhythm

    Science.gov (United States)

    Nadadur, Rangarajan D.; Broman, Michael T.; Boukens, Bastiaan; Mazurek, Stefan R.; Yang, Xinan; van den Boogaard, Malou; Bekeny, Jenna; Gadek, Margaret; Ward, Tarsha; Zhang, Min; Qiao, Yun; Martin, James F.; Seidman, Christine E.; Seidman, Jon; Christoffels, Vincent; Efimov, Igor R.; McNally, Elizabeth M.; Weber, Christopher R.; Moskowitz, Ivan P.

    2017-01-01

    Cardiac rhythm is extremely robust, generating 2 billion contraction cycles during the average human life span. Transcriptional control of cardiac rhythm is poorly understood. We found that removal of the transcription factor gene Tbx5 from the adult mouse caused primary spontaneous and sustained atrial fibrillation (AF). Atrial cardiomyocytes from the Tbx5-mutant mice exhibited action potential abnormalities, including spontaneous depolarizations, which were rescued by chelating free calcium. We identified a multitiered transcriptional network that linked seven previously defined AF risk loci: TBX5 directly activated PITX2, and TBX5 and PITX2 antagonistically regulated membrane effector genes Scn5a, Gja1, Ryr2, Dsp, and Atp2a2. In addition, reduced Tbx5 dose by adult-specific haploinsufficiency caused decreased target gene expression, myocardial automaticity, and AF inducibility, which were all rescued by Pitx2 haploinsufficiency in mice. These results defined a transcriptional architecture for atrial rhythm control organized as an incoherent feed-forward loop, driven by TBX5 and modulated by PITX2. TBX5/PITX2 interplay provides tight control of atrial rhythm effector gene expression, and perturbation of the co-regulated network caused AF susceptibility. This work provides a model for the molecular mechanisms underpinning the genetic implication of multiple AF genome-wide association studies loci and will contribute to future efforts to stratify patients for AF risk by genotype. PMID:27582060

  11. Pitx2 modulates a Tbx5-dependent gene regulatory network to maintain atrial rhythm.

    Science.gov (United States)

    Nadadur, Rangarajan D; Broman, Michael T; Boukens, Bastiaan; Mazurek, Stefan R; Yang, Xinan; van den Boogaard, Malou; Bekeny, Jenna; Gadek, Margaret; Ward, Tarsha; Zhang, Min; Qiao, Yun; Martin, James F; Seidman, Christine E; Seidman, Jon; Christoffels, Vincent; Efimov, Igor R; McNally, Elizabeth M; Weber, Christopher R; Moskowitz, Ivan P

    2016-08-31

    Cardiac rhythm is extremely robust, generating 2 billion contraction cycles during the average human life span. Transcriptional control of cardiac rhythm is poorly understood. We found that removal of the transcription factor gene Tbx5 from the adult mouse caused primary spontaneous and sustained atrial fibrillation (AF). Atrial cardiomyocytes from the Tbx5-mutant mice exhibited action potential abnormalities, including spontaneous depolarizations, which were rescued by chelating free calcium. We identified a multitiered transcriptional network that linked seven previously defined AF risk loci: TBX5 directly activated PITX2, and TBX5 and PITX2 antagonistically regulated membrane effector genes Scn5a, Gja1, Ryr2, Dsp, and Atp2a2 In addition, reduced Tbx5 dose by adult-specific haploinsufficiency caused decreased target gene expression, myocardial automaticity, and AF inducibility, which were all rescued by Pitx2 haploinsufficiency in mice. These results defined a transcriptional architecture for atrial rhythm control organized as an incoherent feed-forward loop, driven by TBX5 and modulated by PITX2. TBX5/PITX2 interplay provides tight control of atrial rhythm effector gene expression, and perturbation of the co-regulated network caused AF susceptibility. This work provides a model for the molecular mechanisms underpinning the genetic implication of multiple AF genome-wide association studies loci and will contribute to future efforts to stratify patients for AF risk by genotype.

  12. Evolutionary rewiring of bacterial regulatory networks

    Directory of Open Access Journals (Sweden)

    Tiffany B. Taylor

    2015-07-01

    Full Text Available Bacteria have evolved complex regulatory networks that enable integration of multiple intracellular and extracellular signals to coordinate responses to environmental changes. However, our knowledge of how regulatory systems function and evolve is still relatively limited. There is often extensive homology between components of different networks, due to past cycles of gene duplication, divergence, and horizontal gene transfer, raising the possibility of cross-talk or redundancy. Consequently, evolutionary resilience is built into gene networks – homology between regulators can potentially allow rapid rescue of lost regulatory function across distant regions of the genome. In our recent study [Taylor, et al. Science (2015, 347(6225] we find that mutations that facilitate cross-talk between pathways can contribute to gene network evolution, but that such mutations come with severe pleiotropic costs. Arising from this work are a number of questions surrounding how this phenomenon occurs.

  13. Improved inference of gene regulatory networks through integrated Bayesian clustering and dynamic modeling of time-course expression data.

    Science.gov (United States)

    Godsey, Brian

    2013-01-01

    Inferring gene regulatory networks from expression data is difficult, but it is common and often useful. Most network problems are under-determined--there are more parameters than data points--and therefore data or parameter set reduction is often necessary. Correlation between variables in the model also contributes to confound network coefficient inference. In this paper, we present an algorithm that uses integrated, probabilistic clustering to ease the problems of under-determination and correlated variables within a fully Bayesian framework. Specifically, ours is a dynamic Bayesian network with integrated Gaussian mixture clustering, which we fit using variational Bayesian methods. We show, using public, simulated time-course data sets from the DREAM4 Challenge, that our algorithm outperforms non-clustering methods in many cases (7 out of 25) with fewer samples, rarely underperforming (1 out of 25), and often selects a non-clustering model if it better describes the data. Source code (GNU Octave) for BAyesian Clustering Over Networks (BACON) and sample data are available at: http://code.google.com/p/bacon-for-genetic-networks.

  14. Whole blood transcriptomics in cardiac surgery identifies a gene regulatory network connecting ischemia reperfusion with systemic inflammation.

    Directory of Open Access Journals (Sweden)

    Orfeas Liangos

    Full Text Available BACKGROUND: Cardiac surgery with cardiopulmonary bypass (CS/CPB is associated with increased risk for postoperative complications causing substantial morbidity and mortality. To identify the molecular mechanisms underlying CS/CPB-induced pathophysiology we employed an integrative systems biology approach using the whole blood transcriptome as the sentinel organ. METHODOLOGY/PRINCIPAL FINDINGS: Total RNA was isolated and globin mRNA depleted from whole blood samples prospectively collected from 10 patients at time points prior (0, 2 and 24 hours following CS/CPB. Genome-wide transcriptional analysis revealed differential expression of 610 genes after CS/CPB (p<0.01. Among the 375 CS/CPB-upregulated genes, we found a gene-regulatory network consisting of 50 genes, reminiscent of activation of a coordinated genetic program triggered by CS/CPB. Intriguingly, the highly connected hub nodes of the identified network included key sensors of ischemia-reperfusion (HIF-1alpha and C/EBPbeta. Activation of this network initiated a concerted inflammatory response via upregulation of TLR-4/5, IL1R2/IL1RAP, IL6, IL18/IL18R1/IL18RAP, MMP9, HGF/HGFR, CalgranulinA/B, and coagulation factors F5/F12 among others. Differential regulation of 13 candidate genes including novel, not hitherto CS/CBP-associated genes, such as PTX3, PGK1 and Resistin, was confirmed using real-time quantitative RT-PCR. In support of the mRNA data, differential expression of MMP9, MIP1alpha and MIP1beta plasma proteins was further confirmed in 34 additional patients. CONCLUSIONS: Analysis of blood transcriptome uncovered critical signaling pathways governing the CS/CPB-induced pathophysiology. The molecular signaling underlying ischemia reperfusion and inflammatory response is highly intertwined and includes pro-inflammatory as well as cardioprotective elements. The herein identified candidate genes and pathways may provide promising prognostic biomarker and therapeutic targets.

  15. GeNeDA: An Open-Source Workflow for Design Automation of Gene Regulatory Networks Inspired from Microelectronics.

    Science.gov (United States)

    Madec, Morgan; Pecheux, François; Gendrault, Yves; Rosati, Elise; Lallement, Christophe; Haiech, Jacques

    2016-10-01

    The topic of this article is the development of an open-source automated design framework for synthetic biology, specifically for the design of artificial gene regulatory networks based on a digital approach. In opposition to other tools, GeNeDA is an open-source online software based on existing tools used in microelectronics that have proven their efficiency over the last 30 years. The complete framework is composed of a computation core directly adapted from an Electronic Design Automation tool, input and output interfaces, a library of elementary parts that can be achieved with gene regulatory networks, and an interface with an electrical circuit simulator. Each of these modules is an extension of microelectronics tools and concepts: ODIN II, ABC, the Verilog language, SPICE simulator, and SystemC-AMS. GeNeDA is first validated on a benchmark of several combinatorial circuits. The results highlight the importance of the part library. Then, this framework is used for the design of a sequential circuit including a biological state machine.

  16. Inferring polymorphism-induced regulatory gene networks active in human lymphocyte cell lines by weighted linear mixed model analysis of multiple RNA-Seq datasets.

    Directory of Open Access Journals (Sweden)

    Wensheng Zhang

    Full Text Available Single-nucleotide polymorphisms (SNPs contribute to the between-individual expression variation of many genes. A regulatory (trait-associated SNP is usually located near or within a (host gene, possibly influencing the gene's transcription or/and post-transcriptional modification. But its targets may also include genes that are physically farther away from it. A heuristic explanation of such multiple-target interferences is that the host gene transfers the SNP genotypic effects to the distant gene(s by a transcriptional or signaling cascade. These connections between the host genes (regulators and the distant genes (targets make the genetic analysis of gene expression traits a promising approach for identifying unknown regulatory relationships. In this study, through a mixed model analysis of multi-source digital expression profiling for 140 human lymphocyte cell lines (LCLs and the genotypes distributed by the international HapMap project, we identified 45 thousands of potential SNP-induced regulatory relationships among genes (the significance level for the underlying associations between expression traits and SNP genotypes was set at FDR < 0.01. We grouped the identified relationships into four classes (paradigms according to the two different mechanisms by which the regulatory SNPs affect their cis- and trans- regulated genes, modifying mRNA level or altering transcript splicing patterns. We further organized the relationships in each class into a set of network modules with the cis- regulated genes as hubs. We found that the target genes in a network module were often characterized by significant functional similarity, and the distributions of the target genes in three out of the four networks roughly resemble a power-law, a typical pattern of gene networks obtained from mutation experiments. By two case studies, we also demonstrated that significant biological insights can be inferred from the identified network modules.

  17. Genetic flexibility of regulatory networks.

    Science.gov (United States)

    Hunziker, Alexander; Tuboly, Csaba; Horváth, Péter; Krishna, Sandeep; Semsey, Szabolcs

    2010-07-20

    Gene regulatory networks are based on simple building blocks such as promoters, transcription factors (TFs) and their binding sites on DNA. But how diverse are the functions that can be obtained by different arrangements of promoters and TF binding sites? In this work we constructed synthetic regulatory regions using promoter elements and binding sites of two noninteracting TFs, each sensing a single environmental input signal. We show that simply by combining these three kinds of elements, we can obtain 11 of the 16 Boolean logic gates that integrate two environmental signals in vivo. Further, we demonstrate how combination of logic gates can result in new logic functions. Our results suggest that simple elements of transcription regulation form a highly flexible toolbox that can generate diverse functions under natural selection.

  18. Reiterative AP2a activity controls sequential steps in the neural crest gene regulatory network.

    Science.gov (United States)

    de Crozé, Noémie; Maczkowiak, Frédérique; Monsoro-Burq, Anne H

    2011-01-04

    The neural crest (NC) emerges from combinatorial inductive events occurring within its progenitor domain, the neural border (NB). Several transcription factors act early at the NB, but the initiating molecular events remain elusive. Recent data from basal vertebrates suggest that ap2 might have been critical for NC emergence; however, the role of AP2 factors at the NB remains unclear. We show here that AP2a initiates NB patterning and is sufficient to elicit a NB-like pattern in neuralized ectoderm. In contrast, the other early regulators do not participate in ap2a initiation at the NB, but cooperate to further establish a robust NB pattern. The NC regulatory network uses a multistep cascade of secreted inducers and transcription factors, first at the NB and then within the NC progenitors. Here we report that AP2a acts at two distinct steps of this cascade. As the earliest known NB specifier, AP2a mediates Wnt signals to initiate the NB and activate pax3; as a NC specifier, AP2a regulates further NC development independent of and downstream of NB patterning. Our findings reconcile conflicting observations from various vertebrate organisms. AP2a provides a paradigm for the reiterated use of multifunctional molecules, thereby facilitating emergence of the NC in vertebrates.

  19. Detecting shifts in gene regulatory networks during time-course experiments at single-time-point temporal resolution.

    Science.gov (United States)

    Takenaka, Yoichi; Seno, Shigeto; Matsuda, Hideo

    2015-10-01

    Comprehensively understanding the dynamics of biological systems is one of the greatest challenges in biology. Vastly improved biological technologies have provided vast amounts of information that must be understood by bioinformatics and systems biology researchers. Gene regulations have been frequently modeled by ordinary differential equations or graphical models based on time-course gene expression profiles. The state-of-the-art computational approaches for analyzing gene regulations assume that their models are same throughout time-course experiments. However, these approaches cannot easily analyze transient changes at a time point, such as diauxic shift. We propose a score that analyzes the gene regulations at each time point. The score is based on the information gains of information criterion values. The method detects the shifts in gene regulatory networks (GRNs) during time-course experiments with single-time-point resolution. The effectiveness of the method is evaluated on the diauxic shift from glucose to lactose in Escherichia coli. Gene regulation shifts were detected at two time points: the first corresponding to the time at which the growth of E. coli ceased and the second corresponding to the end of the experiment, when the nutrient sources (glucose and lactose) had become exhausted. According to these results, the proposed score and method can appropriately detect the time of gene regulation shifts. The method based on the proposed score provides a new tool for analyzing dynamic biological systems. Because the score value indicates the strength of gene regulation at each time point in a gene expression profile, it can potentially infer hidden GRNs from time-course experiments.

  20. Investigation of Gene Regulatory Networks Associated with Autism Spectrum Disorder Based on MiRNA Expression in China.

    Directory of Open Access Journals (Sweden)

    Fengzhen Huang

    Full Text Available Autism spectrum disorder (ASD comprise a group of neurodevelopmental disorders characterized by deficits in social and communication capacities and repetitive behaviors. Increasing neuroscientific evidence indicates that the neuropathology of ASD is widespread and involves epigenetic regulation in the brain. Differentially expressed miRNAs in the peripheral blood from autism patients were identified by high-throughput miRNA microarray analyses. Five of these miRNAs were confirmed through quantitative reverse transcription-polymerase chain reaction (qRT-PCR analysis. A search for candidate target genes of the five confirmed miRNAs was performed through a Kyoto encyclopedia of genes and genomes (KEGG biological pathways and Gene Ontology enrichment analysis of gene function to identify gene regulatory networks. To the best of our knowledge, this study provides the first global miRNA expression profile of ASD in China. The differentially expressed miR-34b may potentially explain the higher percentage of male ASD patients, and the aberrantly expressed miR-103a-3p may contribute to the abnormal ubiquitin-mediated proteolysis observed in ASD.

  1. Intercellular delay regulates the collective period of repressively coupled gene regulatory oscillator networks.

    Science.gov (United States)

    Wang, Yongqiang; Hori, Yutaka; Hara, Shinji; Doyle, Francis J

    2014-01-01

    Most biological rhythms are generated by a population of cellular oscillators coupled through intercellular signaling. Recent experimental evidence shows that the collective period may differ significantly from the autonomous period in the presence of intercellular delays. The phenomenon has been investigated using delay-coupled phase oscillators, but the proposed phase model contains no direct biological mechanism, which may weaken the model's reliability in unraveling biophysical principles. Based on a published gene regulatory oscillator model, we analyze the collective period of delay-coupled biological oscillators using the multivariable harmonic balance technique. We prove that, in contradiction to the common intuition that the collective period increases linearly with the coupling delay, the collective period turns out to be a periodic function of the intercellular delay. More surprisingly, the collective period may even decrease with the intercellular delay when the delay resides in certain regions. The collective period is given in a closed-form in terms of biochemical reaction constants and thus provides biological insights as well as guidance in synthetic-biological-oscillator design. Simulation results are given based on a segmentation clock model to confirm the theoretical predictions.

  2. Transcription regulatory networks analysis using CAGE

    KAUST Repository

    Tegnér, Jesper N.

    2009-10-01

    Mapping out cellular networks in general and transcriptional networks in particular has proved to be a bottle-neck hampering our understanding of biological processes. Integrative approaches fusing computational and experimental technologies for decoding transcriptional networks at a high level of resolution is therefore of uttermost importance. Yet, this is challenging since the control of gene expression in eukaryotes is a complex multi-level process influenced by several epigenetic factors and the fine interplay between regulatory proteins and the promoter structure governing the combinatorial regulation of gene expression. In this chapter we review how the CAGE data can be integrated with other measurements such as expression, physical interactions and computational prediction of regulatory motifs, which together can provide a genome-wide picture of eukaryotic transcriptional regulatory networks at a new level of resolution. © 2010 by Pan Stanford Publishing Pte. Ltd. All rights reserved.

  3. Evolving gene regulatory networks into cellular networks guiding adaptive behavior: an outline how single cells could have evolved into a centralized neurosensory system.

    Science.gov (United States)

    Fritzsch, Bernd; Jahan, Israt; Pan, Ning; Elliott, Karen L

    2015-01-01

    Understanding the evolution of the neurosensory system of man, able to reflect on its own origin, is one of the major goals of comparative neurobiology. Details of the origin of neurosensory cells, their aggregation into central nervous systems and associated sensory organs and their localized patterning leading to remarkably different cell types aggregated into variably sized parts of the central nervous system have begun to emerge. Insights at the cellular and molecular level have begun to shed some light on the evolution of neurosensory cells, partially covered in this review. Molecular evidence suggests that high mobility group (HMG) proteins of pre-metazoans evolved into the definitive Sox [SRY (sex determining region Y)-box] genes used for neurosensory precursor specification in metazoans. Likewise, pre-metazoan basic helix-loop-helix (bHLH) genes evolved in metazoans into the group A bHLH genes dedicated to neurosensory differentiation in bilaterians. Available evidence suggests that the Sox and bHLH genes evolved a cross-regulatory network able to synchronize expansion of precursor populations and their subsequent differentiation into novel parts of the brain or sensory organs. Molecular evidence suggests metazoans evolved patterning gene networks early, which were not dedicated to neuronal development. Only later in evolution were these patterning gene networks tied into the increasing complexity of diffusible factors, many of which were already present in pre-metazoans, to drive local patterning events. It appears that the evolving molecular basis of neurosensory cell development may have led, in interaction with differentially expressed patterning genes, to local network modifications guiding unique specializations of neurosensory cells into sensory organs and various areas of the central nervous system.

  4. Integrative identification of deregulated miRNA/TF-mediated gene regulatory loops and networks in prostate cancer.

    Science.gov (United States)

    Afshar, Ali Sobhi; Xu, Joseph; Goutsias, John

    2014-01-01

    MicroRNAs (miRNAs) have attracted a great deal of attention in biology and medicine. It has been hypothesized that miRNAs interact with transcription factors (TFs) in a coordinated fashion to play key roles in regulating signaling and transcriptional pathways and in achieving robust gene regulation. Here, we propose a novel integrative computational method to infer certain types of deregulated miRNA-mediated regulatory circuits at the transcriptional, post-transcriptional and signaling levels. To reliably predict miRNA-target interactions from mRNA/miRNA expression data, our method collectively utilizes sequence-based miRNA-target predictions obtained from several algorithms, known information about mRNA and miRNA targets of TFs available in existing databases, certain molecular structures identified to be statistically over-represented in gene regulatory networks, available molecular subtyping information, and state-of-the-art statistical techniques to appropriately constrain the underlying analysis. In this way, the method exploits almost every aspect of extractable information in the expression data. We apply our procedure on mRNA/miRNA expression data from prostate tumor and normal samples and detect numerous known and novel miRNA-mediated deregulated loops and networks in prostate cancer. We also demonstrate instances of the results in a number of distinct biological settings, which are known to play crucial roles in prostate and other types of cancer. Our findings show that the proposed computational method can be used to effectively achieve notable insights into the poorly understood molecular mechanisms of miRNA-mediated interactions and dissect their functional roles in cancer in an effort to pave the way for miRNA-based therapeutics in clinical settings.

  5. The role of genome and gene regulatory network canalization in the evolution of multi-trait polymorphisms and sympatric speciation

    Directory of Open Access Journals (Sweden)

    Hogeweg Paulien

    2009-07-01

    Full Text Available Abstract Background Sexual reproduction has classically been considered as a barrier to the buildup of discrete phenotypic differentiation. This notion has been confirmed by models of sympatric speciation in which a fixed genetic architecture and a linear genotype phenotype mapping were assumed. In this paper we study the influence of a flexible genetic architecture and non-linear genotype phenotype map on differentiation under sexual reproduction. We use an individual based model in which organisms have a genome containing genes and transcription factor binding sites. Mutations involve single genes or binding sites or stretches of genome. The genome codes for a regulatory network that determines the gene expression pattern and hence the phenotype of the organism, resulting in a non-linear genotype phenotype map. The organisms compete in a multi-niche environment, imposing selection for phenotypic differentiation. Results We find as a generic outcome the evolution of discrete clusters of organisms adapted to different niches, despite random mating. Organisms from different clusters are distinct on the genotypic, the network and the phenotypic level. However, the genome and network differences are constrained to a subset of the genome locations, a process we call genotypic canalization. We demonstrate how this canalization leads to an increased robustness to recombination and increasing hybrid fitness. Finally, in case of assortative mating, we explain how this canalization increases the effectiveness of assortativeness. Conclusion We conclude that in case of a flexible genetic architecture and a non-linear genotype phenotype mapping, sexual reproduction does not constrain phenotypic differentiation, but instead constrains the genotypic differences underlying it. We hypothesize that, as genotypic canalization enables differentiation despite random mating and increases the effectiveness of assortative mating, sympatric speciation is more likely

  6. Splitting Strategy for Simulating Genetic Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Xiong You

    2014-01-01

    Full Text Available The splitting approach is developed for the numerical simulation of genetic regulatory networks with a stable steady-state structure. The numerical results of the simulation of a one-gene network, a two-gene network, and a p53-mdm2 network show that the new splitting methods constructed in this paper are remarkably more effective and more suitable for long-term computation with large steps than the traditional general-purpose Runge-Kutta methods. The new methods have no restriction on the choice of stepsize due to their infinitely large stability regions.

  7. General theory of genotype to phenotype mapping: derivation of epigenetic landscapes from N-node complex gene regulatory networks.

    Science.gov (United States)

    Villarreal, Carlos; Padilla-Longoria, Pablo; Alvarez-Buylla, Elena R

    2012-09-14

    We propose a systematic methodology to construct a probabilistic epigenetic landscape of cell-fate attainment associated with N-node Boolean genetic regulatory networks. The general derivation proposed here is exemplified with an Arabidopsis thaliana network underlying floral organ determination grounded on qualitative experimental data.

  8. A study of bacterial gene regulatory mechanisms

    DEFF Research Database (Denmark)

    Hansen, Sabine

    the different regulatory mechanisms affect system dynamics. We have designed a synthetic gene regulatory network (GRN) in bacterial cells that enables us to study the dynamics of GRNs. The results presented in this PhD thesis show that model equations based on the established mechanisms of action of each...... of a particular type of regulatory mechanism. The synthetic system presented in this thesis is, to our knowledge, the first of its kind to allow a direct comparison of the dynamic behaviors of gene regulatory networks that employ different mechanisms of regulation. In addition to studying the dynamic behavior...... of GRNs this thesis also provided the first evidence of the sensor histidine kinase VC1831 being an additional player in the Vibrio cholerae quorum sensing (QS) GRN. Bacteria use a process of cell-cell communication called QS which enable the bacterial cells to collectively control their gene expression...

  9. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model

    Science.gov (United States)

    2014-01-01

    Background Obesity is a complex metabolic condition in strong association with various diseases, like type 2 diabetes, resulting in major public health and economic implications. Obesity is the result of environmental and genetic factors and their interactions, including genome-wide genetic interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model for human obesity, offering the possibility to study in-depth organ-level transcriptomic regulations of obesity, unfeasible in humans. Our aim was to reveal adipose tissue co-expression networks, pathways and transcriptional regulations of obesity using RNA Sequencing based systems biology approaches in a porcine model. Methods We selected 36 animals for RNA Sequencing from a previously created F2 pig population representing three extreme groups based on their predicted genetic risks for obesity. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to detect clusters of highly co-expressed genes (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. Results WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P < 0.001). Functional annotation identified pathways enlightening the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using confident scores, for the WGCNA module which was associated with osteoclast differentiation: CCR1, MSR1 and SI1 (probability scores respectively 95.30, 62.28, and

  10. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model.

    Science.gov (United States)

    Kogelman, Lisette J A; Cirera, Susanna; Zhernakova, Daria V; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N

    2014-09-30

    Obesity is a complex metabolic condition in strong association with various diseases, like type 2 diabetes, resulting in major public health and economic implications. Obesity is the result of environmental and genetic factors and their interactions, including genome-wide genetic interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model for human obesity, offering the possibility to study in-depth organ-level transcriptomic regulations of obesity, unfeasible in humans. Our aim was to reveal adipose tissue co-expression networks, pathways and transcriptional regulations of obesity using RNA Sequencing based systems biology approaches in a porcine model. We selected 36 animals for RNA Sequencing from a previously created F2 pig population representing three extreme groups based on their predicted genetic risks for obesity. We applied Weighted Gene Co-expression Network Analysis (WGCNA) to detect clusters of highly co-expressed genes (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using confident scores, for the WGCNA module which was associated with osteoclast differentiation: CCR1, MSR1 and SI1 (probability scores respectively 95.30, 62.28, and 34.58). Moreover, detection of differentially connected genes identified various genes previously identified to be

  11. Plasticity of gene-regulatory networks controlling sex determination: of masters, slaves, usual suspects, newcomers, and usurpators.

    Science.gov (United States)

    Herpin, Amaury; Schartl, Manfred

    2015-10-01

    Sexual dimorphism is one of the most pervasive and diverse features of animal morphology, physiology, and behavior. Despite the generality of the phenomenon itself, the mechanisms controlling how sex is determined differ considerably among various organismic groups, have evolved repeatedly and independently, and the underlying molecular pathways can change quickly during evolution. Even within closely related groups of organisms for which the development of gonads on the morphological, histological, and cell biological level is undistinguishable, the molecular control and the regulation of the factors involved in sex determination and gonad differentiation can be substantially different. The biological meaning of the high molecular plasticity of an otherwise common developmental program is unknown. While comparative studies suggest that the downstream effectors of sex-determining pathways tend to be more stable than the triggering mechanisms at the top, it is still unclear how conserved the downstream networks are and how all components work together. After many years of stasis, when the molecular basis of sex determination was amenable only in the few classical model organisms (fly, worm, mouse), recently, sex-determining genes from several animal species have been identified and new studies have elucidated some novel regulatory interactions and biological functions of the downstream network, particularly in vertebrates. These data have considerably changed our classical perception of a simple linear developmental cascade that makes the decision for the embryo to develop as male or female, and how it evolves.

  12. The Transcriptional and Gene Regulatory Network of Lactococcus lactis MG1363 during Growth in Milk

    DEFF Research Database (Denmark)

    de Jong, Anne; Hansen, Morten Ejby; Kuipers, Oscar P.;

    2013-01-01

    analysis of gene expression over time showed that L. lactis adapted quickly to the environmental changes. Using upstream sequences of genes with correlated gene expression profiles, we uncovered a substantial number of putative DNA binding motifs that may be relevant for L. lactis fermentative growth...

  13. Non-transcriptional regulatory processes shape transcriptional network dynamics

    OpenAIRE

    Ray, J. Christian J; Tabor, Jeffrey J.; Igoshin, Oleg A.

    2011-01-01

    Information about the extra- or intracellular environment is often captured as biochemical signals propagating through regulatory networks. These signals eventually drive phenotypic changes, typically by altering gene expression programs in the cell. Reconstruction of transcriptional regulatory networks has given a compelling picture of bacterial physiology, but transcriptional network maps alone often fail to describe phenotypes. In many cases, the dynamical performance of transcriptional re...

  14. FlyOde - a platform for community curation and interactive visualization of dynamic gene regulatory networks in Drosophila eye development [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Stefan A. Koestler

    2015-12-01

    Full Text Available Motivation: Understanding the regulatory mechanisms governing eye development of the model organism Drosophila melanogaster (D. m. requires structured knowledge of the involved genes and proteins, their interactions, and dynamic expression patterns. Especially the latter information is however to a large extent scattered throughout the literature. Results: FlyOde is an online platform for the systematic assembly of data on D. m. eye development. It consists of data on eye development obtained from the literature, and a web interface for users to interactively display these data as a gene regulatory network. Our manual curation process provides high standard structured data, following a specifically designed ontology. Visualization of gene interactions provides an overview of network topology, and filtering according to user-defined expression patterns makes it a versatile tool for daily tasks, as demonstrated by usage examples. Users are encouraged to submit additional data via a simple online form.

  15. Regulatory network operations in the Pathway Tools software

    Directory of Open Access Journals (Sweden)

    Paley Suzanne M

    2012-09-01

    Full Text Available Abstract Background Biologists are elucidating complex collections of genetic regulatory data for multiple organisms. Software is needed for such regulatory network data. Results The Pathway Tools software supports storage and manipulation of regulatory information through a variety of strategies. The Pathway Tools regulation ontology captures transcriptional and translational regulation, substrate-level regulation of enzyme activity, post-translational modifications, and regulatory pathways. Regulatory visualizations include a novel diagram that summarizes all regulatory influences on a gene; a transcription-unit diagram, and an interactive visualization of a full transcriptional regulatory network that can be painted with gene expression data to probe correlations between gene expression and regulatory mechanisms. We introduce a novel type of enrichment analysis that asks whether a gene-expression dataset is over-represented for known regulators. We present algorithms for ranking the degree of regulatory influence of genes, and for computing the net positive and negative regulatory influences on a gene. Conclusions Pathway Tools provides a comprehensive environment for manipulating molecular regulatory interactions that integrates regulatory data with an organism’s genome and metabolic network. Curated collections of regulatory data authored using Pathway Tools are available for Escherichia coli, Bacillus subtilis, and Shewanella oneidensis.

  16. Discovery of Putative Herbicide Resistance Genes and Its Regulatory Network in Chickpea Using Transcriptome Sequencing

    Directory of Open Access Journals (Sweden)

    Mir A. Iquebal

    2017-06-01

    Full Text Available Background: Chickpea (Cicer arietinum L. contributes 75% of total pulse production. Being cheaper than animal protein, makes it important in dietary requirement of developing countries. Weed not only competes with chickpea resulting into drastic yield reduction but also creates problem of harboring fungi, bacterial diseases and insect pests. Chemical approach having new herbicide discovery has constraint of limited lead molecule options, statutory regulations and environmental clearance. Through genetic approach, transgenic herbicide tolerant crop has given successful result but led to serious concern over ecological safety thus non-transgenic approach like marker assisted selection is desirable. Since large variability in tolerance limit of herbicide already exists in chickpea varieties, thus the genes offering herbicide tolerance can be introgressed in variety improvement programme. Transcriptome studies can discover such associated key genes with herbicide tolerance in chickpea.Results: This is first transcriptomic studies of chickpea or even any legume crop using two herbicide susceptible and tolerant genotypes exposed to imidazoline (Imazethapyr. Approximately 90 million paired-end reads generated from four samples were processed and assembled into 30,803 contigs using reference based assembly. We report 6,310 differentially expressed genes (DEGs, of which 3,037 were regulated by 980 miRNAs, 1,528 transcription factors associated with 897 DEGs, 47 Hub proteins, 3,540 putative Simple Sequence Repeat-Functional Domain Marker (SSR-FDM, 13,778 genic Single Nucleotide Polymorphism (SNP putative markers and 1,174 Indels. Randomly selected 20 DEGs were validated using qPCR. Pathway analysis suggested that xenobiotic degradation related gene, glutathione S-transferase (GST were only up-regulated in presence of herbicide. Down-regulation of DNA replication genes and up-regulation of abscisic acid pathway genes were observed. Study further reveals

  17. Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci

    NARCIS (Netherlands)

    Keurentjes, Joost J.B.; Fu, Jingyuan; Terpstra, Inez R.; Garcia, Juan M.; Ackerveken, Guido van den; Snoek, L. Basten; Peeters, Anton J.M.; Vreugdenhil, Dick; Koornneef, Maarten; Jansen, Ritsert C.

    2007-01-01

    Accessions of a plant species can show considerable genetic differences that are analyzed effectively by using recombinant inbred line (RIL) populations. Here we describe the results of genome-wide expression variation analysis in an RIL population of Arabidopsis thaliana. For many genes, variation

  18. Identification of the Rage-dependent gene regulatory network in a mouse model of skin inflammation

    Directory of Open Access Journals (Sweden)

    Gebhardt Christoffer

    2010-10-01

    Full Text Available Abstract Background In the past, molecular mechanisms that drive the initiation of an inflammatory response have been studied intensively. However, corresponding mechanisms that sustain the expression of inflammatory response genes and hence contribute to the establishment of chronic disorders remain poorly understood. Recently, we provided genetic evidence that signaling via the receptor for advanced glycation end products (Rage drives the strength and maintenance of an inflammatory reaction. In order to decipher the mode of Rage function on gene transcription levels during inflammation, we applied global gene expression profiling on time-resolved samples of mouse back skin, which had been treated with the phorbol ester TPA, a potent inducer of skin inflammation. Results Ranking of TPA-regulated genes according to their time average mean and peak expression and superimposition of data sets from wild-type (wt and Rage-deficient mice revealed that Rage signaling is not essential for initial changes in TPA-induced transcription, but absolutely required for sustained alterations in transcript levels. Next, we used a data set of differentially expressed genes between TPA-treated wt and Rage-deficient skin and performed computational analysis of their proximal promoter regions. We found a highly significant enrichment for several transcription factor binding sites (TFBS leading to the prediction that corresponding transcription factors, such as Sp1, Tcfap2, E2f, Myc and Egr, are regulated by Rage signaling. Accordingly, we could confirm aberrant expression and regulation of members of the E2f protein family in epidermal keratinocytes of Rage-deficient mice. Conclusions In summary, our data support the model that engagement of Rage converts a transient cellular stimulation into sustained cellular dysfunction and highlight a novel role of the Rb-E2f pathway in Rage-dependent inflammation during pathological conditions.

  19. Information transmission in genetic regulatory networks: a review

    Science.gov (United States)

    Tkačik, Gašper; Walczak, Aleksandra M.

    2011-04-01

    Genetic regulatory networks enable cells to respond to changes in internal and external conditions by dynamically coordinating their gene expression profiles. Our ability to make quantitative measurements in these biochemical circuits has deepened our understanding of what kinds of computations genetic regulatory networks can perform, and with what reliability. These advances have motivated researchers to look for connections between the architecture and function of genetic regulatory networks. Transmitting information between a network's inputs and outputs has been proposed as one such possible measure of function, relevant in certain biological contexts. Here we summarize recent developments in the application of information theory to gene regulatory networks. We first review basic concepts in information theory necessary for understanding recent work. We then discuss the functional complexity of gene regulation, which arises from the molecular nature of the regulatory interactions. We end by reviewing some experiments that support the view that genetic networks responsible for early development of multicellular organisms might be maximizing transmitted 'positional information'.

  20. Information transmission in genetic regulatory networks: a review.

    Science.gov (United States)

    Tkačik, Gašper; Walczak, Aleksandra M

    2011-04-20

    Genetic regulatory networks enable cells to respond to changes in internal and external conditions by dynamically coordinating their gene expression profiles. Our ability to make quantitative measurements in these biochemical circuits has deepened our understanding of what kinds of computations genetic regulatory networks can perform, and with what reliability. These advances have motivated researchers to look for connections between the architecture and function of genetic regulatory networks. Transmitting information between a network's inputs and outputs has been proposed as one such possible measure of function, relevant in certain biological contexts. Here we summarize recent developments in the application of information theory to gene regulatory networks. We first review basic concepts in information theory necessary for understanding recent work. We then discuss the functional complexity of gene regulation, which arises from the molecular nature of the regulatory interactions. We end by reviewing some experiments that support the view that genetic networks responsible for early development of multicellular organisms might be maximizing transmitted 'positional information'.

  1. Large-Scale Recurrent Neural Network Based Modelling of Gene Regulatory Network Using Cuckoo Search-Flower Pollination Algorithm.

    Science.gov (United States)

    Mandal, Sudip; Khan, Abhinandan; Saha, Goutam; Pal, Rajat K

    2016-01-01

    The accurate prediction of genetic networks using computational tools is one of the greatest challenges in the postgenomic era. Recurrent Neural Network is one of the most popular but simple approaches to model the network dynamics from time-series microarray data. To date, it has been successfully applied to computationally derive small-scale artificial and real-world genetic networks with high accuracy. However, they underperformed for large-scale genetic networks. Here, a new methodology has been proposed where a hybrid Cuckoo Search-Flower Pollination Algorithm has been implemented with Recurrent Neural Network. Cuckoo Search is used to search the best combination of regulators. Moreover, Flower Pollination Algorithm is applied to optimize the model parameters of the Recurrent Neural Network formalism. Initially, the proposed method is tested on a benchmark large-scale artificial network for both noiseless and noisy data. The results obtained show that the proposed methodology is capable of increasing the inference of correct regulations and decreasing false regulations to a high degree. Secondly, the proposed methodology has been validated against the real-world dataset of the DNA SOS repair network of Escherichia coli. However, the proposed method sacrifices computational time complexity in both cases due to the hybrid optimization process.

  2. HIDEN: Hierarchical decomposition of regulatory networks

    Directory of Open Access Journals (Sweden)

    Gülsoy Günhan

    2012-09-01

    Full Text Available Abstract Background Transcription factors regulate numerous cellular processes by controlling the rate of production of each gene. The regulatory relations are modeled using transcriptional regulatory networks. Recent studies have shown that such networks have an underlying hierarchical organization. We consider the problem of discovering the underlying hierarchy in transcriptional regulatory networks. Results We first transform this problem to a mixed integer programming problem. We then use existing tools to solve the resulting problem. For larger networks this strategy does not work due to rapid increase in running time and space usage. We use divide and conquer strategy for such networks. We use our method to analyze the transcriptional regulatory networks of E. coli, H. sapiens and S. cerevisiae. Conclusions Our experiments demonstrate that: (i Our method gives statistically better results than three existing state of the art methods; (ii Our method is robust against errors in the data and (iii Our method’s performance is not affected by the different topologies in the data.

  3. The PLETHORA Gene Regulatory Network Guides Growth and Cell Differentiation in Arabidopsis Roots.

    Science.gov (United States)

    Santuari, Luca; Sanchez-Perez, Gabino F; Luijten, Marijn; Rutjens, Bas; Terpstra, Inez; Berke, Lidija; Gorte, Maartje; Prasad, Kalika; Bao, Dongping; Timmermans-Hereijgers, Johanna L P M; Maeo, Kenichiro; Nakamura, Kenzo; Shimotohno, Akie; Pencik, Ales; Novak, Ondrej; Ljung, Karin; van Heesch, Sebastiaan; de Bruijn, Ewart; Cuppen, Edwin; Willemsen, Viola; Mähönen, Ari Pekka; Lukowitz, Wolfgang; Snel, Berend; de Ridder, Dick; Scheres, Ben; Heidstra, Renze

    2016-12-01

    Organ formation in animals and plants relies on precise control of cell state transitions to turn stem cell daughters into fully differentiated cells. In plants, cells cannot rearrange due to shared cell walls. Thus, differentiation progression and the accompanying cell expansion must be tightly coordinated across tissues. PLETHORA (PLT) transcription factor gradients are unique in their ability to guide the progression of cell differentiation at different positions in the growing Arabidopsis thaliana root, which contrasts with well-described transcription factor gradients in animals specifying distinct cell fates within an essentially static context. To understand the output of the PLT gradient, we studied the gene set transcriptionally controlled by PLTs. Our work reveals how the PLT gradient can regulate cell state by region-specific induction of cell proliferation genes and repression of differentiation. Moreover, PLT targets include major patterning genes and autoregulatory feedback components, enforcing their role as master regulators of organ development. © 2016 American Society of Plant Biologists. All rights reserved.

  4. Differential network analysis reveals dysfunctional regulatory networks in gastric carcinogenesis.

    Science.gov (United States)

    Cao, Mu-Shui; Liu, Bing-Ya; Dai, Wen-Tao; Zhou, Wei-Xin; Li, Yi-Xue; Li, Yuan-Yuan

    2015-01-01

    Gastric Carcinoma is one of the most common cancers in the world. A large number of differentially expressed genes have been identified as being associated with gastric cancer progression, however, little is known about the underlying regulatory mechanisms. To address this problem, we developed a differential networking approach that is characterized by including a nascent methodology, differential coexpression analysis (DCEA), and two novel quantitative methods for differential regulation analysis. We first applied DCEA to a gene expression dataset of gastric normal mucosa, adenoma and carcinoma samples to identify gene interconnection changes during cancer progression, based on which we inferred normal, adenoma, and carcinoma-specific gene regulation networks by using linear regression model. It was observed that cancer genes and drug targets were enriched in each network. To investigate the dynamic changes of gene regulation during carcinogenesis, we then designed two quantitative methods to prioritize differentially regulated genes (DRGs) and gene pairs or links (DRLs) between adjacent stages. It was found that known cancer genes and drug targets are significantly higher ranked. The top 4% normal vs. adenoma DRGs (36 genes) and top 6% adenoma vs. carcinoma DRGs (56 genes) proved to be worthy of further investigation to explore their association with gastric cancer. Out of the 16 DRGs involved in two top-10 DRG lists of normal vs. adenoma and adenoma vs. carcinoma comparisons, 15 have been reported to be gastric cancer or cancer related. Based on our inferred differential networking information and known signaling pathways, we generated testable hypotheses on the roles of GATA6, ESRRG and their signaling pathways in gastric carcinogenesis. Compared with established approaches which build genome-scale GRNs, or sub-networks around differentially expressed genes, the present one proved to be better at enriching cancer genes and drug targets, and prioritizing

  5. Reconstruction of extended Petri nets from time series data and its application to signal transduction and to gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Marwan Wolfgang

    2011-07-01

    Full Text Available Abstract Background Network inference methods reconstruct mathematical models of molecular or genetic networks directly from experimental data sets. We have previously reported a mathematical method which is exclusively data-driven, does not involve any heuristic decisions within the reconstruction process, and deliveres all possible alternative minimal networks in terms of simple place/transition Petri nets that are consistent with a given discrete time series data set. Results We fundamentally extended the previously published algorithm to consider catalysis and inhibition of the reactions that occur in the underlying network. The results of the reconstruction algorithm are encoded in the form of an extended Petri net involving control arcs. This allows the consideration of processes involving mass flow and/or regulatory interactions. As a non-trivial test case, the phosphate regulatory network of enterobacteria was reconstructed using in silico-generated time-series data sets on wild-type and in silico mutants. Conclusions The new exact algorithm reconstructs extended Petri nets from time series data sets by finding all alternative minimal networks that are consistent with the data. It suggested alternative molecular mechanisms for certain reactions in the network. The algorithm is useful to combine data from wild-type and mutant cells and may potentially integrate physiological, biochemical, pharmacological, and genetic data in the form of a single model.

  6. Identification of New Genes Positively Regulated by Tri10 and a Regulatory Network for Trichothecene Mycotoxin Production

    OpenAIRE

    Peplow, Andrew W.; Tag, Andrew G.; Garifullina, Gulnara F.; Beremand, Marian N.

    2003-01-01

    Tri10, a regulatory gene in trichothecene mycotoxin-producing Fusarium species, is required for trichothecene biosynthesis and the coordinated expression of four trichothecene pathway-specific genes (Tri4, Tri5, Tri6, and Tri101) and the isoprenoid biosynthetic gene for farnesyl pyrophosphate synthetase (FPPS). We showed that six more trichothecene genes (Tri3, Tri7, Tri8, Tri9, Tri11, and Tri12) are regulated by Tri10. We also constructed a cDNA library from a strain of Fusarium sporotrichio...

  7. A Gene Regulatory Network Cooperatively Controlled by Pdx1 and Sox9 Governs Lineage Allocation of Foregut Progenitor Cells

    DEFF Research Database (Denmark)

    Shih, Hung Ping; Seymour, Philip A; Patel, Nisha A;

    2015-01-01

    The generation of pancreas, liver, and intestine from a common pool of progenitors in the foregut endoderm requires the establishment of organ boundaries. How dorsal foregut progenitors activate pancreatic genes and evade the intestinal lineage choice remains unclear. Here, we identify Pdx1 and Sox...... pancreatic fate and sheds light on the gene regulatory circuitry that governs the development of distinct organs from multi-lineage-competent foregut progenitors....

  8. Towards a systems-level understanding of gene regulatory, protein interaction, and metabolic networks in cyanobacteria

    Directory of Open Access Journals (Sweden)

    Miguel Angel Hernández-Prieto

    2014-07-01

    Full Text Available Cyanobacteria are essential primary producers in marine ecosystems, playing an important role in both carbon and nitrogen cycles. In the last decade, various genome sequencing and metagenomic projects have generated large amounts of genetic data for cyanobacteria. This wealth of data provides researchers with a new basis for the study of molecular adaptation, ecology and evolution of cyanobacteria, as well as for developing biotechnological applications. It also facilitates the use of multiplex techniques, i.e., expression profiling by high-throughput technologies such as microarrays, RNA-seq, and proteomics. However, exploration and analysis of these data is challenging, and often requires advanced computational methods. Also, they need to be integrated into our existing framework of knowledge to use them to draw reliable biological conclusions. Here, systems biology provides important tools. Especially, the construction and analysis of molecular networks has emerged as a powerful systems-level framework, with which to integrate such data, and to better understand biological relevant processes in these organisms.In this review, we provide an overview of the advances and experimental approaches undertaken using multiplex data from genomic, transcriptomic, proteomic, and metabolomic studies in cyanobacteria. Furthermore, we summarize currently available web-based tools dedicated to cyanobacteria, i.e., CyanoBase, CyanoEXpress, ProPortal, Cyanorak, CyanoBIKE, and CINPER. Finally, we present a case study for the freshwater model cyanobacteria, Synechocystis sp. PCC6803, to show the power of meta-analysis, and the potential to extrapolate acquired knowledge to the ecologically important marine cyanobacteria genus, Prochlorococcus.

  9. Integrated analysis of the miRNA, gene and pathway regulatory network in gastric cancer.

    Science.gov (United States)

    Zhang, Haiyang; Qu, Yanjun; Duan, Jingjing; Deng, Ting; Liu, Rui; Zhang, Le; Bai, Ming; Li, Jialu; Zhou, Likun; Ning, Tao; Li, Hongli; Ge, Shaohua; Li, Hua; Ying, Guoguang; Huang, Dingzhi; Ba, Yi

    2016-02-01

    Gastric cancer is one of the most common malignant tumors worldwide; however, the efficacy of clinical treatment is limited. MicroRNAs (miRNAs) are a class of small non-coding RNAs that have been reported to play a key role in the development of cancer. They also provide novel candidates for targeted therapy. To date, in-depth studies on the molecular mechanisms of gastric cancer involving miRNAs are still absent. We previously reported that 5 miRNAs were identified as being significantly increased in gastric cancer, and the role of these miRNAs was investigated in the present study. By using bioinformatics tools, we found that more than 4,000 unique genes are potential downstream targets of gastric cancer miRNAs, and these targets belong to the protein class of nucleic acid binding, transcription factor, enzyme modulator, transferase and receptor. Pathway mapping showed that the targets of gastric cancer miRNAs are involved in the MAPK signaling pathway, pathways in cancer, the PI3K-Akt signaling pathway, the HTLV-1 signaling pathway and Ras signaling pathway, thus regulating cell growth, differentiation, apoptosis and metastasis. Analysis of the pathways related to miRNAs may provides potential drug targets for future therapy of gastric cancer.

  10. A gene regulatory network for apical organ neurogenesis and its spatial control in sea star embryos.

    Science.gov (United States)

    Cheatle Jarvela, Alys M; Yankura, Kristen A; Hinman, Veronica F

    2016-11-15

    How neural stem cells generate the correct number and type of differentiated neurons in appropriate places remains an important question. Although nervous systems are diverse across phyla, in many taxa the larva forms an anterior concentration of serotonergic neurons, or apical organ. The sea star embryo initially has a pan-neurogenic ectoderm, but the genetic mechanism that directs a subset of these cells to generate serotonergic neurons in a particular location is unresolved. We show that neurogenesis in sea star larvae begins with soxc-expressing multipotent progenitors. These give rise to restricted progenitors that express lhx2/9 soxc- and lhx2/9-expressing cells can undergo both asymmetric divisions, allowing for progression towards a particular neural fate, and symmetric proliferative divisions. We show that nested concentric domains of gene expression along the anterior-posterior (AP) axis, which are observed in a great diversity of metazoans, control neurogenesis in the sea star larva by promoting particular division modes and progression towards becoming a neuron. This work explains how spatial patterning in the ectoderm controls progression of neurogenesis in addition to providing spatial cues for neuron location. Modification to the sizes of these AP territories provides a simple mechanism to explain the diversity of neuron number among apical organs.

  11. Information transmission in genetic regulatory networks: a review

    CERN Document Server

    Walczak, Aleksandra M

    2011-01-01

    Genetic regulatory networks enable cells to respond to the changes in internal and external conditions by dynamically coordinating their gene expression profiles. Our ability to make quantitative measurements in these biochemical circuits has deepened our understanding of what kinds of computations genetic regulatory networks can perform and with what reliability. These advances have motivated researchers to look for connections between the architecture and function of genetic regulatory networks. Transmitting information between network's inputs and its outputs has been proposed as one such possible measure of function, relevant in certain biological contexts. Here we summarize recent developments in the application of information theory to gene regulatory networks. We first review basic concepts in information theory necessary to understand recent work. We then discuss the functional complexity of gene regulation which arrises from the molecular nature of the regulatory interactions. We end by reviewing som...

  12. Comparative RNA-Seq Analysis Reveals That Regulatory Network of Maize Root Development Controls the Expression of Genes in Response to N Stress.

    Directory of Open Access Journals (Sweden)

    Xiujing He

    Full Text Available Nitrogen (N is an essential nutrient for plants, and it directly affects grain yield and protein content in cereal crops. Plant root systems are not only critical for anchorage in the soil, but also for N acquisition. Therefore, genes controlling root development might also affect N uptake by plants. In this study, the responses of nitrogen on root architecture of mutant rtcs and wild-type of maize were investigated by morphological and physiological analysis. Subsequently, we performed a comparative RNA-Seq analysis to compare gene expression profiles between mutant rtcs roots and wild-type roots under different N conditions. We identified 786 co-modulated differentially expressed genes (DEGs related to root development. These genes participated in various metabolic processes. A co-expression cluster analysis and a cis-regulatory motifs analysis revealed the importance of the AP2-EREBP transcription factor family in the rtcs-dependent regulatory network. Some genotype-specific DEGs contained at least one LBD motif in their promoter region. Further analyses of the differences in gene transcript levels between rtcs and wild-type under different N conditions revealed 403 co-modulated DEGs with distinct functions. A comparative analysis revealed that the regulatory network controlling root development also controlled gene expression in response to N-deficiency. Several AP2-EREBP family members involved in multiple hormone signaling pathways were among the DEGs. These transcription factors might play important roles in the rtcs-dependent regulatory network related to root development and the N-deficiency response. Genes encoding the nitrate transporters NRT2-1, NAR2.1, NAR2.2, and NAR2.3 showed much higher transcript levels in rtcs than in wild-type under normal-N conditions. This result indicated that the LBD gene family mainly functions as transcriptional repressors, as noted in other studies. In summary, using a comparative RNA-Seq-based approach

  13. Comparative RNA-Seq Analysis Reveals That Regulatory Network of Maize Root Development Controls the Expression of Genes in Response to N Stress.

    Science.gov (United States)

    He, Xiujing; Ma, Haixia; Zhao, Xiongwei; Nie, Shujun; Li, Yuhua; Zhang, Zhiming; Shen, Yaou; Chen, Qi; Lu, Yanli; Lan, Hai; Zhou, Shufeng; Gao, Shibin; Pan, Guangtang; Lin, Haijian

    2016-01-01

    Nitrogen (N) is an essential nutrient for plants, and it directly affects grain yield and protein content in cereal crops. Plant root systems are not only critical for anchorage in the soil, but also for N acquisition. Therefore, genes controlling root development might also affect N uptake by plants. In this study, the responses of nitrogen on root architecture of mutant rtcs and wild-type of maize were investigated by morphological and physiological analysis. Subsequently, we performed a comparative RNA-Seq analysis to compare gene expression profiles between mutant rtcs roots and wild-type roots under different N conditions. We identified 786 co-modulated differentially expressed genes (DEGs) related to root development. These genes participated in various metabolic processes. A co-expression cluster analysis and a cis-regulatory motifs analysis revealed the importance of the AP2-EREBP transcription factor family in the rtcs-dependent regulatory network. Some genotype-specific DEGs contained at least one LBD motif in their promoter region. Further analyses of the differences in gene transcript levels between rtcs and wild-type under different N conditions revealed 403 co-modulated DEGs with distinct functions. A comparative analysis revealed that the regulatory network controlling root development also controlled gene expression in response to N-deficiency. Several AP2-EREBP family members involved in multiple hormone signaling pathways were among the DEGs. These transcription factors might play important roles in the rtcs-dependent regulatory network related to root development and the N-deficiency response. Genes encoding the nitrate transporters NRT2-1, NAR2.1, NAR2.2, and NAR2.3 showed much higher transcript levels in rtcs than in wild-type under normal-N conditions. This result indicated that the LBD gene family mainly functions as transcriptional repressors, as noted in other studies. In summary, using a comparative RNA-Seq-based approach, we identified

  14. Evolutionary algorithms in genetic regulatory networks model

    CERN Document Server

    Raza, Khalid

    2012-01-01

    Genetic Regulatory Networks (GRNs) plays a vital role in the understanding of complex biological processes. Modeling GRNs is significantly important in order to reveal fundamental cellular processes, examine gene functions and understanding their complex relationships. Understanding the interactions between genes gives rise to develop better method for drug discovery and diagnosis of the disease since many diseases are characterized by abnormal behaviour of the genes. In this paper we have reviewed various evolutionary algorithms-based approach for modeling GRNs and discussed various opportunities and challenges.

  15. Genetic Regulatory Networks in Embryogenesis and Evolution

    Science.gov (United States)

    1998-01-01

    The article introduces a series of papers that were originally presented at a workshop titled Genetic Regulatory Network in Embryogenesis and Evaluation. Contents include the following: evolution of cleavage programs in relationship to axial specification and body plan evolution, changes in cell lineage specification elucidate evolutionary relations in spiralia, axial patterning in the leech: developmental mechanisms and evolutionary implications, hox genes in arthropod development and evolution, heterochronic genes in development and evolution, a common theme for LIM homeobox gene function across phylogeny, and mechanisms of specification in ascidian embryos.

  16. Regulatory Snapshots: integrative mining of regulatory modules from expression time series and regulatory networks.

    Directory of Open Access Journals (Sweden)

    Joana P Gonçalves

    Full Text Available Explaining regulatory mechanisms is crucial to understand complex cellular responses leading to system perturbations. Some strategies reverse engineer regulatory interactions from experimental data, while others identify functional regulatory units (modules under the assumption that biological systems yield a modular organization. Most modular studies focus on network structure and static properties, ignoring that gene regulation is largely driven by stimulus-response behavior. Expression time series are key to gain insight into dynamics, but have been insufficiently explored by current methods, which often (1 apply generic algorithms unsuited for expression analysis over time, due to inability to maintain the chronology of events or incorporate time dependency; (2 ignore local patterns, abundant in most interesting cases of transcriptional activity; (3 neglect physical binding or lack automatic association of regulators, focusing mainly on expression patterns; or (4 limit the discovery to a predefined number of modules. We propose Regulatory Snapshots, an integrative mining approach to identify regulatory modules over time by combining transcriptional control with response, while overcoming the above challenges. Temporal biclustering is first used to reveal transcriptional modules composed of genes showing coherent expression profiles over time. Personalized ranking is then applied to prioritize prominent regulators targeting the modules at each time point using a network of documented regulatory associations and the expression data. Custom graphics are finally depicted to expose the regulatory activity in a module at consecutive time points (snapshots. Regulatory Snapshots successfully unraveled modules underlying yeast response to heat shock and human epithelial-to-mesenchymal transition, based on regulations documented in the YEASTRACT and JASPAR databases, respectively, and available expression data. Regulatory players involved in

  17. Pax3 and Zic1 trigger the early neural crest gene regulatory network by the direct activation of multiple key neural crest specifiers.

    Science.gov (United States)

    Plouhinec, Jean-Louis; Roche, Daniel D; Pegoraro, Caterina; Figueiredo, Ana Leonor; Maczkowiak, Frédérique; Brunet, Lisa J; Milet, Cécile; Vert, Jean-Philippe; Pollet, Nicolas; Harland, Richard M; Monsoro-Burq, Anne H

    2014-02-15

    Neural crest development is orchestrated by a complex and still poorly understood gene regulatory network. Premigratory neural crest is induced at the lateral border of the neural plate by the combined action of signaling molecules and transcription factors such as AP2, Gbx2, Pax3 and Zic1. Among them, Pax3 and Zic1 are both necessary and sufficient to trigger a complete neural crest developmental program. However, their gene targets in the neural crest regulatory network remain unknown. Here, through a transcriptome analysis of frog microdissected neural border, we identified an extended gene signature for the premigratory neural crest, and we defined novel potential members of the regulatory network. This signature includes 34 novel genes, as well as 44 known genes expressed at the neural border. Using another microarray analysis which combined Pax3 and Zic1 gain-of-function and protein translation blockade, we uncovered 25 Pax3 and Zic1 direct targets within this signature. We demonstrated that the neural border specifiers Pax3 and Zic1 are direct upstream regulators of neural crest specifiers Snail1/2, Foxd3, Twist1, and Tfap2b. In addition, they may modulate the transcriptional output of multiple signaling pathways involved in neural crest development (Wnt, Retinoic Acid) through the induction of key pathway regulators (Axin2 and Cyp26c1). We also found that Pax3 could maintain its own expression through a positive autoregulatory feedback loop. These hierarchical inductions, feedback loops, and pathway modulations provide novel tools to understand the neural crest induction network.

  18. Modeling Emergence in Neuroprotective Regulatory Networks

    Energy Technology Data Exchange (ETDEWEB)

    Sanfilippo, Antonio P.; Haack, Jereme N.; McDermott, Jason E.; Stevens, S.L.; Stenzel-Poore, Mary

    2013-01-05

    The use of predictive modeling in the analysis of gene expression data can greatly accelerate the pace of scientific discovery in biomedical research by enabling in silico experimentation to test disease triggers and potential drug therapies. Techniques that focus on modeling emergence, such as agent-based modeling and multi-agent simulations, are of particular interest as they support the discovery of pathways that may have never been observed in the past. Thus far, these techniques have been primarily applied at the multi-cellular level, or have focused on signaling and metabolic networks. We present an approach where emergence modeling is extended to regulatory networks and demonstrate its application to the discovery of neuroprotective pathways. An initial evaluation of the approach indicates that emergence modeling provides novel insights for the analysis of regulatory networks that can advance the discovery of acute treatments for stroke and other diseases.

  19. Reconstruction of the gene regulatory network involved in the sonic hedgehog pathway with a potential role in early development of the mouse brain.

    Directory of Open Access Journals (Sweden)

    Jinhua Liu

    2014-10-01

    Full Text Available The Sonic hedgehog (Shh signaling pathway is crucial for pattern formation in early central nervous system development. By systematically analyzing high-throughput in situ hybridization data of E11.5 mouse brain, we found that Shh and its receptor Ptch1 define two adjacent mutually exclusive gene expression domains: Shh+Ptch1- and Shh-Ptch1+. These two domains are associated respectively with Foxa2 and Gata3, two transcription factors that play key roles in specifying them. Gata3 ChIP-seq experiments and RNA-seq assays on Gata3-knockdown cells revealed that Gata3 up-regulates the genes that are enriched in the Shh-Ptch1+ domain. Important Gata3 targets include Slit2 and Slit3, which are involved in the process of axon guidance, as well as Slc18a1, Th and Qdpr, which are associated with neurotransmitter synthesis and release. By contrast, Foxa2 both up-regulates the genes expressed in the Shh+Ptch1- domain and down-regulates the genes characteristic of the Shh-Ptch1+ domain. From these and other data, we were able to reconstruct a gene regulatory network governing both domains. Our work provides the first genome-wide characterization of the gene regulatory network involved in the Shh pathway that underlies pattern formation in the early mouse brain.

  20. Using regulatory and epistatic networks to extend the findings of a genome scan: identifying the gene drivers of pigmentation in merino sheep.

    Science.gov (United States)

    García-Gámez, Elsa; Reverter, Antonio; Whan, Vicki; McWilliam, Sean M; Arranz, Juan José; Kijas, James

    2011-01-01

    Extending genome wide association analysis by the inclusion of gene expression data may assist in the dissection of complex traits. We examined piebald, a pigmentation phenotype in both human and Merino sheep, by analysing multiple data types using a systems approach. First, a case control analysis of 49,034 ovine SNP was performed which confirmed a multigenic basis for the condition. We combined these results with gene expression data from five tissue types analysed with a skin-specific microarray. Promoter sequence analysis of differentially expressed genes allowed us to reverse-engineer a regulatory network. Likewise, by testing two-loci models derived from all pair-wise comparisons across piebald-associated SNP, we generated an epistatic network. At the intersection of both networks, we identified thirteen genes with insulin-like growth factor binding protein 7 (IGFBP7), platelet-derived growth factor alpha (PDGFRA) and the tetraspanin platelet activator CD9 at the kernel of the intersection. Further, we report a number of differentially expressed genes in regions containing highly associated SNP including ATRN, DOCK7, FGFR1OP, GLI3, SILV and TBX15. The application of network theory facilitated co-analysis of genetic variation with gene expression, recapitulated aspects of the known molecular biology of skin pigmentation and provided insights into the transcription regulation and epistatic interactions involved in piebald Merino sheep.

  1. Using regulatory and epistatic networks to extend the findings of a genome scan: identifying the gene drivers of pigmentation in merino sheep.

    Directory of Open Access Journals (Sweden)

    Elsa García-Gámez

    Full Text Available Extending genome wide association analysis by the inclusion of gene expression data may assist in the dissection of complex traits. We examined piebald, a pigmentation phenotype in both human and Merino sheep, by analysing multiple data types using a systems approach. First, a case control analysis of 49,034 ovine SNP was performed which confirmed a multigenic basis for the condition. We combined these results with gene expression data from five tissue types analysed with a skin-specific microarray. Promoter sequence analysis of differentially expressed genes allowed us to reverse-engineer a regulatory network. Likewise, by testing two-loci models derived from all pair-wise comparisons across piebald-associated SNP, we generated an epistatic network. At the intersection of both networks, we identified thirteen genes with insulin-like growth factor binding protein 7 (IGFBP7, platelet-derived growth factor alpha (PDGFRA and the tetraspanin platelet activator CD9 at the kernel of the intersection. Further, we report a number of differentially expressed genes in regions containing highly associated SNP including ATRN, DOCK7, FGFR1OP, GLI3, SILV and TBX15. The application of network theory facilitated co-analysis of genetic variation with gene expression, recapitulated aspects of the known molecular biology of skin pigmentation and provided insights into the transcription regulation and epistatic interactions involved in piebald Merino sheep.

  2. Non-transcriptional regulatory processes shape transcriptional network dynamics.

    Science.gov (United States)

    Ray, J Christian J; Tabor, Jeffrey J; Igoshin, Oleg A

    2011-10-11

    Information about the extra- or intracellular environment is often captured as biochemical signals that propagate through regulatory networks. These signals eventually drive phenotypic changes, typically by altering gene expression programmes in the cell. Reconstruction of transcriptional regulatory networks has given a compelling picture of bacterial physiology, but transcriptional network maps alone often fail to describe phenotypes. Cellular response dynamics are ultimately determined by interactions between transcriptional and non-transcriptional networks, with dramatic implications for physiology and evolution. Here, we provide an overview of non-transcriptional interactions that can affect the performance of natural and synthetic bacterial regulatory networks.

  3. Attribute Exploration of Gene Regulatory Processes

    CERN Document Server

    Wollbold, Johannes

    2012-01-01

    This thesis aims at the logical analysis of discrete processes, in particular of such generated by gene regulatory networks. States, transitions and operators from temporal logics are expressed in the language of Formal Concept Analysis. By the attribute exploration algorithm, an expert or a computer program is enabled to validate a minimal and complete set of implications, e.g. by comparison of predictions derived from literature with observed data. Here, these rules represent temporal dependencies within gene regulatory networks including coexpression of genes, reachability of states, invariants or possible causal relationships. This new approach is embedded into the theory of universal coalgebras, particularly automata, Kripke structures and Labelled Transition Systems. A comparison with the temporal expressivity of Description Logics is made. The main theoretical results concern the integration of background knowledge into the successive exploration of the defined data structures (formal contexts). Applyi...

  4. Computational modeling with forward and reverse engineering links signaling network and genomic regulatory responses: NF-κB signaling-induced gene expression responses in inflammation

    Directory of Open Access Journals (Sweden)

    Peng Chien

    2010-06-01

    Full Text Available Abstract Background Signal transduction is the major mechanism through which cells transmit external stimuli to evoke intracellular biochemical responses. Diverse cellular stimuli create a wide variety of transcription factor activities through signal transduction pathways, resulting in different gene expression patterns. Understanding the relationship between external stimuli and the corresponding cellular responses, as well as the subsequent effects on downstream genes, is a major challenge in systems biology. Thus, a systematic approach is needed to integrate experimental data and theoretical hypotheses to identify the physiological consequences of environmental stimuli. Results We proposed a systematic approach that combines forward and reverse engineering to link the signal transduction cascade with the gene responses. To demonstrate the feasibility of our strategy, we focused on linking the NF-κB signaling pathway with the inflammatory gene regulatory responses because NF-κB has long been recognized to play a crucial role in inflammation. We first utilized forward engineering (Hybrid Functional Petri Nets to construct the NF-κB signaling pathway and reverse engineering (Network Components Analysis to build a gene regulatory network (GRN. Then, we demonstrated that the corresponding IKK profiles can be identified in the GRN and are consistent with the experimental validation of the IKK kinase assay. We found that the time-lapse gene expression of several cytokines and chemokines (TNF-α, IL-1, IL-6, CXCL1, CXCL2 and CCL3 is concordant with the NF-κB activity profile, and these genes have stronger influence strength within the GRN. Such regulatory effects have highlighted the crucial roles of NF-κB signaling in the acute inflammatory response and enhance our understanding of the systemic inflammatory response syndrome. Conclusion We successfully identified and distinguished the corresponding signaling profiles among three microarray

  5. Metabolic constraint-based refinement of transcriptional regulatory networks.

    Science.gov (United States)

    Chandrasekaran, Sriram; Price, Nathan D

    2013-01-01

    There is a strong need for computational frameworks that integrate different biological processes and data-types to unravel cellular regulation. Current efforts to reconstruct transcriptional regulatory networks (TRNs) focus primarily on proximal data such as gene co-expression and transcription factor (TF) binding. While such approaches enable rapid reconstruction of TRNs, the overwhelming combinatorics of possible networks limits identification of mechanistic regulatory interactions. Utilizing growth phenotypes and systems-level constraints to inform regulatory network reconstruction is an unmet challenge. We present our approach Gene Expression and Metabolism Integrated for Network Inference (GEMINI) that links a compendium of candidate regulatory interactions with the metabolic network to predict their systems-level effect on growth phenotypes. We then compare predictions with experimental phenotype data to select phenotype-consistent regulatory interactions. GEMINI makes use of the observation that only a small fraction of regulatory network states are compatible with a viable metabolic network, and outputs a regulatory network that is simultaneously consistent with the input genome-scale metabolic network model, gene expression data, and TF knockout phenotypes. GEMINI preferentially recalls gold-standard interactions (p-value = 10(-172)), significantly better than using gene expression alone. We applied GEMINI to create an integrated metabolic-regulatory network model for Saccharomyces cerevisiae involving 25,000 regulatory interactions controlling 1597 metabolic reactions. The model quantitatively predicts TF knockout phenotypes in new conditions (p-value = 10(-14)) and revealed potential condition-specific regulatory mechanisms. Our results suggest that a metabolic constraint-based approach can be successfully used to help reconstruct TRNs from high-throughput data, and highlights the potential of using a biochemically-detailed mechanistic framework to

  6. CluGene: A Bioinformatics Framework for the Identification of Co-Localized, Co-Expressed and Co-Regulated Genes Aimed at the Investigation of Transcriptional Regulatory Networks from High-Throughput Expression Data.

    Directory of Open Access Journals (Sweden)

    Tania Dottorini

    Full Text Available The full understanding of the mechanisms underlying transcriptional regulatory networks requires unravelling of complex causal relationships. Genome high-throughput technologies produce a huge amount of information pertaining gene expression and regulation; however, the complexity of the available data is often overwhelming and tools are needed to extract and organize the relevant information. This work starts from the assumption that the observation of co-occurrent events (in particular co-localization, co-expression and co-regulation may provide a powerful starting point to begin unravelling transcriptional regulatory networks. Co-expressed genes often imply shared functional pathways; co-expressed and functionally related genes are often co-localized, too; moreover, co-expressed and co-localized genes are also potential targets for co-regulation; finally, co-regulation seems more frequent for genes mapped to proximal chromosome regions. Despite the recognized importance of analysing co-occurrent events, no bioinformatics solution allowing the simultaneous analysis of co-expression, co-localization and co-regulation is currently available. Our work resulted in developing and valuating CluGene, a software providing tools to analyze multiple types of co-occurrences within a single interactive environment allowing the interactive investigation of combined co-expression, co-localization and co-regulation of genes. The use of CluGene will enhance the power of testing hypothesis and experimental approaches aimed at unravelling transcriptional regulatory networks. The software is freely available at http://bioinfolab.unipg.it/.

  7. Population Dynamics of Genetic Regulatory Networks

    Science.gov (United States)

    Braun, Erez

    2005-03-01

    Unlike common objects in physics, a biological cell processes information. The cell interprets its genome and transforms the genomic information content, through the action of genetic regulatory networks, into proteins which in turn dictate its metabolism, functionality and morphology. Understanding the dynamics of a population of biological cells presents a unique challenge. It requires to link the intracellular dynamics of gene regulation, through the mechanism of cell division, to the level of the population. We present experiments studying adaptive dynamics of populations of genetically homogeneous microorganisms (yeast), grown for long durations under steady conditions. We focus on population dynamics that do not involve random genetic mutations. Our experiments follow the long-term dynamics of the population distributions and allow to quantify the correlations among generations. We focus on three interconnected issues: adaptation of genetically homogeneous populations following environmental changes, selection processes on the population and population variability and expression distributions. We show that while the population exhibits specific short-term responses to environmental inputs, it eventually adapts to a robust steady-state, largely independent of external conditions. Cycles of medium-switch show that the adapted state is imprinted in the population and that this memory is maintained for many generations. To further study population adaptation, we utilize the process of gene recruitment whereby a gene naturally regulated by a specific promoter is placed under a different regulatory system. This naturally occurring process has been recognized as a major driving force in evolution. We have recruited an essential gene to a foreign regulatory network and followed the population long-term dynamics. Rewiring of the regulatory network allows us to expose their complex dynamics and phase space structure.

  8. Adaptation by Plasticity of Genetic Regulatory Networks

    Science.gov (United States)

    Brenner, Naama

    2007-03-01

    Genetic regulatory networks have an essential role in adaptation and evolution of cell populations. This role is strongly related to their dynamic properties over intermediate-to-long time scales. We have used the budding yeast as a model Eukaryote to study the long-term dynamics of the genetic regulatory system and its significance in evolution. A continuous cell growth technique (chemostat) allows us to monitor these systems over long times under controlled condition, enabling a quantitative characterization of dynamics: steady states and their stability, transients and relaxation. First, we have demonstrated adaptive dynamics in the GAL system, a classic model for a Eukaryotic genetic switch, induced and repressed by different carbon sources in the environment. We found that both induction and repression are only transient responses; over several generations, the system converges to a single robust steady state, independent of external conditions. Second, we explored the functional significance of such plasticity of the genetic regulatory network in evolution. We used genetic engineering to mimic the natural process of gene recruitment, placing the gene HIS3 under the regulation of the GAL system. Such genetic rewiring events are important in the evolution of gene regulation, but little is known about the physiological processes supporting them and the dynamics of their assimilation in a cell population. We have shown that cells carrying the rewired genome adapted to a demanding change of environment and stabilized a population, maintaining the adaptive state for hundreds of generations. Using genome-wide expression arrays we showed that underlying the observed adaptation is a global transcriptional programming that allowed tuning expression of the recruited gene to demands. Our results suggest that non-specific properties reflecting the natural plasticity of the regulatory network support adaptation of cells to novel challenges and enhance their evolvability.

  9. Delay-independent stability of genetic regulatory networks.

    Science.gov (United States)

    Wu, Fang-Xiang

    2011-11-01

    Genetic regulatory networks can be described by nonlinear differential equations with time delays. In this paper, we study both locally and globally delay-independent stability of genetic regulatory networks, taking messenger ribonucleic acid alternative splicing into consideration. Based on nonnegative matrix theory, we first develop necessary and sufficient conditions for locally delay-independent stability of genetic regulatory networks with multiple time delays. Compared to the previous results, these conditions are easy to verify. Then we develop sufficient conditions for global delay-independent stability for genetic regulatory networks. Compared to the previous results, this sufficient condition is less conservative. To illustrate theorems developed in this paper, we analyze delay-independent stability of two genetic regulatory networks: a real-life repressilatory network with three genes and three proteins, and a synthetic gene regulatory network with five genes and seven proteins. The simulation results show that the theorems developed in this paper can effectively determine the delay-independent stability of genetic regulatory networks.

  10. A Systems genetics approach identifies gene regulatory networks associated with fatty acid composition in brassica rapa seed

    NARCIS (Netherlands)

    Basnet, Ram Kumar; Pino Del Carpio, Dunia; Xiao, Dong; Bucher, Johan; Jin, Mina; Boyle, Kerry; Fobert, Pierre; Visser, R.G.F.; Maliepaard, Chris; Bonnema, Guusje

    2016-01-01

    Fatty acids in seeds affect seed germination and seedling vigor, and fatty acid composition determines the quality of seed oil. In this study, quantitative trait locus (QTL) mapping of fatty acid and transcript abundance was integrated with gene network analysis to unravel the genetic regulation

  11. Adaptive Dynamics of Regulatory Networks: Size Matters

    Directory of Open Access Journals (Sweden)

    2009-03-01

    Full Text Available To accomplish adaptability, all living organisms are constructed of regulatory networks on different levels which are capable to differentially respond to a variety of environmental inputs. Structure of regulatory networks determines their phenotypical plasticity, that is, the degree of detail and appropriateness of regulatory replies to environmental or developmental challenges. This regulatory network structure is encoded within the genotype. Our conceptual simulation study investigates how network structure constrains the evolution of networks and their adaptive abilities. The focus is on the structural parameter network size. We show that small regulatory networks adapt fast, but not as good as larger networks in the longer perspective. Selection leads to an optimal network size dependent on heterogeneity of the environment and time pressure of adaptation. Optimal mutation rates are higher for smaller networks. We put special emphasis on discussing our simulation results on the background of functional observations from experimental and evolutionary biology.

  12. Adaptive Dynamics of Regulatory Networks: Size Matters

    Directory of Open Access Journals (Sweden)

    Martinetz Thomas

    2009-01-01

    Full Text Available To accomplish adaptability, all living organisms are constructed of regulatory networks on different levels which are capable to differentially respond to a variety of environmental inputs. Structure of regulatory networks determines their phenotypical plasticity, that is, the degree of detail and appropriateness of regulatory replies to environmental or developmental challenges. This regulatory network structure is encoded within the genotype. Our conceptual simulation study investigates how network structure constrains the evolution of networks and their adaptive abilities. The focus is on the structural parameter network size. We show that small regulatory networks adapt fast, but not as good as larger networks in the longer perspective. Selection leads to an optimal network size dependent on heterogeneity of the environment and time pressure of adaptation. Optimal mutation rates are higher for smaller networks. We put special emphasis on discussing our simulation results on the background of functional observations from experimental and evolutionary biology.

  13. Identifying regulational alterations in gene regulatory networks by state space representation of vector autoregressive models and variational annealing.

    Science.gov (United States)

    Kojima, Kaname; Imoto, Seiya; Yamaguchi, Rui; Fujita, André; Yamauchi, Mai; Gotoh, Noriko; Miyano, Satoru

    2012-01-01

    In the analysis of effects by cell treatment such as drug dosing, identifying changes on gene network structures between normal and treated cells is a key task. A possible way for identifying the changes is to compare structures of networks estimated from data on normal and treated cells separately. However, this approach usually fails to estimate accurate gene networks due to the limited length of time series data and measurement noise. Thus, approaches that identify changes on regulations by using time series data on both conditions in an efficient manner are demanded. We propose a new statistical approach that is based on the state space representation of the vector autoregressive model and estimates gene networks on two different conditions in order to identify changes on regulations between the conditions. In the mathematical model of our approach, hidden binary variables are newly introduced to indicate the presence of regulations on each condition. The use of the hidden binary variables enables an efficient data usage; data on both conditions are used for commonly existing regulations, while for condition specific regulations corresponding data are only applied. Also, the similarity of networks on two conditions is automatically considered from the design of the potential function for the hidden binary variables. For the estimation of the hidden binary variables, we derive a new variational annealing method that searches the configuration of the binary variables maximizing the marginal likelihood. For the performance evaluation, we use time series data from two topologically similar synthetic networks, and confirm that our proposed approach estimates commonly existing regulations as well as changes on regulations with higher coverage and precision than other existing approaches in almost all the experimental settings. For a real data application, our proposed approach is applied to time series data from normal Human lung cells and Human lung cells treated by

  14. State Observer Design for Delayed Genetic Regulatory Networks

    Directory of Open Access Journals (Sweden)

    Li-Ping Tian

    2014-01-01

    Full Text Available Genetic regulatory networks are dynamic systems which describe the interactions among gene products (mRNAs and proteins. The internal states of a genetic regulatory network consist of the concentrations of mRNA and proteins involved in it, which are very helpful in understanding its dynamic behaviors. However, because of some limitations such as experiment techniques, not all internal states of genetic regulatory network can be effectively measured. Therefore it becomes an important issue to estimate the unmeasured states via the available measurements. In this study, we design a state observer to estimate the states of genetic regulatory networks with time delays from available measurements. Furthermore, based on linear matrix inequality (LMI approach, a criterion is established to guarantee that the dynamic of estimation error is globally asymptotically stable. A gene repressillatory network is employed to illustrate the effectiveness of our design approach.

  15. Differential expression analysis and regulatory network reconstruction for genes associated with muscle growth and adipose deposition in obese and lean pigs

    Institute of Scientific and Technical Information of China (English)

    Mingzhou Li; Xuewei Li; Li Zhu; Xiaokun Teng; Huasheng Xiao; Surong Shuai; Lei Chen; Qiang Li; Yujiao Guo

    2008-01-01

    During the growth and development of skeletal muscle cells and adipose cells, the regulatory mechanism of micro-effect polygenes determines porcine meat quality, carcass characteristics and other relative quantitative traits. Obese and lean type pig breeds show obvious differences in muscle growth and adipose deposition; however, the molecular mechanism underlying this phenotypic variation remains unknown. We used pathway-focused oligo microarray studies to examine the expression changes of 140 genes associated with muscle growth and adipose deposition in longissimus dorsi muscle at six growth stages (birth, 1, 2, 3, 4 and 5 months) of Landrace (a leaner, Western breed) and Taihu pigs (a fatty, indigenous, Chinese breed). Variance analysis (ANOVA) revealed that differences in the expression of 18 genes in Landrace pigs and three genes in Taihu pigs were very significant (FDR-adjusted permutation, P<0.01) and differences for 22 genes in Landrace pigs and seven genes in Taihu pigs were significant (FDR-adjusted permutation, P<0.05) among six growth stages. Clustering analysis revealed a high level of significance (FDR-adjusted, P<0.01) for four gene expression patterns, in which genes that strongly up-regulated were mainly associated with the positive regulation of myofiber formation and fatty acid biogenesis and genes that strongly down-regulated were mainly associated with the inhibition of cell proliferation and positive regulation of fatty acid P-oxidation. Based on a dynamic Bayesian network (DBN) model, gene regulatory networks (GRNs) were reconstructed from time-series data for each pig breed. These two GRNs initially revealed the distinct differences in physiological and biochemical aspects of muscle growth and adipose deposition between the two pig breeds; from these results, some potential key genes could be identified. Quantitative real-time RT-PCR (QRT-PCR) was used to verify the microarray data for five modulated genes, and a good correlation between the

  16. Comparative Developmental Transcriptomics Reveals Rewiring of a Highly Conserved Gene Regulatory Network during a Major Life History Switch in the Sea Urchin Genus Heliocidaris.

    Science.gov (United States)

    Israel, Jennifer W; Martik, Megan L; Byrne, Maria; Raff, Elizabeth C; Raff, Rudolf A; McClay, David R; Wray, Gregory A

    2016-03-01

    The ecologically significant shift in developmental strategy from planktotrophic (feeding) to lecithotrophic (nonfeeding) development in the sea urchin genus Heliocidaris is one of the most comprehensively studied life history transitions in any animal. Although the evolution of lecithotrophy involved substantial changes to larval development and morphology, it is not known to what extent changes in gene expression underlie the developmental differences between species, nor do we understand how these changes evolved within the context of the well-defined gene regulatory network (GRN) underlying sea urchin development. To address these questions, we used RNA-seq to measure expression dynamics across development in three species: the lecithotroph Heliocidaris erythrogramma, the closely related planktotroph H. tuberculata, and an outgroup planktotroph Lytechinus variegatus. Using well-established statistical methods, we developed a novel framework for identifying, quantifying, and polarizing evolutionary changes in gene expression profiles across the transcriptome and within the GRN. We found that major changes in gene expression profiles were more numerous during the evolution of lecithotrophy than during the persistence of planktotrophy, and that genes with derived expression profiles in the lecithotroph displayed specific characteristics as a group that are consistent with the dramatically altered developmental program in this species. Compared to the transcriptome, changes in gene expression profiles within the GRN were even more pronounced in the lecithotroph. We found evidence for conservation and likely divergence of particular GRN regulatory interactions in the lecithotroph, as well as significant changes in the expression of genes with known roles in larval skeletogenesis. We further use coexpression analysis to identify genes of unknown function that may contribute to both conserved and derived developmental traits between species. Collectively, our results

  17. Dynamics of network motifs in genetic regulatory networks

    Institute of Scientific and Technical Information of China (English)

    Li Ying; Liu Zeng-Rong; Zhang Jian-Bao

    2007-01-01

    Network motifs hold a very important status in genetic regulatory networks. This paper aims to analyse the dynamical property of the network motifs in genetic regulatory networks. The main result we obtained is that the dynamical property of a single motif is very simple with only an asymptotically stable equilibrium point, but the combination of several motifs can make more complicated dynamical properties emerge such as limit cycles. The above-mentioned result shows that network motif is a stable substructure in genetic regulatory networks while their combinations make the genetic regulatory network more complicated.

  18. Complex regulatory network controls initial adhesion and biofilm formation in Escherichia coli via regulation of the csgD gene.

    Science.gov (United States)

    Prigent-Combaret, C; Brombacher, E; Vidal, O; Ambert, A; Lejeune, P; Landini, P; Dorel, C

    2001-12-01

    The Escherichia coli OmpR/EnvZ two-component regulatory system, which senses environmental osmolarity, also regulates biofilm formation. Up mutations in the ompR gene, such as the ompR234 mutation, stimulate laboratory strains of E. coli to grow as a biofilm community rather than in a planktonic state. In this report, we show that the OmpR234 protein promotes biofilm formation by binding the csgD promoter region and stimulating its transcription. The csgD gene encodes the transcription regulator CsgD, which in turn activates transcription of the csgBA operon encoding curli, extracellular structures involved in bacterial adhesion. Consistent with the role of the ompR gene as part of an osmolarity-sensing regulatory system, we also show that the formation of biofilm by E. coli is inhibited by increasing osmolarity in the growth medium. The ompR234 mutation counteracts adhesion inhibition by high medium osmolarity; we provide evidence that the ompR234 mutation promotes biofilm formation by strongly increasing the initial adhesion of bacteria to an abiotic surface. This increase in initial adhesion is stationary phase dependent, but it is negatively regulated by the stationary-phase-specific sigma factor RpoS. We propose that this negative regulation takes place via rpoS-dependent transcription of the transcription regulator cpxR; cpxR-mediated repression of csgB and csgD promoters is also triggered by osmolarity and by curli overproduction, in a feedback regulation loop.

  19. Simple mathematical models of gene regulatory dynamics

    CERN Document Server

    Mackey, Michael C; Tyran-Kamińska, Marta; Zeron, Eduardo S

    2016-01-01

    This is a short and self-contained introduction to the field of mathematical modeling of gene-networks in bacteria. As an entry point to the field, we focus on the analysis of simple gene-network dynamics. The notes commence with an introduction to the deterministic modeling of gene-networks, with extensive reference to applicable results coming from dynamical systems theory. The second part of the notes treats extensively several approaches to the study of gene-network dynamics in the presence of noise—either arising from low numbers of molecules involved, or due to noise external to the regulatory process. The third and final part of the notes gives a detailed treatment of three well studied and concrete examples of gene-network dynamics by considering the lactose operon, the tryptophan operon, and the lysis-lysogeny switch. The notes contain an index for easy location of particular topics as well as an extensive bibliography of the current literature. The target audience of these notes are mainly graduat...

  20. Multiple horizontal gene transfer events and domain fusions have created novel regulatory and metabolic networks in the oomycete genome.

    Directory of Open Access Journals (Sweden)

    Paul Francis Morris

    Full Text Available Complex enzymes with multiple catalytic activities are hypothesized to have evolved from more primitive precursors. Global analysis of the Phytophthora sojae genome using conservative criteria for evaluation of complex proteins identified 273 novel multifunctional proteins that were also conserved in P. ramorum. Each of these proteins contains combinations of protein motifs that are not present in bacterial, plant, animal, or fungal genomes. A subset of these proteins were also identified in the two diatom genomes, but the majority of these proteins have formed after the split between diatoms and oomycetes. Documentation of multiple cases of domain fusions that are common to both oomycetes and diatom genomes lends additional support for the hypothesis that oomycetes and diatoms are monophyletic. Bifunctional proteins that catalyze two steps in a metabolic pathway can be used to infer the interaction of orthologous proteins that exist as separate entities in other genomes. We postulated that the novel multifunctional proteins of oomycetes could function as potential Rosetta Stones to identify interacting proteins of conserved metabolic and regulatory networks in other eukaryotic genomes. However ortholog analysis of each domain within our set of 273 multifunctional proteins against 39 sequenced bacterial and eukaryotic genomes, identified only 18 candidate Rosetta Stone proteins. Thus the majority of multifunctional proteins are not Rosetta Stones, but they may nonetheless be useful in identifying novel metabolic and regulatory networks in oomycetes. Phylogenetic analysis of all the enzymes in three pathways with one or more novel multifunctional proteins was conducted to determine the probable origins of individual enzymes. These analyses revealed multiple examples of horizontal transfer from both bacterial genomes and the photosynthetic endosymbiont in the ancestral genome of Stramenopiles. The complexity of the phylogenetic origins of these

  1. Partially observed bipartite network analysis to identify predictive connections in transcriptional regulatory networks

    Directory of Open Access Journals (Sweden)

    Woolf Peter J

    2011-05-01

    Full Text Available Abstract Background Messenger RNA expression is regulated by a complex interplay of different regulatory proteins. Unfortunately, directly measuring the individual activity of these regulatory proteins is difficult, leaving us with only the resulting gene expression pattern as a marker for the underlying regulatory network or regulator-gene associations. Furthermore, traditional methods to predict these regulator-gene associations do not define the relative importance of each association, leading to a large number of connections in the global regulatory network that, although true, are not useful. Results Here we present a Bayesian method that identifies which known transcriptional relationships in a regulatory network are consistent with a given body of static gene expression data by eliminating the non-relevant ones. The Partially Observed Bipartite Network (POBN approach developed here is tested using E. coli expression data and a transcriptional regulatory network derived from RegulonDB. When the regulatory network for E. coli was integrated with 266 E. coli gene chip observations, POBN identified 93 out of 570 connections that were either inconsistent or not adequately supported by the expression data. Conclusion POBN provides a systematic way to integrate known transcriptional networks with observed gene expression data to better identify which transcriptional pathways are likely responsible for the observed gene expression pattern.

  2. Identification and functional characterization of the miRNA-gene regulatory network in chronic myeloid leukemia lineage negative cells

    Science.gov (United States)

    Agatheeswaran, S.; Pattnayak, N. C.; Chakraborty, S.

    2016-09-01

    Chronic myeloid leukemia (CML) is maintained by leukemic stem cells (LSCs) which are resistant to the existing TKI therapy. Hence a better understanding of the CML LSCs is necessary to eradicate these cells and achieve complete cure. Using the miRNA-gene interaction networks from the CML lin(-) cells we identified a set of up/down-regulated miRNAs and corresponding target genes. Association studies (Pearson correlation) from the miRNA and gene expression data showed that miR-1469 and miR-1972 have significantly higher number of target genes, 75 and 50 respectively. We observed that miR-1972 induces G2-M cell cycle arrest and miR-1469 moderately arrested G1 cell cycle when overexpressed in KCL22 cells. We have earlier shown that a combination of imatinib and JAK inhibitor I can significantly bring down the proliferation of CML lineage negative cells. Here we observed that imatinib and JAK inhibitor I combination restored the expression pattern of the down-regulated miRNAs in primary CML lin(-) cells. Thus effective manipulation of the deregulated miRNAs can restore the miRNA-mRNA networks that can efficiently inhibit CML stem and progenitor cells and alleviate the disease.

  3. Characterizing disease states from topological properties of transcriptional regulatory networks

    Directory of Open Access Journals (Sweden)

    Kluger Harriet M

    2006-05-01

    Full Text Available Abstract Background High throughput gene expression experiments yield large amounts of data that can augment our understanding of disease processes, in addition to classifying samples. Here we present new paradigms of data Separation based on construction of transcriptional regulatory networks for normal and abnormal cells using sequence predictions, literature based data and gene expression studies. We analyzed expression datasets from a number of diseased and normal cells, including different types of acute leukemia, and breast cancer with variable clinical outcome. Results We constructed sample-specific regulatory networks to identify links between transcription factors (TFs and regulated genes that differentiate between healthy and diseased states. This approach carries the advantage of identifying key transcription factor-gene pairs with differential activity between healthy and diseased states rather than merely using gene expression profiles, thus alluding to processes that may be involved in gene deregulation. We then generalized this approach by studying simultaneous changes in functionality of multiple regulatory links pointing to a regulated gene or emanating from one TF (or changes in gene centrality defined by its in-degree or out-degree measures, respectively. We found that samples can often be separated based on these measures of gene centrality more robustly than using individual links. We examined distributions of distances (the number of links needed to traverse the path between each pair of genes in the transcriptional networks for gene subsets whose collective expression profiles could best separate each dataset into predefined groups. We found that genes that optimally classify samples are concentrated in neighborhoods in the gene regulatory networks. This suggests that genes that are deregulated in diseased states exhibit a remarkable degree of connectivity. Conclusion Transcription factor-regulated gene links and

  4. Analysis of the Salmonella regulatory network suggests involvement of SsrB and H-NS in σ(E)-regulated SPI-2 gene expression.

    Science.gov (United States)

    Li, Jie; Overall, Christopher C; Nakayasu, Ernesto S; Kidwai, Afshan S; Jones, Marcus B; Johnson, Rudd C; Nguyen, Nhu T; McDermott, Jason E; Ansong, Charles; Heffron, Fred; Cambronne, Eric D; Adkins, Joshua N

    2015-01-01

    The extracytoplasmic functioning sigma factor σ(E) is known to play an essential role for Salmonella enterica serovar Typhimurium to survive and proliferate in macrophages and mice. However, its regulatory network is not well-characterized, especially during infection. Here we used microarray to identify genes regulated by σ(E) in Salmonella grown in three conditions: a nutrient-rich condition and two others that mimic early and late intracellular infection. We found that in each condition σ(E) regulated different sets of genes, and notably, several global regulators. When comparing nutrient-rich and infection-like conditions, large changes were observed in the expression of genes involved in Salmonella pathogenesis island (SPI)-1 type-three secretion system (TTSS), SPI-2 TTSS, protein synthesis, and stress responses. In total, the expression of 58% of Salmonella genes was affected by σ(E) in at least one of the three conditions. An important finding is that σ(E) up-regulates SPI-2 genes, which are essential for Salmonella intracellular survival, by up-regulating SPI-2 activator ssrB expression at the early stage of infection and down-regulating SPI-2 repressor hns expression at a later stage. Moreover, σ(E) is capable of countering the silencing of H-NS, releasing the expression of SPI-2 genes. This connection between σ(E) and SPI-2 genes, combined with the global regulatory effect of σ(E), may account for the lethality of rpoE-deficient Salmonella in murine infection.

  5. Murine hyperglycemic vasculopathy and cardiomyopathy: whole-genome gene expression analysis predicts cellular targets and regulatory networks influenced by mannose binding lectin

    Directory of Open Access Journals (Sweden)

    Chenhui eZou

    2012-02-01

    Full Text Available Hyperglycemia, in the absence of type 1 or 2 diabetes, is an independent risk factor for cardiovascular disease. We have previously demonstrated a central role for mannose binding lectin (MBL-mediated cardiac dysfunction in acute hyperglycemic mice. In this study, we applied whole genome microarray data analysis to investigate MBL’s role in systematic gene expression changes. The data predict possible intracellular events taking place in multiple cellular compartments such as enhanced insulin signaling pathway sensitivity, promoted mitochondrial respiratory function, improved cellular energy expenditure and protein quality control, improved cytoskeleton structure and facilitated intracellular trafficking, all of which may contribute to the organismal health of MBL null mice against acute hyperglycemia. Our data show a tight association between gene expression profile and tissue function which might be a very useful tool in predicting cellular targets and regulatory networks connected with in vivo observations, providing clues for further mechanistic studies.

  6. Understanding regulatory networks requires more than computing a multitude of graph statistics. Comment on "Drivers of structural features in gene regulatory networks: From biophysical constraints to biological function" by O.C. Martin et al.

    Science.gov (United States)

    Tkačik, Gašper

    2016-07-01

    The article by O. Martin and colleagues provides a much needed systematic review of a body of work that relates the topological structure of genetic regulatory networks to evolutionary selection for function. This connection is very important. Using the current wealth of genomic data, statistical features of regulatory networks (e.g., degree distributions, motif composition, etc.) can be quantified rather easily; it is, however, often unclear how to interpret the results. On a graph theoretic level the statistical significance of the results can be evaluated by comparing observed graphs to "randomized" ones (bravely ignoring the issue of how precisely to randomize!) and comparing the frequency of appearance of a particular network structure relative to a randomized null expectation. While this is a convenient operational test for statistical significance, its biological meaning is questionable. In contrast, an in-silico genotype-to-phenotype model makes explicit the assumptions about the network function, and thus clearly defines the expected network structures that can be compared to the case of no selection for function and, ultimately, to data.

  7. ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.

    Science.gov (United States)

    Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2017-01-04

    The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Brachyury and SMAD signalling collaboratively orchestrate distinct mesoderm and endoderm gene regulatory networks in differentiating human embryonic stem cells.

    Science.gov (United States)

    Faial, Tiago; Bernardo, Andreia S; Mendjan, Sasha; Diamanti, Evangelia; Ortmann, Daniel; Gentsch, George E; Mascetti, Victoria L; Trotter, Matthew W B; Smith, James C; Pedersen, Roger A

    2015-06-15

    The transcription factor brachyury (T, BRA) is one of the first markers of gastrulation and lineage specification in vertebrates. Despite its wide use and importance in stem cell and developmental biology, its functional genomic targets in human cells are largely unknown. Here, we use differentiating human embryonic stem cells to study the role of BRA in activin A-induced endoderm and BMP4-induced mesoderm progenitors. We show that BRA has distinct genome-wide binding landscapes in these two cell populations, and that BRA interacts and collaborates with SMAD1 or SMAD2/3 signalling to regulate the expression of its target genes in a cell-specific manner. Importantly, by manipulating the levels of BRA in cells exposed to different signalling environments, we demonstrate that BRA is essential for mesoderm but not for endoderm formation. Together, our data illuminate the function of BRA in the context of human embryonic development and show that the regulatory role of BRA is context dependent. Our study reinforces the importance of analysing the functions of a transcription factor in different cellular and signalling environments.

  9. General trends in the evolution of prokaryotic transcriptional regulatory networks.

    Science.gov (United States)

    Madan Babu, M; Balaji, S; Aravind, L

    2007-01-01

    Gene expression in organisms is controlled by regulatory proteins termed transcription factors, which recognize and bind to specific nucleotide sequences. Over the years, considerable information has accumulated on the regulatory interactions between transcription factors and their target genes in various model prokaryotes, such as Escherichia coli and Bacillus subtilis. This has allowed the representation of this information in the form of a directed graph, which is commonly referred to as the transcriptional regulatory network. The network representation provides us with an excellent conceptual framework to understand the structure of the transcriptional regulation, both at local and global levels of organization. Several studies suggest that the transcriptional network inferred from model organisms may be approximated by a scale-free topology, which in turn implies the presence of a relatively small group of highly connected regulators (hubs or global regulators). While the graph theoretical principles have been applied to infer various properties of such networks, there have been few studies that have actually investigated the evolution of the transcriptional regulatory networks across diverse organisms. Using recently developed computational methods that exploit various evolutionary principles, we have attempted to reconstruct and compare these networks across a wide-range of prokaryotes. This has provided several insights on the modification and diversification of network structures of various organisms in course of evolution. Firstly, we observed that target genes show a much higher level of conservation than their transcriptional regulators. This in turn suggested that the same set of functions could be differently controlled across diverse organisms, contributing significantly to their adaptive radiations. In particular, at the local level of network structure, organism-specific optimization of the transcription network has evolved primarily via tinkering

  10. XAANTAL2 (AGL14) Is an Important Component of the Complex Gene Regulatory Network that Underlies Arabidopsis Shoot Apical Meristem Transitions.

    Science.gov (United States)

    Pérez-Ruiz, Rigoberto V; García-Ponce, Berenice; Marsch-Martínez, Nayelli; Ugartechea-Chirino, Yamel; Villajuana-Bonequi, Mitzi; de Folter, Stefan; Azpeitia, Eugenio; Dávila-Velderrain, José; Cruz-Sánchez, David; Garay-Arroyo, Adriana; Sánchez, María de la Paz; Estévez-Palmas, Juan M; Álvarez-Buylla, Elena R

    2015-05-01

    In Arabidopsis thaliana, multiple genes involved in shoot apical meristem (SAM) transitions have been characterized, but the mechanisms required for the dynamic attainment of vegetative, inflorescence, and floral meristem (VM, IM, FM) cell fates during SAM transitions are not well understood. Here we show that a MADS-box gene, XAANTAL2 (XAL2/AGL14), is necessary and sufficient to induce flowering, and its regulation is important in FM maintenance and determinacy. xal2 mutants are late flowering, particularly under short-day (SD) condition, while XAL2 overexpressing plants are early flowering, but their flowers have vegetative traits. Interestingly, inflorescences of the latter plants have higher expression levels of LFY, AP1, and TFL1 than wild-type plants. In addition we found that XAL2 is able to bind the TFL1 regulatory regions. On the other hand, the basipetal carpels of the 35S::XAL2 lines lose determinacy and maintain high levels of WUS expression under SD condition. To provide a mechanistic explanation for the complex roles of XAL2 in SAM transitions and the apparently paradoxical phenotypes of XAL2 and other MADS-box (SOC1, AGL24) overexpressors, we conducted dynamic gene regulatory network (GRN) and epigenetic landscape modeling. We uncovered a GRN module that underlies VM, IM, and FM gene configurations and transition patterns in wild-type plants as well as loss and gain of function lines characterized here and previously. Our approach thus provides a novel mechanistic framework for understanding the complex basis of SAM development. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.

  11. Dissecting microregulation of a master regulatory network

    Directory of Open Access Journals (Sweden)

    Kaimal Vivek

    2008-02-01

    Full Text Available Abstract Background The master regulator p53 tumor-suppressor protein through coordination of several downstream target genes and upstream transcription factors controls many pathways important for tumor suppression. While it has been reported that some of the p53's functions are microRNA-mediated, it is not known as to how many other microRNAs might contribute to the p53-mediated tumorigenesis. Results Here, we use bioinformatics-based integrative approach to identify and prioritize putative p53-regulated miRNAs, and unravel the miRNA-based microregulation of the p53 master regulatory network. Specifically, we identify putative microRNA regulators of a transcription factors that are upstream or downstream to p53 and b p53 interactants. The putative p53-miRs and their targets are prioritized using current knowledge of cancer biology and literature-reported cancer-miRNAs. Conclusion Our predicted p53-miRNA-gene networks strongly suggest that coordinated transcriptional and p53-miR mediated networks could be integral to tumorigenesis and the underlying processes and pathways.

  12. A data integration approach to mapping OCT4 gene regulatory networks operative in embryonic stem cells and embryonal carcinoma cells.

    Directory of Open Access Journals (Sweden)

    Marc Jung

    Full Text Available It is essential to understand the network of transcription factors controlling self-renewal of human embryonic stem cells (ESCs and human embryonal carcinoma cells (ECs if we are to exploit these cells in regenerative medicine regimes. Correlating gene expression levels after RNAi-based ablation of OCT4 function with its downstream targets enables a better prediction of motif-specific driven expression modules pertinent for self-renewal and differentiation of embryonic stem cells and induced pluripotent stem cells.We initially identified putative direct downstream targets of OCT4 by employing CHIP-on-chip analysis. A comparison of three peak analysis programs revealed a refined list of OCT4 targets in the human EC cell line NCCIT, this list was then compared to previously published OCT4 CHIP-on-chip datasets derived from both ES and EC cells. We have verified an enriched POU-motif, discovered by a de novo approach, thus enabling us to define six distinct modules of OCT4 binding and regulation of its target genes.A selection of these targets has been validated, like NANOG, which harbours the evolutionarily conserved OCT4-SOX2 binding motif within its proximal promoter. Other validated targets, which do not harbour the classical HMG motif are USP44 and GADD45G, a key regulator of the cell cycle. Over-expression of GADD45G in NCCIT cells resulted in an enrichment and up-regulation of genes associated with the cell cycle (CDKN1B, CDKN1C, CDK6 and MAPK4 and developmental processes (BMP4, HAND1, EOMES, ID2, GATA4, GATA5, ISL1 and MSX1. A comparison of positively regulated OCT4 targets common to EC and ES cells identified genes such as NANOG, PHC1, USP44, SOX2, PHF17 and OCT4, thus further confirming their universal role in maintaining self-renewal in both cell types. Finally we have created a user-friendly database (http://biit.cs.ut.ee/escd/, integrating all OCT4 and stem cell related datasets in both human and mouse ES and EC cells.In the current

  13. Regulatory networks contributing to psoriasis susceptibility.

    Science.gov (United States)

    Szabó, Kornélia; Bata-Csörgő, Zsuzsanna; Dallos, Attila; Bebes, Attila; Francziszti, László; Dobozy, Attila; Kemény, Lajos; Széll, Márta

    2014-07-01

    The non-involved, healthy-looking skin of psoriatic patients displays inherent characteristics that make it prone to develop typical psoriatic symptoms. Our primary aim was to identify genes and proteins that are differentially regulated in the non-involved psoriatic and the normal epidermis, and to discover regulatory networks responsible for these differences. A cDNA microarray experiment was performed to compare the gene expression profiles of 4 healthy and 4 psoriatic non-involved epidermis samples in response to T-cell lymphokine induction in organotypic cultures. We identified 61 annotated genes and another 11 expressed transcripts that were differentially regulated in the psoriatic tissues. Bioinformatics analysis suggested that the regulation of cell morphology, development and cell death is abnormal, and that the metabolism of small molecules and lipids is differentially regulated in psoriatic epidermis. Our results indicate that one of the early steps of psoriasis pathogenesis may be the abnormal regulation of IL-23A and IL-1B genes in psoriatic keratinocytes.

  14. A stochastic differential equation model for transcriptional regulatory networks

    Directory of Open Access Journals (Sweden)

    Quirk Michelle D

    2007-05-01

    Full Text Available Abstract Background This work explores the quantitative characteristics of the local transcriptional regulatory network based on the availability of time dependent gene expression data sets. The dynamics of the gene expression level are fitted via a stochastic differential equation model, yielding a set of specific regulators and their contribution. Results We show that a beta sigmoid function that keeps track of temporal parameters is a novel prototype of a regulatory function, with the effect of improving the performance of the profile prediction. The stochastic differential equation model follows well the dynamic of the gene expression levels. Conclusion When adapted to biological hypotheses and combined with a promoter analysis, the method proposed here leads to improved models of the transcriptional regulatory networks.

  15. Cancer genomics identifies regulatory gene networks associated with the transition from dysplasia to advanced lung adenocarcinomas induced by c-Raf-1.

    Directory of Open Access Journals (Sweden)

    Astrid Rohrbeck

    Full Text Available BACKGROUND: Lung cancer is a leading cause of cancer morbidity. To improve an understanding of molecular causes of disease a transgenic mouse model was investigated where targeted expression of the serine threonine kinase c-Raf to respiratory epithelium induced initially dysplasia and subsequently adenocarcinomas. This enables dissection of genetic events associated with precancerous and cancerous lesions. METHODOLOGY/PRINCIPAL FINDINGS: By laser microdissection cancer cell populations were harvested and subjected to whole genome expression analyses. Overall 473 and 541 genes were significantly regulated, when cancer versus transgenic and non-transgenic cells were compared, giving rise to three distinct and one common regulatory gene network. At advanced stages of tumor growth predominately repression of gene expression was observed, but genes previously shown to be up-regulated in dysplasia were also up-regulated in solid tumors. Regulation of developmental programs as well as epithelial mesenchymal and mesenchymal endothelial transition was a hall mark of adenocarcinomas. Additionally, genes coding for cell adhesion, i.e. the integrins and the tight and gap junction proteins were repressed, whereas ligands for receptor tyrosine kinase such as epi- and amphiregulin were up-regulated. Notably, Vegfr- 2 and its ligand Vegfd, as well as Notch and Wnt signalling cascades were regulated as were glycosylases that influence cellular recognition. Other regulated signalling molecules included guanine exchange factors that play a role in an activation of the MAP kinases while several tumor suppressors i.e. Mcc, Hey1, Fat3, Armcx1 and Reck were significantly repressed. Finally, probable molecular switches forcing dysplastic cells into malignantly transformed cells could be identified. CONCLUSIONS/SIGNIFICANCE: This study provides insight into molecular pertubations allowing dysplasia to progress further to adenocarcinoma induced by exaggerted c-Raf kinase

  16. MicroRNA and transcription factor mediated regulatory network for ovarian cancer: regulatory network of ovarian cancer.

    Science.gov (United States)

    Ying, Huanchun; Lv, Jing; Ying, Tianshu; Li, Jun; Yang, Qing; Ma, Yuan

    2013-10-01

    A better understanding on the regulatory interactions of microRNA (miRNA) target genes and transcription factor (TF) target genes in ovarian cancer may be conducive for developing early diagnosis strategy. Thus, gene expression data and miRNA expression data were downloaded from The Cancer Genome Atlas in this study. Differentially expressed genes and miRNAs were selected out with t test, and Gene Ontology enrichment analysis was performed with DAVID tools. Regulatory interactions were retrieved from miRTarBase, TRED, and TRANSFAC, and then networks for miRNA target genes and TF target genes were constructed to globally present the mechanisms. As a result, a total of 1,939 differentially expressed genes were identified, and they were enriched in 28 functions, among which cell cycle was affected to the most degree. Besides, 213 differentially expressed miRNAs were identified. Two regulatory networks for miRNA target genes and TF target genes were established and then both were combined, in which E2F transcription factor 1, cyclin-dependent kinase inhibitor 1A, cyclin E1, and miR-16 were the hub genes. These genes may be potential biomarkers for ovarian cancer.

  17. Candidate Genes from Molecular Pathways Related to Appetite Regulatory Neural Network and Adipocyte Homeostasis and Obesity: the Coronary Artery Risk Development in Young Adults (CARDIA) Study

    Science.gov (United States)

    Friedlander, Yechiel; Li, Guo; Fornage, Myriam; Williams, O. Dale; Lewis, Cora E.; Schreiner, Pamela; Pletcher, Mark J.; Enquobahrie, Daniel; Williams, Michelle; Siscovick, David S.

    2010-01-01

    Background Appetite regulatory neural network and adipocyte homeostasis molecular pathways are critical to long-term weight maintenance. Genetic variation in these pathways may explain variability of obesity in the general population. Aims The associations of four genes in these pathways (leptin (LEP), leptin receptor (LEPR), neuropeptide Y2 receptor (NPY2R) and peptide YY (PYY)) with obesity-related phenotypes were examined among participants in the CARDIA Study. Participants were 18-30 years old upon recruitment (1985-86). Weight, BMI and waist circumference were measured at baseline and at years 2, 5, 7, 10, 15, and 20. Genotyping was conducted using tag SNPs that characterize the common pattern of genetic variation in these genes. Race-specific linear regression models were used to examine associations of the various SNPs with obesity-related measurements, controlling for sex and age. The overall association based on the 7 repeated anthropometric measurements was tested with GEE. False discovery rate was used to adjust for multiple testing. Results In African-Americans, SNPs across the LEP gene demonstrated significant overall associations with obesity-related phenotypes. The associations between rs17151919 in LEP gene with weight tended to increase with time (SNP × time interaction p=0.0193). The difference in weight levels associated with each additional minor allele ranged from 2.6 kg at entry to 4.8 kg at year 20. Among African-American men, the global tests indicated that SNPs across the NPY2R gene were also associated with waist circumference measurements (p=0.0462). In Caucasians, SNPs across the LEP gene also tended to be associated with weight measurements (p=0.0471) and rs11684664 in PYY gene was associated with obesity-related phenotypes (p= 0.010-0.026) in women only. Conclusions Several SNPs in the LEP, NPY2R and PYY but not the LEPR genes were associated with obesity-related phenotypes in young adults. The associations were more prominent for the

  18. An Integrative Approach to Computational Modelling of the Gene Regulatory Network Controlling Clostridium botulinum Type A1 Toxin Production

    Science.gov (United States)

    Walshaw, John; Peck, Michael W.; Barker, Gary C.

    2016-01-01

    Clostridium botulinum produces botulinum neurotoxins (BoNTs), highly potent substances responsible for botulism. Currently, mathematical models of C. botulinum growth and toxigenesis are largely aimed at risk assessment and do not include explicit genetic information beyond group level but integrate many component processes, such as signalling, membrane permeability and metabolic activity. In this paper we present a scheme for modelling neurotoxin production in C. botulinum Group I type A1, based on the integration of diverse information coming from experimental results available in the literature. Experiments show that production of BoNTs depends on the growth-phase and is under the control of positive and negative regulatory elements at the intracellular level. Toxins are released as large protein complexes and are associated with non-toxic components. Here, we systematically review and integrate those regulatory elements previously described in the literature for C. botulinum Group I type A1 into a population dynamics model, to build the very first computational model of toxin production at the molecular level. We conduct a validation of our model against several items of published experimental data for different wild type and mutant strains of C. botulinum Group I type A1. The result of this process underscores the potential of mathematical modelling at the cellular level, as a means of creating opportunities in developing new strategies that could be used to prevent botulism; and potentially contribute to improved methods for the production of toxin that is used for therapeutics. PMID:27855161

  19. Modular genetic regulatory networks increase organization during pattern formation.

    Science.gov (United States)

    Mohamadlou, Hamid; Podgorski, Gregory J; Flann, Nicholas S

    2016-08-01

    Studies have shown that genetic regulatory networks (GRNs) consist of modules that are densely connected subnetworks that function quasi-autonomously. Modules may be recognized motifs that comprise of two or three genes with particular regulatory functions and connectivity or be purely structural and identified through connection density. It is unclear what evolutionary and developmental advantages modular structure and in particular motifs provide that have led to this enrichment. This study seeks to understand how modules within developmental GRNs influence the complexity of multicellular patterns that emerge from the dynamics of the regulatory networks. We apply an algorithmic complexity to measure the organization of the patterns. A computational study was performed by creating Boolean intracellular networks within a simulated epithelial field of embryonic cells, where each cell contains the same network and communicates with adjacent cells using contact-mediated signaling. Intracellular networks with random connectivity were compared to those with modular connectivity and with motifs. Results show that modularity effects network dynamics and pattern organization significantly. In particular: (1) modular connectivity alone increases complexity in network dynamics and patterns; (2) bistable switch motifs simplify both the pattern and network dynamics; (3) all other motifs with feedback loops increase multicellular pattern complexity while simplifying the network dynamics; (4) negative feedback loops affect the dynamics complexity more significantly than positive feedback loops.

  20. Evolutionary tinkering with conserved components of a transcriptional regulatory network.

    OpenAIRE

    Hugo Lavoie; Hervé Hogues; Jaideep Mallick; Adnane Sellam; André Nantel; Malcolm Whiteway

    2010-01-01

    Gene expression variation between species is a major contributor to phenotypic diversity, yet the underlying flexibility of transcriptional regulatory networks remains largely unexplored. Transcription of the ribosomal regulon is a critical task for all cells; in S. cerevisiae the transcription factors Rap1, Fhl1, Ifh1, and Hmo1 form a multi-subunit complex that controls ribosomal gene expression, while in C. albicans this regulation is under the control of Tbf1 and Cbf1. Here, we analyzed, u...

  1. Topology of transcriptional regulatory networks: testing and improving.

    Directory of Open Access Journals (Sweden)

    Dicle Hasdemir

    Full Text Available With the increasing amount and complexity of data generated in biological experiments it is becoming necessary to enhance the performance and applicability of existing statistical data analysis methods. This enhancement is needed for the hidden biological information to be better resolved and better interpreted. Towards that aim, systematic incorporation of prior information in biological data analysis has been a challenging problem for systems biology. Several methods have been proposed to integrate data from different levels of information most notably from metabolomics, transcriptomics and proteomics and thus enhance biological interpretation. However, in order not to be misled by the dominance of incorrect prior information in the analysis, being able to discriminate between competing prior information is required. In this study, we show that discrimination between topological information in competing transcriptional regulatory network models is possible solely based on experimental data. We use network topology dependent decomposition of synthetic gene expression data to introduce both local and global discriminating measures. The measures indicate how well the gene expression data can be explained under the constraints of the model network topology and how much each regulatory connection in the model refuses to be constrained. Application of the method to the cell cycle regulatory network of Saccharomyces cerevisiae leads to the prediction of novel regulatory interactions, improving the information content of the hypothesized network model.

  2. Evidence for Alteration of Gene Regulatory Networks through MicroRNAs of the HIV-infected brain: novel analysis of retrospective cases.

    Science.gov (United States)

    Tatro, Erick T; Scott, Erick R; Nguyen, Timothy B; Salaria, Shahid; Banerjee, Sugato; Moore, David J; Masliah, Eliezer; Achim, Cristian L; Everall, Ian P

    2010-04-26

    -sites for dysregulated miRNAs. We provide evidence that certain miRNAs serve as key elements in gene regulatory networks in HIV-infected FC and may be implicated in neurobehavioral disorder. Finally, our data indicates that some genes may serve as hubs of miRNA activity.

  3. Genome-wide study of KNOX regulatory network reveals brassinosteroid catabolic genes important for shoot meristem function in rice

    Science.gov (United States)

    In flowering plants, knotted1-like homeobox (KNOX) transcription factors play crucial roles in establishment and maintenance of the shoot apical meristem (SAM), from which aerial organs such as leaves, stems, and flowers initiate. We report that a rice (Oryza sativa) KNOX gene Oryza sativa homeobox1...

  4. Gene expression profiling of cultured human NF1 heterozygous (NF1+/-) melanocytes reveals downregulation of a transcriptional cis-regulatory network mediating activation of the melanocyte-specific dopachrome tautomerase (DCT) gene.

    Science.gov (United States)

    Boucneau, Joachim; De Schepper, Sofie; Vuylsteke, Marnik; Van Hummelen, Paul; Naeyaert, Jean-Marie; Lambert, Jo

    2005-08-01

    One of the major primary features of the neurocutaneous genetic disorder Neurofibromatosis type 1 are the hyperpigmentary café-au-lait macules where disregulation of melanocyte biology is supposed to play a key etiopathogenic role. To gain better insight into the possible role of the tumor suppressor gene NF1, a transcriptomic microarray analysis was performed on human NF1 heterozygous (NF1+/-) melanocytes of a Neurofibromatosis type 1 patient and NF1 wild type (NF1+/+) melanocytes of a healthy control patient, both cultured from normally pigmented skin and hyperpigmented lesional café-au-lait skin. From the magnitude of gene effects, we found that gene expression was affected most strongly by genotype and less so by lesional type. A total of 137 genes had a significant twofold or more up- (72) or downregulated (65) expression in NF1+/- melanocytes compared with NF1+/+ melanocytes. Melanocytes cultured from hyperpigmented café-au-lait skin showed 37 upregulated genes whereas only 14 were downregulated compared with normal skin melanocytes. In addition, significant genotype xlesional type interactions were observed for 465 genes. Differentially expressed genes were mainly involved in regulating cell proliferation and cell adhesion. A high number of transcription factor genes, among which a specific subset important in melanocyte lineage development, were downregulated in the cis-regulatory network governing the activation of the melanocyte-specific dopachrome tautomerase (DCT) gene. Although the results presented have been obtained with a restricted number of patients (one NF1 patient and one control) and using cDNA microarrays that may limit their interpretation, the data nevertheless addresses for the first time the effect of a heterozygous NF1 gene on the expression of the human melanocyte transcriptome and has generated several interesting candidate genes helpful in elucidating the etiopathology of café-au-lait macules in NF1 patients.

  5. Dynamics of regulatory networks in gastrin-treated adenocarcinoma cells.

    Directory of Open Access Journals (Sweden)

    Naresh Doni Jayavelu

    Full Text Available Understanding gene transcription regulatory networks is critical to deciphering the molecular mechanisms of different cellular states. Most studies focus on static transcriptional networks. In the current study, we used the gastrin-regulated system as a model to understand the dynamics of transcriptional networks composed of transcription factors (TFs and target genes (TGs. The hormone gastrin activates and stimulates signaling pathways leading to various cellular states through transcriptional programs. Dysregulation of gastrin can result in cancerous tumors, for example. However, the regulatory networks involving gastrin are highly complex, and the roles of most of the components of these networks are unknown. We used time series microarray data of AR42J adenocarcinoma cells treated with gastrin combined with static TF-TG relationships integrated from different sources, and we reconstructed the dynamic activities of TFs using network component analysis (NCA. Based on the peak expression of TGs and activity of TFs, we created active sub-networks at four time ranges after gastrin treatment, namely immediate-early (IE, mid-early (ME, mid-late (ML and very late (VL. Network analysis revealed that the active sub-networks were topologically different at the early and late time ranges. Gene ontology analysis unveiled that each active sub-network was highly enriched in a particular biological process. Interestingly, network motif patterns were also distinct between the sub-networks. This analysis can be applied to other time series microarray datasets, focusing on smaller sub-networks that are activated in a cascade, allowing better overview of the mechanisms involved at each time range.

  6. Master regulators, regulatory networks, and pathways of glioblastoma subtypes.

    Science.gov (United States)

    Bozdag, Serdar; Li, Aiguo; Baysan, Mehmet; Fine, Howard A

    2014-01-01

    Glioblastoma multiforme (GBM) is the most common malignant brain tumor. GBM samples are classified into subtypes based on their transcriptomic and epigenetic profiles. Despite numerous studies to better characterize GBM biology, a comprehensive study to identify GBM subtype- specific master regulators, gene regulatory networks, and pathways is missing. Here, we used FastMEDUSA to compute master regulators and gene regulatory networks for each GBM subtype. We also ran Gene Set Enrichment Analysis and Ingenuity Pathway Analysis on GBM expression dataset from The Cancer Genome Atlas Project to compute GBM- and GBM subtype-specific pathways. Our analysis was able to recover some of the known master regulators and pathways in GBM as well as some putative novel regulators and pathways, which will aide in our understanding of the unique biology of GBM subtypes.

  7. Differential Regulatory Analysis Based on Coexpression Network in Cancer Research

    Directory of Open Access Journals (Sweden)

    Junyi Li

    2016-01-01

    Full Text Available With rapid development of high-throughput techniques and accumulation of big transcriptomic data, plenty of computational methods and algorithms such as differential analysis and network analysis have been proposed to explore genome-wide gene expression characteristics. These efforts are aiming to transform underlying genomic information into valuable knowledges in biological and medical research fields. Recently, tremendous integrative research methods are dedicated to interpret the development and progress of neoplastic diseases, whereas differential regulatory analysis (DRA based on gene coexpression network (GCN increasingly plays a robust complement to regular differential expression analysis in revealing regulatory functions of cancer related genes such as evading growth suppressors and resisting cell death. Differential regulatory analysis based on GCN is prospective and shows its essential role in discovering the system properties of carcinogenesis features. Here we briefly review the paradigm of differential regulatory analysis based on GCN. We also focus on the applications of differential regulatory analysis based on GCN in cancer research and point out that DRA is necessary and extraordinary to reveal underlying molecular mechanism in large-scale carcinogenesis studies.

  8. A marker-derived gene network reveals the regulatory role of PPARGC1A, HNF4G, and FOXP3 in intramuscular fat deposition of beef cattle.

    Science.gov (United States)

    Ramayo-Caldas, Y; Fortes, M R S; Hudson, N J; Porto-Neto, L R; Bolormaa, S; Barendse, W; Kelly, M; Moore, S S; Goddard, M E; Lehnert, S A; Reverter, A

    2014-07-01

    High intramuscular fat (IMF) awards price premiums to beef producers and is associated with meat quality and flavor. Studying gene interactions and pathways that affect IMF might unveil causative physiological mechanisms and inform genomic selection, leading to increased accuracy of predictions of breeding value. To study gene interactions and pathways, a gene network was derived from genetic markers associated with direct measures of IMF, other fat phenotypes, feedlot performance, and a number of meat quality traits relating to body conformation, development, and metabolism that might be plausibly expected to interact with IMF biology. Marker associations were inferred from genomewide association studies (GWAS) based on high density genotypes and 29 traits measured on 10,181 beef cattle animals from 3 breed types. For the network inference, SNP pairs were assessed according to the strength of the correlation between their additive association effects across the 29 traits. The co-association inferred network was formed by 2,434 genes connected by 28,283 edges. Topological network parameters suggested a highly cohesive network, in which the genes are strongly functionally interconnected. Pathway and network analyses pointed towards a trio of transcription factors (TF) as key regulators of carcass IMF: PPARGC1A, HNF4G, and FOXP3. Importantly, none of these genes would have been deemed as significantly associated with IMF from the GWAS. Instead, a total of 313 network genes show significant co-association with the 3 TF. These genes belong to a wide variety of biological functions, canonical pathways, and genetic networks linked to IMF-related phenotypes. In summary, our GWAS and network predictions are supported by the current literature and suggest a cooperative role for the 3 TF and other interacting genes including CAPN6, STC2, MAP2K4, EYA1, COPS5, XKR4, NR2E1, TOX, ATF1, ASPH, TGS1, and TTPA as modulators of carcass and meat quality traits in beef cattle.

  9. Regulatory Networks:. Inferring Functional Relationships Through Co-Expression

    Science.gov (United States)

    Wanke, Dierk; Hahn, Achim; Kilian, Joachim; Harter, Klaus; Berendzen, Kenneth W.

    2010-01-01

    Gene expression data not only provide us insights into discrete transcript abundance of specific genes, but contain cryptic information that can not readily be assessed without interpretation. We again used data of the plant Arabidopsis thaliana as our reference organism, yet the analysis presented herein can be performed with any organism with various data sources. Within the cell, information is transduced via different signaling cascades and results in differential gene expression responses. The incoming signals are perceived from upstream signaling components and handed to downstream messengers that further deliver the signals to effector proteins which can directly influence gene expression. In most cases, we can assume that proteins, which are connected to other signaling components within such a regulatory network, exhibit similar expression trajectories. Thus, we extracted a known functional network from literature and demonstrated that it is possible to superimpose microarray expression data onto the pathways. Thereby, we could follow the information flow through time reflected by gene expression changes. This allowed us to predict, whether the upstream signal was transmitted from known elements contained in the network or relayed from outside components. We next conducted the vice versa approach and used large scale microarray expression data to build a co-expression matrix for all genes present on the array. From this, we computed a regulatory network, which allowed us to deduce known and novel signaling pathways.

  10. Genomic analysis of the hierarchical structure of regulatory networks

    Science.gov (United States)

    Yu, Haiyuan; Gerstein, Mark

    2006-01-01

    A fundamental question in biology is how the cell uses transcription factors (TFs) to coordinate the expression of thousands of genes in response to various stimuli. The relationships between TFs and their target genes can be modeled in terms of directed regulatory networks. These relationships, in turn, can be readily compared with commonplace “chain-of-command” structures in social networks, which have characteristic hierarchical layouts. Here, we develop algorithms for identifying generalized hierarchies (allowing for various loop structures) and use these approaches to illuminate extensive pyramid-shaped hierarchical structures existing in the regulatory networks of representative prokaryotes (Escherichia coli) and eukaryotes (Saccharomyces cerevisiae), with most TFs at the bottom levels and only a few master TFs on top. These masters are situated near the center of the protein–protein interaction network, a different type of network from the regulatory one, and they receive most of the input for the whole regulatory hierarchy through protein interactions. Moreover, they have maximal influence over other genes, in terms of affecting expression-level changes. Surprisingly, however, TFs at the bottom of the regulatory hierarchy are more essential to the viability of the cell. Finally, one might think master TFs achieve their wide influence through directly regulating many targets, but TFs with most direct targets are in the middle of the hierarchy. We find, in fact, that these midlevel TFs are “control bottlenecks” in the hierarchy, and this great degree of control for “middle managers” has parallels in efficient social structures in various corporate and governmental settings. PMID:17003135

  11. Protein modularity, cooperative binding, and hybrid regulatory states underlie transcriptional network diversification.

    Science.gov (United States)

    Baker, Christopher R; Booth, Lauren N; Sorrells, Trevor R; Johnson, Alexander D

    2012-09-28

    We examine how different transcriptional network structures can evolve from an ancestral network. By characterizing how the ancestral mode of gene regulation for genes specific to a-type cells in yeast species evolved from an activating paradigm to a repressing one, we show that regulatory protein modularity, conversion of one cis-regulatory sequence to another, distribution of binding energy among protein-protein and protein-DNA interactions, and exploitation of ancestral network features all contribute to the evolution of a novel regulatory mode. The formation of this derived mode of regulation did not disrupt the ancestral mode and thereby created a hybrid regulatory state where both means of transcription regulation (ancestral and derived) contribute to the conserved expression pattern of the network. Finally, we show how this hybrid regulatory state has resolved in different ways in different lineages to generate the diversity of regulatory network structures observed in modern species.

  12. The comprehensive updated regulatory network of Escherichia coli K-12

    Directory of Open Access Journals (Sweden)

    Karp Peter D

    2006-01-01

    Full Text Available Abstract Background Escherichia coli is the model organism for which our knowledge of its regulatory network is the most extensive. Over the last few years, our project has been collecting and curating the literature concerning E. coli transcription initiation and operons, providing in both the RegulonDB and EcoCyc databases the largest electronically encoded network available. A paper published recently by Ma et al. (2004 showed several differences in the versions of the network present in these two databases. Discrepancies have been corrected, annotations from this and other groups (Shen-Orr et al., 2002 have been added, making the RegulonDB and EcoCyc databases the largest comprehensive and constantly curated regulatory network of E. coli K-12. Results Several groups have been using these curated data as part of their bioinformatics and systems biology projects, in combination with external data obtained from other sources, thus enlarging the dataset initially obtained from either RegulonDB or EcoCyc of the E. coli K12 regulatory network. We kindly obtained from the groups of Uri Alon and Hong-Wu Ma the interactions they have added to enrich their public versions of the E. coli regulatory network. These were used to search for original references and curate them with the same standards we use regularly, adding in several cases the original references (instead of reviews or missing references, as well as adding the corresponding experimental evidence codes. We also corrected all discrepancies in the two databases available as explained below. Conclusion One hundred and fifty new interactions have been added to our databases as a result of this specific curation effort, in addition to those added as a result of our continuous curation work. RegulonDB gene names are now based on those of EcoCyc to avoid confusion due to gene names and synonyms, and the public releases of RegulonDB and EcoCyc are henceforth synchronized to avoid confusion due to

  13. Integrative analysis of miRNA and gene expression reveals regulatory networks in tamoxifen-resistant breast cancer

    DEFF Research Database (Denmark)

    Joshi, Tejal; Elias, Daniel; Stenvang, Jan

    2016-01-01

    Tamoxifen is an effective anti-estrogen treatment for patients with estrogen receptor-positive (ER+) breast cancer, however, tamoxifen resistance is frequently observed. To elucidate the underlying molecular mechanisms of tamoxifen resistance, we performed a systematic analysis of mi...... and siRNAmediated gene knockdown, we showed that both SNAI2 and FYN significantly affect the growth of TamR cell lines. Finally, we show that a combination of 2 miRNAs (miR-190b and miR-516a-5p) exhibiting altered expression in TamR cell lines were predictive of treatment outcome in a cohort of ER......+ breast cancer patients receiving adjuvant tamoxifen mono-therapy. Our results provide new insight into the molecular mechanisms of tamoxifen resistance and may form the basis for future medical intervention for the large number of women with tamoxifen-resistant ER+ breast cancer....

  14. Distinct and overlapping gene regulatory networks in BMP- and HDAC-controlled cell fate determination in the embryonic forebrain

    Directory of Open Access Journals (Sweden)

    Scholl Catharina

    2012-07-01

    Full Text Available Abstract Background Both bone morphogenetic proteins (BMPs and histone deacetylases (HDACs have previously been established to play a role in the development of the three major cell types of the central nervous system: neurons, astrocytes, and oligodendrocytes. We have previously established a connection between these two protein families, showing that HDACs suppress BMP-promoted astrogliogenesis in the embryonic striatum. Since HDACs act in the nucleus to effect changes in transcription, an unbiased analysis of their transcriptional targets could shed light on their downstream effects on BMP-signaling. Results Using neurospheres from the embryonic striatum as an in vitro system to analyze this phenomenon, we have performed microarray expression profiling on BMP2- and TSA-treated cultures, followed by validation of the findings with quantitative RT-PCR and protein analysis. In BMP-treated cultures we first observed an upregulation of genes involved in cell-cell communication and developmental processes such as members of BMP and canonical Wnt signaling pathways. In contrast, in TSA-treated cultures we first observed an upregulation of genes involved in chromatin modification and transcription. Interestingly, we could not record direct changes in the protein levels of canonical members of BMP2 signaling, but we did observe an upregulation of both the transcription factor STAT3 and its active isoform phospho-STAT3 at the protein level. Conclusions STAT3 and SMAD1/5/8 interact synergistically to promote astrogliogenesis, and thus we show for the first time that HDACs act to suppress BMP-promoted astrogliogenesis by suppression of the crucial partner STAT3.

  15. Regulatory coordination of clustered microRNAs based on microRNA-transcription factor regulatory network

    Directory of Open Access Journals (Sweden)

    Wang Jin

    2011-12-01

    Full Text Available Abstract Background MicroRNA (miRNA is a class of small RNAs of ~22nt which play essential roles in many crucial biological processes and numerous human diseases at post-transcriptional level of gene expression. It has been revealed that miRNA genes tend to be clustered, and the miRNAs organized into one cluster are usually transcribed coordinately. This implies a coordinated regulation mode exerted by clustered miRNAs. However, how the clustered miRNAs coordinate their regulations on large scale gene expression is still unclear. Results We constructed the miRNA-transcription factor regulatory network that contains the interactions between transcription factors (TFs, miRNAs and non-TF protein-coding genes, and made a genome-wide study on the regulatory coordination of clustered miRNAs. We found that there are two types of miRNA clusters, i.e. homo-clusters that contain miRNAs of the same family and hetero-clusters that contain miRNAs of various families. In general, the homo-clustered as well as the hetero-clustered miRNAs both exhibit coordinated regulation since the miRNAs belonging to one cluster tend to be involved in the same network module, which performs a relatively isolated biological function. However, the homo-clustered miRNAs show a direct regulatory coordination that is realized by one-step regulation (i.e. the direct regulation of the coordinated targets, whereas the hetero-clustered miRNAs show an indirect regulatory coordination that is realized by a regulation comprising at least three steps (e.g. the regulation on the coordinated targets by a miRNA through a sequential action of two TFs. The direct and indirect regulation target different categories of genes, the former predominantly regulating genes involved in emergent responses, the latter targeting genes that imply long-term effects. Conclusion The genomic clustering of miRNAs is closely related to the coordinated regulation in the gene regulatory network. The pattern of

  16. Shaping Formal Networks throug the Regulatory Process

    NARCIS (Netherlands)

    Hall, Thad E.; O'Toole, Laurence J.

    2004-01-01

    Recent research has shown that, at the federal level, new or amended programs typically create networks consisting of multiactor structures spanning governments, sectors, and/or agencies. This study examines the implementation structures created through the regulatory process. We find that in a majo

  17. Integrated in silico Analyses of Regulatory and Metabolic Networks of Synechococcus sp. PCC 7002 Reveal Relationships between Gene Centrality and Essentiality

    Directory of Open Access Journals (Sweden)

    Hyun-Seob Song

    2015-03-01

    Full Text Available Cyanobacteria dynamically relay environmental inputs to intracellular adaptations through a coordinated adjustment of photosynthetic efficiency and carbon processing rates. The output of such adaptations is reflected through changes in transcriptional patterns and metabolic flux distributions that ultimately define growth strategy. To address interrelationships between metabolism and regulation, we performed integrative analyses of metabolic and gene co-expression networks in a model cyanobacterium, Synechococcus sp. PCC 7002. Centrality analyses using the gene co-expression network identified a set of key genes, which were defined here as “topologically important.” Parallel in silico gene knock-out simulations, using the genome-scale metabolic network, classified what we termed as “functionally important” genes, deletion of which affected growth or metabolism. A strong positive correlation was observed between topologically and functionally important genes. Functionally important genes exhibited variable levels of topological centrality; however, the majority of topologically central genes were found to be functionally essential for growth. Subsequent functional enrichment analysis revealed that both functionally and topologically important genes in Synechococcus sp. PCC 7002 are predominantly associated with translation and energy metabolism, two cellular processes critical for growth. This research demonstrates how synergistic network-level analyses can be used for reconciliation of metabolic and gene expression data to uncover fundamental biological principles.

  18. Modeling regulatory cascades using Artificial Neural Networks: the case of transcriptional regulatory networks shaped during the yeast stress response.

    Science.gov (United States)

    Manioudaki, Maria E; Poirazi, Panayiota

    2013-01-01

    Over the last decade, numerous computational methods have been developed in order to infer and model biological networks. Transcriptional networks in particular have attracted significant attention due to their critical role in cell survival. The majority of network inference methods use genome-wide experimental data to search for modules of genes with coherent expression profiles and common regulators, often ignoring the multi-layer structure of transcriptional cascades. Modeling methodologies on the other hand assume a given network structure and vary significantly in their algorithmic approach, ranging from over-simplified representations (e.g., Boolean networks) to detailed -but computationally expensive-network simulations (e.g., with differential equations). In this work we use Artificial Neural Networks (ANNs) to model transcriptional regulatory cascades that emerge during the stress response in Saccharomyces cerevisiae and extend in three layers. We confine the structure of the ANNs to match the structure of the biological networks as determined by gene expression, DNA-protein interaction and experimental evidence provided in publicly available databases. Trained ANNs are able to predict the expression profile of 11 target genes across multiple experimental conditions with a correlation coefficient >0.7. When time-dependent interactions between upstream transcription factors (TFs) and their indirect targets are also included in the ANNs, accurate predictions are achieved for 30/34 target genes. Moreover, heterodimer formation is taken into account. We show that ANNs can be used to (1) accurately predict the expression of downstream genes in a 3-layer transcriptional cascade based on the expression of their indirect regulators and (2) infer the condition- and time-dependent activity of various TFs as well as during heterodimer formation. We show that a three-layer regulatory cascade whose structure is determined by co-expressed gene modules and their

  19. On the underlying assumptions of threshold Boolean networks as a model for genetic regulatory network behavior

    Science.gov (United States)

    Tran, Van; McCall, Matthew N.; McMurray, Helene R.; Almudevar, Anthony

    2013-01-01

    Boolean networks (BoN) are relatively simple and interpretable models of gene regulatory networks. Specifying these models with fewer parameters while retaining their ability to describe complex regulatory relationships is an ongoing methodological challenge. Additionally, extending these models to incorporate variable gene decay rates, asynchronous gene response, and synergistic regulation while maintaining their Markovian nature increases the applicability of these models to genetic regulatory networks (GRN). We explore a previously-proposed class of BoNs characterized by linear threshold functions, which we refer to as threshold Boolean networks (TBN). Compared to traditional BoNs with unconstrained transition functions, these models require far fewer parameters and offer a more direct interpretation. However, the functional form of a TBN does result in a reduction in the regulatory relationships which can be modeled. We show that TBNs can be readily extended to permit self-degradation, with explicitly modeled degradation rates. We note that the introduction of variable degradation compromises the Markovian property fundamental to BoN models but show that a simple state augmentation procedure restores their Markovian nature. Next, we study the effect of assumptions regarding self-degradation on the set of possible steady states. Our findings are captured in two theorems relating self-degradation and regulatory feedback to the steady state behavior of a TBN. Finally, we explore assumptions of synchronous gene response and asynergistic regulation and show that TBNs can be easily extended to relax these assumptions. Applying our methods to the budding yeast cell-cycle network revealed that although the network is complex, its steady state is simplified by the presence of self-degradation and lack of purely positive regulatory cycles. PMID:24376454

  20. Dynamic simulation of regulatory networks using SQUAD

    Directory of Open Access Journals (Sweden)

    Xenarios Ioannis

    2007-11-01

    Full Text Available Abstract Background The ambition of most molecular biologists is the understanding of the intricate network of molecular interactions that control biological systems. As scientists uncover the components and the connectivity of these networks, it becomes possible to study their dynamical behavior as a whole and discover what is the specific role of each of their components. Since the behavior of a network is by no means intuitive, it becomes necessary to use computational models to understand its behavior and to be able to make predictions about it. Unfortunately, most current computational models describe small networks due to the scarcity of kinetic data available. To overcome this problem, we previously published a methodology to convert a signaling network into a dynamical system, even in the total absence of kinetic information. In this paper we present a software implementation of such methodology. Results We developed SQUAD, a software for the dynamic simulation of signaling networks using the standardized qualitative dynamical systems approach. SQUAD converts the network into a discrete dynamical system, and it uses a binary decision diagram algorithm to identify all the steady states of the system. Then, the software creates a continuous dynamical system and localizes its steady states which are located near the steady states of the discrete system. The software permits to make simulations on the continuous system, allowing for the modification of several parameters. Importantly, SQUAD includes a framework for perturbing networks in a manner similar to what is performed in experimental laboratory protocols, for example by activating receptors or knocking out molecular components. Using this software we have been able to successfully reproduce the behavior of the regulatory network implicated in T-helper cell differentiation. Conclusion The simulation of regulatory networks aims at predicting the behavior of a whole system when subject

  1. Genome-Wide Investigation Using sRNA-Seq, Degradome-Seq and Transcriptome-Seq Reveals Regulatory Networks of microRNAs and Their Target Genes in Soybean during Soybean mosaic virus Infection.

    Directory of Open Access Journals (Sweden)

    Hui Chen

    Full Text Available MicroRNAs (miRNAs play key roles in a variety of cellular processes through regulation of their target gene expression. Accumulated experimental evidence has demonstrated that infections by viruses are associated with the altered expression profile of miRNAs and their mRNA targets in the host. However, the regulatory network of miRNA-mRNA interactions during viral infection remains largely unknown. In this study, we performed small RNA (sRNA-seq, degradome-seq and as well as a genome-wide transcriptome analysis to profile the global gene and miRNA expression in soybean following infections by three different Soybean mosaic virus (SMV isolates, L (G2 strain, LRB (G2 strain and G7 (G7 strain. sRNA-seq analyses revealed a total of 253 soybean miRNAs with a two-fold or greater change in abundance compared with the mock-inoculated control. 125 transcripts were identified as the potential cleavage targets of 105 miRNAs and validated by degradome-seq analyses. Genome-wide transcriptome analysis showed that total 2679 genes are differentially expressed in response to SMV infection including 71 genes predicted as involved in defense response. Finally, complex miRNA-mRNA regulatory networks were derived using the RNAseq, small RNAseq and degradome data. This work represents a comprehensive, global approach to examining virus-host interactions. Genes responsive to SMV infection are identified as are their potential miRNA regulators. Additionally, regulatory changes of the miRNAs themselves are described and the regulatory relationships were supported with degradome data. Taken together these data provide new insights into molecular SMV-soybean interactions and offer candidate miRNAs and their targets for further elucidation of the SMV infection process.

  2. Pleiotropy constrains the evolution of protein but not regulatory sequences in a transcription regulatory network influencing complex social behaviours

    Directory of Open Access Journals (Sweden)

    Daria eMolodtsova

    2014-12-01

    Full Text Available It is increasingly apparent that genes and networks that influence complex behaviour are evolutionary conserved, which is paradoxical considering that behaviour is labile over evolutionary timescales. How does adaptive change in behaviour arise if behaviour is controlled by conserved, pleiotropic, and likely evolutionary constrained genes? Pleiotropy and connectedness are known to constrain the general rate of protein evolution, prompting some to suggest that the evolution of complex traits, including behaviour, is fuelled by regulatory sequence evolution. However, we seldom have data on the strength of selection on mutations in coding and regulatory sequences, and this hinders our ability to study how pleiotropy influences coding and regulatory sequence evolution. Here we use population genomics to estimate the strength of selection on coding and regulatory mutations for a transcriptional regulatory network that influences complex behaviour of honey bees. We found that replacement mutations in highly connected transcription factors and target genes experience significantly stronger negative selection relative to weakly connected transcription factors and targets. Adaptively evolving proteins were significantly more likely to reside at the periphery of the regulatory network, while proteins with signs of negative selection were near the core of the network. Interestingly, connectedness and network structure had minimal influence on the strength of selection on putative regulatory sequences for both transcription factors and their targets. Our study indicates that adaptive evolution of complex behaviour can arise because of positive selection on protein-coding mutations in peripheral genes, and on regulatory sequence mutations in both transcription factors and their targets throughout the network.

  3. Identification of transcriptional regulatory networks specific to pilocytic astrocytoma

    Directory of Open Access Journals (Sweden)

    Gutmann David H

    2011-07-01

    Full Text Available Abstract Background Pilocytic Astrocytomas (PAs are common low-grade central nervous system malignancies for which few recurrent and specific genetic alterations have been identified. In an effort to better understand the molecular biology underlying the pathogenesis of these pediatric brain tumors, we performed higher-order transcriptional network analysis of a large gene expression dataset to identify gene regulatory pathways that are specific to this tumor type, relative to other, more aggressive glial or histologically distinct brain tumours. Methods RNA derived from frozen human PA tumours was subjected to microarray-based gene expression profiling, using Affymetrix U133Plus2 GeneChip microarrays. This data set was compared to similar data sets previously generated from non-malignant human brain tissue and other brain tumour types, after appropriate normalization. Results In this study, we examined gene expression in 66 PA tumors compared to 15 non-malignant cortical brain tissues, and identified 792 genes that demonstrated consistent differential expression between independent sets of PA and non-malignant specimens. From this entire 792 gene set, we used the previously described PAP tool to assemble a core transcriptional regulatory network composed of 6 transcription factor genes (TFs and 24 target genes, for a total of 55 interactions. A similar analysis of oligodendroglioma and glioblastoma multiforme (GBM gene expression data sets identified distinct, but overlapping, networks. Most importantly, comparison of each of the brain tumor type-specific networks revealed a network unique to PA that included repressed expression of ONECUT2, a gene frequently methylated in other tumor types, and 13 other uniquely predicted TF-gene interactions. Conclusions These results suggest specific transcriptional pathways that may operate to create the unique molecular phenotype of PA and thus opportunities for corresponding targeted therapeutic

  4. Collagen induced arthritis (CIA) in mice features regulatory transcriptional network connecting major histocompatibility complex (MHC H2) with autoantigen genes in the thymus.

    Science.gov (United States)

    Donate, Paula B; Fornari, Thaís A; Junta, Cristina M; Magalhães, Danielle A; Macedo, Cláudia; Cunha, Thiago M; Nguyen, Catherine; Cunha, Fernando Q; Passos, Geraldo A

    2011-05-01

    Considering that imbalance of central tolerance in the thymus contributes to aggressive autoimmunity, we compared the expression of peripheral tissue autoantigens (PTA) genes, which are involved in self-representation in the thymic stroma, of two mouse strains; DBA-1/J (MHC-H2(q)) susceptible and DBA-2/J (MHC-H2(d)) resistant to collagen induced arthritis (CIA). We evaluate whether these strains differ in their thymic gene expression, allowing identification of genes that might play a role in susceptibility/resistance to CIA. Microarray profiling showed that 1093 PTA genes were differentially modulated between collagen immunized DBA-1/J and DBA-2/J mice. These genes were assigned to 17 different tissues/organs, including joints/bone, characterizing the promiscuous gene expression (PGE), which is implicated in self-representation. Hierarchical clustering of microarray data and quantitative RT-PCR analysis showed that Aire (autoimmune regulator), an important regulator of the PGE process, Aire-dependent (insulin), Aire-independent (Col2A1 and Gad67), and other 22 joint/bone autoantigen genes were down-regulated in DBA-1/J compared with DBA-2/J in the thymus. Considering the importance of MHC-H2 in peptide-self presentation and autoimmunity susceptibility, we reconstructed transcriptional networks of both strains based on actual microarray data. The networks clearly demonstrated different MHC-H2 transcriptional interactions with PTAs genes. DBA-1/J strain featured MHC-H2 as a node influencing downstream genes. Differently, in DBA-2/J strain network MHC-H2 was exclusively self-regulated and does not control other genes. These findings provide evidence that CIA susceptibility in mice may be a reflex of a cascade-like transcriptional control connecting different genes to MHC-H2 in the thymus.

  5. Do scale-free regulatory networks allow more expression than random ones?

    Science.gov (United States)

    Fortuna, Miguel A; Melián, Carlos J

    2007-07-21

    In this paper, we compile the network of software packages with regulatory interactions (dependences and conflicts) from Debian GNU/Linux operating system and use it as an analogy for a gene regulatory network. Using a trace-back algorithm we assemble networks from the pool of packages with both scale-free (real data) and exponential (null model) topologies. We record the maximum number of packages that can be functionally installed in the system (i.e., the active network size). We show that scale-free regulatory networks allow a larger active network size than random ones. This result might have implications for the number of expressed genes at steady state. Small genomes with scale-free regulatory topologies could allow much more expression than large genomes with exponential topologies. This may have implications for the dynamics, robustness and evolution of genomes.

  6. Gene regulatory mechanisms in infected fish

    DEFF Research Database (Denmark)

    Schyth, Brian Dall; Hajiabadi, Seyed Amir Hossein Jalali; Kristensen, Lasse Bøgelund Juel

    2011-01-01

    This talk will highlight the regulatory mechanisms of gene expression especially the programmed form of mRNA decay which is known as RNA interference (RNAi) and how this and other mechanisms contribute to the regulation of genes involved in immunity. In the RNAi mechanism small double stranded RNA...... whole pathways for the fine-tuning of physiological states like immunological reaction. But miRNAs are themselves under control of regulatory sequences for their timed expression. We will give an example of the finding of two rainbow trout microRNAs, which are up-regulated in the liver during infection...

  7. Evolution of communication protocols using an artificial regulatory network.

    Science.gov (United States)

    Mitchener, W Garrett

    2014-01-01

    I describe the Utrecht Machine (UM), a discrete artificial regulatory network designed for studying how evolution discovers biochemical computation mechanisms. The corresponding binary genome format is compatible with gene deletion, duplication, and recombination. In the simulation presented here, an agent consisting of two UMs, a sender and a receiver, must encode, transmit, and decode a binary word over time using the narrow communication channel between them. This communication problem has chicken-and-egg structure in that a sending mechanism is useless without a corresponding receiving mechanism. An in-depth case study reveals that a coincidence creates a minimal partial solution, from which a sequence of partial sending and receiving mechanisms evolve. Gene duplications contribute by enlarging the regulatory network. Analysis of 60,000 sample runs under a variety of parameter settings confirms that crossover accelerates evolution, that stronger selection tends to find clumsier solutions and finds them more slowly, and that there is implicit selection for robust mechanisms and genomes at the codon level. Typical solutions associate each input bit with an activation speed and combine them almost additively. The parents of breakthrough organisms sometimes have lower fitness scores than others in the population, indicating that populations can cross valleys in the fitness landscape via outlying members. The simulation exhibits back mutations and population-level memory effects not accounted for in traditional population genetics models. All together, these phenomena suggest that new evolutionary models are needed that incorporate regulatory network structure.

  8. Genome-wide inference of regulatory networks in Streptomyces coelicolor

    Directory of Open Access Journals (Sweden)

    Takano Eriko

    2010-10-01

    Full Text Available Abstract Background The onset of antibiotics production in Streptomyces species is co-ordinated with differentiation events. An understanding of the genetic circuits that regulate these coupled biological phenomena is essential to discover and engineer the pharmacologically important natural products made by these species. The availability of genomic tools and access to a large warehouse of transcriptome data for the model organism, Streptomyces coelicolor, provides incentive to decipher the intricacies of the regulatory cascades and develop biologically meaningful hypotheses. Results In this study, more than 500 samples of genome-wide temporal transcriptome data, comprising wild-type and more than 25 regulatory gene mutants of Streptomyces coelicolor probed across multiple stress and medium conditions, were investigated. Information based on transcript and functional similarity was used to update a previously-predicted whole-genome operon map and further applied to predict transcriptional networks constituting modules enriched in diverse functions such as secondary metabolism, and sigma factor. The predicted network displays a scale-free architecture with a small-world property observed in many biological networks. The networks were further investigated to identify functionally-relevant modules that exhibit functional coherence and a consensus motif in the promoter elements indicative of DNA-binding elements. Conclusions Despite the enormous experimental as well as computational challenges, a systems approach for integrating diverse genome-scale datasets to elucidate complex regulatory networks is beginning to emerge. We present an integrated analysis of transcriptome data and genomic features to refine a whole-genome operon map and to construct regulatory networks at the cistron level in Streptomyces coelicolor. The functionally-relevant modules identified in this study pose as potential targets for further studies and verification.

  9. Evolution of Intra-specific Regulatory Networks in a Multipartite Bacterial Genome.

    Directory of Open Access Journals (Sweden)

    Marco Galardini

    2015-09-01

    Full Text Available Reconstruction of the regulatory network is an important step in understanding how organisms control the expression of gene products and therefore phenotypes. Recent studies have pointed out the importance of regulatory network plasticity in bacterial adaptation and evolution. The evolution of such networks within and outside the species boundary is however still obscure. Sinorhizobium meliloti is an ideal species for such study, having three large replicons, many genomes available and a significant knowledge of its transcription factors (TF. Each replicon has a specific functional and evolutionary mark; which might also emerge from the analysis of their regulatory signatures. Here we have studied the plasticity of the regulatory network within and outside the S. meliloti species, looking for the presence of 41 TFs binding motifs in 51 strains and 5 related rhizobial species. We have detected a preference of several TFs for one of the three replicons, and the function of regulated genes was found to be in accordance with the overall replicon functional signature: house-keeping functions for the chromosome, metabolism for the chromid, symbiosis for the megaplasmid. This therefore suggests a replicon-specific wiring of the regulatory network in the S. meliloti species. At the same time a significant part of the predicted regulatory network is shared between the chromosome and the chromid, thus adding an additional layer by which the chromid integrates itself in the core genome. Furthermore, the regulatory network distance was found to be correlated with both promoter regions and accessory genome evolution inside the species, indicating that both pangenome compartments are involved in the regulatory network evolution. We also observed that genes which are not included in the species regulatory network are more likely to belong to the accessory genome, indicating that regulatory interactions should also be considered to predict gene conservation in

  10. Evolution of Intra-specific Regulatory Networks in a Multipartite Bacterial Genome.

    Science.gov (United States)

    Galardini, Marco; Brilli, Matteo; Spini, Giulia; Rossi, Matteo; Roncaglia, Bianca; Bani, Alessia; Chiancianesi, Manuela; Moretto, Marco; Engelen, Kristof; Bacci, Giovanni; Pini, Francesco; Biondi, Emanuele G; Bazzicalupo, Marco; Mengoni, Alessio

    2015-09-01

    Reconstruction of the regulatory network is an important step in understanding how organisms control the expression of gene products and therefore phenotypes. Recent studies have pointed out the importance of regulatory network plasticity in bacterial adaptation and evolution. The evolution of such networks within and outside the species boundary is however still obscure. Sinorhizobium meliloti is an ideal species for such study, having three large replicons, many genomes available and a significant knowledge of its transcription factors (TF). Each replicon has a specific functional and evolutionary mark; which might also emerge from the analysis of their regulatory signatures. Here we have studied the plasticity of the regulatory network within and outside the S. meliloti species, looking for the presence of 41 TFs binding motifs in 51 strains and 5 related rhizobial species. We have detected a preference of several TFs for one of the three replicons, and the function of regulated genes was found to be in accordance with the overall replicon functional signature: house-keeping functions for the chromosome, metabolism for the chromid, symbiosis for the megaplasmid. This therefore suggests a replicon-specific wiring of the regulatory network in the S. meliloti species. At the same time a significant part of the predicted regulatory network is shared between the chromosome and the chromid, thus adding an additional layer by which the chromid integrates itself in the core genome. Furthermore, the regulatory network distance was found to be correlated with both promoter regions and accessory genome evolution inside the species, indicating that both pangenome compartments are involved in the regulatory network evolution. We also observed that genes which are not included in the species regulatory network are more likely to belong to the accessory genome, indicating that regulatory interactions should also be considered to predict gene conservation in bacterial

  11. Computer-assisted curation of a human regulatory core network from the biological literature

    NARCIS (Netherlands)

    Thomas, P.; Durek, P.; Solt, I.; Klinger, B.; Witzel, F.; Schulthess, P.; Mayer, Y.; Tikk, D.; Blüthgen, N.; Leser, U.

    2015-01-01

    Motivation: A highly interlinked network of transcription factors (TFs) orchestrates the context-dependent expression of human genes. ChIP-chip experiments that interrogate the binding of particular TFs to genomic regions are used to reconstruct gene regulatory networks at genome-scale, but are plag

  12. SAGA: a hybrid search algorithm for Bayesian Network structure learning of transcriptional regulatory networks.

    Science.gov (United States)

    Adabor, Emmanuel S; Acquaah-Mensah, George K; Oduro, Francis T

    2015-02-01

    Bayesian Networks have been used for the inference of transcriptional regulatory relationships among genes, and are valuable for obtaining biological insights. However, finding optimal Bayesian Network (BN) is NP-hard. Thus, heuristic approaches have sought to effectively solve this problem. In this work, we develop a hybrid search method combining Simulated Annealing with a Greedy Algorithm (SAGA). SAGA explores most of the search space by undergoing a two-phase search: first with a Simulated Annealing search and then with a Greedy search. Three sets of background-corrected and normalized microarray datasets were used to test the algorithm. BN structure learning was also conducted using the datasets, and other established search methods as implemented in BANJO (Bayesian Network Inference with Java Objects). The Bayesian Dirichlet Equivalence (BDe) metric was used to score the networks produced with SAGA. SAGA predicted transcriptional regulatory relationships among genes in networks that evaluated to higher BDe scores with high sensitivities and specificities. Thus, the proposed method competes well with existing search algorithms for Bayesian Network structure learning of transcriptional regulatory networks.

  13. Reconstruction of gene regulatory modules in cancer cell cycle by multi-source data integration.

    Directory of Open Access Journals (Sweden)

    Yuji Zhang

    Full Text Available BACKGROUND: Precise regulation of the cell cycle is crucial to the growth and development of all organisms. Understanding the regulatory mechanism of the cell cycle is crucial to unraveling many complicated diseases, most notably cancer. Multiple sources of biological data are available to study the dynamic interactions among many genes that are related to the cancer cell cycle. Integrating these informative and complementary data sources can help to infer a mutually consistent gene transcriptional regulatory network with strong similarity to the underlying gene regulatory relationships in cancer cells. RESULTS AND PRINCIPAL FINDINGS: We propose an integrative framework that infers gene regulatory modules from the cell cycle of cancer cells by incorporating multiple sources of biological data, including gene expression profiles, gene ontology, and molecular interaction. Among 846 human genes with putative roles in cell cycle regulation, we identified 46 transcription factors and 39 gene ontology groups. We reconstructed regulatory modules to infer the underlying regulatory relationships. Four regulatory network motifs were identified from the interaction network. The relationship between each transcription factor and predicted target gene groups was examined by training a recurrent neural network whose topology mimics the network motif(s to which the transcription factor was assigned. Inferred network motifs related to eight well-known cell cycle genes were confirmed by gene set enrichment analysis, binding site enrichment analysis, and comparison with previously published experimental results. CONCLUSIONS: We established a robust method that can accurately infer underlying relationships between a given transcription factor and its downstream target genes by integrating different layers of biological data. Our method could also be beneficial to biologists for predicting the components of regulatory modules in which any candidate gene is involved

  14. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model

    DEFF Research Database (Denmark)

    Kogelman, Lisette; Cirera Salicio, Susanna; Zhernakova, Daria V.;

    2014-01-01

    (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. Results WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P ... the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using...

  15. On the underlying assumptions of threshold Boolean networks as a model for genetic regulatory network behavior

    Directory of Open Access Journals (Sweden)

    Van eTran

    2013-12-01

    Full Text Available Boolean networks (BoN are relatively simple and interpretable models of gene regulatorynetworks. Specifying these models with fewer parameters while retaining their ability to describe complex regulatory relationships is an ongoing methodological challenge. Additionally, extending these models to incorporate variable gene decay rates, asynchronous gene response, and synergistic regulation while maintaining their Markovian nature increases the applicability of these models to genetic regulatory networks.We explore a previously-proposed class of BoNs characterized by linear threshold functions, which we refer to as threshold Boolean networks (TBN. Compared to traditional BoNs with unconstrained transition functions, these models require far fewer parameters and offer a more direct interpretation. However, the functional form of a TBN does result in a reduction in the regulatory relationships which can be modeled.We show that TBNs can be readily extended to permit self-degradation, with explicitly modeled degradation rates. We note that the introduction of variable degradation compromises the Markovian property fundamental to BoN models but show that a simple state augmentation procedure restores their Markovian nature. Next, we study the effect of assumptions regarding self-degradation on the set of possible steady states. Our findings are captured in two theorems relating self-degradation and regulatory feedback to the steady state behavior of a TBN. Finally, we explore assumptions of synchronous gene response and asynergistic regulation and show that TBNs can be easily extended to relax these assumptions.Applying our methods to the budding yeast cell-cycle network revealed that although the network is complex, its steady state is simplified by the presence of self-degradation and lack of purely positive regulatory cycles.

  16. RMOD: a tool for regulatory motif detection in signaling network.

    Directory of Open Access Journals (Sweden)

    Jinki Kim

    Full Text Available Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.

  17. Linking network topology to function. Comment on "Drivers of structural features in gene regulatory networks: From biophysical constraints to biological function" by O.C. Martin, A. Krzywicki and M. Zagorski

    Science.gov (United States)

    di Bernardo, Diego

    2016-07-01

    The review by Martin et al. deals with a long standing problem at the interface of complex systems and molecular biology, that is the relationship between the topology of a complex network and its function. In biological terms the problem translates to relating the topology of gene regulatory networks (GRNs) to specific cellular functions. GRNs control the spatial and temporal activity of the genes encoded in the cell's genome by means of specialised proteins called Transcription Factors (TFs). A TF is able to recognise and bind specifically to a sequence (TF biding site) of variable length (order of magnitude of 10) found upstream of the sequence encoding one or more genes (at least in prokaryotes) and thus activating or repressing their transcription. TFs can thus be distinguished in activator and repressor. The picture can become more complex since some classes of TFs can form hetero-dimers consisting of a protein complex whose subunits are the individual TFs. Heterodimers can have completely different binding sites and activity compared to their individual parts. In this review the authors limit their attention to prokaryotes where the complexity of GRNs is somewhat reduced. Moreover they exploit a unique feature of living systems, i.e. evolution, to understand whether function can shape network topology. Indeed, prokaryotes such as bacteria are among the oldest living systems that have become perfectly adapted to their environment over geological scales and thus have reached an evolutionary steady-state where the fitness of the population has reached a plateau. By integrating in silico analysis and comparative evolution, the authors show that indeed function does tend to shape the structure of a GRN, however this trend is not always present and depends on the properties of the network being examined. Interestingly, the trend is more apparent for sparse networks, i.e. where the density of edges is very low. Sparsity is indeed one of the most prominent features

  18. Controllability analysis of transcriptional regulatory networks reveals circular control patterns among transcription factors

    DEFF Research Database (Denmark)

    Österlund, Tobias; Bordel, Sergio; Nielsen, Jens

    2015-01-01

    Transcriptional regulation is the most committed type of regulation in living cells where transcription factors (TFs) control the expression of their target genes and TF expression is controlled by other TFs forming complex transcriptional regulatory networks that can be highly interconnected. Here...... we analyze the topology and organization of nine transcriptional regulatory networks for E. coli, yeast, mouse and human, and we evaluate how the structure of these networks influences two of their key properties, namely controllability and stability. We calculate the controllability for each network...... as a measure of the organization and interconnectivity of the network. We find that the number of driver nodes n(D) needed to control the whole network is 64% of the TFs in the E. coli transcriptional regulatory network in contrast to only 17% for the yeast network, 4% for the mouse network and 8...

  19. microRNAs与TP53基因调控网络研究进展%Advances in microRNAs and TP53 Gene Regulatory Network

    Institute of Scientific and Technical Information of China (English)

    龚朝建; 黄宏斌; 徐柯; 梁芳; 李小玲; 熊炜; 曾朝阳; 李桂源

    2012-01-01

    TP53基因(编码p53蛋白)作为一个重要的抑瘤基因,通过调控一系列信号转导通路广泛参与了多种恶性肿瘤的发生发展,一直是肿瘤分子生物学研究领域的热点.最近的研究发现,microRNAs(miRNAs)参与了TP53的信号通路,它们之间存在着复杂的调控网络.一方面,p53通过调控一些miRNAs的转录及转录后成熟,促进细胞周期阻滞、诱导细胞凋亡和衰老,抑制肿瘤发生.另一方面,许多miRNAs,如miR-25、miR-30d、miR-125b和miR-504等可直接调控p53的表达与活性,参与TP53信号通路的调节,还有一些miRNAs则通过调节p53上下游基因,发挥重要的生物学功能.其中,最具有代表性的是miR-34家族,它们受p53直接调控并参与TP53信号通路,通过靶向抑制多个TP53信号通路关键分子的表达,发挥抑瘤作用.此外,它们还可以通过抑制沉默信息调节子,增强p53的活性,反馈调节TP53信号通路.miRNAs与TP53之间调控网络的研究,是对TP53抑瘤机制的重要补充.%The tumor suppressor TP53 gene, which encodes p53 protein, is a hotspot of all time in molecular oncology. p53 suppresses tumor initiation and progression through its regulation of many downstream genes. Recent studies have revealed that microRNAs (miRNAs) interact with the p53 pathway and form a complex regulatory network. On one hand, p53 promotes cell cycle arrest and induces cell apoptosis and senescence to suppress tumorigenesis by regulating the transcription and post-transcriptional maturation of multiple miRNAs. On the other hand, many miRNAs fine-tune the p53 pathway through regulation of TP53 and its upstream regulators or downstream effectors. The miR-34s family, directly transactivated by p53 represents a large number of p53-regulated miRNAs. They exert their tumor suppressing function via targeted inhibition of multiple key molecules in the p53 pathway. Furthermore, miR-34s enhance p53 activity through a feedback loop by inhibiting silent

  20. Network modeling reveals prevalent negative regulatory relationships between signaling sectors in Arabidopsis immune signaling.

    Directory of Open Access Journals (Sweden)

    Masanao Sato

    Full Text Available Biological signaling processes may be mediated by complex networks in which network components and network sectors interact with each other in complex ways. Studies of complex networks benefit from approaches in which the roles of individual components are considered in the context of the network. The plant immune signaling network, which controls inducible responses to pathogen attack, is such a complex network. We studied the Arabidopsis immune signaling network upon challenge with a strain of the bacterial pathogen Pseudomonas syringae expressing the effector protein AvrRpt2 (Pto DC3000 AvrRpt2. This bacterial strain feeds multiple inputs into the signaling network, allowing many parts of the network to be activated at once. mRNA profiles for 571 immune response genes of 22 Arabidopsis immunity mutants and wild type were collected 6 hours after inoculation with Pto DC3000 AvrRpt2. The mRNA profiles were analyzed as detailed descriptions of changes in the network state resulting from the genetic perturbations. Regulatory relationships among the genes corresponding to the mutations were inferred by recursively applying a non-linear dimensionality reduction procedure to the mRNA profile data. The resulting static network model accurately predicted 23 of 25 regulatory relationships reported in the literature, suggesting that predictions of novel regulatory relationships are also accurate. The network model revealed two striking features: (i the components of the network are highly interconnected; and (ii negative regulatory relationships are common between signaling sectors. Complex regulatory relationships, including a novel negative regulatory relationship between the early microbe-associated molecular pattern-triggered signaling sectors and the salicylic acid sector, were further validated. We propose that prevalent negative regulatory relationships among the signaling sectors make the plant immune signaling network a "sector

  1. Impact of Transcription Units rearrangement on the evolution of the regulatory network of gamma-proteobacteria

    Directory of Open Access Journals (Sweden)

    Vasconcelos Ana

    2008-03-01

    Full Text Available Abstract Background In the past years, several studies begun to unravel the structure, dynamical properties, and evolution of transcriptional regulatory networks. However, even those comparative studies that focus on a group of closely related organisms are limited by the rather scarce knowledge on regulatory interactions outside a few model organisms, such as E. coli among the prokaryotes. Results In this paper we used the information annotated in Tractor_DB (a database of regulatory networks in gamma-proteobacteria to calculate a normalized Site Orthology Score (SOS that quantifies the conservation of a regulatory link across thirty genomes of this subclass. Then we used this SOS to assess how regulatory connections have evolved in this group, and how the variation of basic regulatory connection is reflected on the structure of the chromosome. We found that individual regulatory interactions shift between different organisms, a process that may be described as rewiring the network. At this evolutionary scale (the gamma-proteobacteria subclass this rewiring process may be an important source of variation of regulatory incoming interactions for individual networks. We also noticed that the regulatory links that form feed forward motifs are conserved in a better correlated manner than triads of random regulatory interactions or pairs of co-regulated genes. Furthermore, the rewiring process that takes place at the most basic level of the regulatory network may be linked to rearrangements of genetic material within bacterial chromosomes, which change the structure of Transcription Units and therefore the regulatory connections between Transcription Factors and structural genes. Conclusion The rearrangements that occur in bacterial chromosomes-mostly inversion or horizontal gene transfer events – are important sources of variation of gene regulation at this evolutionary scale.

  2. MicroRNA and transcription factor mediated regulatory network analysis reveals critical regulators and regulatory modules in myocardial infarction.

    Directory of Open Access Journals (Sweden)

    Guangde Zhang

    Full Text Available Myocardial infarction (MI is a severe coronary artery disease and a leading cause of mortality and morbidity worldwide. However, the molecular mechanisms of MI have yet to be fully elucidated. In this study, we compiled MI-related genes, MI-related microRNAs (miRNAs and known human transcription factors (TFs, and we then identified 1,232 feed-forward loops (FFLs among these miRNAs, TFs and their co-regulated target genes through integrating target prediction. By merging these FFLs, the first miRNA and TF mediated regulatory network for MI was constructed, from which four regulators (SP1, ESR1, miR-21-5p and miR-155-5p and three regulatory modules that might play crucial roles in MI were then identified. Furthermore, based on the miRNA and TF mediated regulatory network and literature survey, we proposed a pathway model for miR-21-5p, the miR-29 family and SP1 to demonstrate their potential co-regulatory mechanisms in cardiac fibrosis, apoptosis and angiogenesis. The majority of the regulatory relations in the model were confirmed by previous studies, which demonstrated the reliability and validity of this miRNA and TF mediated regulatory network. Our study will aid in deciphering the complex regulatory mechanisms involved in MI and provide putative therapeutic targets for MI.

  3. Synthetic tetracycline-inducible regulatory networks: computer-aided design of dynamic phenotypes

    Directory of Open Access Journals (Sweden)

    Kaznessis Yiannis N

    2007-01-01

    Full Text Available Abstract Background Tightly regulated gene networks, precisely controlling the expression of protein molecules, have received considerable interest by the biomedical community due to their promising applications. Among the most well studied inducible transcription systems are the tetracycline regulatory expression systems based on the tetracycline resistance operon of Escherichia coli, Tet-Off (tTA and Tet-On (rtTA. Despite their initial success and improved designs, limitations still persist, such as low inducer sensitivity. Instead of looking at these networks statically, and simply changing or mutating the promoter and operator regions with trial and error, a systematic investigation of the dynamic behavior of the network can result in rational design of regulatory gene expression systems. Sophisticated algorithms can accurately capture the dynamical behavior of gene networks. With computer aided design, we aim to improve the synthesis of regulatory networks and propose new designs that enable tighter control of expression. Results In this paper we engineer novel networks by recombining existing genes or part of genes. We synthesize four novel regulatory networks based on the Tet-Off and Tet-On systems. We model all the known individual biomolecular interactions involved in transcription, translation, regulation and induction. With multiple time-scale stochastic-discrete and stochastic-continuous models we accurately capture the transient and steady state dynamics of these networks. Important biomolecular interactions are identified and the strength of the interactions engineered to satisfy design criteria. A set of clear design rules is developed and appropriate mutants of regulatory proteins and operator sites are proposed. Conclusion The complexity of biomolecular interactions is accurately captured through computer simulations. Computer simulations allow us to look into the molecular level, portray the dynamic behavior of gene regulatory

  4. Filtering Genes for Cluster and Network Analysis

    Directory of Open Access Journals (Sweden)

    Parkhomenko Elena

    2009-06-01

    Full Text Available Abstract Background Prior to cluster analysis or genetic network analysis it is customary to filter, or remove genes considered to be irrelevant from the set of genes to be analyzed. Often genes whose variation across samples is less than an arbitrary threshold value are deleted. This can improve interpretability and reduce bias. Results This paper introduces modular models for representing network structure in order to study the relative effects of different filtering methods. We show that cluster analysis and principal components are strongly affected by filtering. Filtering methods intended specifically for cluster and network analysis are introduced and compared by simulating modular networks with known statistical properties. To study more realistic situations, we analyze simulated "real" data based on well-characterized E. coli and S. cerevisiae regulatory networks. Conclusion The methods introduced apply very generally, to any similarity matrix describing gene expression. One of the proposed methods, SUMCOV, performed well for all models simulated.

  5. Comparative study of discretization methods of microarray data for inferring transcriptional regulatory networks

    Directory of Open Access Journals (Sweden)

    Ji Wei

    2010-10-01

    Full Text Available Abstract Background Microarray data discretization is a basic preprocess for many algorithms of gene regulatory network inference. Some common discretization methods in informatics are used to discretize microarray data. Selection of the discretization method is often arbitrary and no systematic comparison of different discretization has been conducted, in the context of gene regulatory network inference from time series gene expression data. Results In this study, we propose a new discretization method "bikmeans", and compare its performance with four other widely-used discretization methods using different datasets, modeling algorithms and number of intervals. Sensitivities, specificities and total accuracies were calculated and statistical analysis was carried out. Bikmeans method always gave high total accuracies. Conclusions Our results indicate that proper discretization methods can consistently improve gene regulatory network inference independent of network modeling algorithms and datasets. Our new method, bikmeans, resulted in significant better total accuracies than other methods.

  6. Integrating Transcriptomic and Proteomic Data Using Predictive Regulatory Network Models of Host Response to Pathogens.

    Directory of Open Access Journals (Sweden)

    Deborah Chasman

    2016-07-01

    Full Text Available Mammalian host response to pathogenic infections is controlled by a complex regulatory network connecting regulatory proteins such as transcription factors and signaling proteins to target genes. An important challenge in infectious disease research is to understand molecular similarities and differences in mammalian host response to diverse sets of pathogens. Recently, systems biology studies have produced rich collections of omic profiles measuring host response to infectious agents such as influenza viruses at multiple levels. To gain a comprehensive understanding of the regulatory network driving host response to multiple infectious agents, we integrated host transcriptomes and proteomes using a network-based approach. Our approach combines expression-based regulatory network inference, structured-sparsity based regression, and network information flow to infer putative physical regulatory programs for expression modules. We applied our approach to identify regulatory networks, modules and subnetworks that drive host response to multiple influenza infections. The inferred regulatory network and modules are significantly enriched for known pathways of immune response and implicate apoptosis, splicing, and interferon signaling processes in the differential response of viral infections of different pathogenicities. We used the learned network to prioritize regulators and study virus and time-point specific networks. RNAi-based knockdown of predicted regulators had significant impact on viral replication and include several previously unknown regulators. Taken together, our integrated analysis identified novel module level patterns that capture strain and pathogenicity-specific patterns of expression and helped identify important regulators of host response to influenza infection.

  7. Integrating Transcriptomic and Proteomic Data Using Predictive Regulatory Network Models of Host Response to Pathogens.

    Science.gov (United States)

    Chasman, Deborah; Walters, Kevin B; Lopes, Tiago J S; Eisfeld, Amie J; Kawaoka, Yoshihiro; Roy, Sushmita

    2016-07-01

    Mammalian host response to pathogenic infections is controlled by a complex regulatory network connecting regulatory proteins such as transcription factors and signaling proteins to target genes. An important challenge in infectious disease research is to understand molecular similarities and differences in mammalian host response to diverse sets of pathogens. Recently, systems biology studies have produced rich collections of omic profiles measuring host response to infectious agents such as influenza viruses at multiple levels. To gain a comprehensive understanding of the regulatory network driving host response to multiple infectious agents, we integrated host transcriptomes and proteomes using a network-based approach. Our approach combines expression-based regulatory network inference, structured-sparsity based regression, and network information flow to infer putative physical regulatory programs for expression modules. We applied our approach to identify regulatory networks, modules and subnetworks that drive host response to multiple influenza infections. The inferred regulatory network and modules are significantly enriched for known pathways of immune response and implicate apoptosis, splicing, and interferon signaling processes in the differential response of viral infections of different pathogenicities. We used the learned network to prioritize regulators and study virus and time-point specific networks. RNAi-based knockdown of predicted regulators had significant impact on viral replication and include several previously unknown regulators. Taken together, our integrated analysis identified novel module level patterns that capture strain and pathogenicity-specific patterns of expression and helped identify important regulators of host response to influenza infection.

  8. Research on Application of Artificial Fish Swarm Algorithm in Gene Regulatory Network%人工鱼群算法在基因调控网络中的应用研究

    Institute of Scientific and Technical Information of China (English)

    田旺兰; 李加升

    2014-01-01

    在分析基因调控网络现状及优缺点的基础上,提出利用人工鱼群算法对阈值布尔网络模型构建下的基因调控网络进行研究。将阈值布尔网络模型应用于花发育形态模型,构建基于预定义吸引子和极限环的综合网络。比较人工鱼群算法与模拟退火算法在基因调控网络中的应用情况,分析网络节点更新机制变化时布尔网络保留吸引子的能力,发现在极限环长度为2和特定网络拓扑下网络才具有鲁棒性。实验结果表明,与模拟退火算法相比,人工鱼群算法在网络发现、鲁棒性方面具有更好的性能,因此利用人工鱼群算法学习布尔网络结构是有效可行的。%Based on the analysis of the advantages and disadvantages of the current appliance of swarm intelligence algorithm into Gene Regulatory Network ( GRN ) , this paper studies the gene regulatory network constructed under Boolean network model using Artificial Fish Swarm Algorithm ( AFSA ) . Especially, the comprehensive network of predefined attractors and limit cycle is formulated by applying Boolean network model into flower growth morphogenesis. After comparing AFSA with Simulated Annealing( SA) and analyzing the ability of the networks to preserve the attractors when the updating schemes is changed from parallel to sequential,the paper finds the network has robustness within the limit cycle length equal to two and specific network topologies. Experimental results show that the intelligence algorithm outperforms simulated annealing in network discovery and robustness. Therefore,it is feasible to learn Boolean network using AFSA.

  9. Modeling genomic regulatory networks with big data.

    Science.gov (United States)

    Bolouri, Hamid

    2014-05-01

    High-throughput sequencing, large-scale data generation projects, and web-based cloud computing are changing how computational biology is performed, who performs it, and what biological insights it can deliver. I review here the latest developments in available data, methods, and software, focusing on the modeling and analysis of the gene regulatory interactions in cells. Three key findings are: (i) although sophisticated computational resources are increasingly available to bench biologists, tailored ongoing education is necessary to avoid the erroneous use of these resources. (ii) Current models of the regulation of gene expression are far too simplistic and need updating. (iii) Integrative computational analysis of large-scale datasets is becoming a fundamental component of molecular biology. I discuss current and near-term opportunities and challenges related to these three points.

  10. Critical dynamics in genetic regulatory networks: examples from four kingdoms.

    Science.gov (United States)

    Balleza, Enrique; Alvarez-Buylla, Elena R; Chaos, Alvaro; Kauffman, Stuart; Shmulevich, Ilya; Aldana, Maximino

    2008-06-18

    The coordinated expression of the different genes in an organism is essential to sustain functionality under the random external perturbations to which the organism might be subjected. To cope with such external variability, the global dynamics of the genetic network must possess two central properties. (a) It must be robust enough as to guarantee stability under a broad range of external conditions, and (b) it must be flexible enough to recognize and integrate specific external signals that may help the organism to change and adapt to different environments. This compromise between robustness and adaptability has been observed in dynamical systems operating at the brink of a phase transition between order and chaos. Such systems are termed critical. Thus, criticality, a precise, measurable, and well characterized property of dynamical systems, makes it possible for robustness and adaptability to coexist in living organisms. In this work we investigate the dynamical properties of the gene transcription networks reported for S. cerevisiae, E. coli, and B. subtilis, as well as the network of segment polarity genes of D. melanogaster, and the network of flower development of A. thaliana. We use hundreds of microarray experiments to infer the nature of the regulatory interactions among genes, and implement these data into the Boolean models of the genetic networks. Our results show that, to the best of the current experimental data available, the five networks under study indeed operate close to criticality. The generality of this result suggests that criticality at the genetic level might constitute a fundamental evolutionary mechanism that generates the great diversity of dynamically robust living forms that we observe around us.

  11. Global analysis of photosynthesis transcriptional regulatory networks.

    Science.gov (United States)

    Imam, Saheed; Noguera, Daniel R; Donohue, Timothy J

    2014-12-01

    Photosynthesis is a crucial biological process that depends on the interplay of many components. This work analyzed the gene targets for 4 transcription factors: FnrL, PrrA, CrpK and MppG (RSP_2888), which are known or predicted to control photosynthesis in Rhodobacter sphaeroides. Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) identified 52 operons under direct control of FnrL, illustrating its regulatory role in photosynthesis, iron homeostasis, nitrogen metabolism and regulation of sRNA synthesis. Using global gene expression analysis combined with ChIP-seq, we mapped the regulons of PrrA, CrpK and MppG. PrrA regulates ∼34 operons encoding mainly photosynthesis and electron transport functions, while CrpK, a previously uncharacterized Crp-family protein, regulates genes involved in photosynthesis and maintenance of iron homeostasis. Furthermore, CrpK and FnrL share similar DNA binding determinants, possibly explaining our observation of the ability of CrpK to partially compensate for the growth defects of a ΔFnrL mutant. We show that the Rrf2 family protein, MppG, plays an important role in photopigment biosynthesis, as part of an incoherent feed-forward loop with PrrA. Our results reveal a previously unrealized, high degree of combinatorial regulation of photosynthetic genes and significant cross-talk between their transcriptional regulators, while illustrating previously unidentified links between photosynthesis and the maintenance of iron homeostasis.

  12. MicroRNAs, Regulatory Networks, and Comorbidities

    DEFF Research Database (Denmark)

    Russo, Francesco; Belling, Kirstine; Jensen, Anders Boeck

    2017-01-01

    MicroRNAs (miRNAs) are small noncoding RNAs involved in the posttranscriptional regulation of messenger RNAs (mRNAs). Each miRNA targets a specific set of mRNAs. Upon binding the miRNA inhibits mRNA translation or facilitate mRNA degradation. miRNAs are frequently deregulated in several pathologies...... including cancer and cardiovascular diseases. Since miRNAs have a crucial role in fine-tuning the expression of their targets, they have been proposed as biomarkers of disease progression and prognostication.In this chapter we discuss different approaches for computational predictions of miRNA targets based...... on sequence complementarity and integration of expression data. In the last section of the chapter we discuss new opportunities in the study of miRNA regulatory networks in the context of temporal disease progression and comorbidities....

  13. The PaPsr1 and PaWhi2 genes are members of the regulatory network that connect stationary phase to mycelium differentiation and reproduction in Podospora anserina.

    Science.gov (United States)

    Timpano, Hélène; Chan Ho Tong, Laetitia; Gautier, Valérie; Lalucque, Hervé; Silar, Philippe

    2016-09-01

    In filamentous fungi, entrance into stationary phase is complex as it is accompanied by several differentiation and developmental processes, including the synthesis of pigments, aerial hyphae, anastomoses and sporophores. The regulatory networks that control these processes are still incompletely known. The analysis of the "Impaired in the development of Crippled Growth (IDC)" mutants of the model filamentous ascomycete Podospora anserina has already yielded important information regarding the pathway regulating entrance into stationary phase. Here, the genes affected in two additional IDC mutants are identified as orthologues of the Saccharomyces cerevisiae WHI2 and PSR1 genes, known to regulate stationary phase in this yeast, arguing for a conserved role of these proteins throughout the evolution of ascomycetes.

  14. Controllability analysis of transcriptional regulatory networks reveals circular control patterns among transcription factors.

    Science.gov (United States)

    Österlund, Tobias; Bordel, Sergio; Nielsen, Jens

    2015-05-01

    Transcriptional regulation is the most committed type of regulation in living cells where transcription factors (TFs) control the expression of their target genes and TF expression is controlled by other TFs forming complex transcriptional regulatory networks that can be highly interconnected. Here we analyze the topology and organization of nine transcriptional regulatory networks for E. coli, yeast, mouse and human, and we evaluate how the structure of these networks influences two of their key properties, namely controllability and stability. We calculate the controllability for each network as a measure of the organization and interconnectivity of the network. We find that the number of driver nodes nD needed to control the whole network is 64% of the TFs in the E. coli transcriptional regulatory network in contrast to only 17% for the yeast network, 4% for the mouse network and 8% for the human network. The high controllability (low number of drivers needed to control the system) in yeast, mouse and human is due to the presence of internal loops in their regulatory networks where the TFs regulate each other in a circular fashion. We refer to these internal loops as circular control motifs (CCM). The E. coli transcriptional regulatory network, which does not have any CCMs, shows a hierarchical structure of the transcriptional regulatory network in contrast to the eukaryal networks. The presence of CCMs also has influence on the stability of these networks, as the presence of cycles can be associated with potential unstable steady-states where even small changes in binding affinities can cause dramatic rearrangements of the state of the network.

  15. Induced pluripotent stem cell-derived neuronal cells from a sporadic Alzheimer's disease donor as a model for investigating AD-associated gene regulatory networks.

    Science.gov (United States)

    Hossini, Amir M; Megges, Matthias; Prigione, Alessandro; Lichtner, Bjoern; Toliat, Mohammad R; Wruck, Wasco; Schröter, Friederike; Nuernberg, Peter; Kroll, Hartmut; Makrantonaki, Eugenia; Zouboulis, Christos C; Zoubouliss, Christos C; Adjaye, James

    2015-02-14

    Alzheimer's disease (AD) is a complex, irreversible neurodegenerative disorder. At present there are neither reliable markers to diagnose AD at an early stage nor therapy. To investigate underlying disease mechanisms, induced pluripotent stem cells (iPSCs) allow the generation of patient-derived neuronal cells in a dish. In this study, employing iPS technology, we derived and characterized iPSCs from dermal fibroblasts of an 82-year-old female patient affected by sporadic AD. The AD-iPSCs were differentiated into neuronal cells, in order to generate disease-specific protein association networks modeling the molecular pathology on the transcriptome level of AD, to analyse the reflection of the disease phenotype in gene expression in AD-iPS neuronal cells, in particular in the ubiquitin-proteasome system (UPS), and to address expression of typical AD proteins. We detected the expression of p-tau and GSK3B, a physiological kinase of tau, in neuronal cells derived from AD-iPSCs. Treatment of neuronal cells differentiated from AD-iPSCs with an inhibitor of γ-secretase resulted in the down-regulation of p-tau. Transcriptome analysis of AD-iPS derived neuronal cells revealed significant changes in the expression of genes associated with AD and with the constitutive as well as the inducible subunits of the proteasome complex. The neuronal cells expressed numerous genes associated with sub-regions within the brain thus suggesting the usefulness of our in-vitro model. Moreover, an AD-related protein interaction network composed of APP and GSK3B among others could be generated using neuronal cells differentiated from two AD-iPS cell lines. Our study demonstrates how an iPSC-based model system could represent (i) a tool to study the underlying molecular basis of sporadic AD, (ii) a platform for drug screening and toxicology studies which might unveil novel therapeutic avenues for this debilitating neuronal disorder.

  16. Uncovering Transcriptional Regulatory Networks by Sparse Bayesian Factor Model

    Science.gov (United States)

    Meng, Jia; Zhang, Jianqiu(Michelle); Qi, Yuan(Alan); Chen, Yidong; Huang, Yufei

    2010-12-01

    The problem of uncovering transcriptional regulation by transcription factors (TFs) based on microarray data is considered. A novel Bayesian sparse correlated rectified factor model (BSCRFM) is proposed that models the unknown TF protein level activity, the correlated regulations between TFs, and the sparse nature of TF-regulated genes. The model admits prior knowledge from existing database regarding TF-regulated target genes based on a sparse prior and through a developed Gibbs sampling algorithm, a context-specific transcriptional regulatory network specific to the experimental condition of the microarray data can be obtained. The proposed model and the Gibbs sampling algorithm were evaluated on the simulated systems, and results demonstrated the validity and effectiveness of the proposed approach. The proposed model was then applied to the breast cancer microarray data of patients with Estrogen Receptor positive ([InlineEquation not available: see fulltext.]) status and Estrogen Receptor negative ([InlineEquation not available: see fulltext.]) status, respectively.

  17. Uncovering Transcriptional Regulatory Networks by Sparse Bayesian Factor Model

    Directory of Open Access Journals (Sweden)

    Qi Yuan(Alan

    2010-01-01

    Full Text Available Abstract The problem of uncovering transcriptional regulation by transcription factors (TFs based on microarray data is considered. A novel Bayesian sparse correlated rectified factor model (BSCRFM is proposed that models the unknown TF protein level activity, the correlated regulations between TFs, and the sparse nature of TF-regulated genes. The model admits prior knowledge from existing database regarding TF-regulated target genes based on a sparse prior and through a developed Gibbs sampling algorithm, a context-specific transcriptional regulatory network specific to the experimental condition of the microarray data can be obtained. The proposed model and the Gibbs sampling algorithm were evaluated on the simulated systems, and results demonstrated the validity and effectiveness of the proposed approach. The proposed model was then applied to the breast cancer microarray data of patients with Estrogen Receptor positive ( status and Estrogen Receptor negative ( status, respectively.

  18. Meta-Analysis of Transcriptome Data Related to Hippocampus Biopsies and iPSC-Derived Neuronal Cells from Alzheimer's Disease Patients Reveals an Association with FOXA1 and FOXA2 Gene Regulatory Networks.

    Science.gov (United States)

    Wruck, Wasco; Schröter, Friederike; Adjaye, James

    2016-01-01

    Although the incidence of Alzheimer's disease (AD) is continuously increasing in the aging population worldwide, effective therapies are not available. The interplay between causative genetic and environmental factors is partially understood. Meta-analyses have been performed on aspects such as polymorphisms, cytokines, and cognitive training. Here, we propose a meta-analysis approach based on hierarchical clustering analysis of a reliable training set of hippocampus biopsies, which is condensed to a gene expression signature. This gene expression signature was applied to various test sets of brain biopsies and iPSC-derived neuronal cell models to demonstrate its ability to distinguish AD samples from control. Thus, our identified AD-gene signature may form the basis for determination of biomarkers that are urgently needed to overcome current diagnostic shortfalls. Intriguingly, the well-described AD-related genes APP and APOE are not within the signature because their gene expression profiles show a lower correlation to the disease phenotype than genes from the signature. This is in line with the differing characteristics of the disease as early-/late-onset or with/without genetic predisposition. To investigate the gene signature's systemic role(s), signaling pathways, gene ontologies, and transcription factors were analyzed which revealed over-representation of response to stress, regulation of cellular metabolic processes, and reactive oxygen species. Additionally, our results clearly point to an important role of FOXA1 and FOXA2 gene regulatory networks in the etiology of AD. This finding is in corroboration with the recently reported major role of the dopaminergic system in the development of AD and its regulation by FOXA1 and FOXA2.

  19. A study of bacterial gene regulatory mechanisms

    DEFF Research Database (Denmark)

    Hansen, Sabine

    regulator studied accurately reproduced the experimental data. Simulations of system dynamics reveals that even two step regulatory cascades significantly increase response times compared to direct allosteric regulation of a transcription factor. It is observed that while many system behaviors...... of GRNs this thesis also provided the first evidence of the sensor histidine kinase VC1831 being an additional player in the Vibrio cholerae quorum sensing (QS) GRN. Bacteria use a process of cell-cell communication called QS which enable the bacterial cells to collectively control their gene expression...

  20. A provisional gene regulatory atlas for mouse heart development.

    Directory of Open Access Journals (Sweden)

    Hailin Chen

    Full Text Available Congenital Heart Disease (CHD is one of the most common birth defects. Elucidating the molecular mechanisms underlying normal cardiac development is an important step towards early identification of abnormalities during the developmental program and towards the creation of early intervention strategies. We developed a novel computational strategy for leveraging high-content data sets, including a large selection of microarray data associated with mouse cardiac development, mouse genome sequence, ChIP-seq data of selected mouse transcription factors and Y2H data of mouse protein-protein interactions, to infer the active transcriptional regulatory network of mouse cardiac development. We identified phase-specific expression activity for 765 overlapping gene co-expression modules that were defined for obtained cardiac lineage microarray data. For each co-expression module, we identified the phase of cardiac development where gene expression for that module was higher than other phases. Co-expression modules were found to be consistent with biological pathway knowledge in Wikipathways, and met expectations for enrichment of pathways involved in heart lineage development. Over 359,000 transcription factor-target relationships were inferred by analyzing the promoter sequences within each gene module for overrepresentation against the JASPAR database of Transcription Factor Binding Site (TFBS motifs. The provisional regulatory network will provide a framework of studying the genetic basis of CHD.

  1. Information theoretical methods to deconvolute genetic regulatory networks applied to thyroid neoplasms

    Science.gov (United States)

    Hernández-Lemus, Enrique; Velázquez-Fernández, David; Estrada-Gil, Jesús K.; Silva-Zolezzi, Irma; Herrera-Hernández, Miguel F.; Jiménez-Sánchez, Gerardo

    2009-12-01

    Most common pathologies in humans are not caused by the mutation of a single gene, rather they are complex diseases that arise due to the dynamic interaction of many genes and environmental factors. This plethora of interacting genes generates a complexity landscape that masks the real effects associated with the disease. To construct dynamic maps of gene interactions (also called genetic regulatory networks) we need to understand the interplay between thousands of genes. Several issues arise in the analysis of experimental data related to gene function: on the one hand, the nature of measurement processes generates highly noisy signals; on the other hand, there are far more variables involved (number of genes and interactions among them) than experimental samples. Another source of complexity is the highly nonlinear character of the underlying biochemical dynamics. To overcome some of these limitations, we generated an optimized method based on the implementation of a Maximum Entropy Formalism (MaxEnt) to deconvolute a genetic regulatory network based on the most probable meta-distribution of gene-gene interactions. We tested the methodology using experimental data for Papillary Thyroid Cancer (PTC) and Thyroid Goiter tissue samples. The optimal MaxEnt regulatory network was obtained from a pool of 25,593,993 different probability distributions. The group of observed interactions was validated by several (mostly in silico) means and sources. For the associated Papillary Thyroid Cancer Gene Regulatory Network (PTC-GRN) the majority of the nodes (genes) have very few links (interactions) whereas a small number of nodes are highly connected. PTC-GRN is also characterized by high clustering coefficients and network heterogeneity. These properties have been recognized as characteristic of topological robustness, and they have been largely described in relation to biological networks. A number of biological validity outcomes are discussed with regard to both the

  2. Regulatory Features for Odorant Receptor Genes in the Mouse Genome.

    Science.gov (United States)

    Degl'Innocenti, Andrea; D'Errico, Anna

    2017-01-01

    The odorant receptor genes, seven transmembrane receptor genes constituting the vastest mammalian gene multifamily, are expressed monogenically and monoallelicaly in each sensory neuron in the olfactory epithelium. This characteristic, often referred to as the one neuron-one receptor rule, is driven by mostly uncharacterized molecular dynamics, generally named odorant receptor gene choice. Much attention has been paid by the scientific community to the identification of sequences regulating the expression of odorant receptor genes within their loci, where related genes are usually arranged in genomic clusters. A number of studies identified transcription factor binding sites on odorant receptor promoter sequences. Similar binding sites were also found on a number of enhancers that regulate in cis their transcription, but have been proposed to form interchromosomal networks. Odorant receptor gene choice seems to occur via the local removal of strongly repressive epigenetic markings, put in place during the maturation of the sensory neuron on each odorant receptor locus. Here we review the fast-changing state of art for the study of regulatory features for odorant receptor genes.

  3. Small Rna Regulatory Networks In Pseudomonas Putida

    DEFF Research Database (Denmark)

    Bojanovic, Klara; Long, Katherine

    2015-01-01

    chemicals and has a potential to be used as an efficient cell factory for various products. P. putida KT2240 is a genome-sequenced strain and a well characterized pseudomonad. Our major aim is to identify small RNA molecules (sRNAs) and their regulatory networks. A previous study has identified 37 sRNAs...... in this strain, while in other pseudomonads many more sRNAs have been found so far.P. putida KT2440 has been grown in different conditions which are likely to be encountered in industrial fermentations with the aim of using sRNAs for generation of improved cell factories. For that, cells have been grown in LB...... and harvested in different growth phases, as well as osmotic, membrane and oxidative stress conditions. RNA sequencing data has been analysed with the open source software system Rockhopper, and it has revealed over 180 putative sRNAs. Most of them (86%) seem to be novel and uncharacterized. The majority...

  4. Modelling non-stationary dynamic gene regulatory processes with the BGM model

    NARCIS (Netherlands)

    Grzegorczyk, Marco; Husmeier, Dirk; Rahnenfuehrer, Joerg

    2011-01-01

    Recently, a Bayesian network model for inferring non-stationary regulatory processes from gene expression time series has been proposed. The Bayesian Gaussian Mixture (BGM) Bayesian network model divides the data into disjunct compartments (data subsets) by a free allocation model, and infers networ

  5. The effect of scale-free topology on the robustness and evolvability of genetic regulatory networks

    OpenAIRE

    Greenbury, Sam F.; Johnston, Iain G.; Matthew A Smith; Doye, Jonathan P. K.; Louis, Ard A.

    2010-01-01

    Abstract We investigate how scale-free (SF) and Erdos-Renyi (ER) topologies affect the interplay between evolvability and robustness of model gene regulatory networks with Boolean threshold dynamics. In agreement with Oikonomou and Cluzel (2006) we find that networks with SFin topologies, that is SF topology for incoming nodes and ER topology for outgoing nodes, are significantly more evolvable towards specific oscillatory targets than networks with ER topology for both incoming an...

  6. Steady-State Analysis of Genetic Regulatory Networks Modelled by Probabilistic Boolean Networks

    Directory of Open Access Journals (Sweden)

    Wei Zhang

    2006-04-01

    Full Text Available Probabilistic Boolean networks (PBNs have recently been introduced as a promising class of models of genetic regulatory networks. The dynamic behaviour of PBNs can be analysed in the context of Markov chains. A key goal is the determination of the steady-state (long-run behaviour of a PBN by analysing the corresponding Markov chain. This allows one to compute the long-term influence of a gene on another gene or determine the long-term joint probabilistic behaviour of a few selected genes. Because matrix-based methods quickly become prohibitive for large sizes of networks, we propose the use of Monte Carlo methods. However, the rate of convergence to the stationary distribution becomes a central issue. We discuss several approaches for determining the number of iterations necessary to achieve convergence of the Markov chain corresponding to a PBN. Using a recently introduced method based on the theory of two-state Markov chains, we illustrate the approach on a sub-network designed from human glioma gene expression data and determine the joint steadystate probabilities for several groups of genes.

  7. A novel parametric approach to mine gene regulatory relationship from microarray datasets

    Directory of Open Access Journals (Sweden)

    Zhu Yunping

    2010-12-01

    Full Text Available Abstract Background Microarray has been widely used to measure the gene expression level on the genome scale in the current decade. Many algorithms have been developed to reconstruct gene regulatory networks based on microarray data. Unfortunately, most of these models and algorithms focus on global properties of the expression of genes in regulatory networks. And few of them are able to offer intuitive parameters. We wonder whether some simple but basic characteristics of microarray datasets can be found to identify the potential gene regulatory relationship. Results Based on expression correlation, expression level variation and vectors derived from microarray expression levels, we first introduced several novel parameters to measure the characters of regulating gene pairs. Subsequently, we used the naïve Bayesian network to integrate these features as well as the functional co-annotation between transcription factors and their target genes. Then, based on the character of time-delay from the expression profile, we were able to predict the existence and direction of the regulatory relationship respectively. Conclusions Several novel parameters have been proposed and integrated to identify the regulatory relationship. This new model is proved to be of higher efficacy than that of individual features. It is believed that our parametric approach can serve as a fast approach for regulatory relationship mining.

  8. Inferring regulatory networks from expression data using tree-based methods.

    Directory of Open Access Journals (Sweden)

    Vân Anh Huynh-Thu

    Full Text Available One of the pressing open problems of computational systems biology is the elucidation of the topology of genetic regulatory networks (GRNs using high throughput genomic data, in particular microarray gene expression data. The Dialogue for Reverse Engineering Assessments and Methods (DREAM challenge aims to evaluate the success of GRN inference algorithms on benchmarks of simulated data. In this article, we present GENIE3, a new algorithm for the inference of GRNs that was best performer in the DREAM4 In Silico Multifactorial challenge. GENIE3 decomposes the prediction of a regulatory network between p genes into p different regression problems. In each of the regression problems, the expression pattern of one of the genes (target gene is predicted from the expression patterns of all the other genes (input genes, using tree-based ensemble methods Random Forests or Extra-Trees. The importance of an input gene in the prediction of the target gene expression pattern is taken as an indication of a putative regulatory link. Putative regulatory links are then aggregated over all genes to provide a ranking of interactions from which the whole network is reconstructed. In addition to performing well on the DREAM4 In Silico Multifactorial challenge simulated data, we show that GENIE3 compares favorably with existing algorithms to decipher the genetic regulatory network of Escherichia coli. It doesn't make any assumption about the nature of gene regulation, can deal with combinatorial and non-linear interactions, produces directed GRNs, and is fast and scalable. In conclusion, we propose a new algorithm for GRN inference that performs well on both synthetic and real gene expression data. The algorithm, based on feature selection with tree-based ensemble methods, is simple and generic, making it adaptable to other types of genomic data and interactions.

  9. Prediction of tissue-specific cis-regulatory modules using Bayesian networks and regression trees

    Directory of Open Access Journals (Sweden)

    Chen Xiaoyu

    2007-12-01

    Full Text Available Abstract Background In vertebrates, a large part of gene transcriptional regulation is operated by cis-regulatory modules. These modules are believed to be regulating much of the tissue-specificity of gene expression. Results We develop a Bayesian network approach for identifying cis-regulatory modules likely to regulate tissue-specific expression. The network integrates predicted transcription factor binding site information, transcription factor expression data, and target gene expression data. At its core is a regression tree modeling the effect of combinations of transcription factors bound to a module. A new unsupervised EM-like algorithm is developed to learn the parameters of the network, including the regression tree structure. Conclusion Our approach is shown to accurately identify known human liver and erythroid-specific modules. When applied to the prediction of tissue-specific modules in 10 different tissues, the network predicts a number of important transcription factor combinations whose concerted binding is associated to specific expression.

  10. CARRIE web service: automated transcriptional regulatory network inference and interactive analysis.

    Science.gov (United States)

    Haverty, Peter M; Frith, Martin C; Weng, Zhiping

    2004-07-01

    We present an intuitive and interactive web service for CARRIE (Computational Ascertainment of Regulatory Relationships Inferred from Expression). CARRIE is a computational method that analyzes microarray and promoter sequence data to infer a transcriptional regulatory network from the response to a specific stimulus. This service displays an interactive graph of the inferred network and provides easy access to the evidence for the involvement of each gene in the network. We provide functionality to include network data in KEGG XML (KGML) format in this graph. Our service also provides Gene Ontology annotation to aid the user in forming hypotheses about the role of each gene in the cellular response. The CARRIE web service is freely available at http://zlab.bu.edu/CARRIE-web.

  11. Regulatory network controlling extracellular proteins in Erwinia carotovora subsp. carotovora: FlhDC, the master regulator of flagellar genes, activates rsmB regulatory RNA production by affecting gacA and hexA (lrhA) expression.

    Science.gov (United States)

    Cui, Yaya; Chatterjee, Asita; Yang, Hailian; Chatterjee, Arun K

    2008-07-01

    Erwinia carotovora subsp. carotovora produces an array of extracellular proteins (i.e., exoproteins), including plant cell wall-degrading enzymes and Harpin, an effector responsible for eliciting hypersensitive reaction. Exoprotein genes are coregulated by the quorum-sensing signal, N-acyl homoserine lactone, plant signals, an assortment of transcriptional factors/regulators (GacS/A, ExpR1, ExpR2, KdgR, RpoS, HexA, and RsmC) and posttranscriptional regulators (RsmA, rsmB RNA). rsmB RNA production is positively regulated by GacS/A, a two-component system, and negatively regulated by HexA (PecT in Erwinia chrysanthemi; LrhA [LysR homolog A] in Escherichia coli) and RsmC, a putative transcriptional adaptor. While free RsmA, an RNA-binding protein, promotes decay of mRNAs of exoprotein genes, binding of RsmA with rsmB RNA neutralizes the RsmA effect. In the course of studies of GacA regulation, we discovered that a locus bearing strong homology to the flhDC operon of E. coli also controls extracellular enzyme production. A transposon insertion FlhDC(-) mutant produces very low levels of pectate lyase, polygalacturonase, cellulase, protease, and E. carotovora subsp. carotovora Harpin (Harpin(Ecc)) and is severely attenuated in its plant virulence. The production of these exoproteins is restored in the mutant carrying an FlhDC(+) plasmid. Sequence analysis and transcript assays disclosed that the flhD operon of E. carotovora subsp. carotovora, like those of other enterobacteria, consists of flhD and flhC. Complementation analysis revealed that the regulatory effect requires functions of both flhD and flhC products. The data presented here show that FlhDC positively regulates gacA, rsmC, and fliA and negatively regulates hexA (lrhA). Evidence shows that FlhDC controls extracellular protein production through cumulative effects on hexA and gacA. Reduced levels of GacA and elevated levels of HexA in the FlhDC(-) mutant are responsible for the inhibition of rsmB RNA

  12. Regulatory network modelling of iron acquisition by a fungal pathogen in contact with epithelial cells

    Directory of Open Access Journals (Sweden)

    Guthke Reinhard

    2010-11-01

    Full Text Available Abstract Background Reverse engineering of gene regulatory networks can be used to predict regulatory interactions of an organism faced with environmental changes, but can prove problematic, especially when focusing on complicated multi-factorial processes. Candida albicans is a major human fungal pathogen. During the infection process, this fungus is able to adapt to conditions of very low iron availability. Such adaptation is an important virulence attribute of virtually all pathogenic microbes. Understanding the regulation of iron acquisition genes will extend our knowledge of the complex regulatory changes during the infection process and might identify new potential drug targets. Thus, there is a need for efficient modelling approaches predicting key regulatory events of iron acquisition genes during the infection process. Results This study deals with the regulation of C. albicans iron uptake genes during adhesion to and invasion into human oral epithelial cells. A reverse engineering strategy is presented, which is able to infer regulatory networks on the basis of gene expression data, making use of relevant selection criteria such as sparseness and robustness. An exhaustive use of available knowledge from different data sources improved the network prediction. The predicted regulatory network proposes a number of new target genes for the transcriptional regulators Rim101, Hap3, Sef1 and Tup1. Furthermore, the molecular mode of action for Tup1 is clarified. Finally, regulatory interactions between the transcription factors themselves are proposed. This study presents a model describing how C. albicans may regulate iron acquisition during contact with and invasion of human oral epithelial cells. There is evidence that some of the proposed regulatory interactions might also occur during oral infection. Conclusions This study focuses on a typical problem in Systems Biology where an interesting biological phenomenon is studied using a small

  13. Genome-scale cold stress response regulatory networks in ten Arabidopsis thaliana ecotypes

    DEFF Research Database (Denmark)

    Barah, Pankaj; Jayavelu, Naresh Doni; Rasmussen, Simon;

    2013-01-01

    ontology (GO) categories were identified to delineate natural variation of cold stress regulated differential gene expression in the model plant A. thaliana. The predicted regulatory network model was able to identify new ecotype specific transcription factors and their regulatory interactions, which might...... using Arabidopsis NimbleGen ATH6 microarrays. In total 6061 transcripts were significantly cold regulated (p expression pattern. By using sequence data...

  14. Dynamical Analysis of Protein Regulatory Network in Budding Yeast Nucleus

    Institute of Scientific and Technical Information of China (English)

    LI Fang-Ting; JIA Xun

    2006-01-01

    @@ Recent progresses in the protein regulatory network of budding yeast Saccharomyces cerevisiae have provided a global picture of its protein network for further dynamical research. We simplify and modularize the protein regulatory networks in yeast nucleus, and study the dynamical properties of the core 37-node network by a Boolean network model, especially the evolution steps and final fixed points. Our simulation results show that the number of fixed points N(k) for a given size of the attraction basin k obeys a power-law distribution N(k)∝k-2.024. The yeast network is more similar to a scale-free network than a random network in the above dynamical properties.

  15. Motif-guided sparse decomposition of gene expression data for regulatory module identification

    Directory of Open Access Journals (Sweden)

    Hoffman Eric P

    2011-03-01

    Full Text Available Abstract Background Genes work coordinately as gene modules or gene networks. Various computational approaches have been proposed to find gene modules based on gene expression data; for example, gene clustering is a popular method for grouping genes with similar gene expression patterns. However, traditional gene clustering often yields unsatisfactory results for regulatory module identification because the resulting gene clusters are co-expressed but not necessarily co-regulated. Results We propose a novel approach, motif-guided sparse decomposition (mSD, to identify gene regulatory modules by integrating gene expression data and DNA sequence motif information. The mSD approach is implemented as a two-step algorithm comprising estimates of (1 transcription factor activity and (2 the strength of the predicted gene regulation event(s. Specifically, a motif-guided clustering method is first developed to estimate the transcription factor activity of a gene module; sparse component analysis is then applied to estimate the regulation strength, and so predict the target genes of the transcription factors. The mSD approach was first tested for its improved performance in finding regulatory modules using simulated and real yeast data, revealing functionally distinct gene modules enriched with biologically validated transcription factors. We then demonstrated the efficacy of the mSD approach on breast cancer cell line data and uncovered several important gene regulatory modules related to endocrine therapy of breast cancer. Conclusion We have developed a new integrated strategy, namely motif-guided sparse decomposition (mSD of gene expression data, for regulatory module identification. The mSD method features a novel motif-guided clustering method for transcription factor activity estimation by finding a balance between co-regulation and co-expression. The mSD method further utilizes a sparse decomposition method for regulation strength estimation. The

  16. Inferring gene regression networks with model trees

    Directory of Open Access Journals (Sweden)

    Aguilar-Ruiz Jesus S

    2010-10-01

    Full Text Available Abstract Background Novel strategies are required in order to handle the huge amount of data produced by microarray technologies. To infer gene regulatory networks, the first step is to find direct regulatory relationships between genes building the so-called gene co-expression networks. They are typically generated using correlation statistics as pairwise similarity measures. Correlation-based methods are very useful in order to determine whether two genes have a strong global similarity but do not detect local similarities. Results We propose model trees as a method to identify gene interaction networks. While correlation-based methods analyze each pair of genes, in our approach we generate a single regression tree for each gene from the remaining genes. Finally, a graph from all the relationships among output and input genes is built taking into account whether the pair of genes is statistically significant. For this reason we apply a statistical procedure to control the false discovery rate. The performance of our approach, named REGNET, is experimentally tested on two well-known data sets: Saccharomyces Cerevisiae and E.coli data set. First, the biological coherence of the results are tested. Second the E.coli transcriptional network (in the Regulon database is used as control to compare the results to that of a correlation-based method. This experiment shows that REGNET performs more accurately at detecting true gene associations than the Pearson and Spearman zeroth and first-order correlation-based methods. Conclusions REGNET generates gene association networks from gene expression data, and differs from correlation-based methods in that the relationship between one gene and others is calculated simultaneously. Model trees are very useful techniques to estimate the numerical values for the target genes by linear regression functions. They are very often more precise than linear regression models because they can add just different linear

  17. 遗传基因组学方法对Myoc基因调控网络的研究%Analysis of Myocilin gene regulatory network using a genetic genomics approach

    Institute of Scientific and Technical Information of China (English)

    陆宏; 陆璐; 管怀进; 陈辉; 张俊芳; 胡楠; 帅捷

    2013-01-01

    Background The pathogenesis of primary open angle glaucoma(POAG) and high myopia are very complex.To construct the regulatory network of virulence genes and relevant genes that involved in pathogenicity are helpful for reveal of the pathogenesis.Objective The aim of this study was to investigate myocilin(Myoc),a gene that contributes to POAG and high myopia in eyes of BXD Recombinant Inbred(BXD RI)mice and construct the regulatory network of Myoc.Methods The affymetrix microarray system was used to detect the differential expression of Myoc in the eyes of C57BL/6J(B6),DBA/2J(D2) and BXD RI mice.Expression quantitative trait loci (eQTL) mapping was performed to construct the regulatory network of Myoc gene.Results The average expression level of the Myoc gene in the BXD strains was 10.83,and the gene exhibited expression levels ranging from 8.39 in BXD55 mice tol 1.43 in B6 mice.The eQTL mapping for the Myoc gene showed a significant likelihood ratio statistic (LRS) of 21.78.The QTL was mapped in chromosome 2,and Myoc was located on chromosome 1,indicating that the Myoc gene was a trans-acting QTL.Olfml2a was identified to be a candidate upstream gene of Myoc by analysis of bioinformatics.Genetic regulatory network analysis demonstrated that a series of genes associated with Myoc probably played roles in the pathogenesis and development of POAG and high myopia.Conclusions The genetical genomics approach provides a powerful tool for constructing pathways that contribute to complex traits,such as POAG and high myopia.%背景 原发性开角型青光眼(POAG)和高度近视的发病机制复杂,可能有较多致病基因及致病相关基因参与,构建基因调控网络有助于揭示发病机制.目的 采用遗传基因组学方法研究Myoc基因的调控网络,探索其在POAG中的可能作用机制.方法 Affymetrix基因芯片用于检测确定68个品系的BXD重组近交系(BXD RI)小鼠及其亲本C57 BL/6J(B6)小鼠和DBA/2J(D2)小鼠、F1代小鼠眼组

  18. Pharyngeal mesoderm regulatory network controls cardiac and head muscle morphogenesis.

    Science.gov (United States)

    Harel, Itamar; Maezawa, Yoshiro; Avraham, Roi; Rinon, Ariel; Ma, Hsiao-Yen; Cross, Joe W; Leviatan, Noam; Hegesh, Julius; Roy, Achira; Jacob-Hirsch, Jasmine; Rechavi, Gideon; Carvajal, Jaime; Tole, Shubha; Kioussi, Chrissa; Quaggin, Susan; Tzahor, Eldad

    2012-11-13

    The search for developmental mechanisms driving vertebrate organogenesis has paved the way toward a deeper understanding of birth defects. During embryogenesis, parts of the heart and craniofacial muscles arise from pharyngeal mesoderm (PM) progenitors. Here, we reveal a hierarchical regulatory network of a set of transcription factors expressed in the PM that initiates heart and craniofacial organogenesis. Genetic perturbation of this network in mice resulted in heart and craniofacial muscle defects, revealing robust cross-regulation between its members. We identified Lhx2 as a previously undescribed player during cardiac and pharyngeal muscle development. Lhx2 and Tcf21 genetically interact with Tbx1, the major determinant in the etiology of DiGeorge/velo-cardio-facial/22q11.2 deletion syndrome. Furthermore, knockout of these genes in the mouse recapitulates specific cardiac features of this syndrome. We suggest that PM-derived cardiogenesis and myogenesis are network properties rather than properties specific to individual PM members. These findings shed new light on the developmental underpinnings of congenital defects.

  19. Identification of regulatory network hubs that control lipid metabolism in Chlamydomonas reinhardtii.

    Science.gov (United States)

    Gargouri, Mahmoud; Park, Jeong-Jin; Holguin, F Omar; Kim, Min-Jeong; Wang, Hongxia; Deshpande, Rahul R; Shachar-Hill, Yair; Hicks, Leslie M; Gang, David R

    2015-08-01

    Microalgae-based biofuels are promising sources of alternative energy, but improvements throughout the production process are required to establish them as economically feasible. One of the most influential improvements would be a significant increase in lipid yields, which could be achieved by altering the regulation of lipid biosynthesis and accumulation. Chlamydomonas reinhardtii accumulates oil (triacylglycerols, TAG) in response to nitrogen (N) deprivation. Although a few important regulatory genes have been identified that are involved in controlling this process, a global understanding of the larger regulatory network has not been developed. In order to uncover this network in this species, a combined omics (transcriptomic, proteomic and metabolomic) analysis was applied to cells grown in a time course experiment after a shift from N-replete to N-depleted conditions. Changes in transcript and protein levels of 414 predicted transcription factors (TFs) and transcriptional regulators (TRs) were monitored relative to other genes. The TF and TR genes were thus classified by two separate measures: up-regulated versus down-regulated and early response versus late response relative to two phases of polar lipid synthesis (before and after TAG biosynthesis initiation). Lipidomic and primary metabolite profiling generated compound accumulation levels that were integrated with the transcript dataset and TF profiling to produce a transcriptional regulatory network. Evaluation of this proposed regulatory network led to the identification of several regulatory hubs that control many aspects of cellular metabolism, from N assimilation and metabolism, to central metabolism, photosynthesis and lipid metabolism.

  20. Canalization and symmetry in Boolean models for genetic regulatory networks

    Energy Technology Data Exchange (ETDEWEB)

    Reichhardt, C J Olson [Theoretical Division and Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545 (United States); Bassler, Kevin E [Department of Physics, University of Houston, Houston, TX 77204-5005 (United States)

    2007-04-20

    Canalization of genetic regulatory networks has been argued to be favoured by evolutionary processes due to the stability that it can confer to phenotype expression. We explore whether a significant amount of canalization and partial canalization can arise in purely random networks in the absence of evolutionary pressures. We use a mapping of the Boolean functions in the Kauffman N-K model for genetic regulatory networks onto a k-dimensional Ising hypercube (where k = K) to show that the functions can be divided into different classes strictly due to geometrical constraints. The classes can be counted and their properties determined using results from group theory and isomer chemistry. We demonstrate that partially canalizing functions completely dominate all possible Boolean functions, particularly for higher k. This indicates that partial canalization is extremely common, even in randomly chosen networks, and has implications for how much information can be obtained in experiments on native state genetic regulatory networks.