WorldWideScience

Sample records for evolutionary gene coexpression

  1. Gene Coexpression and Evolutionary Conservation Analysis of the Human Preimplantation Embryos

    Directory of Open Access Journals (Sweden)

    Tiancheng Liu

    2015-01-01

    Full Text Available Evolutionary developmental biology (EVO-DEVO tries to decode evolutionary constraints on the stages of embryonic development. Two models—the “funnel-like” model and the “hourglass” model—have been proposed by investigators to illustrate the fluctuation of selective pressure on these stages. However, selective indices of stages corresponding to mammalian preimplantation embryonic development (PED were undetected in previous studies. Based on single cell RNA sequencing of stages during human PED, we used coexpression method to identify gene modules activated in each of these stages. Through measuring the evolutionary indices of gene modules belonging to each stage, we observed change pattern of selective constraints on PED for the first time. The selective pressure decreases from the zygote stage to the 4-cell stage and increases at the 8-cell stage and then decreases again from 8-cell stage to the late blastocyst stages. Previous EVO-DEVO studies concerning the whole embryo development neglected the fluctuation of selective pressure in these earlier stages, and the fluctuation was potentially correlated with events of earlier stages, such as zygote genome activation (ZGA. Such oscillation in an earlier stage would further affect models of the evolutionary constraints on whole embryo development. Therefore, these earlier stages should be measured intensively in future EVO-DEVO studies.

  2. Global similarity and local divergence in human and mouse gene co-expression networks

    Directory of Open Access Journals (Sweden)

    Koonin Eugene V

    2006-09-01

    Full Text Available Abstract Background A genome-wide comparative analysis of human and mouse gene expression patterns was performed in order to evaluate the evolutionary divergence of mammalian gene expression. Tissue-specific expression profiles were analyzed for 9,105 human-mouse orthologous gene pairs across 28 tissues. Expression profiles were resolved into species-specific coexpression networks, and the topological properties of the networks were compared between species. Results At the global level, the topological properties of the human and mouse gene coexpression networks are, essentially, identical. For instance, both networks have topologies with small-world and scale-free properties as well as closely similar average node degrees, clustering coefficients, and path lengths. However, the human and mouse coexpression networks are highly divergent at the local level: only a small fraction ( Conclusion The dissonance between global versus local network divergence suggests that the interspecies similarity of the global network properties is of limited biological significance, at best, and that the biologically relevant aspects of the architectures of gene coexpression are specific and particular, rather than universal. Nevertheless, there is substantial evolutionary conservation of the local network structure which is compatible with the notion that gene coexpression networks are subject to purifying selection.

  3. Multiscale Embedded Gene Co-expression Network Analysis.

    Directory of Open Access Journals (Sweden)

    Won-Min Song

    2015-11-01

    Full Text Available Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3, the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA by: i introducing quality control of co-expression similarities, ii parallelizing embedded network construction, and iii developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs. We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA. MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  4. Multiscale Embedded Gene Co-expression Network Analysis.

    Science.gov (United States)

    Song, Won-Min; Zhang, Bin

    2015-11-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  5. Link-based quantitative methods to identify differentially coexpressed genes and gene Pairs

    Directory of Open Access Journals (Sweden)

    Ye Zhi-Qiang

    2011-08-01

    Full Text Available Abstract Background Differential coexpression analysis (DCEA is increasingly used for investigating the global transcriptional mechanisms underlying phenotypic changes. Current DCEA methods mostly adopt a gene connectivity-based strategy to estimate differential coexpression, which is characterized by comparing the numbers of gene neighbors in different coexpression networks. Although it simplifies the calculation, this strategy mixes up the identities of different coexpression neighbors of a gene, and fails to differentiate significant differential coexpression changes from those trivial ones. Especially, the correlation-reversal is easily missed although it probably indicates remarkable biological significance. Results We developed two link-based quantitative methods, DCp and DCe, to identify differentially coexpressed genes and gene pairs (links. Bearing the uniqueness of exploiting the quantitative coexpression change of each gene pair in the coexpression networks, both methods proved to be superior to currently popular methods in simulation studies. Re-mining of a publicly available type 2 diabetes (T2D expression dataset from the perspective of differential coexpression analysis led to additional discoveries than those from differential expression analysis. Conclusions This work pointed out the critical weakness of current popular DCEA methods, and proposed two link-based DCEA algorithms that will make contribution to the development of DCEA and help extend it to a broader spectrum.

  6. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  7. Gene coexpression network analysis as a source of functional annotation for rice genes.

    Directory of Open Access Journals (Sweden)

    Kevin L Childs

    Full Text Available With the existence of large publicly available plant gene expression data sets, many groups have undertaken data analyses to construct gene coexpression networks and functionally annotate genes. Often, a large compendium of unrelated or condition-independent expression data is used to construct gene networks. Condition-dependent expression experiments consisting of well-defined conditions/treatments have also been used to create coexpression networks to help examine particular biological processes. Gene networks derived from either condition-dependent or condition-independent data can be difficult to interpret if a large number of genes and connections are present. However, algorithms exist to identify modules of highly connected and biologically relevant genes within coexpression networks. In this study, we have used publicly available rice (Oryza sativa gene expression data to create gene coexpression networks using both condition-dependent and condition-independent data and have identified gene modules within these networks using the Weighted Gene Coexpression Network Analysis method. We compared the number of genes assigned to modules and the biological interpretability of gene coexpression modules to assess the utility of condition-dependent and condition-independent gene coexpression networks. For the purpose of providing functional annotation to rice genes, we found that gene modules identified by coexpression analysis of condition-dependent gene expression experiments to be more useful than gene modules identified by analysis of a condition-independent data set. We have incorporated our results into the MSU Rice Genome Annotation Project database as additional expression-based annotation for 13,537 genes, 2,980 of which lack a functional annotation description. These results provide two new types of functional annotation for our database. Genes in modules are now associated with groups of genes that constitute a collective functional

  8. GeneCAT--novel webtools that combine BLAST and co-expression analyses

    DEFF Research Database (Denmark)

    Mutwil, Marek; Obro, Jens; Willats, William G T

    2008-01-01

    The gene co-expression analysis toolbox (GeneCAT) introduces several novel microarray data analyzing tools. First, the multigene co-expression analysis, combined with co-expressed gene networks, provides a more powerful data mining technique than standard, single-gene co-expression analysis. Second...... orthologs in the plant model organisms Arabidopsis thaliana and Hordeum vulgare (Barley). GeneCAT is equipped with expression data for the model plant A. thaliana, and first to introduce co-expression mining tools for the monocot Barley. GeneCAT is available at http://genecat.mpg.de....

  9. Proteome Profiling Outperforms Transcriptome Profiling for Coexpression Based Gene Function Prediction

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Jing; Ma, Zihao; Carr, Steven A.; Mertins, Philipp; Zhang, Hui; Zhang, Zhen; Chan, Daniel W.; Ellis, Matthew J. C.; Townsend, R. Reid; Smith, Richard D.; McDermott, Jason E.; Chen, Xian; Paulovich, Amanda G.; Boja, Emily S.; Mesri, Mehdi; Kinsinger, Christopher R.; Rodriguez, Henry; Rodland, Karin D.; Liebler, Daniel C.; Zhang, Bing

    2016-11-11

    Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies

  10. Co-Expression of Neighboring Genes in the Zebrafish (Danio rerio Genome

    Directory of Open Access Journals (Sweden)

    Daryi Wang

    2009-08-01

    Full Text Available Neighboring genes in the eukaryotic genome have a tendency to express concurrently, and the proximity of two adjacent genes is often considered a possible explanation for their co-expression behavior. However, the actual contribution of the physical distance between two genes to their co-expression behavior has yet to be defined. To further investigate this issue, we studied the co-expression of neighboring genes in zebrafish, which has a compact genome and has experienced a whole genome duplication event. Our analysis shows that the proportion of highly co-expressed neighboring pairs (Pearson’s correlation coefficient R>0.7 is low (0.24% ~ 0.67%; however, it is still significantly higher than that of random pairs. In particular, the statistical result implies that the co-expression tendency of neighboring pairs is negatively correlated with their physical distance. Our findings therefore suggest that physical distance may play an important role in the co-expression of neighboring genes. Possible mechanisms related to the neighboring genes’ co-expression are also discussed.

  11. G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Directory of Open Access Journals (Sweden)

    Lemay Danielle G

    2012-09-01

    Full Text Available Abstract Background In previous studies, gene neighborhoods—spatial clusters of co-expressed genes in the genome—have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Scoring Tool (G-NEST which combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all possible window sizes simultaneously. Results Using G-NEST on atlases of mouse and human tissue expression data, we found that large neighborhoods of ten or more genes are extremely rare in mammalian genomes. When they do occur, neighborhoods are typically composed of families of related genes. Both the highest scoring and the largest neighborhoods in mammalian genomes are formed by tandem gene duplication. Mammalian gene neighborhoods contain highly and variably expressed genes. Co-localized noisy gene pairs exhibit lower evolutionary conservation of their adjacent genome locations, suggesting that their shared transcriptional background may be disadvantageous. Genes that are essential to mammalian survival and reproduction are less likely to occur in neighborhoods, although neighborhoods are enriched with genes that function in mitosis. We also found that gene orientation and protein-protein interactions are partially responsible for maintenance of gene neighborhoods. Conclusions Our experiments using G-NEST confirm that tandem gene duplication is the primary driver of non-random gene order in mammalian genomes. Non-essentiality, co-functionality, gene orientation, and protein-protein interactions are additional forces that maintain gene neighborhoods, especially those formed by tandem duplicates. We expect G-NEST to be useful for other applications such as the identification of core regulatory modules, common transcriptional backgrounds, and chromatin domains. The

  12. Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways.

    Science.gov (United States)

    Obayashi, Takeshi; Kinoshita, Kengo

    2010-05-01

    Gene coexpression analyses are a powerful method to predict the function of genes and/or to identify genes that are functionally related to query genes. The basic idea of gene coexpression analyses is that genes with similar functions should have similar expression patterns under many different conditions. This approach is now widely used by many experimental researchers, especially in the field of plant biology. In this review, we will summarize recent successful examples obtained by using our gene coexpression database, ATTED-II. Specifically, the examples will describe the identification of new genes, such as the subunits of a complex protein, the enzymes in a metabolic pathway and transporters. In addition, we will discuss the discovery of a new intercellular signaling factor and new regulatory relationships between transcription factors and their target genes. In ATTED-II, we provide two basic views of gene coexpression, a gene list view and a gene network view, which can be used as guide gene approach and narrow-down approach, respectively. In addition, we will discuss the coexpression effectiveness for various types of gene sets.

  13. Characterization of differentially expressed genes using high-dimensional co-expression networks

    DEFF Research Database (Denmark)

    Coelho Goncalves de Abreu, Gabriel; Labouriau, Rodrigo S.

    2010-01-01

    We present a technique to characterize differentially expressed genes in terms of their position in a high-dimensional co-expression network. The set-up of Gaussian graphical models is used to construct representations of the co-expression network in such a way that redundancy and the propagation...... that allow to make effective inference in problems with high degree of complexity (e.g. several thousands of genes) and small number of observations (e.g. 10-100) as typically occurs in high throughput gene expression studies. Taking advantage of the internal structure of decomposable graphical models, we...... construct a compact representation of the co-expression network that allows to identify the regions with high concentration of differentially expressed genes. It is argued that differentially expressed genes located in highly interconnected regions of the co-expression network are less informative than...

  14. Large clusters of co-expressed genes in the Drosophila genome.

    Science.gov (United States)

    Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I

    2002-12-12

    Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.

  15. A methodology for the analysis of differential coexpression across the human lifespan.

    Science.gov (United States)

    Gillis, Jesse; Pavlidis, Paul

    2009-09-22

    Differential coexpression is a change in coexpression between genes that may reflect 'rewiring' of transcriptional networks. It has previously been hypothesized that such changes might be occurring over time in the lifespan of an organism. While both coexpression and differential expression of genes have been previously studied in life stage change or aging, differential coexpression has not. Generalizing differential coexpression analysis to many time points presents a methodological challenge. Here we introduce a method for analyzing changes in coexpression across multiple ordered groups (e.g., over time) and extensively test its validity and usefulness. Our method is based on the use of the Haar basis set to efficiently represent changes in coexpression at multiple time scales, and thus represents a principled and generalizable extension of the idea of differential coexpression to life stage data. We used published microarray studies categorized by age to test the methodology. We validated the methodology by testing our ability to reconstruct Gene Ontology (GO) categories using our measure of differential coexpression and compared this result to using coexpression alone. Our method allows significant improvement in characterizing these groups of genes. Further, we examine the statistical properties of our measure of differential coexpression and establish that the results are significant both statistically and by an improvement in semantic similarity. In addition, we found that our method finds more significant changes in gene relationships compared to several other methods of expressing temporal relationships between genes, such as coexpression over time. Differential coexpression over age generates significant and biologically relevant information about the genes producing it. Our Haar basis methodology for determining age-related differential coexpression performs better than other tested methods. The Haar basis set also lends itself to ready interpretation

  16. Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.

    Science.gov (United States)

    Li, Lin; Briskine, Roman; Schaefer, Robert; Schnable, Patrick S; Myers, Chad L; Flagel, Lex E; Springer, Nathan M; Muehlbauer, Gary J

    2016-11-04

    Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant subgenome. However, the landscape of co-expression network divergence of duplicate genes in maize is still largely uncharacterized. To address the consequence of gene duplication on co-expression network divergence, we developed a gene co-expression network from RNA-seq data derived from 64 different tissues/stages of the maize reference inbred-B73. WGD, tandem and inserted gene duplications exhibited distinct regulatory divergence. Inserted duplicate genes were more likely to be singletons in the co-expression networks, while WGD duplicate genes were likely to be co-expressed with other genes. Tandem duplicate genes were enriched in the co-expression pattern where co-expressed genes were nearly identical for the duplicates in the network. Older gene duplications exhibit more extensive co-expression variation than younger duplications. Overall, non-syntenic genes primarily from inserted duplications show more co-expression divergence. Also, such enlarged co-expression divergence is significantly related to duplication age. Moreover, subgenome dominance was not observed in the co-expression networks - maize1 and maize2 exhibit similar levels of intra subgenome correlations. Intriguingly, the level of inter subgenome co-expression was similar to the level of intra subgenome correlations, and genes from specific subgenomes were not likely to be the enriched in co-expression network modules and the hub genes were not predominantly from any specific subgenomes in maize. Our work provides a comprehensive analysis of maize co-expression network divergence for three different types of gene duplications and identifies potential relationships between duplication types

  17. Gene coexpression measures in large heterogeneous samples using count statistics.

    Science.gov (United States)

    Wang, Y X Rachel; Waterman, Michael S; Huang, Haiyan

    2014-11-18

    With the advent of high-throughput technologies making large-scale gene expression data readily available, developing appropriate computational tools to process these data and distill insights into systems biology has been an important part of the "big data" challenge. Gene coexpression is one of the earliest techniques developed that is still widely in use for functional annotation, pathway analysis, and, most importantly, the reconstruction of gene regulatory networks, based on gene expression data. However, most coexpression measures do not specifically account for local features in expression profiles. For example, it is very likely that the patterns of gene association may change or only exist in a subset of the samples, especially when the samples are pooled from a range of experiments. We propose two new gene coexpression statistics based on counting local patterns of gene expression ranks to take into account the potentially diverse nature of gene interactions. In particular, one of our statistics is designed for time-course data with local dependence structures, such as time series coupled over a subregion of the time domain. We provide asymptotic analysis of their distributions and power, and evaluate their performance against a wide range of existing coexpression measures on simulated and real data. Our new statistics are fast to compute, robust against outliers, and show comparable and often better general performance.

  18. Elucidating gene function and function evolution through comparison of co-expression networks in plants

    Directory of Open Access Journals (Sweden)

    Marek eMutwil

    2014-08-01

    Full Text Available The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23. In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We show that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that, in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.

  19. Characterization of chemically induced liver injuries using gene co-expression modules.

    Directory of Open Access Journals (Sweden)

    Gregory J Tawa

    Full Text Available Liver injuries due to ingestion or exposure to chemicals and industrial toxicants pose a serious health risk that may be hard to assess due to a lack of non-invasive diagnostic tests. Mapping chemical injuries to organ-specific damage and clinical outcomes via biomarkers or biomarker panels will provide the foundation for highly specific and robust diagnostic tests. Here, we have used DrugMatrix, a toxicogenomics database containing organ-specific gene expression data matched to dose-dependent chemical exposures and adverse clinical pathology assessments in Sprague Dawley rats, to identify groups of co-expressed genes (modules specific to injury endpoints in the liver. We identified 78 such gene co-expression modules associated with 25 diverse injury endpoints categorized from clinical pathology, organ weight changes, and histopathology. Using gene expression data associated with an injury condition, we showed that these modules exhibited different patterns of activation characteristic of each injury. We further showed that specific module genes mapped to 1 known biochemical pathways associated with liver injuries and 2 clinically used diagnostic tests for liver fibrosis. As such, the gene modules have characteristics of both generalized and specific toxic response pathways. Using these results, we proposed three gene signature sets characteristic of liver fibrosis, steatosis, and general liver injury based on genes from the co-expression modules. Out of all 92 identified genes, 18 (20% genes have well-documented relationships with liver disease, whereas the rest are novel and have not previously been associated with liver disease. In conclusion, identifying gene co-expression modules associated with chemically induced liver injuries aids in generating testable hypotheses and has the potential to identify putative biomarkers of adverse health effects.

  20. Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks

    Directory of Open Access Journals (Sweden)

    Kohane Isaac S

    2005-09-01

    Full Text Available Abstract Background Biological processes are carried out by coordinated modules of interacting molecules. As clustering methods demonstrate that genes with similar expression display increased likelihood of being associated with a common functional module, networks of coexpressed genes provide one framework for assigning gene function. This has informed the guilt-by-association (GBA heuristic, widely invoked in functional genomics. Yet although the idea of GBA is accepted, the breadth of GBA applicability is uncertain. Results We developed methods to systematically explore the breadth of GBA across a large and varied corpus of expression data to answer the following question: To what extent is the GBA heuristic broadly applicable to the transcriptome and conversely how broadly is GBA captured by a priori knowledge represented in the Gene Ontology (GO? Our study provides an investigation of the functional organization of five coexpression networks using data from three mammalian organisms. Our method calculates a probabilistic score between each gene and each Gene Ontology category that reflects coexpression enrichment of a GO module. For each GO category we use Receiver Operating Curves to assess whether these probabilistic scores reflect GBA. This methodology applied to five different coexpression networks demonstrates that the signature of guilt-by-association is ubiquitous and reproducible and that the GBA heuristic is broadly applicable across the population of nine hundred Gene Ontology categories. We also demonstrate the existence of highly reproducible patterns of coexpression between some pairs of GO categories. Conclusion We conclude that GBA has universal value and that transcriptional control may be more modular than previously realized. Our analyses also suggest that methodologies combining coexpression measurements across multiple genes in a biologically-defined module can aid in characterizing gene function or in characterizing

  1. VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

    Science.gov (United States)

    Wong, Darren C J; Sweetman, Crystal; Drew, Damian P; Ford, Christopher M

    2013-12-16

    Gene expression datasets in model plants such as Arabidopsis have contributed to our understanding of gene function and how a single underlying biological process can be governed by a diverse network of genes. The accumulation of publicly available microarray data encompassing a wide range of biological and environmental conditions has enabled the development of additional capabilities including gene co-expression analysis (GCA). GCA is based on the understanding that genes encoding proteins involved in similar and/or related biological processes may exhibit comparable expression patterns over a range of experimental conditions, developmental stages and tissues. We present an open access database for the investigation of gene co-expression networks within the cultivated grapevine, Vitis vinifera. The new gene co-expression database, VTCdb (http://vtcdb.adelaide.edu.au/Home.aspx), offers an online platform for transcriptional regulatory inference in the cultivated grapevine. Using condition-independent and condition-dependent approaches, grapevine co-expression networks were constructed using the latest publicly available microarray datasets from diverse experimental series, utilising the Affymetrix Vitis vinifera GeneChip (16 K) and the NimbleGen Grape Whole-genome microarray chip (29 K), thus making it possible to profile approximately 29,000 genes (95% of the predicted grapevine transcriptome). Applications available with the online platform include the use of gene names, probesets, modules or biological processes to query the co-expression networks, with the option to choose between Affymetrix or Nimblegen datasets and between multiple co-expression measures. Alternatively, the user can browse existing network modules using interactive network visualisation and analysis via CytoscapeWeb. To demonstrate the utility of the database, we present examples from three fundamental biological processes (berry development, photosynthesis and flavonoid biosynthesis

  2. Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes

    Directory of Open Access Journals (Sweden)

    Paules Richard S

    2007-11-01

    Full Text Available Abstract Background A common observation in the analysis of gene expression data is that many genes display similarity in their expression patterns and therefore appear to be co-regulated. However, the variation associated with microarray data and the complexity of the experimental designs make the acquisition of co-expressed genes a challenge. We developed a novel method for Extracting microarray gene expression Patterns and Identifying co-expressed Genes, designated as EPIG. The approach utilizes the underlying structure of gene expression data to extract patterns and identify co-expressed genes that are responsive to experimental conditions. Results Through evaluation of the correlations among profiles, the magnitude of variation in gene expression profiles, and profile signal-to-noise ratio's, EPIG extracts a set of patterns representing co-expressed genes. The method is shown to work well with a simulated data set and microarray data obtained from time-series studies of dauer recovery and L1 starvation in C. elegans and after ultraviolet (UV or ionizing radiation (IR-induced DNA damage in diploid human fibroblasts. With the simulated data set, EPIG extracted the appropriate number of patterns which were more stable and homogeneous than the set of patterns that were determined using the CLICK or CAST clustering algorithms. However, CLICK performed better than EPIG and CAST with respect to the average correlation between clusters/patterns of the simulated data. With real biological data, EPIG extracted more dauer-specific patterns than CLICK. Furthermore, analysis of the IR/UV data revealed 18 unique patterns and 2661 genes out of approximately 17,000 that were identified as significantly expressed and categorized to the patterns by EPIG. The time-dependent patterns displayed similar and dissimilar responses between IR and UV treatments. Gene Ontology analysis applied to each pattern-related subset of co-expressed genes revealed underlying

  3. Local coexpression domains of two to four genes in the genome of Arabidopsis

    NARCIS (Netherlands)

    Ren, X.Y.; Fiers, M.W.E.J.; Stiekema, W.J.; Nap, J.P.H.

    2005-01-01

    Expression of genes in eukaryotic genomes is known to cluster, but cluster size is generally loosely defined and highly variable. We have here taken a very strict definition of cluster as sets of physically adjacent genes that are highly coexpressed and form so-called local coexpression domains. The

  4. Evaluation of gene association methods for coexpression network construction and biological knowledge discovery.

    Directory of Open Access Journals (Sweden)

    Sapna Kumari

    Full Text Available BACKGROUND: Constructing coexpression networks and performing network analysis using large-scale gene expression data sets is an effective way to uncover new biological knowledge; however, the methods used for gene association in constructing these coexpression networks have not been thoroughly evaluated. Since different methods lead to structurally different coexpression networks and provide different information, selecting the optimal gene association method is critical. METHODS AND RESULTS: In this study, we compared eight gene association methods - Spearman rank correlation, Weighted Rank Correlation, Kendall, Hoeffding's D measure, Theil-Sen, Rank Theil-Sen, Distance Covariance, and Pearson - and focused on their true knowledge discovery rates in associating pathway genes and construction coordination networks of regulatory genes. We also examined the behaviors of different methods to microarray data with different properties, and whether the biological processes affect the efficiency of different methods. CONCLUSIONS: We found that the Spearman, Hoeffding and Kendall methods are effective in identifying coexpressed pathway genes, whereas the Theil-sen, Rank Theil-Sen, Spearman, and Weighted Rank methods perform well in identifying coordinated transcription factors that control the same biological processes and traits. Surprisingly, the widely used Pearson method is generally less efficient, and so is the Distance Covariance method that can find gene pairs of multiple relationships. Some analyses we did clearly show Pearson and Distance Covariance methods have distinct behaviors as compared to all other six methods. The efficiencies of different methods vary with the data properties to some degree and are largely contingent upon the biological processes, which necessitates the pre-analysis to identify the best performing method for gene association and coexpression network construction.

  5. G-NEST: A gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Science.gov (United States)

    In previous studies, gene neighborhoods--spatial clusters of co-expressed genes in the genome--have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Sc...

  6. FastGCN: a GPU accelerated tool for fast gene co-expression networks.

    Directory of Open Access Journals (Sweden)

    Meimei Liang

    Full Text Available Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.

  7. Building gene co-expression networks using transcriptomics data for systems biology investigations

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Watson-Haigh, Nathan S.

    2012-01-01

    Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four......) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT...... (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended....

  8. Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean) Cattle.

    Science.gov (United States)

    Lim, Dajeong; Lee, Seung-Hwan; Kim, Nam-Kuk; Cho, Yong-Min; Chai, Han-Ha; Seong, Hwan-Hoo; Kim, Heebal

    2013-01-01

    Marbling (intramuscular fat) is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the 'marbling score' trait and systemically analyzed the network topology in Hanwoo (Korean cattle). As a result, we determined 3 modules (gene groups) that showed statistically significant results for marbling score. In particular, one module (denoted as red) has a statistically significant result for marbling score (p = 0.008) and intramuscular fat (p = 0.02) and water capacity (p = 0.006). From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA) have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

  9. Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean Cattle

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    2013-01-01

    Full Text Available Marbling (intramuscular fat is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the ‘marbling score’ trait and systemically analyzed the network topology in Hanwoo (Korean cattle. As a result, we determined 3 modules (gene groups that showed statistically significant results for marbling score. In particular, one module (denoted as red has a statistically significant result for marbling score (p = 0.008 and intramuscular fat (p = 0.02 and water capacity (p = 0.006. From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

  10. Using gene co-expression network analysis to predict biomarkers for chronic lymphocytic leukemia

    Directory of Open Access Journals (Sweden)

    Borlawsky Tara B

    2010-10-01

    Full Text Available Abstract Background Chronic lymphocytic leukemia (CLL is the most common adult leukemia. It is a highly heterogeneous disease, and can be divided roughly into indolent and progressive stages based on classic clinical markers. Immunoglobin heavy chain variable region (IgVH mutational status was found to be associated with patient survival outcome, and biomarkers linked to the IgVH status has been a focus in the CLL prognosis research field. However, biomarkers highly correlated with IgVH mutational status which can accurately predict the survival outcome are yet to be discovered. Results In this paper, we investigate the use of gene co-expression network analysis to identify potential biomarkers for CLL. Specifically we focused on the co-expression network involving ZAP70, a well characterized biomarker for CLL. We selected 23 microarray datasets corresponding to multiple types of cancer from the Gene Expression Omnibus (GEO and used the frequent network mining algorithm CODENSE to identify highly connected gene co-expression networks spanning the entire genome, then evaluated the genes in the co-expression network in which ZAP70 is involved. We then applied a set of feature selection methods to further select genes which are capable of predicting IgVH mutation status from the ZAP70 co-expression network. Conclusions We have identified a set of genes that are potential CLL prognostic biomarkers IL2RB, CD8A, CD247, LAG3 and KLRK1, which can predict CLL patient IgVH mutational status with high accuracies. Their prognostic capabilities were cross-validated by applying these biomarker candidates to classify patients into different outcome groups using a CLL microarray datasets with clinical information.

  11. A powerful nonparametric method for detecting differentially co-expressed genes: distance correlation screening and edge-count test.

    Science.gov (United States)

    Zhang, Qingyang

    2018-05-16

    Differential co-expression analysis, as a complement of differential expression analysis, offers significant insights into the changes in molecular mechanism of different phenotypes. A prevailing approach to detecting differentially co-expressed genes is to compare Pearson's correlation coefficients in two phenotypes. However, due to the limitations of Pearson's correlation measure, this approach lacks the power to detect nonlinear changes in gene co-expression which is common in gene regulatory networks. In this work, a new nonparametric procedure is proposed to search differentially co-expressed gene pairs in different phenotypes from large-scale data. Our computational pipeline consisted of two main steps, a screening step and a testing step. The screening step is to reduce the search space by filtering out all the independent gene pairs using distance correlation measure. In the testing step, we compare the gene co-expression patterns in different phenotypes by a recently developed edge-count test. Both steps are distribution-free and targeting nonlinear relations. We illustrate the promise of the new approach by analyzing the Cancer Genome Atlas data and the METABRIC data for breast cancer subtypes. Compared with some existing methods, the new method is more powerful in detecting nonlinear type of differential co-expressions. The distance correlation screening can greatly improve computational efficiency, facilitating its application to large data sets.

  12. Using the 2A Protein Coexpression System: Multicistronic 2A Vectors Expressing Gene(s) of Interest and Reporter Proteins.

    Science.gov (United States)

    Luke, Garry A; Ryan, Martin D

    2018-01-01

    To date, a huge range of different proteins-many with cotranslational and posttranslational subcellular localization signals-have been coexpressed together with various reporter proteins in vitro and in vivo using 2A peptides. The pros and cons of 2A co-expression technology are considered below, followed by a simple example of a "how to" protocol to concatenate multiple genes of interest, together with a reporter gene, into a single gene linked via 2As for easy identification or selection of transduced cells.

  13. Network statistics of genetically-driven gene co-expression modules in mouse crosses

    Directory of Open Access Journals (Sweden)

    Marie-Pier eScott-Boyer

    2013-12-01

    Full Text Available In biology, networks are used in different contexts as ways to represent relationships between entities, such as for instance interactions between genes, proteins or metabolites. Despite progress in the analysis of such networks and their potential to better understand the collective impact of genes on complex traits, one remaining challenge is to establish the biologic validity of gene co-expression networks and to determine what governs their organization. We used WGCNA to construct and analyze seven gene expression datasets from several tissues of mouse recombinant inbred strains (RIS. For six out of the 7 networks, we found that linkage to module QTLs (mQTLs could be established for 29.3% of gene co-expression modules detected in the several mouse RIS. For about 74.6% of such genetically-linked modules, the mQTL was on the same chromosome as the one contributing most genes to the module, with genes originating from that chromosome showing higher connectivity than other genes in the modules. Such modules (that we considered as genetically-driven had network statistic properties (density, centralization and heterogeneity that set them apart from other modules in the network. Altogether, a sizeable portion of gene co-expression modules detected in mouse RIS panels had genetic determinants as their main organizing principle. In addition to providing a biologic interpretation validation for these modules, these genetic determinants imparted on them particular properties that set them apart from other modules in the network, to the point that they can be predicted to a large extent on the basis of their network statistics.

  14. The Arabidopsis co-expression tool (act): a WWW-based tool and database for microarray-based gene expression analysis

    DEFF Research Database (Denmark)

    Jen, C. H.; Manfield, I. W.; Michalopoulos, D. W.

    2006-01-01

    be examined using the novel clique finder tool to determine the sets of genes most likely to be regulated in a similar manner. In combination, these tools offer three levels of analysis: creation of correlation lists of co-expressed genes, refinement of these lists using two-dimensional scatter plots......We present a new WWW-based tool for plant gene analysis, the Arabidopsis Co-Expression Tool (act) , based on a large Arabidopsis thaliana microarray data set obtained from the Nottingham Arabidopsis Stock Centre. The co-expression analysis tool allows users to identify genes whose expression...

  15. Identification of Transcriptional Modules and Key Genes in Chickens Infected with Salmonella enterica Serovar Pullorum Using Integrated Coexpression Analyses

    Directory of Open Access Journals (Sweden)

    Bao-Hong Liu

    2017-01-01

    Full Text Available Salmonella enterica Pullorum is one of the leading causes of mortality in poultry. Understanding the molecular response in chickens in response to the infection by S. enterica is important in revealing the mechanisms of pathogenesis and disease progress. There have been studies on identifying genes associated with Salmonella infection by differential expression analysis, but the relationships among regulated genes have not been investigated. In this study, we employed weighted gene coexpression network analysis (WGCNA and differential coexpression analysis (DCEA to identify coexpression modules by exploring microarray data derived from chicken splenic tissues in response to the S. enterica infection. A total of 19 modules from 13,538 genes were associated with the Jak-STAT signaling pathway, the extracellular matrix, cytoskeleton organization, the regulation of the actin cytoskeleton, G-protein coupled receptor activity, Toll-like receptor signaling pathways, and immune system processes; among them, 14 differentially coexpressed modules (DCMs and 2,856 differentially coexpressed genes (DCGs were identified. The global expression of module genes between infected and uninfected chickens showed slight differences but considerable changes for global coexpression. Furthermore, DCGs were consistently linked to the hubs of the modules. These results will help prioritize candidate genes for future studies of Salmonella infection.

  16. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    Science.gov (United States)

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  17. A stochastic model for identifying differential gene pair co-expression patterns in prostate cancer progression

    Directory of Open Access Journals (Sweden)

    Mao Yu

    2009-07-01

    Full Text Available Abstract Background The identification of gene differential co-expression patterns between cancer stages is a newly developing method to reveal the underlying molecular mechanisms of carcinogenesis. Most researches of this subject lack an algorithm useful for performing a statistical significance assessment involving cancer progression. Lacking this specific algorithm is apparently absent in identifying precise gene pairs correlating to cancer progression. Results In this investigation we studied gene pair co-expression change by using a stochastic process model for approximating the underlying dynamic procedure of the co-expression change during cancer progression. Also, we presented a novel analytical method named 'Stochastic process model for Identifying differentially co-expressed Gene pair' (SIG method. This method has been applied to two well known prostate cancer data sets: hormone sensitive versus hormone resistant, and healthy versus cancerous. From these data sets, 428,582 gene pairs and 303,992 gene pairs were identified respectively. Afterwards, we used two different current statistical methods to the same data sets, which were developed to identify gene pair differential co-expression and did not consider cancer progression in algorithm. We then compared these results from three different perspectives: progression analysis, gene pair identification effectiveness analysis, and pathway enrichment analysis. Statistical methods were used to quantify the quality and performance of these different perspectives. They included: Re-identification Scale (RS and Progression Score (PS in progression analysis, True Positive Rate (TPR in gene pair analysis, and Pathway Enrichment Score (PES in pathway analysis. Our results show small values of RS and large values of PS, TPR, and PES; thus, suggesting that gene pairs identified by the SIG method are highly correlated with cancer progression, and highly enriched in disease-specific pathways. From

  18. In-silico gene co-expression network analysis in Paracoccidioides brasiliensis with reference to haloacid dehalogenase superfamily hydrolase gene

    Directory of Open Access Journals (Sweden)

    Raghunath Satpathy

    2015-01-01

    Full Text Available Context: Paracoccidioides brasiliensis, a dimorphic fungus is the causative agent of paracoccidioidomycosis, a disease globally affecting millions of people. The haloacid dehalogenase (HAD superfamily hydrolases enzyme in the fungi, in particular, is known to be responsible in the pathogenesis by adhering to the tissue. Hence, identification of novel drug targets is essential. Aims: In-silico based identification of co-expressed genes along with HAD superfamily hydrolase in P. brasiliensis during the morphogenesis from mycelium to yeast to identify possible genes as drug targets. Materials and Methods: In total, four datasets were retrieved from the NCBI-gene expression omnibus (GEO database, each containing 4340 genes, followed by gene filtration expression of the data set. Further co-expression (CE study was performed individually and then a combination these genes were visualized in the Cytoscape 2. 8.3. Statistical Analysis Used: Mean and standard deviation value of the HAD superfamily hydrolase gene was obtained from the expression data and this value was subsequently used for the CE calculation purpose by selecting specific correlation power and filtering threshold. Results: The 23 genes that were thus obtained are common with respect to the HAD superfamily hydrolase gene. A significant network was selected from the Cytoscape network visualization that contains total 7 genes out of which 5 genes, which do not have significant protein hits, obtained from gene annotation of the expressed sequence tags by BLAST X. For all the protein PSI-BLAST was performed against human genome to find the homology. Conclusions: The gene co-expression network was obtained with respect to HAD superfamily dehalogenase gene in P. Brasiliensis.

  19. Novel Approach for Coexpression Analysis of E2F1–3 and MYC Target Genes in Chronic Myelogenous Leukemia

    Directory of Open Access Journals (Sweden)

    Fengfeng Wang

    2014-01-01

    Full Text Available Background. Chronic myelogenous leukemia (CML is characterized by tremendous amount of immature myeloid cells in the blood circulation. E2F1–3 and MYC are important transcription factors that form positive feedback loops by reciprocal regulation in their own transcription processes. Since genes regulated by E2F1–3 or MYC are related to cell proliferation and apoptosis, we wonder if there exists difference in the coexpression patterns of genes regulated concurrently by E2F1–3 and MYC between the normal and the CML states. Results. We proposed a method to explore the difference in the coexpression patterns of those candidate target genes between the normal and the CML groups. A disease-specific cutoff point for coexpression levels that classified the coexpressed gene pairs into strong and weak coexpression classes was identified. Our developed method effectively identified the coexpression pattern differences from the overall structure. Moreover, we found that genes related to the cell adhesion and angiogenesis properties were more likely to be coexpressed in the normal group when compared to the CML group. Conclusion. Our findings may be helpful in exploring the underlying mechanisms of CML and provide useful information in cancer treatment.

  20. Uncovering co-expression gene network modules regulating fruit acidity in diverse apples.

    Science.gov (United States)

    Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Zhong, Gan-Yuan; Xu, Kenong

    2015-08-16

    Acidity is a major contributor to fruit quality. Several organic acids are present in apple fruit, but malic acid is predominant and determines fruit acidity. The trait is largely controlled by the Malic acid (Ma) locus, underpinning which Ma1 that putatively encodes a vacuolar aluminum-activated malate transporter1 (ALMT1)-like protein is a strong candidate gene. We hypothesize that fruit acidity is governed by a gene network in which Ma1 is key member. The goal of this study is to identify the gene network and the potential mechanisms through which the network operates. Guided by Ma1, we analyzed the transcriptomes of mature fruit of contrasting acidity from six apple accessions of genotype Ma_ (MaMa or Mama) and four of mama using RNA-seq and identified 1301 fruit acidity associated genes, among which 18 were most significant acidity genes (MSAGs). Network inferring using weighted gene co-expression network analysis (WGCNA) revealed five co-expression gene network modules of significant (P acidity. Overall, this study provides important insight into the Ma1-mediated gene network controlling acidity in mature apple fruit of diverse genetic background.

  1. Exploring Plant Co-Expression and Gene-Gene Interactions with CORNET 3.0.

    Science.gov (United States)

    Van Bel, Michiel; Coppens, Frederik

    2017-01-01

    Selecting and filtering a reference expression and interaction dataset when studying specific pathways and regulatory interactions can be a very time-consuming and error-prone task. In order to reduce the duplicated efforts required to amass such datasets, we have created the CORNET (CORrelation NETworks) platform which allows for easy access to a wide variety of data types: coexpression data, protein-protein interactions, regulatory interactions, and functional annotations. The CORNET platform outputs its results in either text format or through the Cytoscape framework, which is automatically launched by the CORNET website.CORNET 3.0 is the third iteration of the web platform designed for the user exploration of the coexpression space of plant genomes, with a focus on the model species Arabidopsis thaliana. Here we describe the platform: the tools, data, and best practices when using the platform. We indicate how the platform can be used to infer networks from a set of input genes, such as upregulated genes from an expression experiment. By exploring the network, new target and regulator genes can be discovered, allowing for follow-up experiments and more in-depth study. We also indicate how to avoid common pitfalls when evaluating the networks and how to avoid over interpretation of the results.All CORNET versions are available at http://bioinformatics.psb.ugent.be/cornet/ .

  2. Modular organization of the white spruce (Picea glauca) transcriptome reveals functional organization and evolutionary signatures.

    Science.gov (United States)

    Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J

    2015-07-01

    Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  3. Gene Coexpression Analysis Reveals Complex Metabolism of the Monoterpene Alcohol Linalool in Arabidopsis FlowersW

    NARCIS (Netherlands)

    Ginglinger, J.F.; Boachon, B.; Hofer, R.; Paetz, C.; Kollner, T.G.; Miesch, L.; Lugan, R.; Baltenweck, R.; Mutterer, J.; Ullman, P.; Verstappen, F.W.A.; Bouwmeester, H.J.

    2013-01-01

    The cytochrome P450 family encompasses the largest family of enzymes in plant metabolism, and the functions of many of its members in Arabidopsis thaliana are still unknown. Gene coexpression analysis pointed to two P450s that were coexpressed with two monoterpene synthases in flowers and were thus

  4. Meta-analysis of inter-species liver co-expression networks elucidates traits associated with common human diseases.

    Directory of Open Access Journals (Sweden)

    Kai Wang

    2009-12-01

    Full Text Available Co-expression networks are routinely used to study human diseases like obesity and diabetes. Systematic comparison of these networks between species has the potential to elucidate common mechanisms that are conserved between human and rodent species, as well as those that are species-specific characterizing evolutionary plasticity. We developed a semi-parametric meta-analysis approach for combining gene-gene co-expression relationships across expression profile datasets from multiple species. The simulation results showed that the semi-parametric method is robust against noise. When applied to human, mouse, and rat liver co-expression networks, our method out-performed existing methods in identifying gene pairs with coherent biological functions. We identified a network conserved across species that highlighted cell-cell signaling, cell-adhesion and sterol biosynthesis as main biological processes represented in genome-wide association study candidate gene sets for blood lipid levels. We further developed a heterogeneity statistic to test for network differences among multiple datasets, and demonstrated that genes with species-specific interactions tend to be under positive selection throughout evolution. Finally, we identified a human-specific sub-network regulated by RXRG, which has been validated to play a different role in hyperlipidemia and Type 2 diabetes between human and mouse. Taken together, our approach represents a novel step forward in integrating gene co-expression networks from multiple large scale datasets to leverage not only common information but also differences that are dataset-specific.

  5. Characterization of Genes for Beef Marbling Based on Applying Gene Coexpression Network

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    2014-01-01

    Full Text Available Marbling is an important trait in characterization beef quality and a major factor for determining the price of beef in the Korean beef market. In particular, marbling is a complex trait and needs a system-level approach for identifying candidate genes related to the trait. To find the candidate gene associated with marbling, we used a weighted gene coexpression network analysis from the expression value of bovine genes. Hub genes were identified; they were topologically centered with large degree and BC values in the global network. We performed gene expression analysis to detect candidate genes in M. longissimus with divergent marbling phenotype (marbling scores 2 to 7 using qRT-PCR. The results demonstrate that transmembrane protein 60 (TMEM60 and dihydropyrimidine dehydrogenase (DPYD are associated with increasing marbling fat. We suggest that the network-based approach in livestock may be an important method for analyzing the complex effects of candidate genes associated with complex traits like marbling or tenderness.

  6. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    Science.gov (United States)

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  7. Effects of threshold on the topology of gene co-expression networks.

    Science.gov (United States)

    Couto, Cynthia Martins Villar; Comin, César Henrique; Costa, Luciano da Fontoura

    2017-09-26

    Several developments regarding the analysis of gene co-expression profiles using complex network theory have been reported recently. Such approaches usually start with the construction of an unweighted gene co-expression network, therefore requiring the selection of a suitable threshold defining which pairs of vertices will be connected. We aimed at addressing such an important problem by suggesting and comparing five different approaches for threshold selection. Each of the methods considers a respective biologically-motivated criterion for electing a potentially suitable threshold. A set of 21 microarray experiments from different biological groups was used to investigate the effect of applying the five proposed criteria to several biological situations. For each experiment, we used the Pearson correlation coefficient to measure the relationship between each gene pair, and the resulting weight matrices were thresholded considering several values, generating respective adjacency matrices (co-expression networks). Each of the five proposed criteria was then applied in order to select the respective threshold value. The effects of these thresholding approaches on the topology of the resulting networks were compared by using several measurements, and we verified that, depending on the database, the impact on the topological properties can be large. However, a group of databases was verified to be similarly affected by most of the considered criteria. Based on such results, it can be suggested that when the generated networks present similar measurements, the thresholding method can be chosen with greater freedom. If the generated networks are markedly different, the thresholding method that better suits the interests of each specific research study represents a reasonable choice.

  8. Two novel antimicrobial defensins from rice identified by gene coexpression network analyses.

    Science.gov (United States)

    Tantong, Supaluk; Pringsulaka, Onanong; Weerawanich, Kamonwan; Meeprasert, Arthitaya; Rungrotmongkol, Thanyada; Sarnthima, Rakrudee; Roytrakul, Sittiruk; Sirikantaramas, Supaart

    2016-10-01

    Defensins form an antimicrobial peptides (AMP) family, and have been widely studied in various plants because of their considerable inhibitory functions. However, their roles in rice (Oryza sativa L.) have not been characterized, even though rice is one of the most important staple crops that is susceptible to damaging infections. Additionally, a previous study identified 598 rice genes encoding cysteine-rich peptides, suggesting there are several uncharacterized AMPs in rice. We performed in silico gene expression and coexpression network analyses of all genes encoding defensin and defensin-like peptides, and determined that OsDEF7 and OsDEF8 are coexpressed with pathogen-responsive genes. Recombinant OsDEF7 and OsDEF8 could form homodimers. They inhibited the growth of the bacteria Xanthomonas oryzae pv. oryzae, X. oryzae pv. oryzicola, and Erwinia carotovora subsp. atroseptica with minimum inhibitory concentration (MIC) ranging from 0.6 to 63μg/mL. However, these OsDEFs are weakly active against the phytopathogenic fungi Helminthosporium oryzae and Fusarium oxysporum f.sp. cubense. This study describes a useful method for identifying potential plant AMPs with biological activities. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Gene co-expression analysis identifies gene clusters associated with isotropic and polarized growth in Aspergillus fumigatus conidia.

    Science.gov (United States)

    Baltussen, Tim J H; Coolen, Jordy P M; Zoll, Jan; Verweij, Paul E; Melchers, Willem J G

    2018-04-26

    Aspergillus fumigatus is a saprophytic fungus that extensively produces conidia. These microscopic asexually reproductive structures are small enough to reach the lungs. Germination of conidia followed by hyphal growth inside human lungs is a key step in the establishment of infection in immunocompromised patients. RNA-Seq was used to analyze the transcriptome of dormant and germinating A. fumigatus conidia. Construction of a gene co-expression network revealed four gene clusters (modules) correlated with a growth phase (dormant, isotropic growth, polarized growth). Transcripts levels of genes encoding for secondary metabolites were high in dormant conidia. During isotropic growth, transcript levels of genes involved in cell wall modifications increased. Two modules encoding for growth and cell cycle/DNA processing were associated with polarized growth. In addition, the co-expression network was used to identify highly connected intermodular hub genes. These genes may have a pivotal role in the respective module and could therefore be compelling therapeutic targets. Generally, cell wall remodeling is an important process during isotropic and polarized growth, characterized by an increase of transcripts coding for hyphal growth and cell cycle/DNA processing when polarized growth is initiated. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  10. An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.

    Science.gov (United States)

    Botía, Juan A; Vandrovcova, Jana; Forabosco, Paola; Guelfi, Sebastian; D'Sa, Karishma; Hardy, John; Lewis, Cathryn M; Ryten, Mina; Weale, Michael E

    2017-04-12

    Weighted Gene Co-expression Network Analysis (WGCNA) is a widely used R software package for the generation of gene co-expression networks (GCN). WGCNA generates both a GCN and a derived partitioning of clusters of genes (modules). We propose k-means clustering as an additional processing step to conventional WGCNA, which we have implemented in the R package km2gcn (k-means to gene co-expression network, https://github.com/juanbot/km2gcn ). We assessed our method on networks created from UKBEC data (10 different human brain tissues), on networks created from GTEx data (42 human tissues, including 13 brain tissues), and on simulated networks derived from GTEx data. We observed substantially improved module properties, including: (1) few or zero misplaced genes; (2) increased counts of replicable clusters in alternate tissues (x3.1 on average); (3) improved enrichment of Gene Ontology terms (seen in 48/52 GCNs) (4) improved cell type enrichment signals (seen in 21/23 brain GCNs); and (5) more accurate partitions in simulated data according to a range of similarity indices. The results obtained from our investigations indicate that our k-means method, applied as an adjunct to standard WGCNA, results in better network partitions. These improved partitions enable more fruitful downstream analyses, as gene modules are more biologically meaningful.

  11. The drug target genes show higher evolutionary conservation than non-target genes.

    Science.gov (United States)

    Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie

    2016-01-26

    Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.

  12. Chronic ethanol exposure produces time- and brain region-dependent changes in gene coexpression networks.

    Directory of Open Access Journals (Sweden)

    Elizabeth A Osterndorff-Kahanek

    Full Text Available Repeated ethanol exposure and withdrawal in mice increases voluntary drinking and represents an animal model of physical dependence. We examined time- and brain region-dependent changes in gene coexpression networks in amygdala (AMY, nucleus accumbens (NAC, prefrontal cortex (PFC, and liver after four weekly cycles of chronic intermittent ethanol (CIE vapor exposure in C57BL/6J mice. Microarrays were used to compare gene expression profiles at 0-, 8-, and 120-hours following the last ethanol exposure. Each brain region exhibited a large number of differentially expressed genes (2,000-3,000 at the 0- and 8-hour time points, but fewer changes were detected at the 120-hour time point (400-600. Within each region, there was little gene overlap across time (~20%. All brain regions were significantly enriched with differentially expressed immune-related genes at the 8-hour time point. Weighted gene correlation network analysis identified modules that were highly enriched with differentially expressed genes at the 0- and 8-hour time points with virtually no enrichment at 120 hours. Modules enriched for both ethanol-responsive and cell-specific genes were identified in each brain region. These results indicate that chronic alcohol exposure causes global 'rewiring' of coexpression systems involving glial and immune signaling as well as neuronal genes.

  13. Weighted gene co-expression network analysis of the peripheral blood from Amyotrophic Lateral Sclerosis patients

    Directory of Open Access Journals (Sweden)

    DeYoung Joseph

    2009-08-01

    Full Text Available Abstract Background Amyotrophic Lateral Sclerosis (ALS is a lethal disorder characterized by progressive degeneration of motor neurons in the brain and spinal cord. Diagnosis is mainly based on clinical symptoms, and there is currently no therapy to stop the disease or slow its progression. Since access to spinal cord tissue is not possible at disease onset, we investigated changes in gene expression profiles in whole blood of ALS patients. Results Our transcriptional study showed dramatic changes in blood of ALS patients; 2,300 probes (9.4% showed significant differential expression in a discovery dataset consisting of 30 ALS patients and 30 healthy controls. Weighted gene co-expression network analysis (WGCNA was used to find disease-related networks (modules and disease related hub genes. Two large co-expression modules were found to be associated with ALS. Our findings were replicated in a second (30 patients and 30 controls and third dataset (63 patients and 63 controls, thereby demonstrating a highly significant and consistent association of two large co-expression modules with ALS disease status. Ingenuity Pathway Analysis of the ALS related module genes implicates enrichment of functional categories related to genetic disorders, neurodegeneration of the nervous system and inflammatory disease. The ALS related modules contain a number of candidate genes possibly involved in pathogenesis of ALS. Conclusion This first large-scale blood gene expression study in ALS observed distinct patterns between cases and controls which may provide opportunities for biomarker development as well as new insights into the molecular mechanisms of the disease.

  14. Gene co-expression networks shed light into diseases of brain iron accumulation.

    Science.gov (United States)

    Bettencourt, Conceição; Forabosco, Paola; Wiethoff, Sarah; Heidari, Moones; Johnstone, Daniel M; Botía, Juan A; Collingwood, Joanna F; Hardy, John; Milward, Elizabeth A; Ryten, Mina; Houlden, Henry

    2016-03-01

    Aberrant brain iron deposition is observed in both common and rare neurodegenerative disorders, including those categorized as Neurodegeneration with Brain Iron Accumulation (NBIA), which are characterized by focal iron accumulation in the basal ganglia. Two NBIA genes are directly involved in iron metabolism, but whether other NBIA-related genes also regulate iron homeostasis in the human brain, and whether aberrant iron deposition contributes to neurodegenerative processes remains largely unknown. This study aims to expand our understanding of these iron overload diseases and identify relationships between known NBIA genes and their main interacting partners by using a systems biology approach. We used whole-transcriptome gene expression data from human brain samples originating from 101 neuropathologically normal individuals (10 brain regions) to generate weighted gene co-expression networks and cluster the 10 known NBIA genes in an unsupervised manner. We investigated NBIA-enriched networks for relevant cell types and pathways, and whether they are disrupted by iron loading in NBIA diseased tissue and in an in vivo mouse model. We identified two basal ganglia gene co-expression modules significantly enriched for NBIA genes, which resemble neuronal and oligodendrocytic signatures. These NBIA gene networks are enriched for iron-related genes, and implicate synapse and lipid metabolism related pathways. Our data also indicates that these networks are disrupted by excessive brain iron loading. We identified multiple cell types in the origin of NBIA disorders. We also found unforeseen links between NBIA networks and iron-related processes, and demonstrate convergent pathways connecting NBIAs and phenotypically overlapping diseases. Our results are of further relevance for these diseases by providing candidates for new causative genes and possible points for therapeutic intervention. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  15. A contribution to the study of plant development evolution based on gene co-expression networks

    Directory of Open Access Journals (Sweden)

    Francisco J. Romero-Campero

    2013-08-01

    Full Text Available Phototrophic eukaryotes are among the most successful organisms on Earth due to their unparalleled efficiency at capturing light energy and fixing carbon dioxide to produce organic molecules. A conserved and efficient network of light-dependent regulatory modules could be at the bases of this success. This regulatory system conferred early advantages to phototrophic eukaryotes that allowed for specialization, complex developmental processes and modern plant characteristics. We have studied light-dependent gene regulatory modules from algae to plants employing integrative-omics approaches based on gene co-expression networks. Our study reveals some remarkably conserved ways in which eukaryotic phototrophs deal with day length and light signaling. Here we describe how a family of Arabidopsis transcription factors involved in photoperiod response has evolved from a single algal gene according to the innovation, amplification and divergence theory of gene evolution by duplication. These modifications of the gene co-expression networks from the ancient unicellular green algae Chlamydomonas reinhardtii to the modern brassica Arabidopsis thaliana may hint on the evolution and specialization of plants and other organisms.

  16. A Predictive Coexpression Network Identifies Novel Genes Controlling the Seed-to-Seedling Phase Transition in Arabidopsis thaliana.

    Science.gov (United States)

    Silva, Anderson Tadeu; Ribone, Pamela A; Chan, Raquel L; Ligterink, Wilco; Hilhorst, Henk W M

    2016-04-01

    The transition from a quiescent dry seed to an actively growing photoautotrophic seedling is a complex and crucial trait for plant propagation. This study provides a detailed description of global gene expression in seven successive developmental stages of seedling establishment in Arabidopsis (Arabidopsis thaliana). Using the transcriptome signature from these developmental stages, we obtained a coexpression gene network that highlights interactions between known regulators of the seed-to-seedling transition and predicts the functions of uncharacterized genes in seedling establishment. The coexpressed gene data sets together with the transcriptional module indicate biological functions related to seedling establishment. Characterization of the homeodomain leucine zipper I transcription factor AtHB13, which is expressed during the seed-to-seedling transition, demonstrated that this gene regulates some of the network nodes and affects late seedling establishment. Knockout mutants for athb13 showed increased primary root length as compared with wild-type (Columbia-0) seedlings, suggesting that this transcription factor is a negative regulator of early root growth, possibly repressing cell division and/or cell elongation or the length of time that cells elongate. The signal transduction pathways present during the early phases of the seed-to-seedling transition anticipate the control of important events for a vigorous seedling, such as root growth. This study demonstrates that a gene coexpression network together with transcriptional modules can provide insights that are not derived from comparative transcript profiling alone. © 2016 American Society of Plant Biologists. All Rights Reserved.

  17. Genes and co-expression modules common to drought and bacterial stress responses in Arabidopsis and rice.

    Directory of Open Access Journals (Sweden)

    Rafi Shaik

    Full Text Available Plants are simultaneously exposed to multiple stresses resulting in enormous changes in the molecular landscape within the cell. Identification and characterization of the synergistic and antagonistic components of stress response mechanisms contributing to the cross talk between stresses is of high priority to explore and enhance multiple stress responses. To this end, we performed meta-analysis of drought (abiotic, bacterial (biotic stress response in rice and Arabidopsis by analyzing a total of 386 microarray samples belonging to 20 microarray studies and identified approximately 3100 and 900 DEGs in rice and Arabidopsis, respectively. About 38.5% (1214 and 28.7% (272 DEGs were common to drought and bacterial stresses in rice and Arabidopsis, respectively. A majority of these common DEGs showed conserved expression status in both stresses. Gene ontology enrichment analysis clearly demarcated the response and regulation of various plant hormones and related biological processes. Fatty acid metabolism and biosynthesis of alkaloids were upregulated and, nitrogen metabolism and photosynthesis was downregulated in both stress conditions. WRKY transcription family genes were highly enriched in all upregulated gene sets while 'CO-like' TF family showed inverse relationship of expression between drought and bacterial stresses. Weighted gene co-expression network analysis divided DEG sets into multiple modules that show high co-expression and identified stress specific hub genes with high connectivity. Detection of consensus modules based on DEGs common to drought and bacterial stress revealed 9 and 4 modules in rice and Arabidopsis, respectively, with conserved and reversed co-expression patterns.

  18. Evolutionary rate patterns of the Gibberellin pathway genes

    Directory of Open Access Journals (Sweden)

    Zhang Fu-min

    2009-08-01

    Full Text Available Abstract Background Analysis of molecular evolutionary patterns of different genes within metabolic pathways allows us to determine whether these genes are subject to equivalent evolutionary forces and how natural selection shapes the evolution of proteins in an interacting system. Although previous studies found that upstream genes in the pathway evolved more slowly than downstream genes, the correlation between evolutionary rate and position of the genes in metabolic pathways as well as its implications in molecular evolution are still less understood. Results We sequenced and characterized 7 core structural genes of the gibberellin biosynthetic pathway from 8 representative species of the rice tribe (Oryzeae to address alternative hypotheses regarding evolutionary rates and patterns of metabolic pathway genes. We have detected significant rate heterogeneity among 7 GA pathway genes for both synonymous and nonsynonymous sites. Such rate variation is mostly likely attributed to differences of selection intensity rather than differential mutation pressures on the genes. Unlike previous argument that downstream genes in metabolic pathways would evolve more slowly than upstream genes, the downstream genes in the GA pathway did not exhibited the elevated substitution rate and instead, the genes that encode either the enzyme at the branch point (GA20ox or enzymes catalyzing multiple steps (KO, KAO and GA3ox in the pathway had the lowest evolutionary rates due to strong purifying selection. Our branch and codon models failed to detect signature of positive selection for any lineage and codon of the GA pathway genes. Conclusion This study suggests that significant heterogeneity of evolutionary rate of the GA pathway genes is mainly ascribed to differential constraint relaxation rather than the positive selection and supports the pathway flux theory that predicts that natural selection primarily targets enzymes that have the greatest control on fluxes.

  19. Gene co-expression networks and profiles reveal potential biomarkers of boar taint in pigs

    DEFF Research Database (Denmark)

    Drag, Markus; Skinkyté-Juskiené, R.; Do, Duy Ngoc

    synthesis. In testis, >80 DE genes were functionally classified by the PANTHER tool to “Gonadotropin releasing hormone receptor” and “Wnt signaling” pathways which play a role in reproductive maturation and proliferation of spermatogonia, respectively. WGCNA was used to build co-expression modules...

  20. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery.

    Science.gov (United States)

    Jia, Zhilong; Liu, Ying; Guan, Naiyang; Bo, Xiaochen; Luo, Zhigang; Barnes, Michael R

    2016-05-27

    Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and

  1. Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex

    Science.gov (United States)

    Hulsman, Marc; Lelieveldt, Boudewijn P. F.; de Ridder, Jeroen; Reinders, Marcel

    2015-01-01

    The three dimensional conformation of the genome in the cell nucleus influences important biological processes such as gene expression regulation. Recent studies have shown a strong correlation between chromatin interactions and gene co-expression. However, predicting gene co-expression from frequent long-range chromatin interactions remains challenging. We address this by characterizing the topology of the cortical chromatin interaction network using scale-aware topological measures. We demonstrate that based on these characterizations it is possible to accurately predict spatial co-expression between genes in the mouse cortex. Consistent with previous findings, we find that the chromatin interaction profile of a gene-pair is a good predictor of their spatial co-expression. However, the accuracy of the prediction can be substantially improved when chromatin interactions are described using scale-aware topological measures of the multi-resolution chromatin interaction network. We conclude that, for co-expression prediction, it is necessary to take into account different levels of chromatin interactions ranging from direct interaction between genes (i.e. small-scale) to chromatin compartment interactions (i.e. large-scale). PMID:25965262

  2. RNA Sequencing and Coexpression Analysis Reveal Key Genes Involved in α-Linolenic Acid Biosynthesis in Perilla frutescens Seed

    Directory of Open Access Journals (Sweden)

    Tianyuan Zhang

    2017-11-01

    Full Text Available Perilla frutescen is used as traditional food and medicine in East Asia. Its seeds contain high levels of α-linolenic acid (ALA, which is important for health, but is scarce in our daily meals. Previous reports on RNA-seq of perilla seed had identified fatty acid (FA and triacylglycerol (TAG synthesis genes, but the underlying mechanism of ALA biosynthesis and its regulation still need to be further explored. So we conducted Illumina RNA-sequencing in seven temporal developmental stages of perilla seeds. Sequencing generated a total of 127 million clean reads, containing 15.88 Gb of valid data. The de novo assembly of sequence reads yielded 64,156 unigenes with an average length of 777 bp. A total of 39,760 unigenes were annotated and 11,693 unigenes were found to be differentially expressed in all samples. According to Kyoto Encyclopedia of Genes and Genomes (KEGG pathway analysis, 486 unigenes were annotated in the “lipid metabolism” pathway. Of these, 150 unigenes were found to be involved in fatty acid (FA biosynthesis and triacylglycerol (TAG assembly in perilla seeds. A coexpression analysis showed that a total of 104 genes were highly coexpressed (r > 0.95. The coexpression network could be divided into two main subnetworks showing over expression in the medium or earlier and late phases, respectively. In order to identify the putative regulatory genes, a transcription factor (TF analysis was performed. This led to the identification of 45 gene families, mainly including the AP2-EREBP, bHLH, MYB, and NAC families, etc. After coexpression analysis of TFs with highly expression of FAD2 and FAD3 genes, 162 TFs were found to be significantly associated with two FAD genes (r > 0.95. Those TFs were predicted to be the key regulatory factors in ALA biosynthesis in perilla seed. The qRT-PCR analysis also verified the relevance of expression pattern between two FAD genes and partial candidate TFs. Although it has been reported that some TFs

  3. [Weighted gene co-expression network analysis in biomedicine research].

    Science.gov (United States)

    Liu, Wei; Li, Li; Ye, Hua; Tu, Wei

    2017-11-25

    High-throughput biological technologies are now widely applied in biology and medicine, allowing scientists to monitor thousands of parameters simultaneously in a specific sample. However, it is still an enormous challenge to mine useful information from high-throughput data. The emergence of network biology provides deeper insights into complex bio-system and reveals the modularity in tissue/cellular networks. Correlation networks are increasingly used in bioinformatics applications. Weighted gene co-expression network analysis (WGCNA) tool can detect clusters of highly correlated genes. Therefore, we systematically reviewed the application of WGCNA in the study of disease diagnosis, pathogenesis and other related fields. First, we introduced principle, workflow, advantages and disadvantages of WGCNA. Second, we presented the application of WGCNA in disease, physiology, drug, evolution and genome annotation. Then, we indicated the application of WGCNA in newly developed high-throughput methods. We hope this review will help to promote the application of WGCNA in biomedicine research.

  4. Dissection of regulatory networks that are altered in disease via differential co-expression.

    Directory of Open Access Journals (Sweden)

    David Amar

    Full Text Available Comparing the gene-expression profiles of sick and healthy individuals can help in understanding disease. Such differential expression analysis is a well-established way to find gene sets whose expression is altered in the disease. Recent approaches to gene-expression analysis go a step further and seek differential co-expression patterns, wherein the level of co-expression of a set of genes differs markedly between disease and control samples. Such patterns can arise from a disease-related change in the regulatory mechanism governing that set of genes, and pinpoint dysfunctional regulatory networks. Here we present DICER, a new method for detecting differentially co-expressed gene sets using a novel probabilistic score for differential correlation. DICER goes beyond standard differential co-expression and detects pairs of modules showing differential co-expression. The expression profiles of genes within each module of the pair are correlated across all samples. The correlation between the two modules, however, differs markedly between the disease and normal samples. We show that DICER outperforms the state of the art in terms of significance and interpretability of the detected gene sets. Moreover, the gene sets discovered by DICER manifest regulation by disease-specific microRNA families. In a case study on Alzheimer's disease, DICER dissected biological processes and protein complexes into functional subunits that are differentially co-expressed, thereby revealing inner structures in disease regulatory networks.

  5. MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters.

    Science.gov (United States)

    Gonzalez-Dominguez, Jorge; Martin, Maria J

    2017-10-10

    In this work we present MPIGeneNet, a parallel tool that applies Pearson's correlation and Random Matrix Theory to construct gene co-expression networks. It is based on the state-of-the-art sequential tool RMTGeneNet, which provides networks with high robustness and sensitivity at the expenses of relatively long runtimes for large scale input datasets. MPIGeneNet returns the same results as RMTGeneNet but improves the memory management, reduces the I/O cost, and accelerates the two most computationally demanding steps of co-expression network construction by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on two different systems using three typical input datasets shows that MPIGeneNet is significantly faster than RMTGeneNet. As an example, our tool is up to 175.41 times faster on a cluster with eight nodes, each one containing two 12-core Intel Haswell processors. Source code of MPIGeneNet, as well as a reference manual, are available at https://sourceforge.net/projects/mpigenenet/.

  6. Evolutionary dynamics of human autoimmune disease genes and malfunctioned immunological genes

    Directory of Open Access Journals (Sweden)

    Podder Soumita

    2012-01-01

    Full Text Available Abstract Background One of the main issues of molecular evolution is to divulge the principles in dictating the evolutionary rate differences among various gene classes. Immunological genes have received considerable attention in evolutionary biology as candidates for local adaptation and for studying functionally important polymorphisms. The normal structure and function of immunological genes will be distorted when they experience mutations leading to immunological dysfunctions. Results Here, we examined the fundamental differences between the genes which on mutation give rise to autoimmune or other immune system related diseases and the immunological genes that do not cause any disease phenotypes. Although the disease genes examined are analogous to non-disease genes in product, expression, function, and pathway affiliation, a statistically significant decrease in evolutionary rate has been found in autoimmune disease genes relative to all other immune related diseases and non-disease genes. Possible ways of accumulation of mutation in the three steps of the central dogma (DNA-mRNA-Protein have been studied to trace the mutational effects predisposed to disease consequence and acquiring higher selection pressure. Principal Component Analysis and Multivariate Regression Analysis have established the predominant role of single nucleotide polymorphisms in guiding the evolutionary rate of immunological disease and non-disease genes followed by m-RNA abundance, paralogs number, fraction of phosphorylation residue, alternatively spliced exon, protein residue burial and protein disorder. Conclusions Our study provides an empirical insight into the etiology of autoimmune disease genes and other immunological diseases. The immediate utility of our study is to help in disease gene identification and may also help in medicinal improvement of immune related disease.

  7. Analysis of the relationship between coexpression domains and chromatin 3D organization.

    Directory of Open Access Journals (Sweden)

    María E Soler-Oliva

    2017-09-01

    Full Text Available Gene order is not random in eukaryotic chromosomes, and co-regulated genes tend to be clustered. The mechanisms that determine co-regulation of large regions of the genome and its connection with chromatin three-dimensional (3D organization are still unclear however. Here we have adapted a recently described method for identifying chromatin topologically associating domains (TADs to identify coexpression domains (which we term "CODs". Using human normal breast and breast cancer RNA-seq data, we have identified approximately 500 CODs. CODs in the normal and breast cancer genomes share similar characteristics but differ in their gene composition. COD genes have a greater tendency to be coexpressed with genes that reside in other CODs than with non-COD genes. Such inter-COD coexpression is maintained over large chromosomal distances in the normal genome but is partially lost in the cancer genome. Analyzing the relationship between CODs and chromatin 3D organization using Hi-C contact data, we find that CODs do not correspond to TADs. In fact, intra-TAD gene coexpression is the same as random for most chromosomes. However, the contact profile is similar between gene pairs that reside either in the same COD or in coexpressed CODs. These data indicate that co-regulated genes in the genome present similar patterns of contacts irrespective of the frequency of physical chromatin contacts between them.

  8. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

    Science.gov (United States)

    Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

    2009-02-01

    Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

  9. Bioinformatics Data Mining Approach Suggests Coexpression of AGTPBP1 with an ALS-linked Gene C9orf72

    Directory of Open Access Journals (Sweden)

    Shouta Kitano

    2015-01-01

    Full Text Available Background Expanded GGGGCC hexanucleotide repeats located in the noncoding region of the chromosome 9 open reading frame 72 ( C9orf72 gene represent the most common genetic abnormality for familial and sporadic amyotrophic lateral sclerosis (ALS and frontotemporal dementia (FTD. Formation of nuclear RNA foci, accumulation of repeat-associated non-ATG-translated dipeptide-repeat proteins, and haploinsufficiency of C9orf72 are proposed for pathological mechanisms of C9ALS/FTD. However, at present, the physiological function of C9orf72 remains largely unknown. Methods By searching on a bioinformatics database named COXPRESdb composed of the comprehensive gene coexpression data, we studied potential C9orf72 interactors. Results We identified the ATP/GTP binding protein 1 ( AGTPBP1 gene alternatively named NNA1 encoding a cytosolic carboxypeptidase whose mutation is causative of the degeneration of Purkinje cells and motor neurons as the most significant gene coexpressed with C9orf72. We verified coexpression and interaction of AGTPBP1 and C9orf72 in transfected cells by immunoprecipitation and in neurons of the human brain by double-labeling immunohistochemistry. Furthermore, we found a positive correlation between AGTPBP1 and C9orf72 mRNA expression levels in the set of 21 human brains examined. Conclusions These results suggest that AGTPBP1 serves as a C9orf72 interacting partner that plays a role in the regulation of neuronal function in a coordinated manner within the central nervous system.

  10. Discovery of cis-elements between sorghum and rice using co-expression and evolutionary conservation

    Directory of Open Access Journals (Sweden)

    Haberer Georg

    2009-06-01

    Full Text Available Abstract Background The spatiotemporal regulation of gene expression largely depends on the presence and absence of cis-regulatory sites in the promoter. In the economically highly important grass family, our knowledge of transcription factor binding sites and transcriptional networks is still very limited. With the completion of the sorghum genome and the available rice genome sequence, comparative promoter analyses now allow genome-scale detection of conserved cis-elements. Results In this study, we identified thousands of phylogenetic footprints conserved between orthologous rice and sorghum upstream regions that are supported by co-expression information derived from three different rice expression data sets. In a complementary approach, cis-motifs were discovered by their highly conserved co-occurrence in syntenic promoter pairs. Sequence conservation and matches to known plant motifs support our findings. Expression similarities of gene pairs positively correlate with the number of motifs that are shared by gene pairs and corroborate the importance of similar promoter architectures for concerted regulation. This strongly suggests that these motifs function in the regulation of transcript levels in rice and, presumably also in sorghum. Conclusion Our work provides the first large-scale collection of cis-elements for rice and sorghum and can serve as a paradigm for cis-element analysis through comparative genomics in grasses in general.

  11. Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model.

    Science.gov (United States)

    Omony, Jimmy; de Jong, Anne; Krawczyk, Antonina O; Eijlander, Robyn T; Kuipers, Oscar P

    2018-02-09

    Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes.

  12. Integrating Genetic and Gene Co-expression Analysis Identifies Gene Networks Involved in Alcohol and Stress Responses.

    Science.gov (United States)

    Luo, Jie; Xu, Pei; Cao, Peijian; Wan, Hongjian; Lv, Xiaonan; Xu, Shengchun; Wang, Gangjun; Cook, Melloni N; Jones, Byron C; Lu, Lu; Wang, Xusheng

    2018-01-01

    Although the link between stress and alcohol is well recognized, the underlying mechanisms of how they interplay at the molecular level remain unclear. The purpose of this study is to identify molecular networks underlying the effects of alcohol and stress responses, as well as their interaction on anxiety behaviors in the hippocampus of mice using a systems genetics approach. Here, we applied a gene co-expression network approach to transcriptomes of 41 BXD mouse strains under four conditions: stress, alcohol, stress-induced alcohol and control. The co-expression analysis identified 14 modules and characterized four expression patterns across the four conditions. The four expression patterns include up-regulation in no restraint stress and given an ethanol injection (NOE) but restoration in restraint stress followed by an ethanol injection (RSE; pattern 1), down-regulation in NOE but rescue in RSE (pattern 2), up-regulation in both restraint stress followed by a saline injection (RSS) and NOE, and further amplification in RSE (pattern 3), and up-regulation in RSS but reduction in both NOE and RSE (pattern 4). We further identified four functional subnetworks by superimposing protein-protein interactions (PPIs) to the 14 co-expression modules, including γ-aminobutyric acid receptor (GABA) signaling, glutamate signaling, neuropeptide signaling, cAMP-dependent signaling. We further performed module specificity analysis to identify modules that are specific to stress, alcohol, or stress-induced alcohol responses. Finally, we conducted causality analysis to link genetic variation to these identified modules, and anxiety behaviors after stress and alcohol treatments. This study underscores the importance of integrative analysis and offers new insights into the molecular networks underlying stress and alcohol responses.

  13. Integrating Genetic and Gene Co-expression Analysis Identifies Gene Networks Involved in Alcohol and Stress Responses

    Directory of Open Access Journals (Sweden)

    Jie Luo

    2018-04-01

    Full Text Available Although the link between stress and alcohol is well recognized, the underlying mechanisms of how they interplay at the molecular level remain unclear. The purpose of this study is to identify molecular networks underlying the effects of alcohol and stress responses, as well as their interaction on anxiety behaviors in the hippocampus of mice using a systems genetics approach. Here, we applied a gene co-expression network approach to transcriptomes of 41 BXD mouse strains under four conditions: stress, alcohol, stress-induced alcohol and control. The co-expression analysis identified 14 modules and characterized four expression patterns across the four conditions. The four expression patterns include up-regulation in no restraint stress and given an ethanol injection (NOE but restoration in restraint stress followed by an ethanol injection (RSE; pattern 1, down-regulation in NOE but rescue in RSE (pattern 2, up-regulation in both restraint stress followed by a saline injection (RSS and NOE, and further amplification in RSE (pattern 3, and up-regulation in RSS but reduction in both NOE and RSE (pattern 4. We further identified four functional subnetworks by superimposing protein-protein interactions (PPIs to the 14 co-expression modules, including γ-aminobutyric acid receptor (GABA signaling, glutamate signaling, neuropeptide signaling, cAMP-dependent signaling. We further performed module specificity analysis to identify modules that are specific to stress, alcohol, or stress-induced alcohol responses. Finally, we conducted causality analysis to link genetic variation to these identified modules, and anxiety behaviors after stress and alcohol treatments. This study underscores the importance of integrative analysis and offers new insights into the molecular networks underlying stress and alcohol responses.

  14. Identifying gene coexpression networks underlying the dynamic regulation of wood-forming tissues in Populus under diverse environmental conditions.

    Science.gov (United States)

    Zinkgraf, Matthew; Liu, Lijun; Groover, Andrew; Filkov, Vladimir

    2017-06-01

    Trees modify wood formation through integration of environmental and developmental signals in complex but poorly defined transcriptional networks, allowing trees to produce woody tissues appropriate to diverse environmental conditions. In order to identify relationships among genes expressed during wood formation, we integrated data from new and publically available datasets in Populus. These datasets were generated from woody tissue and include transcriptome profiling, transcription factor binding, DNA accessibility and genome-wide association mapping experiments. Coexpression modules were calculated, each of which contains genes showing similar expression patterns across experimental conditions, genotypes and treatments. Conserved gene coexpression modules (four modules totaling 8398 genes) were identified that were highly preserved across diverse environmental conditions and genetic backgrounds. Functional annotations as well as correlations with specific experimental treatments associated individual conserved modules with distinct biological processes underlying wood formation, such as cell-wall biosynthesis, meristem development and epigenetic pathways. Module genes were also enriched for DNase I hypersensitivity footprints and binding from four transcription factors associated with wood formation. The conserved modules are excellent candidates for modeling core developmental pathways common to wood formation in diverse environments and genotypes, and serve as testbeds for hypothesis generation and testing for future studies. No claim to original US government works. New Phytologist © 2017 New Phytologist Trust.

  15. Matrix factorization reveals aging-specific co-expression gene modules in the fat and muscle tissues in nonhuman primates

    Science.gov (United States)

    Wang, Yongcui; Zhao, Weiling; Zhou, Xiaobo

    2016-10-01

    Accurate identification of coherent transcriptional modules (subnetworks) in adipose and muscle tissues is important for revealing the related mechanisms and co-regulated pathways involved in the development of aging-related diseases. Here, we proposed a systematically computational approach, called ICEGM, to Identify the Co-Expression Gene Modules through a novel mathematical framework of Higher-Order Generalized Singular Value Decomposition (HO-GSVD). ICEGM was applied on the adipose, and heart and skeletal muscle tissues in old and young female African green vervet monkeys. The genes associated with the development of inflammation, cardiovascular and skeletal disorder diseases, and cancer were revealed by the ICEGM. Meanwhile, genes in the ICEGM modules were also enriched in the adipocytes, smooth muscle cells, cardiac myocytes, and immune cells. Comprehensive disease annotation and canonical pathway analysis indicated that immune cells, adipocytes, cardiomyocytes, and smooth muscle cells played a synergistic role in cardiac and physical functions in the aged monkeys by regulation of the biological processes associated with metabolism, inflammation, and atherosclerosis. In conclusion, the ICEGM provides an efficiently systematic framework for decoding the co-expression gene modules in multiple tissues. Analysis of genes in the ICEGM module yielded important insights on the cooperative role of multiple tissues in the development of diseases.

  16. Step-by-Step Construction of Gene Co-expression Networks from High-Throughput Arabidopsis RNA Sequencing Data.

    Science.gov (United States)

    Contreras-López, Orlando; Moyano, Tomás C; Soto, Daniela C; Gutiérrez, Rodrigo A

    2018-01-01

    The rapid increase in the availability of transcriptomics data generated by RNA sequencing represents both a challenge and an opportunity for biologists without bioinformatics training. The challenge is handling, integrating, and interpreting these data sets. The opportunity is to use this information to generate testable hypothesis to understand molecular mechanisms controlling gene expression and biological processes (Fig. 1). A successful strategy to generate tractable hypotheses from transcriptomics data has been to build undirected network graphs based on patterns of gene co-expression. Many examples of new hypothesis derived from network analyses can be found in the literature, spanning different organisms including plants and specific fields such as root developmental biology.In order to make the process of constructing a gene co-expression network more accessible to biologists, here we provide step-by-step instructions using published RNA-seq experimental data obtained from a public database. Similar strategies have been used in previous studies to advance root developmental biology. This guide includes basic instructions for the operation of widely used open source platforms such as Bio-Linux, R, and Cytoscape. Even though the data we used in this example was obtained from Arabidopsis thaliana, the workflow developed in this guide can be easily adapted to work with RNA-seq data from any organism.

  17. A comprehensive analysis on preservation patterns of gene co-expression networks during Alzheimer's disease progression.

    Science.gov (United States)

    Ray, Sumanta; Hossain, Sk Md Mosaddek; Khatun, Lutfunnesa; Mukhopadhyay, Anirban

    2017-12-20

    Alzheimer's disease (AD) is a chronic neuro-degenerative disruption of the brain which involves in large scale transcriptomic variation. The disease does not impact every regions of the brain at the same time, instead it progresses slowly involving somewhat sequential interaction with different regions. Analysis of the expression patterns of the genes in different regions of the brain influenced in AD surely contribute for a enhanced comprehension of AD pathogenesis and shed light on the early characterization of the disease. Here, we have proposed a framework to identify perturbation and preservation characteristics of gene expression patterns across six distinct regions of the brain ("EC", "HIP", "PC", "MTG", "SFG", and "VCX") affected in AD. Co-expression modules were discovered considering a couple of regions at once. These are then analyzed to know the preservation and perturbation characteristics. Different module preservation statistics and a rank aggregation mechanism have been adopted to detect the changes of expression patterns across brain regions. Gene ontology (GO) and pathway based analysis were also carried out to know the biological meaning of preserved and perturbed modules. In this article, we have extensively studied the preservation patterns of co-expressed modules in six distinct brain regions affected in AD. Some modules are emerged as the most preserved while some others are detected as perturbed between a pair of brain regions. Further investigation on the topological properties of preserved and non-preserved modules reveals a substantial association amongst "betweenness centrality" and "degree" of the involved genes. Our findings may render a deeper realization of the preservation characteristics of gene expression patterns in discrete brain regions affected by AD.

  18. Evolutionary relationships between miRNA genes and their activity.

    Science.gov (United States)

    Zhu, Yan; Skogerbø, Geir; Ning, Qianqian; Wang, Zhen; Li, Biqing; Yang, Shuang; Sun, Hong; Li, Yixue

    2012-12-22

    The emergence of vertebrates is characterized by a strong increase in miRNA families. MicroRNAs interact broadly with many transcripts, and the evolution of such a system is intriguing. However, evolutionary questions concerning the origin of miRNA genes and their subsequent evolution remain unexplained. In order to systematically understand the evolutionary relationship between miRNAs gene and their function, we classified human known miRNAs into eight groups based on their evolutionary ages estimated by maximum parsimony method. New miRNA genes with new functional sequences accumulated more dynamically in vertebrates than that observed in Drosophila. Different levels of evolutionary selection were observed over miRNA gene sequences with different time of origin. Most genic miRNAs differ from their host genes in time of origin, there is no particular relationship between the age of a miRNA and the age of its host genes, genic miRNAs are mostly younger than the corresponding host genes. MicroRNAs originated over different time-scales are often predicted/verified to target the same or overlapping sets of genes, opening the possibility of substantial functional redundancy among miRNAs of different ages. Higher degree of tissue specificity and lower expression level was found in young miRNAs. Our data showed that compared with protein coding genes, miRNA genes are more dynamic in terms of emergence and decay. Evolution patterns are quite different between miRNAs of different ages. MicroRNAs activity is under tight control with well-regulated expression increased and targeting decreased over time. Our work calls attention to the study of miRNA activity with a consideration of their origin time.

  19. Coexpression of nuclear receptors and histone methylation modifying genes in the testis: implications for endocrine disruptor modes of action.

    Directory of Open Access Journals (Sweden)

    Alison M Anderson

    Full Text Available BACKGROUND: Endocrine disruptor chemicals elicit adverse health effects by perturbing nuclear receptor signalling systems. It has been speculated that these compounds may also perturb epigenetic mechanisms and thus contribute to the early origin of adult onset disease. We hypothesised that histone methylation may be a component of the epigenome that is susceptible to perturbation. We used coexpression analysis of publicly available data to investigate the combinatorial actions of nuclear receptors and genes involved in histone methylation in normal testis and when faced with endocrine disruptor compounds. METHODOLOGY/PRINCIPAL FINDINGS: The expression patterns of a set of genes were profiled across testis tissue in human, rat and mouse, plus control and exposed samples from four toxicity experiments in the rat. Our results indicate that histone methylation events are a more general component of nuclear receptor mediated transcriptional regulation in the testis than previously appreciated. Coexpression patterns support the role of a gatekeeper mechanism involving the histone methylation modifiers Kdm1, Prdm2, and Ehmt1 and indicate that this mechanism is a common determinant of transcriptional integrity for genes critical to diverse physiological endpoints relevant to endocrine disruption. Coexpression patterns following exposure to vinclozolin and dibutyl phthalate suggest that coactivity of the demethylase Kdm1 in particular warrants further investigation in relation to endocrine disruptor mode of action. CONCLUSIONS/SIGNIFICANCE: This study provides proof of concept that a bioinformatics approach that profiles genes related to a specific hypothesis across multiple biological settings can provide powerful insight into coregulatory activity that would be difficult to discern at an individual experiment level or by traditional differential expression analysis methods.

  20. Weighted gene co-expression network analysis reveals potential genes involved in early metamorphosis process in sea cucumber Apostichopus japonicus.

    Science.gov (United States)

    Li, Yongxin; Kikuchi, Mani; Li, Xueyan; Gao, Qionghua; Xiong, Zijun; Ren, Yandong; Zhao, Ruoping; Mao, Bingyu; Kondo, Mariko; Irie, Naoki; Wang, Wen

    2018-01-01

    Sea cucumbers, one main class of Echinoderms, have a very fast and drastic metamorphosis process during their development. However, the molecular basis under this process remains largely unknown. Here we systematically examined the gene expression profiles of Japanese common sea cucumber (Apostichopus japonicus) for the first time by RNA sequencing across 16 developmental time points from fertilized egg to juvenile stage. Based on the weighted gene co-expression network analysis (WGCNA), we identified 21 modules. Among them, MEdarkmagenta was highly expressed and correlated with the early metamorphosis process from late auricularia to doliolaria larva. Furthermore, gene enrichment and differentially expressed gene analysis identified several genes in the module that may play key roles in the metamorphosis process. Our results not only provide a molecular basis for experimentally studying the development and morphological complexity of sea cucumber, but also lay a foundation for improving its emergence rate. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Expression atlas and comparative coexpression network analyses reveal important genes involved in the formation of lignified cell wall in Brachypodium distachyon.

    Science.gov (United States)

    Sibout, Richard; Proost, Sebastian; Hansen, Bjoern Oest; Vaid, Neha; Giorgi, Federico M; Ho-Yue-Kuang, Severine; Legée, Frédéric; Cézart, Laurent; Bouchabké-Coussa, Oumaya; Soulhat, Camille; Provart, Nicholas; Pasha, Asher; Le Bris, Philippe; Roujol, David; Hofte, Herman; Jamet, Elisabeth; Lapierre, Catherine; Persson, Staffan; Mutwil, Marek

    2017-08-01

    While Brachypodium distachyon (Brachypodium) is an emerging model for grasses, no expression atlas or gene coexpression network is available. Such tools are of high importance to provide insights into the function of Brachypodium genes. We present a detailed Brachypodium expression atlas, capturing gene expression in its major organs at different developmental stages. The data were integrated into a large-scale coexpression database ( www.gene2function.de), enabling identification of duplicated pathways and conserved processes across 10 plant species, thus allowing genome-wide inference of gene function. We highlight the importance of the atlas and the platform through the identification of duplicated cell wall modules, and show that a lignin biosynthesis module is conserved across angiosperms. We identified and functionally characterised a putative ferulate 5-hydroxylase gene through overexpression of it in Brachypodium, which resulted in an increase in lignin syringyl units and reduced lignin content of mature stems, and led to improved saccharification of the stem biomass. Our Brachypodium expression atlas thus provides a powerful resource to reveal functionally related genes, which may advance our understanding of important biological processes in grasses. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  2. ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis

    Directory of Open Access Journals (Sweden)

    Saurav Mallik

    2017-12-01

    Full Text Available For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures—weighted rank-based Jaccard and Cosine measures—and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm—RANWAR—was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.

  3. ConGEMs: Condensed Gene Co-Expression Module Discovery Through Rule-Based Clustering and Its Application to Carcinogenesis.

    Science.gov (United States)

    Mallik, Saurav; Zhao, Zhongming

    2017-12-28

    For transcriptomic analysis, there are numerous microarray-based genomic data, especially those generated for cancer research. The typical analysis measures the difference between a cancer sample-group and a matched control group for each transcript or gene. Association rule mining is used to discover interesting item sets through rule-based methodology. Thus, it has advantages to find causal effect relationships between the transcripts. In this work, we introduce two new rule-based similarity measures-weighted rank-based Jaccard and Cosine measures-and then propose a novel computational framework to detect condensed gene co-expression modules ( C o n G E M s) through the association rule-based learning system and the weighted similarity scores. In practice, the list of evolved condensed markers that consists of both singular and complex markers in nature depends on the corresponding condensed gene sets in either antecedent or consequent of the rules of the resultant modules. In our evaluation, these markers could be supported by literature evidence, KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway and Gene Ontology annotations. Specifically, we preliminarily identified differentially expressed genes using an empirical Bayes test. A recently developed algorithm-RANWAR-was then utilized to determine the association rules from these genes. Based on that, we computed the integrated similarity scores of these rule-based similarity measures between each rule-pair, and the resultant scores were used for clustering to identify the co-expressed rule-modules. We applied our method to a gene expression dataset for lung squamous cell carcinoma and a genome methylation dataset for uterine cervical carcinogenesis. Our proposed module discovery method produced better results than the traditional gene-module discovery measures. In summary, our proposed rule-based method is useful for exploring biomarker modules from transcriptomic data.

  4. Genes with stable DNA methylation levels show higher evolutionary conservation than genes with fluctuant DNA methylation levels.

    Science.gov (United States)

    Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai

    2015-11-24

    Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.

  5. Mutational and Evolutionary Analyses of Bovine Reprimo Gene ...

    African Journals Online (AJOL)

    It can therefore be concluded that bovine RPRM gene contained 4 transition mutations and 5 indels that can be used in marker assisted selection. Evolutionary findings also demonstrated the existence of a divergent evolution between bovine RPRM gene and RPRM gene of fishes and frog. Keywords: Identity, phylogeny ...

  6. Evolutionary change and phylogenetic relationships in light of horizontal gene transfer.

    Science.gov (United States)

    Boto, Luis

    2015-06-01

    Horizontal gene transfer has, over the past 25 years, become a part of evolutionary thinking. In the present paper I discuss horizontal gene transfer (HGT) in relation to contingency, natural selection, evolutionary change speed and the Tree-of-Life endeavour, with the aim of contributing to the understanding of the role of HGT in evolutionary processes. In addition, the challenges that HGT imposes on the current view of evolution are emphasized.

  7. Genes from scratch--the evolutionary fate of de novo genes.

    Science.gov (United States)

    Schlötterer, Christian

    2015-04-01

    Although considered an extremely unlikely event, many genes emerge from previously noncoding genomic regions. This review covers the entire life cycle of such de novo genes. Two competing hypotheses about the process of de novo gene birth are discussed as well as the high death rate of de novo genes. Despite the high death rate, some de novo genes are retained and remain functional, even in distantly related species, through their integration into gene networks. Further studies combining gene expression with ribosome profiling in multiple populations across different species will be instrumental for an improved understanding of the evolutionary processes operating on de novo genes. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.

  8. Selective modes determine evolutionary rates, gene compactness and expression patterns in Brassica.

    Science.gov (United States)

    Guo, Yue; Liu, Jing; Zhang, Jiefu; Liu, Shengyi; Du, Jianchang

    2017-07-01

    It has been well documented that most nuclear protein-coding genes in organisms can be classified into two categories: positively selected genes (PSGs) and negatively selected genes (NSGs). The characteristics and evolutionary fates of different types of genes, however, have been poorly understood. In this study, the rates of nonsynonymous substitution (K a ) and the rates of synonymous substitution (K s ) were investigated by comparing the orthologs between the two sequenced Brassica species, Brassica rapa and Brassica oleracea, and the evolutionary rates, gene structures, expression patterns, and codon bias were compared between PSGs and NSGs. The resulting data show that PSGs have higher protein evolutionary rates, lower synonymous substitution rates, shorter gene length, fewer exons, higher functional specificity, lower expression level, higher tissue-specific expression and stronger codon bias than NSGs. Although the quantities and values are different, the relative features of PSGs and NSGs have been largely verified in the model species Arabidopsis. These data suggest that PSGs and NSGs differ not only under selective pressure (K a /K s ), but also in their evolutionary, structural and functional properties, indicating that selective modes may serve as a determinant factor for measuring evolutionary rates, gene compactness and expression patterns in Brassica. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  9. Evolutionary Fates and Dynamic Functionalization of Young Duplicate Genes in Arabidopsis Genomes.

    Science.gov (United States)

    Wang, Jun; Tao, Feng; Marowsky, Nicholas C; Fan, Chuanzhu

    2016-09-01

    Gene duplication is a primary means to generate genomic novelties, playing an essential role in speciation and adaptation. Particularly in plants, a high abundance of duplicate genes has been maintained for significantly long periods of evolutionary time. To address the manner in which young duplicate genes were derived primarily from small-scale gene duplication and preserved in plant genomes and to determine the underlying driving mechanisms, we generated transcriptomes to produce the expression profiles of five tissues in Arabidopsis thaliana and the closely related species Arabidopsis lyrata and Capsella rubella Based on the quantitative analysis metrics, we investigated the evolutionary processes of young duplicate genes in Arabidopsis. We determined that conservation, neofunctionalization, and specialization are three main evolutionary processes for Arabidopsis young duplicate genes. We explicitly demonstrated the dynamic functionalization of duplicate genes along the evolutionary time scale. Upon origination, duplicates tend to maintain their ancestral functions; but as they survive longer, they might be likely to develop distinct and novel functions. The temporal evolutionary processes and functionalization of plant duplicate genes are associated with their ancestral functions, dynamic DNA methylation levels, and histone modification abundances. Furthermore, duplicate genes tend to be initially expressed in pollen and then to gain more interaction partners over time. Altogether, our study provides novel insights into the dynamic retention processes of young duplicate genes in plant genomes. © 2016 American Society of Plant Biologists. All rights reserved.

  10. The evolutionary dynamics of variant antigen genes in Babesia reveal a history of genomic innovation underlying host-parasite interaction

    KAUST Repository

    Jackson, Andrew P.

    2014-05-05

    Babesia spp. are tick-borne, intraerythrocytic hemoparasites that use antigenic variation to resist host immunity, through sequential modification of the parasite-derived variant erythrocyte surface antigen (VESA) expressed on the infected red blood cell surface. We identified the genomic processes driving antigenic diversity in genes encoding VESA (ves1) through comparative analysis within and between three Babesia species, (B. bigemina, B. divergens and B. bovis). Ves1 structure diverges rapidly after speciation, notably through the evolution of shortened forms (ves2) from 5? ends of canonical ves1 genes. Phylogenetic analyses show that ves1 genes are transposed between loci routinely, whereas ves2 genes are not. Similarly, analysis of sequence mosaicism shows that recombination drives variation in ves1 sequences, but less so for ves2, indicating the adoption of different mechanisms for variation of the two families. Proteomic analysis of the B. bigemina PR isolate shows that two dominant VESA1 proteins are expressed in the population, whereas numerous VESA2 proteins are co-expressed, consistent with differential transcriptional regulation of each family. Hence, VESA2 proteins are abundant and previously unrecognized elements of Babesia biology, with evolutionary dynamics consistently different to those of VESA1, suggesting that their functions are distinct. 2014 The Author(s) 2014.

  11. The evolutionary dynamics of variant antigen genes in Babesia reveal a history of genomic innovation underlying host-parasite interaction

    KAUST Repository

    Jackson, Andrew P.; Otto, Thomas D.; Darby, Alistair; Ramaprasad, Abhinay; Xia, Dong; Echaide, Ignacio Eduardo; Farber, Marisa; Gahlot, Sunayna; Gamble, John; Gupta, Dinesh; Gupta, Yask; Jackson, Louise; Malandrin, Laurence; Malas, Tareq B.; Moussa, Ehab; Nair, Mridul; Reid, Adam J.; Sanders, Mandy; Sharma, Jyotsna; Tracey, Alan; Quail, Mike A.; Weir, William; Wastling, Jonathan M.; Hall, Neil; Willadsen, Peter; Lingelbach, Klaus; Shiels, Brian; Tait, Andy; Berriman, Matt; Allred, David R.; Pain, Arnab

    2014-01-01

    Babesia spp. are tick-borne, intraerythrocytic hemoparasites that use antigenic variation to resist host immunity, through sequential modification of the parasite-derived variant erythrocyte surface antigen (VESA) expressed on the infected red blood cell surface. We identified the genomic processes driving antigenic diversity in genes encoding VESA (ves1) through comparative analysis within and between three Babesia species, (B. bigemina, B. divergens and B. bovis). Ves1 structure diverges rapidly after speciation, notably through the evolution of shortened forms (ves2) from 5? ends of canonical ves1 genes. Phylogenetic analyses show that ves1 genes are transposed between loci routinely, whereas ves2 genes are not. Similarly, analysis of sequence mosaicism shows that recombination drives variation in ves1 sequences, but less so for ves2, indicating the adoption of different mechanisms for variation of the two families. Proteomic analysis of the B. bigemina PR isolate shows that two dominant VESA1 proteins are expressed in the population, whereas numerous VESA2 proteins are co-expressed, consistent with differential transcriptional regulation of each family. Hence, VESA2 proteins are abundant and previously unrecognized elements of Babesia biology, with evolutionary dynamics consistently different to those of VESA1, suggesting that their functions are distinct. 2014 The Author(s) 2014.

  12. The evolutionary dynamics of variant antigen genes in Babesia reveal a history of genomic innovation underlying host–parasite interaction

    Science.gov (United States)

    Jackson, Andrew P.; Otto, Thomas D.; Darby, Alistair; Ramaprasad, Abhinay; Xia, Dong; Echaide, Ignacio Eduardo; Farber, Marisa; Gahlot, Sunayna; Gamble, John; Gupta, Dinesh; Gupta, Yask; Jackson, Louise; Malandrin, Laurence; Malas, Tareq B.; Moussa, Ehab; Nair, Mridul; Reid, Adam J.; Sanders, Mandy; Sharma, Jyotsna; Tracey, Alan; Quail, Mike A.; Weir, William; Wastling, Jonathan M.; Hall, Neil; Willadsen, Peter; Lingelbach, Klaus; Shiels, Brian; Tait, Andy; Berriman, Matt; Allred, David R.; Pain, Arnab

    2014-01-01

    Babesia spp. are tick-borne, intraerythrocytic hemoparasites that use antigenic variation to resist host immunity, through sequential modification of the parasite-derived variant erythrocyte surface antigen (VESA) expressed on the infected red blood cell surface. We identified the genomic processes driving antigenic diversity in genes encoding VESA (ves1) through comparative analysis within and between three Babesia species, (B. bigemina, B. divergens and B. bovis). Ves1 structure diverges rapidly after speciation, notably through the evolution of shortened forms (ves2) from 5′ ends of canonical ves1 genes. Phylogenetic analyses show that ves1 genes are transposed between loci routinely, whereas ves2 genes are not. Similarly, analysis of sequence mosaicism shows that recombination drives variation in ves1 sequences, but less so for ves2, indicating the adoption of different mechanisms for variation of the two families. Proteomic analysis of the B. bigemina PR isolate shows that two dominant VESA1 proteins are expressed in the population, whereas numerous VESA2 proteins are co-expressed, consistent with differential transcriptional regulation of each family. Hence, VESA2 proteins are abundant and previously unrecognized elements of Babesia biology, with evolutionary dynamics consistently different to those of VESA1, suggesting that their functions are distinct. PMID:24799432

  13. Gene genealogies indicates abundant gene conversions and independent evolutionary histories of the mating-type chromosomes in the evolutionary history of Neurospora tetrasperma

    Directory of Open Access Journals (Sweden)

    Whittle Carrie A

    2010-07-01

    Full Text Available Abstract Background The self-fertile filamentous ascomycete Neurospora tetrasperma contains a large (~7 Mbp and young (mat chromosomes. The objective of the present study is to reveal the evolutionary history, including key genomic events, associated with the various regions of the mat chromosomes among ten strains representing all the nine known species (lineages contained within the N. tetrasperma species complex. Results Comparative analysis of sequence divergence among alleles of 24 mat-linked genes (mat A and mat a indicates that a large region of suppressed recombination exists within the mat chromosome for each of nine lineages of N. tetrasperma sensu latu. The recombinationally suppressed region varies in size and gene composition among lineages, and is flanked on both ends by normally recombining regions. Genealogical analyses among lineages reveals that eight gene conversion events have occurred between homologous mat A and mat a-linked alleles of genes located within the region of restricted recombination during the evolutionary history of N. tetrasperma. Conclusions We conclude that the region of suppressed recombination in the mat chromosomes has likely been subjected to independent contraction and/or expansion during the evolutionary history of the N. tetrasperma species complex. Furthermore, we infer that gene conversion events are likely a common phenomenon within this recombinationally suppressed genomic region. We argue that gene conversions might provide an efficient mechanism of adaptive editing of functional genes, including the removal of deleterious mutations, within the young recombinationally suppressed region of the mat chromosomes.

  14. Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

    Science.gov (United States)

    Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

    2017-11-13

    The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly

  15. Identification of co-expression gene networks, regulatory genes and pathways for obesity based on adipose tissue RNA Sequencing in a porcine model

    DEFF Research Database (Denmark)

    Kogelman, Lisette; Cirera Salicio, Susanna; Zhernakova, Daria V.

    2014-01-01

    interactions. Identification of co-expressed and regulatory genes in RNA extracted from relevant tissues representing lean and obese individuals provides an entry point for the identification of genes and pathways of importance to the development of obesity. The pig, an omnivorous animal, is an excellent model...... (modules). Additionally, regulator genes were detected using Lemon-Tree algorithms. Results WGCNA revealed five modules which were strongly correlated with at least one obesity-related phenotype (correlations ranging from -0.54 to 0.72, P ... the association between obesity and other diseases, like osteoporosis (osteoclast differentiation, P = 1.4E-7), and immune-related complications (e.g. Natural killer cell mediated cytotoxity, P = 3.8E-5; B cell receptor signaling pathway, P = 7.2E-5). Lemon-Tree identified three potential regulator genes, using...

  16. Genome-Scale Co-Expression Network Comparison across Escherichia coli and Salmonella enterica Serovar Typhimurium Reveals Significant Conservation at the Regulon Level of Local Regulators Despite Their Dissimilar Lifestyles

    Science.gov (United States)

    Zarrineh, Peyman; Sánchez-Rodríguez, Aminael; Hosseinkhan, Nazanin; Narimani, Zahra; Marchal, Kathleen; Masoudi-Nejad, Ali

    2014-01-01

    Availability of genome-wide gene expression datasets provides the opportunity to study gene expression across different organisms under a plethora of experimental conditions. In our previous work, we developed an algorithm called COMODO (COnserved MODules across Organisms) that identifies conserved expression modules between two species. In the present study, we expanded COMODO to detect the co-expression conservation across three organisms by adapting the statistics behind it. We applied COMODO to study expression conservation/divergence between Escherichia coli, Salmonella enterica, and Bacillus subtilis. We observed that some parts of the regulatory interaction networks were conserved between E. coli and S. enterica especially in the regulon of local regulators. However, such conservation was not observed between the regulatory interaction networks of B. subtilis and the two other species. We found co-expression conservation on a number of genes involved in quorum sensing, but almost no conservation for genes involved in pathogenicity across E. coli and S. enterica which could partially explain their different lifestyles. We concluded that despite their different lifestyles, no significant rewiring have occurred at the level of local regulons involved for instance, and notable conservation can be detected in signaling pathways and stress sensing in the phylogenetically close species S. enterica and E. coli. Moreover, conservation of local regulons seems to depend on the evolutionary time of divergence across species disappearing at larger distances as shown by the comparison with B. subtilis. Global regulons follow a different trend and show major rewiring even at the limited evolutionary distance that separates E. coli and S. enterica. PMID:25101984

  17. Comparison of co-expression measures: mutual information, correlation, and model based indices.

    Science.gov (United States)

    Song, Lin; Langfelder, Peter; Horvath, Steve

    2012-12-09

    Co-expression measures are often used to define networks among genes. Mutual information (MI) is often used as a generalized correlation measure. It is not clear how much MI adds beyond standard (robust) correlation measures or regression model based association measures. Further, it is important to assess what transformations of these and other co-expression measures lead to biologically meaningful modules (clusters of genes). We provide a comprehensive comparison between mutual information and several correlation measures in 8 empirical data sets and in simulations. We also study different approaches for transforming an adjacency matrix, e.g. using the topological overlap measure. Overall, we confirm close relationships between MI and correlation in all data sets which reflects the fact that most gene pairs satisfy linear or monotonic relationships. We discuss rare situations when the two measures disagree. We also compare correlation and MI based approaches when it comes to defining co-expression network modules. We show that a robust measure of correlation (the biweight midcorrelation transformed via the topological overlap transformation) leads to modules that are superior to MI based modules and maximal information coefficient (MIC) based modules in terms of gene ontology enrichment. We present a function that relates correlation to mutual information which can be used to approximate the mutual information from the corresponding correlation coefficient. We propose the use of polynomial or spline regression models as an alternative to MI for capturing non-linear relationships between quantitative variables. The biweight midcorrelation outperforms MI in terms of elucidating gene pairwise relationships. Coupled with the topological overlap matrix transformation, it often leads to more significantly enriched co-expression modules. Spline and polynomial networks form attractive alternatives to MI in case of non-linear relationships. Our results indicate that MI

  18. Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

    Science.gov (United States)

    Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

    2018-05-09

    Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.

  19. The arabidopsis wall associated kinase-like 10 gene encodes a functional guanylyl cyclase and is co-expressed with pathogen defense related genes

    KAUST Repository

    Meier, Stuart; Ruzvidzo, Oziniel; Morse, Monique; Donaldson, Lara; Kwezi, Lusisizwe; Gehring, Christoph A

    2010-01-01

    Background: Second messengers have a key role in linking environmental stimuli to physiological responses. One such messenger, guanosine 3?,5?-cyclic monophosphate (cGMP), has long been known to be an essential signaling molecule in many different physiological processes in higher plants, including biotic stress responses. To date, however, the guanylyl cyclase (GC) enzymes that catalyze the formation of cGMP from GTP have largely remained elusive in higher plants. Principal Findings: We have identified an Arabidopsis receptor type wall associated kinase-like molecule (AtWAKL10) as a candidate GC and provide experimental evidence to show that the intracellular domain of AtWAKL10431-700 can generate cGMP in vitro. Further, we also demonstrate that the molecule has kinase activity indicating that AtWAKL10 is a twin-domain catalytic protein. A co-expression and stimulus-specific expression analysis revealed that AtWAKL10 is consistently coexpressed with well characterized pathogen defense related genes and along with these genes is induced early and sharply in response to a range of pathogens and their elicitors. Conclusions: We demonstrate that AtWAKL10 is a twin-domain, kinase-GC signaling molecule that may function in biotic stress responses that are critically dependent on the second messenger cGMP. © 2010 Meier et al.

  20. The arabidopsis wall associated kinase-like 10 gene encodes a functional guanylyl cyclase and is co-expressed with pathogen defense related genes

    KAUST Repository

    Meier, Stuart

    2010-01-26

    Background: Second messengers have a key role in linking environmental stimuli to physiological responses. One such messenger, guanosine 3?,5?-cyclic monophosphate (cGMP), has long been known to be an essential signaling molecule in many different physiological processes in higher plants, including biotic stress responses. To date, however, the guanylyl cyclase (GC) enzymes that catalyze the formation of cGMP from GTP have largely remained elusive in higher plants. Principal Findings: We have identified an Arabidopsis receptor type wall associated kinase-like molecule (AtWAKL10) as a candidate GC and provide experimental evidence to show that the intracellular domain of AtWAKL10431-700 can generate cGMP in vitro. Further, we also demonstrate that the molecule has kinase activity indicating that AtWAKL10 is a twin-domain catalytic protein. A co-expression and stimulus-specific expression analysis revealed that AtWAKL10 is consistently coexpressed with well characterized pathogen defense related genes and along with these genes is induced early and sharply in response to a range of pathogens and their elicitors. Conclusions: We demonstrate that AtWAKL10 is a twin-domain, kinase-GC signaling molecule that may function in biotic stress responses that are critically dependent on the second messenger cGMP. © 2010 Meier et al.

  1. Evolutionary origins of Brassicaceae specific genes in Arabidopsis thaliana

    Science.gov (United States)

    2011-01-01

    Background All sequenced genomes contain a proportion of lineage-specific genes, which exhibit no sequence similarity to any genes outside the lineage. Despite their prevalence, the origins and functions of most lineage-specific genes remain largely unknown. As more genomes are sequenced opportunities for understanding evolutionary origins and functions of lineage-specific genes are increasing. Results This study provides a comprehensive analysis of the origins of lineage-specific genes (LSGs) in Arabidopsis thaliana that are restricted to the Brassicaceae family. In this study, lineage-specific genes within the nuclear (1761 genes) and mitochondrial (28 genes) genomes are identified. The evolutionary origins of two thirds of the lineage-specific genes within the Arabidopsis thaliana genome are also identified. Almost a quarter of lineage-specific genes originate from non-lineage-specific paralogs, while the origins of ~10% of lineage-specific genes are partly derived from DNA exapted from transposable elements (twice the proportion observed for non-lineage-specific genes). Lineage-specific genes are also enriched in genes that have overlapping CDS, which is consistent with such novel genes arising from overprinting. Over half of the subset of the 958 lineage-specific genes found only in Arabidopsis thaliana have alignments to intergenic regions in Arabidopsis lyrata, consistent with either de novo origination or differential gene loss and retention, with both evolutionary scenarios explaining the lineage-specific status of these genes. A smaller number of lineage-specific genes with an incomplete open reading frame across different Arabidopsis thaliana accessions are further identified as accession-specific genes, most likely of recent origin in Arabidopsis thaliana. Putative de novo origination for two of the Arabidopsis thaliana-only genes is identified via additional sequencing across accessions of Arabidopsis thaliana and closely related sister species

  2. Co-expression Network Approach to Studying the Effects of Botulinum Neurotoxin-A.

    Science.gov (United States)

    Mukund, Kavitha; Ward, Samuel R; Lieber, Richard L; Subramaniam, Shankar

    2017-10-16

    Botulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules, further hierarchically clustered into 5 groups. Quantifying average expression and co-expression patterns across these groups revealed temporal aspects of muscle's response to BoNT-A. Functional analysis revealed enrichment of group 1 with metabolism; group 5 with contradictory functions of atrophy and cellular recovery; and groups 2 and 3 with extracellular matrix (ECM) and non-fast fiber isoforms. Topological positioning of two highly ranked, significantly expressed genes- Dclk1 and Ostalpha within group 5 suggested possible mechanistic roles in recovery from BoNT-A induced atrophy. Phenotypic correlations of groups with titin and myosin protein content further emphasized the effect of BoNT-A on the sarcomeric contraction machinery in early phase of chemodenervation. In summary, our approach revealed a hierarchical functional response to BoNT-A induced paralysis with early metabolic and later ECM responses and identified putative biomarkers associated with chemodenervation. Additionally, our results provide an unbiased validation of the response documented in our previous workBotulinum Neurotoxin A (BoNT-A) is a potent neurotoxin with several clinical applications.The goal of this study was to utilize co-expression network theory to analyze temporal transcriptional data from skeletal muscle after BoNT-A treatment. Expression data for 2000 genes (extracted using a ranking heuristic) served as the basis for this analysis. Using weighted gene co-expression network analysis (WGCNA), we identified 19 co-expressed modules

  3. Why is the correlation between gene importance and gene evolutionary rate so weak?

    Science.gov (United States)

    Wang, Zhi; Zhang, Jianzhi

    2009-01-01

    One of the few commonly believed principles of molecular evolution is that functionally more important genes (or DNA sequences) evolve more slowly than less important ones. This principle is widely used by molecular biologists in daily practice. However, recent genomic analysis of a diverse array of organisms found only weak, negative correlations between the evolutionary rate of a gene and its functional importance, typically measured under a single benign lab condition. A frequently suggested cause of the above finding is that gene importance determined in the lab differs from that in an organism's natural environment. Here, we test this hypothesis in yeast using gene importance values experimentally determined in 418 lab conditions or computationally predicted for 10,000 nutritional conditions. In no single condition or combination of conditions did we find a much stronger negative correlation, which is explainable by our subsequent finding that always-essential (enzyme) genes do not evolve significantly more slowly than sometimes-essential or always-nonessential ones. Furthermore, we verified that functional density, approximated by the fraction of amino acid sites within protein domains, is uncorrelated with gene importance. Thus, neither the lab-nature mismatch nor a potentially biased among-gene distribution of functional density explains the observed weakness of the correlation between gene importance and evolutionary rate. We conclude that the weakness is factual, rather than artifactual. In addition to being weakened by population genetic reasons, the correlation is likely to have been further weakened by the presence of multiple nontrivial rate determinants that are independent from gene importance. These findings notwithstanding, we show that the principle of slower evolution of more important genes does have some predictive power when genes with vastly different evolutionary rates are compared, explaining why the principle can be practically useful

  4. Evolutionary Transitions of MicroRNA-Target Pairs

    KAUST Repository

    Nozawa, Masafumi; Fujimi, Mai; Iwamoto, Chie; Onizuka, Kanako; Fukuda, Nana; Ikeo, Kazuho; Gojobori, Takashi

    2016-01-01

    How newly generated microRNA (miRNA) genes are integrated into gene regulatory networks during evolution is fundamental in understanding the molecular and evolutionary bases of robustness and plasticity in gene regulation. A recent model proposed that after the birth of a miRNA, the miRNA is generally integrated into the network by decreasing the number of target genes during evolution. However, this decreasing model remains to be carefully examined by considering in vivo conditions. In this study, we therefore compared the number of target genes among miRNAs with different ages, combining experiments with bioinformatics predictions. First, we focused on three Drosophila miRNAs with different ages. As a result, we found that an older miRNA has a greater number of target genes than a younger miRNA, suggesting the increasing number of targets for each miRNA during evolution (increasing model). To further confirm our results, we also predicted all target genes for all miRNAs in D. melanogaster, considering co-expression of miRNAs and mRNAs in vivo. The results obtained also do not support the decreasing model but are reasonably consistent with the increasing model of miRNA-target pairs. Furthermore, our large-scale analyses of currently available experimental data of miRNA-target pairs also showed a weak but the same trend in humans. These results indicate that the current decreasing model of miRNA-target pairs should be reconsidered and the increasing model may be more appropriate to explain the evolutionary transitions of miRNA-target pairs in many organisms.

  5. Evolutionary Transitions of MicroRNA-Target Pairs

    KAUST Repository

    Nozawa, Masafumi

    2016-04-27

    How newly generated microRNA (miRNA) genes are integrated into gene regulatory networks during evolution is fundamental in understanding the molecular and evolutionary bases of robustness and plasticity in gene regulation. A recent model proposed that after the birth of a miRNA, the miRNA is generally integrated into the network by decreasing the number of target genes during evolution. However, this decreasing model remains to be carefully examined by considering in vivo conditions. In this study, we therefore compared the number of target genes among miRNAs with different ages, combining experiments with bioinformatics predictions. First, we focused on three Drosophila miRNAs with different ages. As a result, we found that an older miRNA has a greater number of target genes than a younger miRNA, suggesting the increasing number of targets for each miRNA during evolution (increasing model). To further confirm our results, we also predicted all target genes for all miRNAs in D. melanogaster, considering co-expression of miRNAs and mRNAs in vivo. The results obtained also do not support the decreasing model but are reasonably consistent with the increasing model of miRNA-target pairs. Furthermore, our large-scale analyses of currently available experimental data of miRNA-target pairs also showed a weak but the same trend in humans. These results indicate that the current decreasing model of miRNA-target pairs should be reconsidered and the increasing model may be more appropriate to explain the evolutionary transitions of miRNA-target pairs in many organisms.

  6. Coexpression of bile salt hydrolase gene and catalase gene remarkably improves oxidative stress and bile salt resistance in Lactobacillus casei.

    Science.gov (United States)

    Wang, Guohong; Yin, Sheng; An, Haoran; Chen, Shangwu; Hao, Yanling

    2011-08-01

    Lactic acid bacteria (LAB) encounter various types of stress during industrial processes and gastrointestinal transit. Catalase (CAT) and bile salt hydrolase (BSH) can protect bacteria from oxidative stress or damage caused by bile salts by decomposing hydrogen peroxide (H(2)O(2)) or deconjugating the bile salts, respectively. Lactobacillus casei is a valuable probiotic strain and is often deficient in both CAT and BSH. In order to improve the resistance of L. casei to both oxidative and bile salts stress, the catalase gene katA from L. sakei and the bile salt hydrolase gene bsh1 from L. plantarum were coexpressed in L. casei HX01. The enzyme activities of CAT and BSH were 2.41 μmol H(2)O(2)/min/10(8) colony-forming units (CFU) and 2.11 μmol glycine/min/ml in the recombinant L. casei CB, respectively. After incubation with 8 mM H(2)O(2), survival ratio of L. casei CB was 40-fold higher than that of L. casei CK. Treatment of L. casei CB with various concentrations of sodium glycodeoxycholate (GDCA) showed that ~10(5) CFU/ml cells survived after incubation with 0.5% GDCA, whereas almost all the L. casei CK cells were killed when treaded with 0.4% GDCA. These results indicate that the coexpression of CAT and BSH confers high-level resistance to both oxidative and bile salts stress conditions in L. casei HX01.

  7. Gene Coexpression Analysis Reveals Complex Metabolism of the Monoterpene Alcohol Linalool in Arabidopsis Flowers[W][OPEN

    Science.gov (United States)

    Ginglinger, Jean-François; Boachon, Benoit; Höfer, René; Paetz, Christian; Köllner, Tobias G.; Miesch, Laurence; Lugan, Raphael; Baltenweck, Raymonde; Mutterer, Jérôme; Ullmann, Pascaline; Beran, Franziska; Claudel, Patricia; Verstappen, Francel; Fischer, Marc J.C.; Karst, Francis; Bouwmeester, Harro; Miesch, Michel; Schneider, Bernd; Gershenzon, Jonathan; Ehlting, Jürgen; Werck-Reichhart, Danièle

    2013-01-01

    The cytochrome P450 family encompasses the largest family of enzymes in plant metabolism, and the functions of many of its members in Arabidopsis thaliana are still unknown. Gene coexpression analysis pointed to two P450s that were coexpressed with two monoterpene synthases in flowers and were thus predicted to be involved in monoterpenoid metabolism. We show that all four selected genes, the two terpene synthases (TPS10 and TPS14) and the two cytochrome P450s (CYP71B31 and CYP76C3), are simultaneously expressed at anthesis, mainly in upper anther filaments and in petals. Upon transient expression in Nicotiana benthamiana, the TPS enzymes colocalize in vesicular structures associated with the plastid surface, whereas the P450 proteins were detected in the endoplasmic reticulum. Whether they were expressed in Saccharomyces cerevisiae or in N. benthamiana, the TPS enzymes formed two different enantiomers of linalool: (−)-(R)-linalool for TPS10 and (+)-(S)-linalool for TPS14. Both P450 enzymes metabolize the two linalool enantiomers to form different but overlapping sets of hydroxylated or epoxidized products. These oxygenated products are not emitted into the floral headspace, but accumulate in floral tissues as further converted or conjugated metabolites. This work reveals complex linalool metabolism in Arabidopsis flowers, the ecological role of which remains to be determined. PMID:24285789

  8. Predicting targeted drug combinations based on Pareto optimal patterns of coexpression network connectivity.

    Science.gov (United States)

    Penrod, Nadia M; Greene, Casey S; Moore, Jason H

    2014-01-01

    Molecularly targeted drugs promise a safer and more effective treatment modality than conventional chemotherapy for cancer patients. However, tumors are dynamic systems that readily adapt to these agents activating alternative survival pathways as they evolve resistant phenotypes. Combination therapies can overcome resistance but finding the optimal combinations efficiently presents a formidable challenge. Here we introduce a new paradigm for the design of combination therapy treatment strategies that exploits the tumor adaptive process to identify context-dependent essential genes as druggable targets. We have developed a framework to mine high-throughput transcriptomic data, based on differential coexpression and Pareto optimization, to investigate drug-induced tumor adaptation. We use this approach to identify tumor-essential genes as druggable candidates. We apply our method to a set of ER(+) breast tumor samples, collected before (n = 58) and after (n = 60) neoadjuvant treatment with the aromatase inhibitor letrozole, to prioritize genes as targets for combination therapy with letrozole treatment. We validate letrozole-induced tumor adaptation through coexpression and pathway analyses in an independent data set (n = 18). We find pervasive differential coexpression between the untreated and letrozole-treated tumor samples as evidence of letrozole-induced tumor adaptation. Based on patterns of coexpression, we identify ten genes as potential candidates for combination therapy with letrozole including EPCAM, a letrozole-induced essential gene and a target to which drugs have already been developed as cancer therapeutics. Through replication, we validate six letrozole-induced coexpression relationships and confirm the epithelial-to-mesenchymal transition as a process that is upregulated in the residual tumor samples following letrozole treatment. To derive the greatest benefit from molecularly targeted drugs it is critical to design combination

  9. Age gene expression and coexpression progressive signatures in peripheral blood leukocytes.

    Science.gov (United States)

    Irizar, Haritz; Goñi, Joaquín; Alzualde, Ainhoa; Castillo-Triviño, Tamara; Olascoaga, Javier; Lopez de Munain, Adolfo; Otaegui, David

    2015-12-01

    Both cellular senescence and organismic aging are known to be dynamic processes that start early in life and progress constantly during the whole life of the individual. In this work, with the objective of identifying signatures of age-related progressive change at the transcriptomic level, we have performed a whole-genome gene expression analysis of peripheral blood leukocytes in a group of healthy individuals with ages ranging from 14 to 93 years. A set of genes with progressively changing gene expression (either increase or decrease with age) has been identified and contextualized in a coexpression network. A modularity analysis has been performed on this network and biological-term and pathway enrichment analyses have been used for biological interpretation of each module. In summary, the results of the present work reveal the existence of a transcriptomic component that shows progressive expression changes associated to age in peripheral blood leukocytes, highlighting both the dynamic nature of the process and the need to complement young vs. elder studies with longitudinal studies that include middle aged individuals. From the transcriptional point of view, immunosenescence seems to be occurring from a relatively early age, at least from the late 20s/early 30s, and the 49-56 year old age-range appears to be critical. In general, the genes that, according to our results, show progressive expression changes with aging are involved in pathogenic/cellular processes that have classically been linked to aging in humans: cancer, immune processes and cellular growth vs. maintenance. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Gene family size conservation is a good indicator of evolutionary rates.

    Science.gov (United States)

    Chen, Feng-Chi; Chen, Chiuan-Jung; Li, Wen-Hsiung; Chuang, Trees-Juen

    2010-08-01

    The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.

  11. The evolutionary dynamics of variant antigen genes in Babesia reveal a history of genomic innovation underlying host-parasite interaction.

    Science.gov (United States)

    Jackson, Andrew P; Otto, Thomas D; Darby, Alistair; Ramaprasad, Abhinay; Xia, Dong; Echaide, Ignacio Eduardo; Farber, Marisa; Gahlot, Sunayna; Gamble, John; Gupta, Dinesh; Gupta, Yask; Jackson, Louise; Malandrin, Laurence; Malas, Tareq B; Moussa, Ehab; Nair, Mridul; Reid, Adam J; Sanders, Mandy; Sharma, Jyotsna; Tracey, Alan; Quail, Mike A; Weir, William; Wastling, Jonathan M; Hall, Neil; Willadsen, Peter; Lingelbach, Klaus; Shiels, Brian; Tait, Andy; Berriman, Matt; Allred, David R; Pain, Arnab

    2014-06-01

    Babesia spp. are tick-borne, intraerythrocytic hemoparasites that use antigenic variation to resist host immunity, through sequential modification of the parasite-derived variant erythrocyte surface antigen (VESA) expressed on the infected red blood cell surface. We identified the genomic processes driving antigenic diversity in genes encoding VESA (ves1) through comparative analysis within and between three Babesia species, (B. bigemina, B. divergens and B. bovis). Ves1 structure diverges rapidly after speciation, notably through the evolution of shortened forms (ves2) from 5' ends of canonical ves1 genes. Phylogenetic analyses show that ves1 genes are transposed between loci routinely, whereas ves2 genes are not. Similarly, analysis of sequence mosaicism shows that recombination drives variation in ves1 sequences, but less so for ves2, indicating the adoption of different mechanisms for variation of the two families. Proteomic analysis of the B. bigemina PR isolate shows that two dominant VESA1 proteins are expressed in the population, whereas numerous VESA2 proteins are co-expressed, consistent with differential transcriptional regulation of each family. Hence, VESA2 proteins are abundant and previously unrecognized elements of Babesia biology, with evolutionary dynamics consistently different to those of VESA1, suggesting that their functions are distinct. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Evolutionary signatures amongst disease genes permit novel methods for gene prioritization and construction of informative gene-based networks.

    Directory of Open Access Journals (Sweden)

    Nolan Priedigkeit

    2015-02-01

    Full Text Available Genes involved in the same function tend to have similar evolutionary histories, in that their rates of evolution covary over time. This coevolutionary signature, termed Evolutionary Rate Covariation (ERC, is calculated using only gene sequences from a set of closely related species and has demonstrated potential as a computational tool for inferring functional relationships between genes. To further define applications of ERC, we first established that roughly 55% of genetic diseases posses an ERC signature between their contributing genes. At a false discovery rate of 5% we report 40 such diseases including cancers, developmental disorders and mitochondrial diseases. Given these coevolutionary signatures between disease genes, we then assessed ERC's ability to prioritize known disease genes out of a list of unrelated candidates. We found that in the presence of an ERC signature, the true disease gene is effectively prioritized to the top 6% of candidates on average. We then apply this strategy to a melanoma-associated region on chromosome 1 and identify MCL1 as a potential causative gene. Furthermore, to gain global insight into disease mechanisms, we used ERC to predict molecular connections between 310 nominally distinct diseases. The resulting "disease map" network associates several diseases with related pathogenic mechanisms and unveils many novel relationships between clinically distinct diseases, such as between Hirschsprung's disease and melanoma. Taken together, these results demonstrate the utility of molecular evolution as a gene discovery platform and show that evolutionary signatures can be used to build informative gene-based networks.

  13. Guidance for RNA-seq co-expression network construction and analysis: safety in numbers.

    Science.gov (United States)

    Ballouz, S; Verleyen, W; Gillis, J

    2015-07-01

    RNA-seq co-expression analysis is in its infancy and reasonable practices remain poorly defined. We assessed a variety of RNA-seq expression data to determine factors affecting functional connectivity and topology in co-expression networks. We examine RNA-seq co-expression data generated from 1970 RNA-seq samples using a Guilt-By-Association framework, in which genes are assessed for the tendency of co-expression to reflect shared function. Minimal experimental criteria to obtain performance on par with microarrays were >20 samples with read depth >10 M per sample. While the aggregate network constructed shows good performance (area under the receiver operator characteristic curve ∼0.71), the dependency on number of experiments used is nearly identical to that present in microarrays, suggesting thousands of samples are required to obtain 'gold-standard' co-expression. We find a major topological difference between RNA-seq and microarray co-expression in the form of low overlaps between hub-like genes from each network due to changes in the correlation of expression noise within each technology. jgillis@cshl.edu or sballouz@cshl.edu Networks are available at: http://gillislab.labsites.cshl.edu/supplements/rna-seq-networks/ and supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. Construction and identification of double-gene co-expression vector with radiation-inducible human TRAIL and endostatin

    International Nuclear Information System (INIS)

    Li Yanbo; Guo Caixia; Gong Pingsheng; Liu Yang; Liangshuo; Wang Hongfang; Wang Jianfeng; Gong Shouliang

    2010-01-01

    Objective: To construct a recombinant plasmid pshuttle-Egr1-shTRAIL-shES containing tumor necrosis factor-related apoptosis-inducing ligand (TRAIL) and endostatin double genes. Methods: The secretary endostatin gene (shES) fragment was amplified from the pMD19T-endostatin vector by PCR. The shES gene was ligated to pMD19Tand sequenced. Finally, using the gene recombinant technique, the recombinant plasmid pshuttle-Egr1- shTRAIL-shES with radiation-inducible Egr1 promoter, secretary TRAIL and endostatin double-gene was constructed. Results: The sequence of the shES gene was in concordance with that anticipated indicating shES gene was acquired successfully.Moreover, the results acquired by PCR and restrictive digestion identification of the recombinant plasmid pshuttle-Egr1-shTRAIL-shES and all the vectors refered to its construction confirmed that pshuttle-Egr1-shTRAIL-shES was constructed correctly. Conclusion: The radiation-inducible double-gene co-expression vector pshuttle-Egr1-shTRAIL-shES is constructed successfully, which would set the experimental foundation for further study on the anti-tumor effect of TRAIL and endostatin double-gene-radiotherapy and its related mechanisms. (authors)

  15. The Roles and Evolutionary Patterns of Intronless Genes in Deuterostomes

    Directory of Open Access Journals (Sweden)

    Ming Zou

    2011-01-01

    Full Text Available Genes without introns are a characteristic feature of prokaryotes, but there are still a number of intronless genes in eukaryotes. To study these eukaryotic genes that have prokaryotic architecture could help to understand the evolutionary patterns of related genes and genomes. Our analyses revealed a number of intronless genes that reside in 6 deuterostomes (sea urchin, sea squirt, zebrafish, chicken, platypus, and human. We also determined the conservation for each intronless gene in archaea, bacteria, fungi, plants, metazoans, and other eukaryotes. Proportions of intronless genes that are inherited from the common ancestor of archaea, bacteria, and eukaryotes in these species were consistent with their phylogenetic positions, with more proportions of ancient intronless genes residing in more primitive species. In these species, intronless genes belong to different cellular roles and gene ontology (GO categories, and some of these functions are very basic. Part of intronless genes is derived from other intronless genes or multiexon genes in each species. In conclusion, we showed that a varying number and proportion of intronless genes reside in these 6 deuterostomes, and some of them function importantly. These genes are good candidates for subsequent functional and evolutionary analyses specifically.

  16. Evolutionary Fates and Dynamic Functionalization of Young Duplicate Genes in Arabidopsis Genomes1[OPEN

    Science.gov (United States)

    Wang, Jun; Tao, Feng; Marowsky, Nicholas C.; Fan, Chuanzhu

    2016-01-01

    Gene duplication is a primary means to generate genomic novelties, playing an essential role in speciation and adaptation. Particularly in plants, a high abundance of duplicate genes has been maintained for significantly long periods of evolutionary time. To address the manner in which young duplicate genes were derived primarily from small-scale gene duplication and preserved in plant genomes and to determine the underlying driving mechanisms, we generated transcriptomes to produce the expression profiles of five tissues in Arabidopsis thaliana and the closely related species Arabidopsis lyrata and Capsella rubella. Based on the quantitative analysis metrics, we investigated the evolutionary processes of young duplicate genes in Arabidopsis. We determined that conservation, neofunctionalization, and specialization are three main evolutionary processes for Arabidopsis young duplicate genes. We explicitly demonstrated the dynamic functionalization of duplicate genes along the evolutionary time scale. Upon origination, duplicates tend to maintain their ancestral functions; but as they survive longer, they might be likely to develop distinct and novel functions. The temporal evolutionary processes and functionalization of plant duplicate genes are associated with their ancestral functions, dynamic DNA methylation levels, and histone modification abundances. Furthermore, duplicate genes tend to be initially expressed in pollen and then to gain more interaction partners over time. Altogether, our study provides novel insights into the dynamic retention processes of young duplicate genes in plant genomes. PMID:27485883

  17. Co-expression of interleukin 12 enhances antitumor effects of a novel chimeric promoter-mediated suicide gene therapy in an immunocompetent mouse model

    International Nuclear Information System (INIS)

    Xu, Yu; Liu, Zhengchun; Kong, Haiyan; Sun, Wenjie; Liao, Zhengkai; Zhou, Fuxiang; Xie, Conghua

    2011-01-01

    Highlights: → A novel chimeric promoter consisting of CArG element and hTERT promoter was developed. → The promoter was characterized with radiation-inducibility and tumor-specificity. → Suicide gene system driven by the promoter showed remarkable cytotoxicity in vitro. → Co-expression of IL12 enhanced the promoter mediated suicide gene therapy in vivo. -- Abstract: The human telomerase reverse transcriptase (hTERT) promoter has been widely used in target gene therapy of cancer. However, low transcriptional activity limited its clinical application. Here, we designed a novel dual radiation-inducible and tumor-specific promoter system consisting of CArG elements and the hTERT promoter, resulting in increased expression of reporter genes after gamma-irradiation. Therapeutic and side effects of adenovirus-mediated horseradish peroxidase (HRP)/indole-3-acetic (IAA) system downstream of the chimeric promoter were evaluated in mice bearing Lewis lung carcinoma, combining with or without adenovirus-mediated interleukin 12 (IL12) gene driven by the cytomegalovirus promoter. The combination treatment showed more effective suppression of tumor growth than those with single agent alone, being associated with pronounced intratumoral T-lymphocyte infiltration and minor side effects. Our results suggest that the combination treatment with HRP/IAA system driven by the novel chimeric promoter and the co-expression of IL12 might be an effective and safe target gene therapy strategy of cancer.

  18. Co-expression of interleukin 12 enhances antitumor effects of a novel chimeric promoter-mediated suicide gene therapy in an immunocompetent mouse model

    Energy Technology Data Exchange (ETDEWEB)

    Xu, Yu, E-mail: xuyu1001@gmail.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Liu, Zhengchun, E-mail: l135027@126.com [Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Kong, Haiyan, E-mail: suppleant@163.com [Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Sun, Wenjie, E-mail: wendy11240325@163.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Liao, Zhengkai, E-mail: fastbeta@gmail.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Zhou, Fuxiang, E-mail: happyzhoufx@sina.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); Xie, Conghua, E-mail: chxie_65@hotmail.com [Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, 169 Donghu Road, Wuhan 430071 (China); Hubei Key Laboratory of Tumor Biological Behaviors and Hubei Cancer Clinical Study Center, 169 Donghu Road, Wuhan 430071 (China); and others

    2011-09-09

    Highlights: {yields} A novel chimeric promoter consisting of CArG element and hTERT promoter was developed. {yields} The promoter was characterized with radiation-inducibility and tumor-specificity. {yields} Suicide gene system driven by the promoter showed remarkable cytotoxicity in vitro. {yields} Co-expression of IL12 enhanced the promoter mediated suicide gene therapy in vivo. -- Abstract: The human telomerase reverse transcriptase (hTERT) promoter has been widely used in target gene therapy of cancer. However, low transcriptional activity limited its clinical application. Here, we designed a novel dual radiation-inducible and tumor-specific promoter system consisting of CArG elements and the hTERT promoter, resulting in increased expression of reporter genes after gamma-irradiation. Therapeutic and side effects of adenovirus-mediated horseradish peroxidase (HRP)/indole-3-acetic (IAA) system downstream of the chimeric promoter were evaluated in mice bearing Lewis lung carcinoma, combining with or without adenovirus-mediated interleukin 12 (IL12) gene driven by the cytomegalovirus promoter. The combination treatment showed more effective suppression of tumor growth than those with single agent alone, being associated with pronounced intratumoral T-lymphocyte infiltration and minor side effects. Our results suggest that the combination treatment with HRP/IAA system driven by the novel chimeric promoter and the co-expression of IL12 might be an effective and safe target gene therapy strategy of cancer.

  19. A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways

    OpenAIRE

    Musungu, Bryan M.; Bhatnagar, Deepak; Brown, Robert L.; Payne, Gary A.; OBrian, Greg; Fakhoury, Ahmad M.; Geisler, Matt

    2016-01-01

    A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flav...

  20. Codon usage is associated with the evolutionary age of genes in metazoan genomes

    Directory of Open Access Journals (Sweden)

    Linial Nathan

    2009-12-01

    Full Text Available Abstract Background Codon usage may vary significantly between different organisms and between genes within the same organism. Several evolutionary processes have been postulated to be the predominant determinants of codon usage: selection, mutation, and genetic drift. However, the relative contribution of each of these factors in different species remains debatable. The availability of complete genomes for tens of multicellular organisms provides an opportunity to inspect the relationship between codon usage and the evolutionary age of genes. Results We assign an evolutionary age to a gene based on the relative positions of its identified homologues in a standard phylogenetic tree. This yields a classification of all genes in a genome to several evolutionary age classes. The present study starts from the observation that each age class of genes has a unique codon usage and proceeds to provide a quantitative analysis of the codon usage in these classes. This observation is made for the genomes of Homo sapiens, Mus musculus, and Drosophila melanogaster. It is even more remarkable that the differences between codon usages in different age groups exhibit similar and consistent behavior in various organisms. While we find that GC content and gene length are also associated with the evolutionary age of genes, they can provide only a partial explanation for the observed codon usage. Conclusion While factors such as GC content, mutational bias, and selection shape the codon usage in a genome, the evolutionary history of an organism over hundreds of millions of years is an overlooked property that is strongly linked to GC content, protein length, and, even more significantly, to the codon usage of metazoan genomes.

  1. Evolutionary constraints shape caste-specific gene expression across 15 ant species.

    Science.gov (United States)

    Morandin, Claire; Mikheyev, Alexander S; Pedersen, Jes Søe; Helanterä, Heikki

    2017-05-01

    Development of polymorphic phenotypes from similar genomes requires gene expression differences. However, little is known about how morph-specific gene expression patterns vary on a broad phylogenetic scale. We hypothesize that evolution of morph-specific gene expression, and consequently morph-specific phenotypic evolution, may be constrained by gene essentiality and the amount of pleiotropic constraints. Here, we use comparative transcriptomics of queen and worker morphs, that is, castes, from 15 ant species to understand the constraints of morph-biased gene expression. In particular, we investigate how measures of evolutionary constraints at the sequence level (expression level, connectivity, and number of gene ontology [GO] terms) correlate with morph-biased expression. Our results show that genes indeed vary in their potential to become morph-biased. The existence of genes that are constrained in becoming caste-biased potentially limits the evolutionary decoupling of the caste phenotypes, that is, it might result in "caste load" occasioning from antagonistic fitness variation, similarly to sexually antagonistic fitness variation between males and females. On the other hand, we suggest that genes under low constraints are released from antagonistic variation and thus more likely to be co-opted for morph specific use. Overall, our results suggest that the factors that affect sequence evolutionary rates and evolution of plastic expression may largely overlap. © 2017 The Author(s). Evolution © 2017 The Society for the Study of Evolution.

  2. Evolutionary Rate Heterogeneity of Primary and Secondary Metabolic Pathway Genes in Arabidopsis thaliana.

    Science.gov (United States)

    Mukherjee, Dola; Mukherjee, Ashutosh; Ghosh, Tapash Chandra

    2015-11-10

    Primary metabolism is essential to plants for growth and development, and secondary metabolism helps plants to interact with the environment. Many plant metabolites are industrially important. These metabolites are produced by plants through complex metabolic pathways. Lack of knowledge about these pathways is hindering the successful breeding practices for these metabolites. For a better knowledge of the metabolism in plants as a whole, evolutionary rate variation of primary and secondary metabolic pathway genes is a prerequisite. In this study, evolutionary rate variation of primary and secondary metabolic pathway genes has been analyzed in the model plant Arabidopsis thaliana. Primary metabolic pathway genes were found to be more conserved than secondary metabolic pathway genes. Several factors such as gene structure, expression level, tissue specificity, multifunctionality, and domain number are the key factors behind this evolutionary rate variation. This study will help to better understand the evolutionary dynamics of plant metabolism. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  3. Inferring the transcriptional landscape of bovine skeletal muscle by integrating co-expression networks.

    Directory of Open Access Journals (Sweden)

    Nicholas J Hudson

    Full Text Available BACKGROUND: Despite modern technologies and novel computational approaches, decoding causal transcriptional regulation remains challenging. This is particularly true for less well studied organisms and when only gene expression data is available. In muscle a small number of well characterised transcription factors are proposed to regulate development. Therefore, muscle appears to be a tractable system for proposing new computational approaches. METHODOLOGY/PRINCIPAL FINDINGS: Here we report a simple algorithm that asks "which transcriptional regulator has the highest average absolute co-expression correlation to the genes in a co-expression module?" It correctly infers a number of known causal regulators of fundamental biological processes, including cell cycle activity (E2F1, glycolysis (HLF, mitochondrial transcription (TFB2M, adipogenesis (PIAS1, neuronal development (TLX3, immune function (IRF1 and vasculogenesis (SOX17, within a skeletal muscle context. However, none of the canonical pro-myogenic transcription factors (MYOD1, MYOG, MYF5, MYF6 and MEF2C were linked to muscle structural gene expression modules. Co-expression values were computed using developing bovine muscle from 60 days post conception (early foetal to 30 months post natal (adulthood for two breeds of cattle, in addition to a nutritional comparison with a third breed. A number of transcriptional landscapes were constructed and integrated into an always correlated landscape. One notable feature was a 'metabolic axis' formed from glycolysis genes at one end, nuclear-encoded mitochondrial protein genes at the other, and centrally tethered by mitochondrially-encoded mitochondrial protein genes. CONCLUSIONS/SIGNIFICANCE: The new module-to-regulator algorithm complements our recently described Regulatory Impact Factor analysis. Together with a simple examination of a co-expression module's contents, these three gene expression approaches are starting to illuminate the in vivo

  4. Synonymous genes explore different evolutionary landscapes.

    Directory of Open Access Journals (Sweden)

    Guillaume Cambray

    2008-11-01

    Full Text Available The evolutionary potential of a gene is constrained not only by the amino acid sequence of its product, but by its DNA sequence as well. The topology of the genetic code is such that half of the amino acids exhibit synonymous codons that can reach different subsets of amino acids from each other through single mutation. Thus, synonymous DNA sequences should access different regions of the protein sequence space through a limited number of mutations, and this may deeply influence the evolution of natural proteins. Here, we demonstrate that this feature can be of value for manipulating protein evolvability. We designed an algorithm that, starting from an input gene, constructs a synonymous sequence that systematically includes the codons with the most different evolutionary perspectives; i.e., codons that maximize accessibility to amino acids previously unreachable from the template by point mutation. A synonymous version of a bacterial antibiotic resistance gene was computed and synthesized. When concurrently submitted to identical directed evolution protocols, both the wild type and the recoded sequence led to the isolation of specific, advantageous phenotypic variants. Simulations based on a mutation isolated only from the synthetic gene libraries were conducted to assess the impact of sub-functional selective constraints, such as codon usage, on natural adaptation. Our data demonstrate that rational design of synonymous synthetic genes stands as an affordable improvement to any directed evolution protocol. We show that using two synonymous DNA sequences improves the overall yield of the procedure by increasing the diversity of mutants generated. These results provide conclusive evidence that synonymous coding sequences do experience different areas of the corresponding protein adaptive landscape, and that a sequence's codon usage effectively constrains the evolution of the encoded protein.

  5. Prevalent Role of Gene Features in Determining Evolutionary Fates of Whole-Genome Duplication Duplicated Genes in Flowering Plants1[W][OA

    Science.gov (United States)

    Jiang, Wen-kai; Liu, Yun-long; Xia, En-hua; Gao, Li-zhi

    2013-01-01

    The evolution of genes and genomes after polyploidization has been the subject of extensive studies in evolutionary biology and plant sciences. While a significant number of duplicated genes are rapidly removed during a process called fractionation, which operates after the whole-genome duplication (WGD), another considerable number of genes are retained preferentially, leading to the phenomenon of biased gene retention. However, the evolutionary mechanisms underlying gene retention after WGD remain largely unknown. Through genome-wide analyses of sequence and functional data, we comprehensively investigated the relationships between gene features and the retention probability of duplicated genes after WGDs in six plant genomes, Arabidopsis (Arabidopsis thaliana), poplar (Populus trichocarpa), soybean (Glycine max), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays). The results showed that multiple gene features were correlated with the probability of gene retention. Using a logistic regression model based on principal component analysis, we resolved evolutionary rate, structural complexity, and GC3 content as the three major contributors to gene retention. Cluster analysis of these features further classified retained genes into three distinct groups in terms of gene features and evolutionary behaviors. Type I genes are more prone to be selected by dosage balance; type II genes are possibly subject to subfunctionalization; and type III genes may serve as potential targets for neofunctionalization. This study highlights that gene features are able to act jointly as primary forces when determining the retention and evolution of WGD-derived duplicated genes in flowering plants. These findings thus may help to provide a resolution to the debate on different evolutionary models of gene fates after WGDs. PMID:23396833

  6. ERC analysis: web-based inference of gene function via evolutionary rate covariation.

    Science.gov (United States)

    Wolfe, Nicholas W; Clark, Nathan L

    2015-12-01

    The recent explosion of comparative genomics data presents an unprecedented opportunity to construct gene networks via the evolutionary rate covariation (ERC) signature. ERC is used to identify genes that experienced similar evolutionary histories, and thereby draws functional associations between them. The ERC Analysis website allows researchers to exploit genome-wide datasets to infer novel genes in any biological function and to explore deep evolutionary connections between distinct pathways and complexes. The website provides five analytical methods, graphical output, statistical support and access to an increasing number of taxonomic groups. Analyses and data at http://csb.pitt.edu/erc_analysis/ nclark@pitt.edu. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  7. Construction and comparison of gene co-expression networks shows complex plant immune responses

    Directory of Open Access Journals (Sweden)

    Luis Guillermo Leal

    2014-10-01

    Full Text Available Gene co-expression networks (GCNs are graphic representations that depict the coordinated transcription of genes in response to certain stimuli. GCNs provide functional annotations of genes whose function is unknown and are further used in studies of translational functional genomics among species. In this work, a methodology for the reconstruction and comparison of GCNs is presented. This approach was applied using gene expression data that were obtained from immunity experiments in Arabidopsis thaliana, rice, soybean, tomato and cassava. After the evaluation of diverse similarity metrics for the GCN reconstruction, we recommended the mutual information coefficient measurement and a clustering coefficient-based method for similarity threshold selection. To compare GCNs, we proposed a multivariate approach based on the Principal Component Analysis (PCA. Branches of plant immunity that were exemplified by each experiment were analyzed in conjunction with the PCA results, suggesting both the robustness and the dynamic nature of the cellular responses. The dynamic of molecular plant responses produced networks with different characteristics that are differentiable using our methodology. The comparison of GCNs from plant pathosystems, showed that in response to similar pathogens plants could activate conserved signaling pathways. The results confirmed that the closeness of GCNs projected on the principal component space is an indicative of similarity among GCNs. This also can be used to understand global patterns of events triggered during plant immune responses.

  8. Comprehensive analysis of the flowering genes in Chinese cabbage and examination of evolutionary pattern of CO-like genes in plant kingdom

    Science.gov (United States)

    Song, Xiaoming; Duan, Weike; Huang, Zhinan; Liu, Gaofeng; Wu, Peng; Liu, Tongkun; Li, Ying; Hou, Xilin

    2015-09-01

    In plants, flowering is the most important transition from vegetative to reproductive growth. The flowering patterns of monocots and eudicots are distinctly different, but few studies have described the evolutionary patterns of the flowering genes in them. In this study, we analysed the evolutionary pattern, duplication and expression level of these genes. The main results were as follows: (i) characterization of flowering genes in monocots and eudicots, including the identification of family-specific, orthologous and collinear genes; (ii) full characterization of CONSTANS-like genes in Brassica rapa (BraCOL genes), the key flowering genes; (iii) exploration of the evolution of COL genes in plant kingdom and construction of the evolutionary pattern of COL genes; (iv) comparative analysis of CO and FT genes between Brassicaceae and Grass, which identified several family-specific amino acids, and revealed that CO and FT protein structures were similar in B. rapa and Arabidopsis but different in rice; and (v) expression analysis of photoperiod pathway-related genes in B. rapa under different photoperiod treatments by RT-qPCR. This analysis will provide resources for understanding the flowering mechanisms and evolutionary pattern of COL genes. In addition, this genome-wide comparative study of COL genes may also provide clues for evolution of other flowering genes.

  9. Evolutionary Trails of Plant Group II Pyridoxal Phosphate-Dependent Decarboxylase Genes.

    Science.gov (United States)

    Kumar, Rahul

    2016-01-01

    Type II pyridoxal phosphate-dependent decarboxylase (PLP_deC) enzymes play important metabolic roles during nitrogen metabolism. Recent evolutionary profiling of these genes revealed a sharp expansion of histidine decarboxylase genes in the members of Solanaceae family. In spite of the high sequence homology shared by PLP_deC orthologs, these enzymes display remarkable differences in their substrate specificities. Currently, limited information is available on the gene repertoires and substrate specificities of PLP_deCs which renders their precise annotation challenging and offers technical challenges in the immediate identification and biochemical characterization of their full gene complements in plants. Herein, we explored their evolutionary trails in a comprehensive manner by taking advantage of high-throughput data accessibility and computational approaches. We discussed the premise that has enabled an improved reconstruction of their evolutionary lineage and evaluated the factors offering constraints in their rapid functional characterization, till date. We envisage that the synthesized information herein would act as a catalyst for the rapid exploration of their biochemical specificity and physiological roles in more plant species.

  10. Evolutionary Origins of Cancer Driver Genes and Implications for Cancer Prognosis.

    Science.gov (United States)

    Chu, Xin-Yi; Jiang, Ling-Han; Zhou, Xiong-Hui; Cui, Ze-Jia; Zhang, Hong-Yu

    2017-07-14

    The cancer atavistic theory suggests that carcinogenesis is a reverse evolution process. It is thus of great interest to explore the evolutionary origins of cancer driver genes and the relevant mechanisms underlying the carcinogenesis. Moreover, the evolutionary features of cancer driver genes could be helpful in selecting cancer biomarkers from high-throughput data. In this study, through analyzing the cancer endogenous molecular networks, we revealed that the subnetwork originating from eukaryota could control the unlimited proliferation of cancer cells, and the subnetwork originating from eumetazoa could recapitulate the other hallmarks of cancer. In addition, investigations based on multiple datasets revealed that cancer driver genes were enriched in genes originating from eukaryota, opisthokonta, and eumetazoa. These results have important implications for enhancing the robustness of cancer prognosis models through selecting the gene signatures by the gene age information.

  11. Co-expression modules construction by WGCNA and identify potential prognostic markers of uveal melanoma.

    Science.gov (United States)

    Wan, Qi; Tang, Jing; Han, Yu; Wang, Dan

    2018-01-01

    Uveal melanoma is an aggressive cancer which has a high percentage recurrence and with a worse prognosis. Identify the potential prognostic markers of uveal melanoma may provide information for early detection of recurrence and treatment. RNA sequence data of uveal melanoma and patient clinic traits were obtained from The Cancer Genome Atlas (TCGA) database. Co-expression modules were built by weighted gene co -expression network analysis (WGCNA) and applied to investigate the relationship underlying modules and clinic traits. Besides, functional enrichment analysis was performed on these co-expression genes from interested modules. First, using WGCNA, identified 21 co-expression modules were constructed by the 10975 genes from the 80 human uveal melanoma samples. The number of genes in these modules ranged from 42 to 5091. Found four co -expression modules significantly correlated with three clinic traits (status, recurrence and recurrence Time). Module red, and purple positively correlated with patient's life status and recurrence Time. Module green positively correlates with recurrence. The result of functional enrichment analysis showed that the module magenta was mainly enriched genetic material assemble processes, the purple module was mainly enriched in tissue homeostasis and melanosome membrane and the module red was mainly enriched metastasis of cell, suggesting its critical role in the recurrence and development of the disease. Additionally, identified the hug gene (top connectivity with other genes) in each module. The hub gene SLC17A7, NTRK2, ABTB1 and ADPRHL1 might play a vital role in recurrence of uveal melanoma. Our findings provided the framework of co-expression gene modules of uveal melanoma and identified some prognostic markers might be detection of recurrence and treatment for uveal melanoma. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Changes in transcriptional orientation are associated with increases in evolutionary rates of enterobacterial genes

    Directory of Open Access Journals (Sweden)

    Hsiung Chao

    2011-10-01

    Full Text Available Abstract Background Changes in transcriptional orientation (“CTOs” occur frequently in prokaryotic genomes. Such changes usually result from genomic inversions, which may cause a conflict between the directions of replication and transcription and an increase in mutation rate. However, CTOs do not always lead to the replication-transcription confrontation. Furthermore, CTOs may cause deleterious disruptions of operon structure and/or gene regulations. The currently existing CTOs may indicate relaxation of selection pressure. Therefore, it is of interest to investigate whether CTOs have an independent effect on the evolutionary rates of the affected genes, and whether these genes are subject to any type of selection pressure in prokaryotes. Methods Three closely related enterbacteria, Escherichia coli, Klebsiella pneumoniae and Salmonella enterica serovar Typhimurium, were selected for comparisons of synonymous (dS and nonsynonymous (dN substitution rate between the genes that have experienced changes in transcriptional orientation (changed-orientation genes, “COGs” and those that do not (same-orientation genes, “SOGs”. The dN/dS ratio was also derived to evaluate the selection pressure on the analyzed genes. Confounding factors in the estimation of evolutionary rates, such as gene essentiality, gene expression level, replication-transcription confrontation, and decreased dS at gene terminals were controlled in the COG-SOG comparisons. Results We demonstrate that COGs have significantly higher dN and dS than SOGs when a series of confounding factors are controlled. However, the dN/dS ratios are similar between the two gene groups, suggesting that the increase in dS can sufficiently explain the increase in dN in COGs. Therefore, the increases in evolutionary rates in COGs may be mainly mutation-driven. Conclusions Here we show that CTOs can increase the evolutionary rates of the affected genes. This effect is independent of the

  13. Functional Module Analysis for Gene Coexpression Networks with Network Integration.

    Science.gov (United States)

    Zhang, Shuqin; Zhao, Hongyu; Ng, Michael K

    2015-01-01

    Network has been a general tool for studying the complex interactions between different genes, proteins, and other small molecules. Module as a fundamental property of many biological networks has been widely studied and many computational methods have been proposed to identify the modules in an individual network. However, in many cases, a single network is insufficient for module analysis due to the noise in the data or the tuning of parameters when building the biological network. The availability of a large amount of biological networks makes network integration study possible. By integrating such networks, more informative modules for some specific disease can be derived from the networks constructed from different tissues, and consistent factors for different diseases can be inferred. In this paper, we have developed an effective method for module identification from multiple networks under different conditions. The problem is formulated as an optimization model, which combines the module identification in each individual network and alignment of the modules from different networks together. An approximation algorithm based on eigenvector computation is proposed. Our method outperforms the existing methods, especially when the underlying modules in multiple networks are different in simulation studies. We also applied our method to two groups of gene coexpression networks for humans, which include one for three different cancers, and one for three tissues from the morbidly obese patients. We identified 13 modules with three complete subgraphs, and 11 modules with two complete subgraphs, respectively. The modules were validated through Gene Ontology enrichment and KEGG pathway enrichment analysis. We also showed that the main functions of most modules for the corresponding disease have been addressed by other researchers, which may provide the theoretical basis for further studying the modules experimentally.

  14. Identifying miRNA and gene modules of colon cancer associated with pathological stage by weighted gene co-expression network analysis

    Directory of Open Access Journals (Sweden)

    Zhou X

    2018-05-01

    Full Text Available Xian-guo Zhou,1,2,* Xiao-liang Huang,1,2,* Si-yuan Liang,1–3 Shao-mei Tang,1,2 Si-kao Wu,1,2 Tong-tong Huang,1,2 Zeng-nan Mo,1,2,4 Qiu-yan Wang1,2,5 1Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China; 2Guangxi Key Laboratory for Genomic and Personalized Medicine, Guangxi Collaborative Innovation Center for Genomic and Personalized Medicine, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China; 3Department of Colorectal Surgery, First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China; 4Department of Urology and Nephrology, The First Affiliated Hospital of Guangxi, Medical University, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China; 5Guangxi Colleges and Universities Key Laboratory of Biological Molecular Medicine Research, Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, People’s Republic of China *These authors contributed equally to this work Introduction: Colorectal cancer (CRC is the fourth most common cause of cancer-related mortality worldwide. The tumor, node, metastasis (TNM stage remains the standard for CRC prognostication. Identification of meaningful microRNA (miRNA and gene modules or representative biomarkers related to the pathological stage of colon cancer helps to predict prognosis and reveal the mechanisms behind cancer progression.Materials and methods: We applied a systems biology approach by combining differential expression analysis and weighted gene co-expression network analysis (WGCNA to detect the pathological stage-related miRNA and gene modules and construct a miRNA–gene network. The Cancer Genome Atlas (TCGA colon adenocarcinoma (CAC RNA-sequencing data and miRNA-sequencing data were subjected to WGCNA analysis, and the GSE29623, GSE35602 and GSE39396 were utilized to validate and

  15. Weighted Gene Co-expression Network Analysis of the Dioscin Rich Medicinal Plant Dioscorea nipponica

    Directory of Open Access Journals (Sweden)

    Wei Sun

    2017-06-01

    Full Text Available Dioscorea contains critically important species which can be used as staple foods or sources of bioactive substances, including Dioscorea nipponica, which has been used to develop highly successful drugs to treat cardiovascular disease. Its major active ingredients are thought to be sterol compounds such as diosgenin, which has been called “medicinal gold” because of its valuable properties. However, reliance on naturally growing plants as a production system limits the potential use of D. nipponica, raising interest in engineering metabolic pathways to enhance the production of secondary metabolites. However, the biosynthetic pathway of diosgenin is still poorly understood, and D. nipponica is poorly characterized at a molecular level, hindering in-depth investigation. In the present work, the RNAs from five organs and seven methyl jasmonate treated D. nipponica rhizomes were sequenced using the Illumina high-throughput sequencing platform, yielding 52 gigabases of data, which were pooled and assembled into a reference transcriptome. Four hundred and eighty two genes were found to be highly expressed in the rhizomes, and these genes are mainly involved in stress response and transcriptional regulation. Based on their expression patterns, 36 genes were selected for further investigation as candidate genes involved in dioscin biosynthesis. Constructing co-expression networks based on significant changes in gene expression revealed 15 gene modules. Of these, four modules with properties correlating to dioscin regulation and biosynthesis, consisting of 4,665 genes in total, were selected for further functional investigation. These results improve our understanding of dioscin biosynthesis in this important medicinal plant and will help guide more intensive investigations.

  16. Evolutionary Inference across Eukaryotes Identifies Specific Pressures Favoring Mitochondrial Gene Retention.

    Science.gov (United States)

    Johnston, Iain G; Williams, Ben P

    2016-02-24

    Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modeling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondrial genomes, we inferred evolutionary trajectories of mtDNA gene loss across the eukaryotic tree of life. We find that proteins comprising the structural cores of the electron transport chain are preferentially encoded within mitochondrial genomes across eukaryotes. A combination of high GC content and high protein hydrophobicity is required to explain patterns of mtDNA gene retention; a model that accounts for these selective pressures can also predict the success of artificial gene transfer experiments in vivo. This work provides a general method for data-driven inference of the ordering of evolutionary and progressive events, here identifying the distinct features shaping mitochondrial genomes of present-day species. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Evolutionary relationship and structural characterization of the EPF/EPFL gene family.

    Science.gov (United States)

    Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu

    2013-01-01

    EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes.

  18. Production of natural fragrance aromatic acids by coexpression of trans-anethole oxygenase and p-anisaldehyde dehydrogenase genes of Pseudomonas putida JYR-1 in Escherichia coli.

    Science.gov (United States)

    Han, Dongfei; Kurusarttra, Somwang; Ryu, Ji-Young; Kanaly, Robert A; Hur, Hor-Gil

    2012-12-05

    A gene encoding p-anisaldehyde dehydrogenase (PAADH), which catalyzes the oxidation of p-anisaldehyde to p-anisic acid, was identified to be clustered with the trans-anethole oxygenase (tao) gene in Pseudomonas putida JYR-1. Heterologously expressed PAADH in Escherichia coli catalyzed the oxidation of vanillin, veratraldehyde, and piperonal to the corresponding aromatic acids vanillic acid, veratric acid, and piperonylic acid, respectively. Coexpression of trans-anethole oxygenase (TAO) and PAADH in E. coli also resulted in the successful transformation of trans-anethole, isoeugenol, O-methyl isoeugenol, and isosafrole to p-anisic acid, vanillic acid, veratric acid, and piperonylic acid, respectively, which are compounds found in plants as secondary metabolites. Because of the relaxed substrate specificity and high transformation rates by coexpressed TAO and PAADH in E. coli , the engineered strain has potential to be applied in the fragrance industry.

  19. Tracing evolutionary relicts of positive selection on eight malaria-related immune genes in mammals.

    Science.gov (United States)

    Huang, Bing-Hong; Liao, Pei-Chun

    2015-07-01

    Plasmodium-induced malaria widely infects primates and other mammals. Multiple past studies have revealed that positive selection could be the main evolutionary force triggering the genetic diversity of anti-malaria resistance-associated genes in human or primates. However, researchers focused most of their attention on the infra-generic and intra-specific genome evolution rather than analyzing the complete evolutionary history of mammals. Here we extend previous research by testing the evolutionary link of natural selection on eight candidate genes associated with malaria resistance in mammals. Three of the eight genes were detected to be affected by recombination, including TNF-α, iNOS and DARC. Positive selection was detected in the rest five immunogenes multiple times in different ancestral lineages of extant species throughout the mammalian evolution. Signals of positive selection were exposed in four malaria-related immunogenes in primates: CCL2, IL-10, HO1 and CD36. However, selection signals of G6PD have only been detected in non-primate eutherians. Significantly higher evolutionary rates and more radical amino acid replacement were also detected in primate CD36, suggesting its functional divergence from other eutherians. Prevalent positive selection throughout the evolutionary trajectory of mammalian malaria-related genes supports the arms race evolutionary hypothesis of host genetic response of mammalian immunogenes to infectious pathogens. © The Author(s) 2014 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.

  20. Comparative transcriptome and gene co-expression network analysis reveal genes and signaling pathways adaptively responsive to varied adverse stresses in the insect fungal pathogen, Beauveria bassiana.

    Science.gov (United States)

    He, Zhangjiang; Zhao, Xin; Lu, Zhuoyue; Wang, Huifang; Liu, Pengfei; Zeng, Fanqin; Zhang, Yongjun

    2018-01-01

    Sensing, responding, and adapting to the surrounding environment are crucial for all living organisms to survive, proliferate, and differentiate in their biological niches. Beauveria bassiana is an economically important insect-pathogenic fungus which is widely used as a biocontrol agent to control a variety of insect pests. The fungal pathogen unavoidably encounters a variety of adverse environmental stresses and defense response from the host insects during application of the fungal agents. However, few are known about the transcription response of the fungus to respond or adapt varied adverse stresses. Here, we comparatively analyzed the transcriptome of B. bassiana in globe genome under the varied stationary-phase stresses including osmotic agent (0.8 M NaCl), high temperature (32 °C), cell wall-perturbing agent (Congo red), and oxidative agents (H 2 O 2 or menadione). Total of 12,412 reads were obtained, and mapped to the 6767 genes of the B. bassiana. All of these stresses caused transcription responses involved in basal metabolism, cell wall construction, stress response or cell rescue/detoxification, signaling transduction and gene transcription regulation, and likely other cellular processes. An array of genes displayed similar transcription patterns in response to at least two of the five stresses, suggesting a shared transcription response to varied adverse stresses. Gene co-expression network analysis revealed that mTOR signaling pathway, but not HOG1 MAP kinase pathway, played a central role in regulation the varied adverse stress responses, which was verified by RNAi-mediated knockdown of TOR1. Our findings provided an insight of transcription response and gene co-expression network of B. bassiana in adaptation to varied environments. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Application of Weighted Gene Co-expression Network Analysis for Data from Paired Design.

    Science.gov (United States)

    Li, Jianqiang; Zhou, Doudou; Qiu, Weiliang; Shi, Yuliang; Yang, Ji-Jiang; Chen, Shi; Wang, Qing; Pan, Hui

    2018-01-12

    Investigating how genes jointly affect complex human diseases is important, yet challenging. The network approach (e.g., weighted gene co-expression network analysis (WGCNA)) is a powerful tool. However, genomic data usually contain substantial batch effects, which could mask true genomic signals. Paired design is a powerful tool that can reduce batch effects. However, it is currently unclear how to appropriately apply WGCNA to genomic data from paired design. In this paper, we modified the current WGCNA pipeline to analyse high-throughput genomic data from paired design. We illustrated the modified WGCNA pipeline by analysing the miRNA dataset provided by Shiah et al. (2014), which contains forty oral squamous cell carcinoma (OSCC) specimens and their matched non-tumourous epithelial counterparts. OSCC is the sixth most common cancer worldwide. The modified WGCNA pipeline identified two sets of novel miRNAs associated with OSCC, in addition to the existing miRNAs reported by Shiah et al. (2014). Thus, this work will be of great interest to readers of various scientific disciplines, in particular, genetic and genomic scientists as well as medical scientists working on cancer.

  2. Co-expression analysis identifies CRC and AP1 the regulator of Arabidopsis fatty acid biosynthesis.

    Science.gov (United States)

    Han, Xinxin; Yin, Linlin; Xue, Hongwei

    2012-07-01

    Fatty acids (FAs) play crucial rules in signal transduction and plant development, however, the regulation of FA metabolism is still poorly understood. To study the relevant regulatory network, fifty-eight FA biosynthesis genes including de novo synthases, desaturases and elongases were selected as "guide genes" to construct the co-expression network. Calculation of the correlation between all Arabidopsis thaliana (L.) genes with each guide gene by Arabidopsis co-expression dating mining tools (ACT) identifies 797 candidate FA-correlated genes. Gene ontology (GO) analysis of these co-expressed genes showed they are tightly correlated to photosynthesis and carbohydrate metabolism, and function in many processes. Interestingly, 63 transcription factors (TFs) were identified as candidate FA biosynthesis regulators and 8 TF families are enriched. Two TF genes, CRC and AP1, both correlating with 8 FA guide genes, were further characterized. Analyses of the ap1 and crc mutant showed the altered total FA composition of mature seeds. The contents of palmitoleic acid, stearic acid, arachidic acid and eicosadienoic acid are decreased, whereas that of oleic acid is increased in ap1 and crc seeds, which is consistent with the qRT-PCR analysis revealing the suppressed expression of the corresponding guide genes. In addition, yeast one-hybrid analysis and electrophoretic mobility shift assay (EMSA) revealed that CRC can bind to the promoter regions of KCS7 and KCS15, indicating that CRC may directly regulate FA biosynthesis. © 2012 Institute of Botany, Chinese Academy of Sciences.

  3. Evolutionary relationship and structural characterization of the EPF/EPFL gene family.

    Directory of Open Access Journals (Sweden)

    Naoki Takata

    Full Text Available EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes.

  4. A gene co-expression network in whole blood of schizophrenia patients is independent of antipsychotic-use and enriched for brain-expressed genes.

    Directory of Open Access Journals (Sweden)

    Simone de Jong

    Full Text Available Despite large-scale genome-wide association studies (GWAS, the underlying genes for schizophrenia are largely unknown. Additional approaches are therefore required to identify the genetic background of this disorder. Here we report findings from a large gene expression study in peripheral blood of schizophrenia patients and controls. We applied a systems biology approach to genome-wide expression data from whole blood of 92 medicated and 29 antipsychotic-free schizophrenia patients and 118 healthy controls. We show that gene expression profiling in whole blood can identify twelve large gene co-expression modules associated with schizophrenia. Several of these disease related modules are likely to reflect expression changes due to antipsychotic medication. However, two of the disease modules could be replicated in an independent second data set involving antipsychotic-free patients and controls. One of these robustly defined disease modules is significantly enriched with brain-expressed genes and with genetic variants that were implicated in a GWAS study, which could imply a causal role in schizophrenia etiology. The most highly connected intramodular hub gene in this module (ABCF1, is located in, and regulated by the major histocompatibility (MHC complex, which is intriguing in light of the fact that common allelic variants from the MHC region have been implicated in schizophrenia. This suggests that the MHC increases schizophrenia susceptibility via altered gene expression of regulatory genes in this network.

  5. A Comprehensive Classification and Evolutionary Analysis of Plant Homeobox Genes

    OpenAIRE

    Mukherjee, Krishanu; Brocchieri, Luciano; B?rglin, Thomas R.

    2009-01-01

    The full complement of homeobox transcription factor sequences, including genes and pseudogenes, was determined from the analysis of 10 complete genomes from flowering plants, moss, Selaginella, unicellular green algae, and red algae. Our exhaustive genome-wide searches resulted in the discovery in each class of a greater number of homeobox genes than previously reported. All homeobox genes can be unambiguously classified by sequence evolutionary analysis into 14 distinct classes also charact...

  6. Orthoscape: a cytoscape application for grouping and visualization KEGG based gene networks by taxonomy and homology principles.

    Science.gov (United States)

    Mustafin, Zakhar Sergeevich; Lashin, Sergey Alexandrovich; Matushkin, Yury Georgievich; Gunbin, Konstantin Vladimirovich; Afonnikov, Dmitry Arkadievich

    2017-01-27

    There are many available software tools for visualization and analysis of biological networks. Among them, Cytoscape ( http://cytoscape.org/ ) is one of the most comprehensive packages, with many plugins and applications which extends its functionality by providing analysis of protein-protein interaction, gene regulatory and gene co-expression networks, metabolic, signaling, neural as well as ecological-type networks including food webs, communities networks etc. Nevertheless, only three plugins tagged 'network evolution' found in Cytoscape official app store and in literature. We have developed a new Cytoscape 3.0 application Orthoscape aimed to facilitate evolutionary analysis of gene networks and visualize the results. Orthoscape aids in analysis of evolutionary information available for gene sets and networks by highlighting: (1) the orthology relationships between genes; (2) the evolutionary origin of gene network components; (3) the evolutionary pressure mode (diversifying or stabilizing, negative or positive selection) of orthologous groups in general and/or branch-oriented mode. The distinctive feature of Orthoscape is the ability to control all data analysis steps via user-friendly interface. Orthoscape allows its users to analyze gene networks or separated gene sets in the context of evolution. At each step of data analysis, Orthoscape also provides for convenient visualization and data manipulation.

  7. Assessing pathogenicity of MLH1 variants by co-expression of human MLH1 and PMS2 genes in yeast

    Energy Technology Data Exchange (ETDEWEB)

    Vogelsang, Matjaz; Comino, Aleksandra; Zupanec, Neja [Department for Biosynthesis and Biotransformation, National Institute of Chemistry, Hajdrihova 19, SI-1001 Ljubljana (Slovenia); Hudler, Petra [Medical Center for Molecular Biology, Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1000 Ljubljana (Slovenia); Komel, Radovan [Department for Biosynthesis and Biotransformation, National Institute of Chemistry, Hajdrihova 19, SI-1001 Ljubljana (Slovenia); Medical Center for Molecular Biology, Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1000 Ljubljana (Slovenia)

    2009-10-28

    Loss of DNA mismatch repair (MMR) in humans, mainly due to mutations in the hMLH1 gene, is linked to hereditary nonpolyposis colorectal cancer (HNPCC). Because not all MLH1 alterations result in loss of MMR function, accurate characterization of variants and their classification in terms of their effect on MMR function is essential for reliable genetic testing and effective treatment. To date, in vivo assays for functional characterization of MLH1 mutations performed in various model systems have used episomal expression of the modified MMR genes. We describe here a novel approach to determine accurately the functional significance of hMLH1 mutations in vivo, based on co-expression of human MLH1 and PMS2 in yeast cells. Yeast MLH1 and PMS1 genes, whose protein products form the MutLα complex, were replaced by human orthologs directly on yeast chromosomes by homologous recombination, and the resulting MMR activity was tested. The yeast strain co-expressing hMLH1 and hPMS2 exhibited the same mutation rate as the wild-type. Eight cancer-related MLH1 variants were introduced, using the same approach, into the prepared yeast model, and their effect on MMR function was determined. Five variants (A92P, S93G, I219V, K618R and K618T) were classified as non-pathogenic, whereas variants T117M, Y646C and R659Q were characterized as pathogenic. Results of our in vivo yeast-based approach correlate well with clinical data in five out of seven hMLH1 variants and the described model was thus shown to be useful for functional characterization of MLH1 variants in cancer patients found throughout the entire coding region of the gene.

  8. Assessing pathogenicity of MLH1 variants by co-expression of human MLH1 and PMS2 genes in yeast

    Directory of Open Access Journals (Sweden)

    Hudler Petra

    2009-10-01

    Full Text Available Abstract Background Loss of DNA mismatch repair (MMR in humans, mainly due to mutations in the hMLH1 gene, is linked to hereditary nonpolyposis colorectal cancer (HNPCC. Because not all MLH1 alterations result in loss of MMR function, accurate characterization of variants and their classification in terms of their effect on MMR function is essential for reliable genetic testing and effective treatment. To date, in vivo assays for functional characterization of MLH1 mutations performed in various model systems have used episomal expression of the modified MMR genes. We describe here a novel approach to determine accurately the functional significance of hMLH1 mutations in vivo, based on co-expression of human MLH1 and PMS2 in yeast cells. Methods Yeast MLH1 and PMS1 genes, whose protein products form the MutLα complex, were replaced by human orthologs directly on yeast chromosomes by homologous recombination, and the resulting MMR activity was tested. Results The yeast strain co-expressing hMLH1 and hPMS2 exhibited the same mutation rate as the wild-type. Eight cancer-related MLH1 variants were introduced, using the same approach, into the prepared yeast model, and their effect on MMR function was determined. Five variants (A92P, S93G, I219V, K618R and K618T were classified as non-pathogenic, whereas variants T117M, Y646C and R659Q were characterized as pathogenic. Conclusion Results of our in vivo yeast-based approach correlate well with clinical data in five out of seven hMLH1 variants and the described model was thus shown to be useful for functional characterization of MLH1 variants in cancer patients found throughout the entire coding region of the gene.

  9. Assessing pathogenicity of MLH1 variants by co-expression of human MLH1 and PMS2 genes in yeast

    International Nuclear Information System (INIS)

    Vogelsang, Matjaz; Comino, Aleksandra; Zupanec, Neja; Hudler, Petra; Komel, Radovan

    2009-01-01

    Loss of DNA mismatch repair (MMR) in humans, mainly due to mutations in the hMLH1 gene, is linked to hereditary nonpolyposis colorectal cancer (HNPCC). Because not all MLH1 alterations result in loss of MMR function, accurate characterization of variants and their classification in terms of their effect on MMR function is essential for reliable genetic testing and effective treatment. To date, in vivo assays for functional characterization of MLH1 mutations performed in various model systems have used episomal expression of the modified MMR genes. We describe here a novel approach to determine accurately the functional significance of hMLH1 mutations in vivo, based on co-expression of human MLH1 and PMS2 in yeast cells. Yeast MLH1 and PMS1 genes, whose protein products form the MutLα complex, were replaced by human orthologs directly on yeast chromosomes by homologous recombination, and the resulting MMR activity was tested. The yeast strain co-expressing hMLH1 and hPMS2 exhibited the same mutation rate as the wild-type. Eight cancer-related MLH1 variants were introduced, using the same approach, into the prepared yeast model, and their effect on MMR function was determined. Five variants (A92P, S93G, I219V, K618R and K618T) were classified as non-pathogenic, whereas variants T117M, Y646C and R659Q were characterized as pathogenic. Results of our in vivo yeast-based approach correlate well with clinical data in five out of seven hMLH1 variants and the described model was thus shown to be useful for functional characterization of MLH1 variants in cancer patients found throughout the entire coding region of the gene

  10. Comparison of evolutionary algorithms in gene regulatory network model inference.

    LENUS (Irish Health Repository)

    2010-01-01

    ABSTRACT: BACKGROUND: The evolution of high throughput technologies that measure gene expression levels has created a data base for inferring GRNs (a process also known as reverse engineering of GRNs). However, the nature of these data has made this process very difficult. At the moment, several methods of discovering qualitative causal relationships between genes with high accuracy from microarray data exist, but large scale quantitative analysis on real biological datasets cannot be performed, to date, as existing approaches are not suitable for real microarray data which are noisy and insufficient. RESULTS: This paper performs an analysis of several existing evolutionary algorithms for quantitative gene regulatory network modelling. The aim is to present the techniques used and offer a comprehensive comparison of approaches, under a common framework. Algorithms are applied to both synthetic and real gene expression data from DNA microarrays, and ability to reproduce biological behaviour, scalability and robustness to noise are assessed and compared. CONCLUSIONS: Presented is a comparison framework for assessment of evolutionary algorithms, used to infer gene regulatory networks. Promising methods are identified and a platform for development of appropriate model formalisms is established.

  11. Divergent and convergent modes of interaction between wheat and Puccinia graminis f. sp. tritici isolates revealed by the comparative gene co-expression network and genome analyses.

    Science.gov (United States)

    Rutter, William B; Salcedo, Andres; Akhunova, Alina; He, Fei; Wang, Shichen; Liang, Hanquan; Bowden, Robert L; Akhunov, Eduard

    2017-04-12

    Two opposing evolutionary constraints exert pressure on plant pathogens: one to diversify virulence factors in order to evade plant defenses, and the other to retain virulence factors critical for maintaining a compatible interaction with the plant host. To better understand how the diversified arsenals of fungal genes promote interaction with the same compatible wheat line, we performed a comparative genomic analysis of two North American isolates of Puccinia graminis f. sp. tritici (Pgt). The patterns of inter-isolate divergence in the secreted candidate effector genes were compared with the levels of conservation and divergence of plant-pathogen gene co-expression networks (GCN) developed for each isolate. Comprative genomic analyses revealed substantial level of interisolate divergence in effector gene complement and sequence divergence. Gene Ontology (GO) analyses of the conserved and unique parts of the isolate-specific GCNs identified a number of conserved host pathways targeted by both isolates. Interestingly, the degree of inter-isolate sub-network conservation varied widely for the different host pathways and was positively associated with the proportion of conserved effector candidates associated with each sub-network. While different Pgt isolates tended to exploit similar wheat pathways for infection, the mode of plant-pathogen interaction varied for different pathways with some pathways being associated with the conserved set of effectors and others being linked with the diverged or isolate-specific effectors. Our data suggest that at the intra-species level pathogen populations likely maintain divergent sets of effectors capable of targeting the same plant host pathways. This functional redundancy may play an important role in the dynamic of the "arms-race" between host and pathogen serving as the basis for diverse virulence strategies and creating conditions where mutations in certain effector groups will not have a major effect on the pathogen

  12. An extensive (co-expression analysis tool for the cytochrome P450 superfamily in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Provart Nicholas J

    2008-04-01

    Full Text Available Abstract Background Sequencing of the first plant genomes has revealed that cytochromes P450 have evolved to become the largest family of enzymes in secondary metabolism. The proportion of P450 enzymes with characterized biochemical function(s is however very small. If P450 diversification mirrors evolution of chemical diversity, this points to an unexpectedly poor understanding of plant metabolism. We assumed that extensive analysis of gene expression might guide towards the function of P450 enzymes, and highlight overlooked aspects of plant metabolism. Results We have created a comprehensive database, 'CYPedia', describing P450 gene expression in four data sets: organs and tissues, stress response, hormone response, and mutants of Arabidopsis thaliana, based on public Affymetrix ATH1 microarray expression data. P450 expression was then combined with the expression of 4,130 re-annotated genes, predicted to act in plant metabolism, for co-expression analyses. Based on the annotation of co-expressed genes from diverse pathway annotation databases, co-expressed pathways were identified. Predictions were validated for most P450s with known functions. As examples, co-expression results for P450s related to plastidial functions/photosynthesis, and to phenylpropanoid, triterpenoid and jasmonate metabolism are highlighted here. Conclusion The large scale hypothesis generation tools presented here provide leads to new pathways, unexpected functions, and regulatory networks for many P450s in plant metabolism. These can now be exploited by the community to validate the proposed functions experimentally using reverse genetics, biochemistry, and metabolic profiling.

  13. Co-expression of G2-EPSPS and glyphosate acetyltransferase GAT genes conferring high tolerance to glyphosate in soybean

    Directory of Open Access Journals (Sweden)

    Bingfu eGuo

    2015-10-01

    Full Text Available Glyphosate is a widely used non-selective herbicide with broad spectrum of weed control around the world. At present, most of the commercial glyphosate tolerant soybeans utilize glyphosate tolerant gene CP4-EPSPS or glyphosate acetyltransferase gene GAT separately. In this study, both glyphosate tolerant gene G2-EPSPS and glyphosate degraded gene GAT were co-transferred into soybean and transgenic plants showed high tolerance to glyphosate. Molecular analysis including PCR, Sothern blot, qRT-PCR and Western blot revealed that target genes have been integrated into genome and expressed effectively at both mRNA and protein levels. Furthermore, the glyphosate tolerance analysis showed that no typical symptom was observed when compared with a glyphosate tolerant line HJ06-698 derived from GR1 transgenic soybean even at four-fold labeled rate of Roundup. Chlorophyll and shikimic acid content analysis of transgenic plant also revealed that these two indexes were not significantly altered after glyphosate application. These results indicated that co-expression of G2-EPSPS and GAT conferred high tolerance to the herbicide glyphosate in soybean. Therefore, combination of tolerant and degraded genes provides a new strategy for developing glyphosate tolerant transgenic crops.

  14. Neutralization of Bacterial YoeBSpn Toxicity and Enhanced Plant Growth in Arabidopsis thaliana via Co-Expression of the Toxin-Antitoxin Genes

    Science.gov (United States)

    Abu Bakar, Fauziah; Yeo, Chew Chieng; Harikrishna, Jennifer Ann

    2016-01-01

    Bacterial toxin-antitoxin (TA) systems have various cellular functions, including as part of the general stress response. The genome of the Gram-positive human pathogen Streptococcus pneumoniae harbors several putative TA systems, including yefM-yoeBSpn, which is one of four systems that had been demonstrated to be biologically functional. Overexpression of the yoeBSpn toxin gene resulted in cell stasis and eventually cell death in its native host, as well as in Escherichia coli. Our previous work showed that induced expression of a yoeBSpn toxin-Green Fluorescent Protein (GFP) fusion gene apparently triggered apoptosis and was lethal in the model plant, Arabidopsis thaliana. In this study, we investigated the effects of co-expression of the yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic A. thaliana. When co-expressed in Arabidopsis, the YefMSpn antitoxin was found to neutralize the toxicity of YoeBSpn-GFP. Interestingly, the inducible expression of both yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic hybrid Arabidopsis resulted in larger rosette leaves and taller plants with a higher number of inflorescence stems and increased silique production. To our knowledge, this is the first demonstration of a prokaryotic antitoxin neutralizing its cognate toxin in plant cells. PMID:27104531

  15. Neutralization of Bacterial YoeBSpn Toxicity and Enhanced Plant Growth in Arabidopsis thaliana via Co-Expression of the Toxin-Antitoxin Genes

    Directory of Open Access Journals (Sweden)

    Fauziah Abu Bakar

    2016-04-01

    Full Text Available Bacterial toxin-antitoxin (TA systems have various cellular functions, including as part of the general stress response. The genome of the Gram-positive human pathogen Streptococcus pneumoniae harbors several putative TA systems, including yefM-yoeBSpn, which is one of four systems that had been demonstrated to be biologically functional. Overexpression of the yoeBSpn toxin gene resulted in cell stasis and eventually cell death in its native host, as well as in Escherichia coli. Our previous work showed that induced expression of a yoeBSpn toxin-Green Fluorescent Protein (GFP fusion gene apparently triggered apoptosis and was lethal in the model plant, Arabidopsis thaliana. In this study, we investigated the effects of co-expression of the yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic A. thaliana. When co-expressed in Arabidopsis, the YefMSpn antitoxin was found to neutralize the toxicity of YoeBSpn-GFP. Interestingly, the inducible expression of both yefMSpn antitoxin and yoeBSpn toxin-GFP fusion in transgenic hybrid Arabidopsis resulted in larger rosette leaves and taller plants with a higher number of inflorescence stems and increased silique production. To our knowledge, this is the first demonstration of a prokaryotic antitoxin neutralizing its cognate toxin in plant cells.

  16. Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

    Science.gov (United States)

    Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

    2010-10-07

    PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out

  17. Evolutionary and genetic analysis of the VP2 gene of canine parvovirus.

    Science.gov (United States)

    Li, Gairu; Ji, Senlin; Zhai, Xiaofeng; Zhang, Yuxiang; Liu, Jie; Zhu, Mengyan; Zhou, Jiyong; Su, Shuo

    2017-07-17

    Canine parvovirus (CPV) type 2 emerged in 1978 in the USA and quickly spread among dog populations all over the world with high morbidity. Although CPV is a DNA virus, its genomic substitution rate is similar to some RNA viruses. Therefore, it is important to trace the evolution of CPV to monitor the appearance of mutations that might affect vaccine effectiveness. Our analysis shows that the VP2 genes of CPV isolated from 1979 to 2016 are divided into six groups: GI, GII, GIII, GIV, GV, and GVI. Amino acid mutation analysis revealed several undiscovered important mutation sites: F267Y, Y324I, and T440A. Of note, the evolutionary rate of the CPV VP2 gene from Asia and Europe decreased. Codon usage analysis showed that the VP2 gene of CPV exhibits high bias with an ENC ranging from 34.93 to 36.7. Furthermore, we demonstrate that natural selection plays a major role compared to mutation pressure driving CPV evolution. There are few studies on the codon usage of CPV. Here, we comprehensively studied the genetic evolution, codon usage pattern, and evolutionary characterization of the VP2 gene of CPV. The novel findings revealing the evolutionary process of CPV will greatly serve future CPV research.

  18. Characterization of the avian Trojan gene family reveals contrasting evolutionary constraints.

    Science.gov (United States)

    Petrov, Petar; Syrjänen, Riikka; Smith, Jacqueline; Gutowska, Maria Weronika; Uchida, Tatsuya; Vainio, Olli; Burt, David W

    2015-01-01

    "Trojan" is a leukocyte-specific, cell surface protein originally identified in the chicken. Its molecular function has been hypothesized to be related to anti-apoptosis and the proliferation of immune cells. The Trojan gene has been localized onto the Z sex chromosome. The adjacent two genes also show significant homology to Trojan, suggesting the existence of a novel gene/protein family. Here, we characterize this Trojan family, identify homologues in other species and predict evolutionary constraints on these genes. The two Trojan-related proteins in chicken were predicted as a receptor-type tyrosine phosphatase and a transmembrane protein, bearing a cytoplasmic immuno-receptor tyrosine-based activation motif. We identified the Trojan gene family in ten other bird species and found related genes in three reptiles and a fish species. The phylogenetic analysis of the homologues revealed a gradual diversification among the family members. Evolutionary analyzes of the avian genes predicted that the extracellular regions of the proteins have been subjected to positive selection. Such selection was possibly a response to evolving interacting partners or to pathogen challenges. We also observed an almost complete lack of intracellular positively selected sites, suggesting a conserved signaling mechanism of the molecules. Therefore, the contrasting patterns of selection likely correlate with the interaction and signaling potential of the molecules.

  19. The evolutionary history of the SAL1 gene family in eutherian mammals

    Directory of Open Access Journals (Sweden)

    Callebaut Isabelle

    2011-05-01

    Full Text Available Abstract Background SAL1 (salivary lipocalin is a member of the OBP (Odorant Binding Protein family and is involved in chemical sexual communication in pig. SAL1 and its relatives may be involved in pheromone and olfactory receptor binding and in pre-mating behaviour. The evolutionary history and the selective pressures acting on SAL1 and its orthologous genes have not yet been exhaustively described. The aim of the present work was to study the evolution of these genes, to elucidate the role of selective pressures in their evolution and the consequences for their functions. Results Here, we present the evolutionary history of SAL1 gene and its orthologous genes in mammals. We found that (1 SAL1 and its related genes arose in eutherian mammals with lineage-specific duplications in rodents, horse and cow and are lost in human, mouse lemur, bushbaby and orangutan, (2 the evolution of duplicated genes of horse, rat, mouse and guinea pig is driven by concerted evolution with extensive gene conversion events in mouse and guinea pig and by positive selection mainly acting on paralogous genes in horse and guinea pig, (3 positive selection was detected for amino acids involved in pheromone binding and amino acids putatively involved in olfactory receptor binding, (4 positive selection was also found for lineage, indicating a species-specific strategy for amino acid selection. Conclusions This work provides new insights into the evolutionary history of SAL1 and its orthologs. On one hand, some genes are subject to concerted evolution and to an increase in dosage, suggesting the need for homogeneity of sequence and function in certain species. On the other hand, positive selection plays a role in the diversification of the functions of the family and in lineage, suggesting adaptive evolution, with possible consequences for speciation and for the reinforcement of prezygotic barriers.

  20. A co-expression gene network associated with developmental regulation of apple fruit acidity.

    Science.gov (United States)

    Bai, Yang; Dougherty, Laura; Cheng, Lailiang; Xu, Kenong

    2015-08-01

    Apple fruit acidity, which affects the fruit's overall taste and flavor to a large extent, is primarily determined by the concentration of malic acid. Previous studies demonstrated that the major QTL malic acid (Ma) on chromosome 16 is largely responsible for fruit acidity variations in apple. Recent advances suggested that a natural mutation that gives rise to a premature stop codon in one of the two aluminum-activated malate transporter (ALMT)-like genes (called Ma1) is the genetic causal element underlying Ma. However, the natural mutation does not explain the developmental changes of fruit malate levels in a given genotype. Using RNA-seq data from the fruit of 'Golden Delicious' taken at 14 developmental stages from 1 week after full-bloom (WAF01) to harvest (WAF20), we characterized their transcriptomes in groups of high (12.2 ± 1.6 mg/g fw, WAF03-WAF08), mid (7.4 ± 0.5 mg/g fw, WAF01-WAF02 and WAF10-WAF14) and low (5.4 ± 0.4 mg/g fw, WAF16-WAF20) malate concentrations. Detailed analyses showed that a set of 3,066 genes (including Ma1) were expressed not only differentially (P FDR < 0.05) between the high and low malate groups (or between the early and late developmental stages) but also in significant (P < 0.05) correlation with malate concentrations. The 3,066 genes fell in 648 MapMan (sub-) bins or functional classes, and 19 of them were significantly (P FDR < 0.05) co-enriched or co-suppressed in a malate dependent manner. Network inferring using the 363 genes encompassed in the 19 (sub-) bins, identified a major co-expression network of 239 genes. Since the 239 genes were also differentially expressed between the early (WAF03-WAF08) and late (WAF16-WAF20) developmental stages, the major network was considered to be associated with developmental regulation of apple fruit acidity in 'Golden Delicious'.

  1. Functional conservation of the Drosophila gooseberry gene and its evolutionary alleles.

    Directory of Open Access Journals (Sweden)

    Wei Liu

    Full Text Available The Drosophila Pax gene gooseberry (gsb is required for development of the larval cuticle and CNS, survival to adulthood, and male fertility. These functions can be rescued in gsb mutants by two gsb evolutionary alleles, gsb-Prd and gsb-Pax3, which express the Drosophila Paired and mouse Pax3 proteins under the control of gooseberry cis-regulatory region. Therefore, both Paired and Pax3 proteins have conserved all the Gsb functions that are required for survival of embryos to fertile adults, despite the divergent primary sequences in their C-terminal halves. As gsb-Prd and gsb-Pax3 uncover a gsb function involved in male fertility, construction of evolutionary alleles may provide a powerful strategy to dissect hitherto unknown gene functions. Our results provide further evidence for the essential role of cis-regulatory regions in the functional diversification of duplicated genes during evolution.

  2. Divergent Evolutionary Patterns of NAC Transcription Factors Are Associated with Diversification and Gene Duplications in Angiosperm

    Directory of Open Access Journals (Sweden)

    Xiaoli Jin

    2017-06-01

    Full Text Available NAC (NAM/ATAF/CUC proteins constitute one of the biggest plant-specific transcription factor (TF families and have crucial roles in diverse developmental programs during plant growth. Phylogenetic analyses have revealed both conserved and lineage-specific NAC subfamilies, among which various origins and distinct features were observed. It is reasonable to hypothesize that there should be divergent evolutionary patterns of NAC TFs both between dicots and monocots, and among NAC subfamilies. In this study, we compared the gene duplication and loss, evolutionary rate, and selective pattern among non-lineage specific NAC subfamilies, as well as those between dicots and monocots, through genome-wide analyses of sequence and functional data in six dicot and five grass lineages. The number of genes gained in the dicot lineages was much larger than that in the grass lineages, while fewer gene losses were observed in the grass than that in the dicots. We revealed (1 uneven constitution of Clusters of Orthologous Groups (COGs and contrasting birth/death rates among subfamilies, and (2 two distinct evolutionary scenarios of NAC TFs between dicots and grasses. Our results demonstrated that relaxed selection, resulting from concerted gene duplications, may have permitted substitutions responsible for functional divergence of NAC genes into new lineages. The underlying mechanism of distinct evolutionary fates of NAC TFs shed lights on how evolutionary divergence contributes to differences in establishing NAC gene subfamilies and thus impacts the distinct features between dicots and grasses.

  3. Evolutionary maintenance of filovirus-like genes in bat genomes

    Directory of Open Access Journals (Sweden)

    Taylor Derek J

    2011-11-01

    Full Text Available Abstract Background Little is known of the biological significance and evolutionary maintenance of integrated non-retroviral RNA virus genes in eukaryotic host genomes. Here, we isolated novel filovirus-like genes from bat genomes and tested for evolutionary maintenance. We also estimated the age of filovirus VP35-like gene integrations and tested the phylogenetic hypotheses that there is a eutherian mammal clade and a marsupial/ebolavirus/Marburgvirus dichotomy for filoviruses. Results We detected homologous copies of VP35-like and NP-like gene integrations in both Old World and New World species of Myotis (bats. We also detected previously unknown VP35-like genes in rodents that are positionally homologous. Comprehensive phylogenetic estimates for filovirus NP-like and VP35-like loci support two main clades with a marsupial and a rodent grouping within the ebolavirus/Lloviu virus/Marburgvirus clade. The concordance of VP35-like, NP-like and mitochondrial gene trees with the expected species tree supports the notion that the copies we examined are orthologs that predate the global spread and radiation of the genus Myotis. Parametric simulations were consistent with selective maintenance for the open reading frame (ORF of VP35-like genes in Myotis. The ORF of the filovirus-like VP35 gene has been maintained in bat genomes for an estimated 13. 4 MY. ORFs were disrupted for the NP-like genes in Myotis. Likelihood ratio tests revealed that a model that accommodates positive selection is a significantly better fit to the data than a model that does not allow for positive selection for VP35-like sequences. Moreover, site-by-site analysis of selection using two methods indicated at least 25 sites in the VP35-like alignment are under positive selection in Myotis. Conclusions Our results indicate that filovirus-like elements have significance beyond genomic imprints of prior infection. That is, there appears to be, or have been, functionally maintained

  4. Characterization of the avian Trojan gene family reveals contrasting evolutionary constraints.

    Directory of Open Access Journals (Sweden)

    Petar Petrov

    Full Text Available "Trojan" is a leukocyte-specific, cell surface protein originally identified in the chicken. Its molecular function has been hypothesized to be related to anti-apoptosis and the proliferation of immune cells. The Trojan gene has been localized onto the Z sex chromosome. The adjacent two genes also show significant homology to Trojan, suggesting the existence of a novel gene/protein family. Here, we characterize this Trojan family, identify homologues in other species and predict evolutionary constraints on these genes. The two Trojan-related proteins in chicken were predicted as a receptor-type tyrosine phosphatase and a transmembrane protein, bearing a cytoplasmic immuno-receptor tyrosine-based activation motif. We identified the Trojan gene family in ten other bird species and found related genes in three reptiles and a fish species. The phylogenetic analysis of the homologues revealed a gradual diversification among the family members. Evolutionary analyzes of the avian genes predicted that the extracellular regions of the proteins have been subjected to positive selection. Such selection was possibly a response to evolving interacting partners or to pathogen challenges. We also observed an almost complete lack of intracellular positively selected sites, suggesting a conserved signaling mechanism of the molecules. Therefore, the contrasting patterns of selection likely correlate with the interaction and signaling potential of the molecules.

  5. The evolutionary history of bears is characterized by gene flow across species

    Science.gov (United States)

    Kumar, Vikas; Lammers, Fritjof; Bidon, Tobias; Pfenninger, Markus; Kolter, Lydia; Nilsson, Maria A.; Janke, Axel

    2017-01-01

    Bears are iconic mammals with a complex evolutionary history. Natural bear hybrids and studies of few nuclear genes indicate that gene flow among bears may be more common than expected and not limited to polar and brown bears. Here we present a genome analysis of the bear family with representatives of all living species. Phylogenomic analyses of 869 mega base pairs divided into 18,621 genome fragments yielded a well-resolved coalescent species tree despite signals for extensive gene flow across species. However, genome analyses using different statistical methods show that gene flow is not limited to closely related species pairs. Strong ancestral gene flow between the Asiatic black bear and the ancestor to polar, brown and American black bear explains uncertainties in reconstructing the bear phylogeny. Gene flow across the bear clade may be mediated by intermediate species such as the geographically wide-spread brown bears leading to large amounts of phylogenetic conflict. Genome-scale analyses lead to a more complete understanding of complex evolutionary processes. Evidence for extensive inter-specific gene flow, found also in other animal species, necessitates shifting the attention from speciation processes achieving genome-wide reproductive isolation to the selective processes that maintain species divergence in the face of gene flow. PMID:28422140

  6. The evolutionary history of bears is characterized by gene flow across species.

    Science.gov (United States)

    Kumar, Vikas; Lammers, Fritjof; Bidon, Tobias; Pfenninger, Markus; Kolter, Lydia; Nilsson, Maria A; Janke, Axel

    2017-04-19

    Bears are iconic mammals with a complex evolutionary history. Natural bear hybrids and studies of few nuclear genes indicate that gene flow among bears may be more common than expected and not limited to polar and brown bears. Here we present a genome analysis of the bear family with representatives of all living species. Phylogenomic analyses of 869 mega base pairs divided into 18,621 genome fragments yielded a well-resolved coalescent species tree despite signals for extensive gene flow across species. However, genome analyses using different statistical methods show that gene flow is not limited to closely related species pairs. Strong ancestral gene flow between the Asiatic black bear and the ancestor to polar, brown and American black bear explains uncertainties in reconstructing the bear phylogeny. Gene flow across the bear clade may be mediated by intermediate species such as the geographically wide-spread brown bears leading to large amounts of phylogenetic conflict. Genome-scale analyses lead to a more complete understanding of complex evolutionary processes. Evidence for extensive inter-specific gene flow, found also in other animal species, necessitates shifting the attention from speciation processes achieving genome-wide reproductive isolation to the selective processes that maintain species divergence in the face of gene flow.

  7. Identification of PEG-induced water stress responsive transcripts using co-expression network in Eucalyptus grandis.

    Science.gov (United States)

    Ghosh Dasgupta, Modhumita; Dharanishanthi, Veeramuthu

    2017-09-05

    Ecophysiological studies in Eucalyptus have shown that water is the principal factor limiting stem growth. Effect of water deficit conditions on physiological and biochemical parameters has been extensively reported in Eucalyptus. The present study was conducted to identify major polyethylene glycol induced water stress responsive transcripts in Eucalyptus grandis using gene co-expression network. A customized array representing 3359 water stress responsive genes was designed to document their expression in leaves of E. grandis cuttings subjected to -0.225MPa of PEG treatment. The differentially expressed transcripts were documented and significantly co-expressed transcripts were used for construction of network. The co-expression network was constructed with 915 nodes and 3454 edges with degree ranging from 2 to 45. Ninety four GO categories and 117 functional pathways were identified in the network. MCODE analysis generated 27 modules and module 6 with 479 nodes and 1005 edges was identified as the biologically relevant network. The major water responsive transcripts represented in the module included dehydrin, osmotin, LEA protein, expansin, arabinogalactans, heat shock proteins, major facilitator proteins, ARM repeat proteins, raffinose synthase, tonoplast intrinsic protein and transcription factors like DREB2A, ARF9, AGL24, UNE12, WLIM1 and MYB66, MYB70, MYB 55, MYB 16 and MYB 103. The coordinated analysis of gene expression patterns and coexpression networks developed in this study identified an array of transcripts that may regulate PEG induced water stress responses in E. grandis. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Enhancement of γ-aminobutyric acid production in recombinant Corynebacterium glutamicum by co-expressing two glutamate decarboxylase genes from Lactobacillus brevis.

    Science.gov (United States)

    Shi, Feng; Jiang, Junjun; Li, Yongfu; Li, Youxin; Xie, Yilong

    2013-11-01

    γ-Aminobutyric acid (GABA), a non-protein amino acid, is a bioactive component in the food, feed and pharmaceutical fields. To establish an effective single-step production system for GABA, a recombinant Corynebacterium glutamicum strain co-expressing two glutamate decarboxylase (GAD) genes (gadB1 and gadB2) derived from Lactobacillus brevis Lb85 was constructed. Compared with the GABA production of the gadB1 or gadB2 single-expressing strains, GABA production by the gadB1-gadB2 co-expressing strain increased more than twofold. By optimising urea supplementation, the total production of L-glutamate and GABA increased from 22.57 ± 1.24 to 30.18 ± 1.33 g L⁻¹, and GABA production increased from 4.02 ± 0.95 to 18.66 ± 2.11 g L⁻¹ after 84-h cultivation. Under optimal urea supplementation, L-glutamate continued to be consumed, GABA continued to accumulate after 36 h of fermentation, and the pH level fluctuated. GABA production increased to a maximum level of 27.13 ± 0.54 g L⁻¹ after 120-h flask cultivation and 26.32 g L⁻¹ after 60-h fed-batch fermentation. The conversion ratio of L-glutamate to GABA reached 0.60-0.74 mol mol⁻¹. By co-expressing gadB1 and gadB2 and optimising the urea addition method, C. glutamicum was genetically improved for de novo biosynthesis of GABA from its own accumulated L-glutamate.

  9. Time warping of evolutionary distant temporal gene expression data based on noise suppression

    Directory of Open Access Journals (Sweden)

    Papatsenko Dmitri

    2009-10-01

    Full Text Available Abstract Background Comparative analysis of genome wide temporal gene expression data has a broad potential area of application, including evolutionary biology, developmental biology, and medicine. However, at large evolutionary distances, the construction of global alignments and the consequent comparison of the time-series data are difficult. The main reason is the accumulation of variability in expression profiles of orthologous genes, in the course of evolution. Results We applied Pearson distance matrices, in combination with other noise-suppression techniques and data filtering to improve alignments. This novel framework enhanced the capacity to capture the similarities between the temporal gene expression datasets separated by large evolutionary distances. We aligned and compared the temporal gene expression data in budding (Saccharomyces cerevisiae and fission (Schizosaccharomyces pombe yeast, which are separated by more then ~400 myr of evolution. We found that the global alignment (time warping properly matched the duration of cell cycle phases in these distant organisms, which was measured in prior studies. At the same time, when applied to individual ortholog pairs, this alignment procedure revealed groups of genes with distinct alignments, different from the global alignment. Conclusion Our alignment-based predictions of differences in the cell cycle phases between the two yeast species were in a good agreement with the existing data, thus supporting the computational strategy adopted in this study. We propose that the existence of the alternative alignments, specific to distinct groups of genes, suggests presence of different synchronization modes between the two organisms and possible functional decoupling of particular physiological gene networks in the course of evolution.

  10. Systematic comparison of co-expression of multiple recombinant thermophilic enzymes in Escherichia coli BL21(DE3).

    Science.gov (United States)

    Chen, Hui; Huang, Rui; Zhang, Y-H Percival

    2017-06-01

    The precise control of multiple heterologous enzyme expression levels in one Escherichia coli strain is important for cascade biocatalysis, metabolic engineering, synthetic biology, natural product synthesis, and studies of complexed proteins. We systematically investigated the co-expression of up to four thermophilic enzymes (i.e., α-glucan phosphorylase (αGP), phosphoglucomutase (PGM), glucose 6-phosphate dehydrogenase (G6PDH), and 6-phosphogluconate dehydrogenase (6PGDH)) in E. coli BL21(DE3) by adding T7 promoter or T7 terminator of each gene for multiple genes in tandem, changing gene alignment, and comparing one or two plasmid systems. It was found that the addition of T7 terminator after each gene was useful to decrease the influence of the upstream gene. The co-expression of the four enzymes in E. coli BL21(DE3) was demonstrated to generate two NADPH molecules from one glucose unit of maltodextrin, where NADPH was oxidized to convert xylose to xylitol. The best four-gene co-expression system was based on two plasmids (pET and pACYC) which harbored two genes. As a result, apparent enzymatic activities of the four enzymes were regulated to be at similar levels and the overall four-enzyme activity was the highest based on the formation of xylitol. This study provides useful information for the precise control of multi-enzyme-coordinated expression in E. coli BL21(DE3).

  11. Gene coexpression network analysis of fruit transcriptomes uncovers a possible mechanistically distinct class of sugar/acid ratio-associated genes in sweet orange.

    Science.gov (United States)

    Qiao, Liang; Cao, Minghao; Zheng, Jian; Zhao, Yihong; Zheng, Zhi-Liang

    2017-10-30

    The ratio of sugars to organic acids, two of the major metabolites in fleshy fruits, has been considered the most important contributor to fruit sweetness. Although accumulation of sugars and acids have been extensively studied, whether plants evolve a mechanism to maintain, sense or respond to the fruit sugar/acid ratio remains a mystery. In a prior study, we used an integrated systems biology tool to identify a group of 39 acid-associated genes from the fruit transcriptomes in four sweet orange varieties (Citrus sinensis L. Osbeck) with varying fruit acidity, Succari (acidless), Bingtang (low acid), and Newhall and Xinhui (normal acid). We reanalyzed the prior sweet orange fruit transcriptome data, leading to the identification of 72 genes highly correlated with the fruit sugar/acid ratio. The majority of these sugar/acid ratio-related genes are predicted to be involved in regulatory functions such as transport, signaling and transcription or encode enzymes involved in metabolism. Surprisingly, only three of these sugar/acid ratio-correlated genes are weakly correlated with sugar level and none of them overlaps with the acid-associated genes. Weighted Gene Coexpression Network Analysis (WGCNA) has revealed that these genes belong to four modules, Blue, Grey, Brown and Turquoise, with the former two modules being unique to the sugar/acid ratio control. Our results indicate that orange fruits contain a possible mechanistically distinct class of genes that may potentially be involved in maintaining fruit sugar/acid ratios and/or responding to the cellular sugar/acid ratio status. Therefore, our analysis of orange transcriptomes provides an intriguing insight into the potentially novel genetic or molecular mechanisms controlling the sugar/acid ratio in fruits.

  12. Characterization and evolutionary analysis of tributyltin-binding protein and pufferfish saxitoxin and tetrodotoxin-binding protein genes in toxic and nontoxic pufferfishes.

    Science.gov (United States)

    Hashiguchi, Y; Lee, J M; Shiraishi, M; Komatsu, S; Miki, S; Shimasaki, Y; Mochioka, N; Kusakabe, T; Oshima, Y

    2015-05-01

    Understanding the evolutionary mechanisms of toxin accumulation in pufferfishes has been long-standing problem in toxicology and evolutionary biology. Pufferfish saxitoxin and tetrodotoxin-binding protein (PSTBP) is involved in the transport and accumulation of tetrodotoxin and is one of the most intriguing proteins related to the toxicity of pufferfishes. PSTBPs are fusion proteins consisting of two tandem repeated tributyltin-binding protein type 2 (TBT-bp2) domains. In this study, we examined the evolutionary dynamics of TBT-bp2 and PSTBP genes to understand the evolution of toxin accumulation in pufferfishes. Database searches and/or PCR-based cDNA cloning in nine pufferfish species (6 toxic and 3 nontoxic) revealed that all species possessed one or more TBT-bp2 genes, but PSTBP genes were found only in 5 toxic species belonging to genus Takifugu. These toxic Takifugu species possessed two or three copies of PSTBP genes. Phylogenetic analysis of TBT-bp2 and PSTBP genes suggested that PSTBPs evolved in the common ancestor of Takifugu species by repeated duplications and fusions of TBT-bp2 genes. In addition, a detailed comparison of Takifugu TBT-bp2 and PSTBP gene sequences detected a signature of positive selection under the pressure of gene conversion. The complicated evolutionary dynamics of TBT-bp2 and PSTBP genes may reflect the diversity of toxicity in pufferfishes. © 2015 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2015 European Society For Evolutionary Biology.

  13. Co-expression of Exo-inulinase and Endo-inulinase Genes in the Oleaginous Yeast Yarrowia lipolytica for Efficient Single Cell Oil Production from Inulin.

    Science.gov (United States)

    Shi, Nianci; Mao, Weian; He, Xiaoxia; Chi, Zhe; Chi, Zhenming; Liu, Guanglei

    2018-05-01

    Yarrowia lipolytica is a promising platform for the single cell oil (SCO) production. In this study, a transformant X+N8 in which exo- and endo-inulinase genes were co-expressed could produce an inulinase activity of 124.33 U/mL within 72 h. However, the inulinase activity of a transformant X2 carrying a single exo-inulinase gene was only 47.33 U/mL within 72 h. Moreover, the transformant X+N8 could accumulate 48.13% (w/w) SCO from inulin and the cell dry weight reached 13.63 g/L within 78 h, which were significantly higher than those of the transformant X2 (41.87% (w/w) and 11.23 g/L) under the same conditions. In addition, inulin hydrolysis and utilization of the transformant X+N8 were also more efficient than those of the transformant X2 during the fermentation process. These results demonstrated that the co-expression of the exo- and endo-inulinase genes significantly enhanced the SCO production from inulin due to the improvement of the inulinase activity and the synergistic action of exo- and endo-inulinase. Besides, over 95.01% of the fatty acids from the transformant X+N8 were C16-C18, especially C18:1 (53.10%), suggesting that the fatty acids could be used as feedstock for biodiesel production.

  14. Evolutionary Analysis of Minor Histocompatibility Genes In Hydra

    KAUST Repository

    Aalismail, Nojood

    2016-05-01

    Hydra is a simple freshwater solitary polyp used as a model system to study evolutionary aspects. The immune response of this organism has not been studied extensively and the immune response genes have not been identified and characterized. On the other hand, immune response has been investigated and genetic analysis has been initiated in other lower invertebrates. In the present study we took initiative to study the self/nonself recognition in hydra and its relation to the immune response. Moreover, performing phylogenetic analysis to look for annotated immune genes in hydra gave us a potential to analyze the expression of minor histocompatibility genes that have been shown to play a major role in grafting and transplantation in mammals. Here we obtained the cDNA library that shows expression of minor histocompatibility genes and confirmed that the annotated sequences in databases are actually present. In addition, grafting experiments suggested, although still preliminary, that homograft showed less rejection response than in heterograft. Involvement of possible minor histocompatibility gene orthologous in immune response was examined by qPCR.

  15. A global evolutionary and metabolic analysis of human obesity gene risk variants.

    Science.gov (United States)

    Castillo, Joseph J; Hazlett, Zachary S; Orlando, Robert A; Garver, William S

    2017-09-05

    It is generally accepted that the selection of gene variants during human evolution optimized energy metabolism that now interacts with our obesogenic environment to increase the prevalence of obesity. The purpose of this study was to perform a global evolutionary and metabolic analysis of human obesity gene risk variants (110 human obesity genes with 127 nearest gene risk variants) identified using genome-wide association studies (GWAS) to enhance our knowledge of early and late genotypes. As a result of determining the mean frequency of these obesity gene risk variants in 13 available populations from around the world our results provide evidence for the early selection of ancestral risk variants (defined as selection before migration from Africa) and late selection of derived risk variants (defined as selection after migration from Africa). Our results also provide novel information for association of these obesity genes or encoded proteins with diverse metabolic pathways and other human diseases. The overall results indicate a significant differential evolutionary pattern for the selection of obesity gene ancestral and derived risk variants proposed to optimize energy metabolism in varying global environments and complex association with metabolic pathways and other human diseases. These results are consistent with obesity genes that encode proteins possessing a fundamental role in maintaining energy metabolism and survival during the course of human evolution. Copyright © 2017. Published by Elsevier B.V.

  16. Whole brain and brain regional coexpression network interactions associated with predisposition to alcohol consumption.

    Directory of Open Access Journals (Sweden)

    Lauren A Vanderlinden

    Full Text Available To identify brain transcriptional networks that may predispose an animal to consume alcohol, we used weighted gene coexpression network analysis (WGCNA. Candidate coexpression modules are those with an eigengene expression level that correlates significantly with the level of alcohol consumption across a panel of BXD recombinant inbred mouse strains, and that share a genomic region that regulates the module transcript expression levels (mQTL with a genomic region that regulates alcohol consumption (bQTL. To address a controversy regarding utility of gene expression profiles from whole brain, vs specific brain regions, as indicators of the relationship of gene expression to phenotype, we compared candidate coexpression modules from whole brain gene expression data (gathered with Affymetrix 430 v2 arrays in the Colorado laboratories and from gene expression data from 6 brain regions (nucleus accumbens (NA; prefrontal cortex (PFC; ventral tegmental area (VTA; striatum (ST; hippocampus (HP; cerebellum (CB available from GeneNetwork. The candidate modules were used to construct candidate eigengene networks across brain regions, resulting in three "meta-modules", composed of candidate modules from two or more brain regions (NA, PFC, ST, VTA and whole brain. To mitigate the potential influence of chromosomal location of transcripts and cis-eQTLs in linkage disequilibrium, we calculated a semi-partial correlation of the transcripts in the meta-modules with alcohol consumption conditional on the transcripts' cis-eQTLs. The function of transcripts that retained the correlation with the phenotype after correction for the strong genetic influence, implicates processes of protein metabolism in the ER and Golgi as influencing susceptibility to variation in alcohol consumption. Integration of these data with human GWAS provides further information on the function of polymorphisms associated with alcohol-related traits.

  17. Evolutionary Relationship and Structural Characterization of the EPF/EPFL Gene Family

    OpenAIRE

    Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu

    2013-01-01

    EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that...

  18. Cis-Natural Antisense Transcripts Are Mainly Co-expressed with Their Sense Transcripts and Primarily Related to Energy Metabolic Pathways during Muscle Development.

    Science.gov (United States)

    Zhao, Yunxia; Hou, Ye; Zhao, Changzhi; Liu, Fei; Luan, Yu; Jing, Lu; Li, Xinyun; Zhu, Mengjin; Zhao, Shuhong

    2016-01-01

    Cis-natural antisense transcripts (cis-NATs) are a new class of RNAs identified in various species. However, the biological functions of cis-NATs are largely unknown. In this study, we investigated the transcriptional characteristics and functions of cis-NATs in the muscle tissue of lean Landrace and indigenous fatty Lantang pigs. In total, 3,306 cis-NATs of 2,469 annotated genes were identified in the muscle tissue of pigs. More than 1,300 cis-NATs correlated with their sense genes at the transcriptional level, and approximately 80% of them were co-expressed in the two breeds. Furthermore, over 1,200 differentially expressed cis-NATs were identified during muscle development. Function annotation showed that the cis-NATs participated in muscle development mainly by co-expressing with genes involved in energy metabolic pathways, including citrate cycle (TCA cycle), glycolysis or gluconeogenesis, mitochondrial activation and so on. Moreover, these cis-NATs and their sense genes abruptly increased at the transition from the late fetal stages to the early postnatal stages and then decreased along with muscle development. In conclusion, the cis-NATs in the muscle tissue of pigs were identified and determined to be mainly co-expressed with their sense genes. The co-expressed cis-NATs and their sense gene were primarily related to energy metabolic pathways during muscle development in pigs. Our results offered novel evidence on the roles of cis-NATs during the muscle development of pigs.

  19. Evolutionary history of chordate PAX genes: dynamics of change in a complex gene family.

    Directory of Open Access Journals (Sweden)

    Vanessa Rodrigues Paixão-Côrtes

    Full Text Available Paired box (PAX genes are transcription factors that play important roles in embryonic development. Although the PAX gene family occurs in animals only, it is widely distributed. Among the vertebrates, its 9 genes appear to be the product of complete duplication of an original set of 4 genes, followed by an additional partial duplication. Although some studies of PAX genes have been conducted, no comprehensive survey of these genes across the entire taxonomic unit has yet been attempted. In this study, we conducted a detailed comparison of PAX sequences from 188 chordates, which revealed restricted variation. The absence of PAX4 and PAX8 among some species of reptiles and birds was notable; however, all 9 genes were present in all 74 mammalian genomes investigated. A search for signatures of selection indicated that all genes are subject to purifying selection, with a possible constraint relaxation in PAX4, PAX7, and PAX8. This result indicates asymmetric evolution of PAX family genes, which can be associated with the emergence of adaptive novelties in the chordate evolutionary trajectory.

  20. Transient Co-Expression of Post-Transcriptional Gene Silencing Suppressors for Increased in Planta Expression of a Recombinant Anthrax Receptor Fusion Protein

    Directory of Open Access Journals (Sweden)

    Kittipong Rattanaporn

    2011-08-01

    Full Text Available Potential epidemics of infectious diseases and the constant threat of bioterrorism demand rapid, scalable, and cost-efficient manufacturing of therapeutic proteins. Molecular farming of tobacco plants provides an alternative for the recombinant production of therapeutics. We have developed a transient production platform that uses Agrobacterium infiltration of Nicotiana benthamiana plants to express a novel anthrax receptor decoy protein (immunoadhesin, CMG2-Fc. This chimeric fusion protein, designed to protect against the deadly anthrax toxins, is composed of the von Willebrand factor A (VWA domain of human capillary morphogenesis 2 (CMG2, an effective anthrax toxin receptor, and the Fc region of human immunoglobulin G (IgG. We evaluated, in N. benthamiana intact plants and detached leaves, the expression of CMG2-Fc under the control of the constitutive CaMV 35S promoter, and the co-expression of CMG2-Fc with nine different viral suppressors of post-transcriptional gene silencing (PTGS: p1, p10, p19, p21, p24, p25, p38, 2b, and HCPro. Overall, transient CMG2-Fc expression was higher on intact plants than detached leaves. Maximum expression was observed with p1 co-expression at 3.5 days post-infiltration (DPI, with a level of 0.56 g CMG2-Fc per kg of leaf fresh weight and 1.5% of the total soluble protein, a ten-fold increase in expression when compared to absence of suppression. Co-expression with the p25 PTGS suppressor also significantly increased the CMG2-Fc expression level after just 3.5 DPI.

  1. Transient co-expression of post-transcriptional gene silencing suppressors for increased in planta expression of a recombinant anthrax receptor fusion protein.

    Science.gov (United States)

    Arzola, Lucas; Chen, Junxing; Rattanaporn, Kittipong; Maclean, James M; McDonald, Karen A

    2011-01-01

    Potential epidemics of infectious diseases and the constant threat of bioterrorism demand rapid, scalable, and cost-efficient manufacturing of therapeutic proteins. Molecular farming of tobacco plants provides an alternative for the recombinant production of therapeutics. We have developed a transient production platform that uses Agrobacterium infiltration of Nicotiana benthamiana plants to express a novel anthrax receptor decoy protein (immunoadhesin), CMG2-Fc. This chimeric fusion protein, designed to protect against the deadly anthrax toxins, is composed of the von Willebrand factor A (VWA) domain of human capillary morphogenesis 2 (CMG2), an effective anthrax toxin receptor, and the Fc region of human immunoglobulin G (IgG). We evaluated, in N. benthamiana intact plants and detached leaves, the expression of CMG2-Fc under the control of the constitutive CaMV 35S promoter, and the co-expression of CMG2-Fc with nine different viral suppressors of post-transcriptional gene silencing (PTGS): p1, p10, p19, p21, p24, p25, p38, 2b, and HCPro. Overall, transient CMG2-Fc expression was higher on intact plants than detached leaves. Maximum expression was observed with p1 co-expression at 3.5 days post-infiltration (DPI), with a level of 0.56 g CMG2-Fc per kg of leaf fresh weight and 1.5% of the total soluble protein, a ten-fold increase in expression when compared to absence of suppression. Co-expression with the p25 PTGS suppressor also significantly increased the CMG2-Fc expression level after just 3.5 DPI.

  2. Integration of liver gene co-expression networks and eGWAs analyses highlighted candidate regulators implicated in lipid metabolism in pigs.

    Science.gov (United States)

    Ballester, Maria; Ramayo-Caldas, Yuliaxis; Revilla, Manuel; Corominas, Jordi; Castelló, Anna; Estellé, Jordi; Fernández, Ana I; Folch, Josep M

    2017-04-19

    In the present study, liver co-expression networks and expression Genome Wide Association Study (eGWAS) were performed to identify DNA variants and molecular pathways implicated in the functional regulatory mechanisms of meat quality traits in pigs. With this purpose, the liver mRNA expression of 44 candidates genes related with lipid metabolism was analysed in 111 Iberian x Landrace backcross animals. The eGWAS identified 92 eSNPs located in seven chromosomal regions and associated with eight genes: CROT, CYP2U1, DGAT1, EGF, FABP1, FABP5, PLA2G12A, and PPARA. Remarkably, cis-eSNPs associated with FABP1 gene expression which may be determining the C18:2(n-6)/C18:3(n-3) ratio in backfat through the multiple interaction of DNA variants and genes were identified. Furthermore, a hotspot on SSC8 associated with the gene expression of eight genes was identified and the TBCK gene was pointed out as candidate gene regulating it. Our results also suggested that the PI3K-Akt-mTOR pathway plays an important role in the control of the analysed genes highlighting nuclear receptors as the NR3C1 or PPARA. Finally, sex-dimorphism associated with hepatic lipid metabolism was identified with over-representation of female-biased genes. These results increase our knowledge of the genetic architecture underlying fat composition traits.

  3. Designing a parallel evolutionary algorithm for inferring gene networks on the cloud computing environment.

    Science.gov (United States)

    Lee, Wei-Po; Hsiao, Yu-Ting; Hwang, Wei-Che

    2014-01-16

    To improve the tedious task of reconstructing gene networks through testing experimentally the possible interactions between genes, it becomes a trend to adopt the automated reverse engineering procedure instead. Some evolutionary algorithms have been suggested for deriving network parameters. However, to infer large networks by the evolutionary algorithm, it is necessary to address two important issues: premature convergence and high computational cost. To tackle the former problem and to enhance the performance of traditional evolutionary algorithms, it is advisable to use parallel model evolutionary algorithms. To overcome the latter and to speed up the computation, it is advocated to adopt the mechanism of cloud computing as a promising solution: most popular is the method of MapReduce programming model, a fault-tolerant framework to implement parallel algorithms for inferring large gene networks. This work presents a practical framework to infer large gene networks, by developing and parallelizing a hybrid GA-PSO optimization method. Our parallel method is extended to work with the Hadoop MapReduce programming model and is executed in different cloud computing environments. To evaluate the proposed approach, we use a well-known open-source software GeneNetWeaver to create several yeast S. cerevisiae sub-networks and use them to produce gene profiles. Experiments have been conducted and the results have been analyzed. They show that our parallel approach can be successfully used to infer networks with desired behaviors and the computation time can be largely reduced. Parallel population-based algorithms can effectively determine network parameters and they perform better than the widely-used sequential algorithms in gene network inference. These parallel algorithms can be distributed to the cloud computing environment to speed up the computation. By coupling the parallel model population-based optimization method and the parallel computational framework, high

  4. Upon Accounting for the Impact of Isoenzyme Loss, Gene Deletion Costs Anticorrelate with Their Evolutionary Rates.

    Directory of Open Access Journals (Sweden)

    Christopher Jacobs

    Full Text Available System-level metabolic network models enable the computation of growth and metabolic phenotypes from an organism's genome. In particular, flux balance approaches have been used to estimate the contribution of individual metabolic genes to organismal fitness, offering the opportunity to test whether such contributions carry information about the evolutionary pressure on the corresponding genes. Previous failure to identify the expected negative correlation between such computed gene-loss cost and sequence-derived evolutionary rates in Saccharomyces cerevisiae has been ascribed to a real biological gap between a gene's fitness contribution to an organism "here and now" and the same gene's historical importance as evidenced by its accumulated mutations over millions of years of evolution. Here we show that this negative correlation does exist, and can be exposed by revisiting a broadly employed assumption of flux balance models. In particular, we introduce a new metric that we call "function-loss cost", which estimates the cost of a gene loss event as the total potential functional impairment caused by that loss. This new metric displays significant negative correlation with evolutionary rate, across several thousand minimal environments. We demonstrate that the improvement gained using function-loss cost over gene-loss cost is explained by replacing the base assumption that isoenzymes provide unlimited capacity for backup with the assumption that isoenzymes are completely non-redundant. We further show that this change of the assumption regarding isoenzymes increases the recall of epistatic interactions predicted by the flux balance model at the cost of a reduction in the precision of the predictions. In addition to suggesting that the gene-to-reaction mapping in genome-scale flux balance models should be used with caution, our analysis provides new evidence that evolutionary gene importance captures much more than strict essentiality.

  5. Upon Accounting for the Impact of Isoenzyme Loss, Gene Deletion Costs Anticorrelate with Their Evolutionary Rates.

    Science.gov (United States)

    Jacobs, Christopher; Lambourne, Luke; Xia, Yu; Segrè, Daniel

    2017-01-01

    System-level metabolic network models enable the computation of growth and metabolic phenotypes from an organism's genome. In particular, flux balance approaches have been used to estimate the contribution of individual metabolic genes to organismal fitness, offering the opportunity to test whether such contributions carry information about the evolutionary pressure on the corresponding genes. Previous failure to identify the expected negative correlation between such computed gene-loss cost and sequence-derived evolutionary rates in Saccharomyces cerevisiae has been ascribed to a real biological gap between a gene's fitness contribution to an organism "here and now" and the same gene's historical importance as evidenced by its accumulated mutations over millions of years of evolution. Here we show that this negative correlation does exist, and can be exposed by revisiting a broadly employed assumption of flux balance models. In particular, we introduce a new metric that we call "function-loss cost", which estimates the cost of a gene loss event as the total potential functional impairment caused by that loss. This new metric displays significant negative correlation with evolutionary rate, across several thousand minimal environments. We demonstrate that the improvement gained using function-loss cost over gene-loss cost is explained by replacing the base assumption that isoenzymes provide unlimited capacity for backup with the assumption that isoenzymes are completely non-redundant. We further show that this change of the assumption regarding isoenzymes increases the recall of epistatic interactions predicted by the flux balance model at the cost of a reduction in the precision of the predictions. In addition to suggesting that the gene-to-reaction mapping in genome-scale flux balance models should be used with caution, our analysis provides new evidence that evolutionary gene importance captures much more than strict essentiality.

  6. Module discovery by exhaustive search for densely connected, co-expressed regions in biomolecular interaction networks.

    Directory of Open Access Journals (Sweden)

    Recep Colak

    2010-10-01

    Full Text Available Computational prediction of functionally related groups of genes (functional modules from large-scale data is an important issue in computational biology. Gene expression experiments and interaction networks are well studied large-scale data sources, available for many not yet exhaustively annotated organisms. It has been well established, when analyzing these two data sources jointly, modules are often reflected by highly interconnected (dense regions in the interaction networks whose participating genes are co-expressed. However, the tractability of the problem had remained unclear and methods by which to exhaustively search for such constellations had not been presented.We provide an algorithmic framework, referred to as Densely Connected Biclustering (DECOB, by which the aforementioned search problem becomes tractable. To benchmark the predictive power inherent to the approach, we computed all co-expressed, dense regions in physical protein and genetic interaction networks from human and yeast. An automatized filtering procedure reduces our output which results in smaller collections of modules, comparable to state-of-the-art approaches. Our results performed favorably in a fair benchmarking competition which adheres to standard criteria. We demonstrate the usefulness of an exhaustive module search, by using the unreduced output to more quickly perform GO term related function prediction tasks. We point out the advantages of our exhaustive output by predicting functional relationships using two examples.We demonstrate that the computation of all densely connected and co-expressed regions in interaction networks is an approach to module discovery of considerable value. Beyond confirming the well settled hypothesis that such co-expressed, densely connected interaction network regions reflect functional modules, we open up novel computational ways to comprehensively analyze the modular organization of an organism based on prevalent and largely

  7. Module discovery by exhaustive search for densely connected, co-expressed regions in biomolecular interaction networks.

    Science.gov (United States)

    Colak, Recep; Moser, Flavia; Chu, Jeffrey Shih-Chieh; Schönhuth, Alexander; Chen, Nansheng; Ester, Martin

    2010-10-25

    Computational prediction of functionally related groups of genes (functional modules) from large-scale data is an important issue in computational biology. Gene expression experiments and interaction networks are well studied large-scale data sources, available for many not yet exhaustively annotated organisms. It has been well established, when analyzing these two data sources jointly, modules are often reflected by highly interconnected (dense) regions in the interaction networks whose participating genes are co-expressed. However, the tractability of the problem had remained unclear and methods by which to exhaustively search for such constellations had not been presented. We provide an algorithmic framework, referred to as Densely Connected Biclustering (DECOB), by which the aforementioned search problem becomes tractable. To benchmark the predictive power inherent to the approach, we computed all co-expressed, dense regions in physical protein and genetic interaction networks from human and yeast. An automatized filtering procedure reduces our output which results in smaller collections of modules, comparable to state-of-the-art approaches. Our results performed favorably in a fair benchmarking competition which adheres to standard criteria. We demonstrate the usefulness of an exhaustive module search, by using the unreduced output to more quickly perform GO term related function prediction tasks. We point out the advantages of our exhaustive output by predicting functional relationships using two examples. We demonstrate that the computation of all densely connected and co-expressed regions in interaction networks is an approach to module discovery of considerable value. Beyond confirming the well settled hypothesis that such co-expressed, densely connected interaction network regions reflect functional modules, we open up novel computational ways to comprehensively analyze the modular organization of an organism based on prevalent and largely available large

  8. ESR1 Is Co-Expressed with Closely Adjacent Uncharacterised Genes Spanning a Breast Cancer Susceptibility Locus at 6q25.1

    Science.gov (United States)

    Dunbier, Anita K.; Anderson, Helen; Ghazoui, Zara; Lopez-Knowles, Elena; Pancholi, Sunil; Ribas, Ricardo; Drury, Suzanne; Sidhu, Kally; Leary, Alexandra; Martin, Lesley-Ann; Dowsett, Mitch

    2011-01-01

    Approximately 80% of human breast carcinomas present as oestrogen receptor α-positive (ER+ve) disease, and ER status is a critical factor in treatment decision-making. Recently, single nucleotide polymorphisms (SNPs) in the region immediately upstream of the ER gene (ESR1) on 6q25.1 have been associated with breast cancer risk. Our investigation of factors associated with the level of expression of ESR1 in ER+ve tumours has revealed unexpected associations between genes in this region and ESR1 expression that are important to consider in studies of the genetic causes of breast cancer risk. RNA from tumour biopsies taken from 104 postmenopausal women before and after 2 weeks treatment with an aromatase (oestrogen synthase) inhibitor was analyzed on Illumina 48K microarrays. Multiple-testing corrected Spearman correlation revealed that three previously uncharacterized open reading frames (ORFs) located immediately upstream of ESR1, C6ORF96, C6ORF97, and C6ORF211 were highly correlated with ESR1 (Rs = 0.67, 0.64, and 0.55 respectively, FDRaccount for the correlations. The correlations were maintained in cultured cells. An ERα antagonist did not affect the ORFs' expression or their correlation with ESR1, suggesting their transcriptional co-activation is not directly mediated by ERα. siRNA inhibition of C6ORF211 suppressed proliferation in MCF7 cells, and C6ORF211 positively correlated with a proliferation metagene in tumours. In contrast, C6ORF97 expression correlated negatively with the metagene and predicted for improved disease-free survival in a tamoxifen-treated published dataset, independently of ESR1. Our observations suggest that some of the biological effects previously attributed to ER could be mediated and/or modified by these co-expressed genes. The co-expression and function of these genes may be important influences on the recently identified relationship between SNPs in this region and breast cancer risk. PMID:21552322

  9. Evolution, functional differentiation, and co-expression of the RLK gene family revealed in Jilin ginseng, Panax ginseng C.A. Meyer.

    Science.gov (United States)

    Lin, Yanping; Wang, Kangyu; Li, Xiangyu; Sun, Chunyu; Yin, Rui; Wang, Yanfang; Wang, Yi; Zhang, Meiping

    2018-02-21

    Most genes in a genome exist in the form of a gene family; therefore, it is necessary to have knowledge of how a gene family functions to comprehensively understand organismal biology. The receptor-like kinase (RLK)-encoding gene family is one of the most important gene families in plants. It plays important roles in biotic and abiotic stress tolerances, and growth and development. However, little is known about the functional differentiation and relationships among the gene members within a gene family in plants. This study has isolated 563 RLK genes (designated as PgRLK genes) expressed in Jilin ginseng (Panax ginseng C.A. Meyer), investigated their evolution, and deciphered their functional diversification and relationships. The PgRLK gene family is highly diverged and formed into eight types. The LRR type is the earliest and most prevalent, while only the Lec type originated after P. ginseng evolved. Furthermore, although the members of the PgRLK gene family all encode receptor-like protein kinases and share conservative domains, they are functionally very diverse, participating in numerous biological processes. The expressions of different members of the PgRLK gene family are extremely variable within a tissue, at a developmental stage and in the same cultivar, but most of the genes tend to express correlatively, forming a co-expression network. These results not only provide a deeper and comprehensive understanding of the evolution, functional differentiation and correlation of a gene family in plants, but also an RLK genic resource useful for enhanced ginseng genetic improvement.

  10. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  11. Diverse evolutionary trajectories for small RNA biogenesis genes in the oomycete genus Phytophthora

    Directory of Open Access Journals (Sweden)

    Stephanie eBollmann

    2016-03-01

    Full Text Available Gene regulation by small RNA pathways is ubiquitous among eukaryotes, but little is known about small RNA pathways in the Stramenopile kingdom. Phytophthora, a genus of filamentous oomycetes, contains many devastating plant pathogens, causing multibillion-dollar damage to crops, ornamental plants, and natural environments. The genomes of several oomycetes including Phytophthora species such as the soybean pathogen P. sojae, have been sequenced, allowing evolutionary analysis of small RNA-processing enzymes. This study examined the evolutionary origins of the oomycete small RNA-related genes Dicer-like (DCL, and RNA-dependent RNA polymerase (RDR through broad phylogenetic analyses of the key domains. Two Dicer gene homologs, DCL1 and DCL2, and one RDR homolog were cloned and analyzed from P. sojae. Gene expression analysis revealed only minor changes in transcript levels among different life stages. Oomycete DCL1 homologs clustered with animal and plant Dicer homologs in evolutionary trees, whereas oomycete DCL2 homologs clustered basally to the tree along with Drosha homologs. Phylogenetic analysis of the RDR homologs confirmed a previous study that suggested the last common eukaryote ancestor possessed three RDR homologs, which were selectively retained or lost in later lineages. Our analysis clarifies the position of some Unikont and Chromalveolate RDR lineages within the tree, including oomycete homologs. Finally, we analyzed alterations in the domain structure of oomycete Dicer and RDR homologs, specifically focusing on the proposed domain transfer of the DEAD-box helicase domain from Dicer to RDR. Implications of the oomycete domain structure are discussed, and possible roles of the two oomycete Dicer homologs are proposed.

  12. Co-expression of NCED and ALO improves vitamin C level and tolerance to drought and chilling in transgenic tobacco and stylo plants.

    Science.gov (United States)

    Bao, Gegen; Zhuo, Chunliu; Qian, Chunmei; Xiao, Ting; Guo, Zhenfei; Lu, Shaoyun

    2016-01-01

    Abscisic acid (ABA) regulates plant adaptive responses to various environmental stresses, while L-ascorbic acid (AsA) that is also named vitamin C is an important antioxidant and involves in plant stress tolerance and the immune system in domestic animals. Transgenic tobacco (Nicotiana tabacum L.) and stylo [Stylosanthes guianensis (Aublet) Swartz], a forage legume, plants co-expressing stylo 9-cis-epoxycarotenoid dioxygenase (SgNCED1) and yeast D-arabinono-1,4-lactone oxidase (ALO) genes were generated in this study, and tolerance to drought and chilling was analysed in comparison with transgenic tobacco overexpressing SgNCED1 or ALO and the wild-type plants. Compared to the SgNCED1 or ALO transgenic plants, in which only ABA or AsA levels were increased, both ABA and AsA levels were increased in transgenic tobacco and stylo plants co-expressing SgNCED1 and ALO genes. Compared to the wild type, an enhanced drought tolerance was observed in SgNCED1 transgenic tobacco plants with induced expression of drought-responsive genes, but not in ALO plants, while an enhanced chilling tolerance was observed in ALO transgenic tobaccos with induced expression of cold-responsive genes, but not in SgNCED1 plants. Co-expression of SgNCED1 and ALO genes resulted in elevated tolerance to both drought and chilling in transgenic tobacco and stylo plants with induced expression of both drought and cold-responsive genes. Our result suggests that co-expression of SgNCED1 and ALO genes is an effective way for use in forage plant improvement for increased tolerance to drought and chilling and nutrition quality. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  13. Reconstructing the Evolutionary History of Paralogous APETALA1/FRUITFULL-Like Genes in Grasses (Poaceae)

    Science.gov (United States)

    Preston, Jill C.; Kellogg, Elizabeth A.

    2006-01-01

    Gene duplication is an important mechanism for the generation of evolutionary novelty. Paralogous genes that are not silenced may evolve new functions (neofunctionalization) that will alter the developmental outcome of preexisting genetic pathways, partition ancestral functions (subfunctionalization) into divergent developmental modules, or function redundantly. Functional divergence can occur by changes in the spatio-temporal patterns of gene expression and/or by changes in the activities of their protein products. We reconstructed the evolutionary history of two paralogous monocot MADS-box transcription factors, FUL1 and FUL2, and determined the evolution of sequence and gene expression in grass AP1/FUL-like genes. Monocot AP1/FUL-like genes duplicated at the base of Poaceae and codon substitutions occurred under relaxed selection mostly along the branch leading to FUL2. Following the duplication, FUL1 was apparently lost from early diverging taxa, a pattern consistent with major changes in grass floral morphology. Overlapping gene expression patterns in leaves and spikelets indicate that FUL1 and FUL2 probably share some redundant functions, but that FUL2 may have become temporally restricted under partial subfunctionalization to particular stages of floret development. These data have allowed us to reconstruct the history of AP1/FUL-like genes in Poaceae and to hypothesize a role for this gene duplication in the evolution of the grass spikelet. PMID:16816429

  14. Genetic architecture of wood properties based on association analysis and co-expression networks in white spruce.

    Science.gov (United States)

    Lamara, Mebarek; Raherison, Elie; Lenz, Patrick; Beaulieu, Jean; Bousquet, Jean; MacKay, John

    2016-04-01

    Association studies are widely utilized to analyze complex traits but their ability to disclose genetic architectures is often limited by statistical constraints, and functional insights are usually minimal in nonmodel organisms like forest trees. We developed an approach to integrate association mapping results with co-expression networks. We tested single nucleotide polymorphisms (SNPs) in 2652 candidate genes for statistical associations with wood density, stiffness, microfibril angle and ring width in a population of 1694 white spruce trees (Picea glauca). Associations mapping identified 229-292 genes per wood trait using a statistical significance level of P wood associated genes and several known MYB and NAC regulators were identified as network hubs. The network revealed a link between the gene PgNAC8, wood stiffness and microfibril angle, as well as considerable within-season variation for both genetic control of wood traits and gene expression. Trait associations were distributed throughout the network suggesting complex interactions and pleiotropic effects. Our findings indicate that integration of association mapping and co-expression networks enhances our understanding of complex wood traits. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  15. Co-expression of an Erwinia chrysanthemi pectate lyase-encoding gene (pelE) and an E. carotovora polygalacturonase-encoding gene (peh1) in Saccharomyces cerevisiae.

    Science.gov (United States)

    Laing, E; Pretorius, I S

    1993-05-01

    A pectate lyase (PL)-encoding gene (pelE) from Erwinia chrysanthemi and a polygalacturonase (PG)-encoding gene (peh1) from E. carotovora were each inserted between a novel yeast expression-secretion cassette and a yeast gene terminator, and cloned separately into a yeast-centromeric shuttle vector (YCp50), generating recombinant plasmids pAMS12 and pAMS13. Transcription initiation signals present in the expression-secretion cassette were derived from the yeast alcohol dehydrogenase gene promoter (ADC1P), whereas the transcription termination signals were derived from the yeast tryptophan synthase gene terminator (TRP5T). Secretion of PL and PG was directed by the signal sequence of the yeast mating pheromone alpha-factor (MF alpha 1s). A pectinase cassette comprising ADC1P-MF alpha 1s-pelE-TRP5T and ADC1P-MF alpha 1s-peh1-TRP5T was subcloned into YCp50, generating plasmid pAMS14. Subsequently, the dominant selectable Geneticin G418-resistance (GtR) marker, APH1, inserted between the yeast uridine diphosphoglucose 4-epimerase gene promoter (GAL10P) and yeast orotidine-5'-phosphate carboxylase gene terminator (URA3T), was cloned into pAMS14, resulting in plasmid pAMS15. Plasmids pAMS12, pAMS13 and pAMS14 were transformed into a laboratory strain of Saccharomyces cerevisiae, whereas pAMS15 was stably introduced into two commercial wine yeast strains. DNA-DNA and DNA-RNA hybridization analyses revealed the presence of these plasmids, and the pelE and peh1 transcripts in the yeast transformants, respectively. A polypectate agarose assay indicated the extracellular production of biologically active PL and PG by the S. cerevisiae transformants and confirmed that co-expression of the pelE and peh1 genes synergistically enhanced pectate degradation.

  16. Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea

    OpenAIRE

    Wolf Yuri I; Novichkov Pavel S; Sorokin Alexander V; Makarova Kira S; Koonin Eugene V

    2007-01-01

    Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs ...

  17. Evolutionary Transition of Promoter and Gene Body DNA Methylation across Invertebrate-Vertebrate Boundary.

    Science.gov (United States)

    Keller, Thomas E; Han, Priscilla; Yi, Soojin V

    2016-04-01

    Genomes of invertebrates and vertebrates exhibit highly divergent patterns of DNA methylation. Invertebrate genomes tend to be sparsely methylated, and DNA methylation is mostly targeted to a subset of transcription units (gene bodies). In a drastic contrast, vertebrate genomes are generally globally and heavily methylated, punctuated by the limited local hypo-methylation of putative regulatory regions such as promoters. These genomic differences also translate into functional differences in DNA methylation and gene regulation. Although promoter DNA methylation is an important regulatory component of vertebrate gene expression, its role in invertebrate gene regulation has been little explored. Instead, gene body DNA methylation is associated with expression of invertebrate genes. However, the evolutionary steps leading to the differentiation of invertebrate and vertebrate genomic DNA methylation remain unresolved. Here we analyzed experimentally determined DNA methylation maps of several species across the invertebrate-vertebrate boundary, to elucidate how vertebrate gene methylation has evolved. We show that, in contrast to the prevailing idea, a substantial number of promoters in an invertebrate basal chordate Ciona intestinalis are methylated. Moreover, gene expression data indicate significant, epigenomic context-dependent associations between promoter methylation and expression in C. intestinalis. However, there is no evidence that promoter methylation in invertebrate chordate has been evolutionarily maintained across the invertebrate-vertebrate boundary. Rather, body-methylated invertebrate genes preferentially obtain hypo-methylated promoters among vertebrates. Conversely, promoter methylation is preferentially found in lineage- and tissue-specific vertebrate genes. These results provide important insights into the evolutionary origin of epigenetic regulation of vertebrate gene expression. © The Author(s) 2015. Published by Oxford University Press on behalf

  18. In silico identification of miRNAs and their target genes and analysis of gene co-expression network in saffron (Crocus sativus L.) stigma

    Science.gov (United States)

    Zinati, Zahra; Shamloo-Dashtpagerdi, Roohollah; Behpouri, Ali

    2016-01-01

    As an aromatic and colorful plant of substantive taste, saffron (Crocus sativus L.) owes such properties of matter to growing class of the secondary metabolites derived from the carotenoids, apocarotenoids. Regarding the critical role of microRNAs in secondary metabolic synthesis and the limited number of identified miRNAs in C. sativus, on the other hand, one may see the point how the characterization of miRNAs along with the corresponding target genes in C. sativus might expand our perspectives on the roles of miRNAs in carotenoid/apocarotenoid biosynthetic pathway. A computational analysis was used to identify miRNAs and their targets using EST (Expressed Sequence Tag) library from mature saffron stigmas. Then, a gene co- expression network was constructed to identify genes which are potentially involved in carotenoid/apocarotenoid biosynthetic pathways. EST analysis led to the identification of two putative miRNAs (miR414 and miR837-5p) along with the corresponding stem- looped precursors. To our knowledge, this is the first report on miR414 and miR837-5p in C. sativus. Co-expression network analysis indicated that miR414 and miR837-5p may play roles in C. sativus metabolic pathways and led to identification of candidate genes including six transcription factors and one protein kinase probably involved in carotenoid/apocarotenoid biosynthetic pathway. Presence of transcription factors, miRNAs and protein kinase in the network indicated multiple layers of regulation in saffron stigma. The candidate genes from this study may help unraveling regulatory networks underlying the carotenoid/apocarotenoid biosynthesis in saffron and designing metabolic engineering for enhanced secondary metabolites. PMID:28261627

  19. ESR1 is co-expressed with closely adjacent uncharacterised genes spanning a breast cancer susceptibility locus at 6q25.1.

    Directory of Open Access Journals (Sweden)

    Anita K Dunbier

    2011-04-01

    Full Text Available Approximately 80% of human breast carcinomas present as oestrogen receptor α-positive (ER+ve disease, and ER status is a critical factor in treatment decision-making. Recently, single nucleotide polymorphisms (SNPs in the region immediately upstream of the ER gene (ESR1 on 6q25.1 have been associated with breast cancer risk. Our investigation of factors associated with the level of expression of ESR1 in ER+ve tumours has revealed unexpected associations between genes in this region and ESR1 expression that are important to consider in studies of the genetic causes of breast cancer risk. RNA from tumour biopsies taken from 104 postmenopausal women before and after 2 weeks treatment with an aromatase (oestrogen synthase inhibitor was analyzed on Illumina 48K microarrays. Multiple-testing corrected Spearman correlation revealed that three previously uncharacterized open reading frames (ORFs located immediately upstream of ESR1, C6ORF96, C6ORF97, and C6ORF211 were highly correlated with ESR1 (Rs =  0.67, 0.64, and 0.55 respectively, FDR<1 × 10(-7. Publicly available datasets confirmed this relationship in other groups of ER+ve tumours. DNA copy number changes did not account for the correlations. The correlations were maintained in cultured cells. An ERα antagonist did not affect the ORFs' expression or their correlation with ESR1, suggesting their transcriptional co-activation is not directly mediated by ERα. siRNA inhibition of C6ORF211 suppressed proliferation in MCF7 cells, and C6ORF211 positively correlated with a proliferation metagene in tumours. In contrast, C6ORF97 expression correlated negatively with the metagene and predicted for improved disease-free survival in a tamoxifen-treated published dataset, independently of ESR1. Our observations suggest that some of the biological effects previously attributed to ER could be mediated and/or modified by these co-expressed genes. The co-expression and function of these genes may be

  20. Estimation of the proteomic cancer co-expression sub networks by using association estimators.

    Directory of Open Access Journals (Sweden)

    Cihat Erdoğan

    Full Text Available In this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA. Correlation and mutual information (MI based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators' performance, a multi-layer data integration platform on gene-disease associations (DisGeNET and the Molecular Signatures Database (MSigDB was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink and 64% for Schurmann-Grassberger (SG association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists.

  1. Evolutionary changes of Hox genes and relevant regulatory factors provide novel insights into mammalian morphological modifications.

    Science.gov (United States)

    Li, Kui; Sun, Xiaohui; Chen, Meixiu; Sun, Yingying; Tian, Ran; Wang, Zhengfei; Xu, Shixia; Yang, Guang

    2018-01-01

    The diversity of body plans of mammals accelerates the innovation of lifestyles and the extensive adaptation to different habitats, including terrestrial, aerial and aquatic habitats. However, the genetic basis of those phenotypic modifications, which have occurred during mammalian evolution, remains poorly explored. In the present study, we synthetically surveyed the evolutionary pattern of Hox clusters that played a powerful role in the morphogenesis along the head-tail axis of animal embryos and the main regulatory factors (Mll, Bmi1 and E2f6) that control the expression of Hox genes. A deflected density of repetitive elements and lineage-specific radical mutations of Mll have been determined in marine mammals with morphological changes, suggesting that evolutionary changes may alter Hox gene expression in these lineages, leading to the morphological modification of these lineages. Although no positive selection was detected at certain ancestor nodes of lineages, the increased ω values of Hox genes implied the relaxation of functional constraints of these genes during the mammalian evolutionary process. More importantly, 49 positively-selected sites were identified in mammalian lineages with phenotypic modifications, indicating adaptive evolution acting on Hox genes and regulatory factors. In addition, 3 parallel amino acid substitutions in some Hox genes were examined in marine mammals, which might be responsible for their streamlined body. © 2017 The Authors. Integrative Zoology published by International Society of Zoological Sciences, Institute of Zoology/Chinese Academy of Sciences and John Wiley & Sons Australia, Ltd.

  2. Evolutionary conservation and network structure characterize genes of phenotypic relevance for mitosis in human.

    Directory of Open Access Journals (Sweden)

    Marek Ostaszewski

    Full Text Available The impact of gene silencing on cellular phenotypes is difficult to establish due to the complexity of interactions in the associated biological processes and pathways. A recent genome-wide RNA knock-down study both identified and phenotypically characterized a set of important genes for the cell cycle in HeLa cells. Here, we combine a molecular interaction network analysis, based on physical and functional protein interactions, in conjunction with evolutionary information, to elucidate the common biological and topological properties of these key genes. Our results show that these genes tend to be conserved with their corresponding protein interactions across several species and are key constituents of the evolutionary conserved molecular interaction network. Moreover, a group of bistable network motifs is found to be conserved within this network, which are likely to influence the network stability and therefore the robustness of cellular functioning. They form a cluster, which displays functional homogeneity and is significantly enriched in genes phenotypically relevant for mitosis. Additional results reveal a relationship between specific cellular processes and the phenotypic outcomes induced by gene silencing. This study introduces new ideas regarding the relationship between genotype and phenotype in the context of the cell cycle. We show that the analysis of molecular interaction networks can result in the identification of genes relevant to cellular processes, which is a promising avenue for future research.

  3. rbcL gene sequences provide evidence for the evolutionary lineages of leptosporangiate ferns.

    OpenAIRE

    Hasebe, M; Omori, T; Nakazawa, M; Sano, T; Kato, M; Iwatsuki, K

    1994-01-01

    Pteriodophytes have a longer evolutionary history than any other vascular land plant and, therefore, have endured greater loss of phylogenetically informative information. This factor has resulted in substantial disagreements in evaluating characters and, thus, controversy in establishing a stable classification. To compare competing classifications, we obtained DNA sequences of a chloroplast gene. The sequence of 1206 nt of the large subunit of the ribulose-bisphosphate carboxylase gene (rbc...

  4. Different substrate regimes determine transcriptional profiles and gene co-expression in Methanosarcina barkeri (DSM 800)

    Czech Academy of Sciences Publication Activity Database

    Lin, Qiang; Fang, X.; Ho, A.; Li, J.; Yan, X.; Tu, B.; Li, Ch.; Li, J.; Yao, M.; Li, X.

    2017-01-01

    Roč. 101, č. 19 (2017), s. 7303-7316 ISSN 0175-7598 Institutional support: RVO:60077344 Keywords : Methanosarcina barkeri * substrate regimes * diversity * co-expression * ecological strategies Subject RIV: EH - Ecology, Behaviour OBOR OECD: Ecology Impact factor: 3.420, year: 2016

  5. A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways

    Science.gov (United States)

    Musungu, Bryan M.; Bhatnagar, Deepak; Brown, Robert L.; Payne, Gary A.; OBrian, Greg; Fakhoury, Ahmad M.; Geisler, Matt

    2016-01-01

    A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flavus, a link between aflatoxin production and vesicular transport was identified within the network. There was significant interspecies correlation of expression between Z. mays and A. flavus for a subset of 104 Z. mays, and 1942 A. flavus genes. This resulted in an interspecies subnetwork enriched in multiple Z. mays genes involved in the production of ROS. In addition to the ROS from Z. mays, there was enrichment in the vesicular transport pathways and the aflatoxin pathway for A. flavus. Included in these genes, a key aflatoxin cluster regulator, AflS, was found to be co-regulated with multiple Z. mays ROS producing genes within the network, suggesting AflS may be monitoring host ROS levels. The entire GEN for both host and pathogen, and the subset of interspecies correlations, is presented as a tool for hypothesis generation and discovery for events in the early stages of fungal infection of Z. mays by A. flavus. PMID:27917194

  6. A Network Approach of Gene Co-expression in the Zea mays/Aspergillus flavus Pathosystem to Map Host/Pathogen Interaction Pathways.

    Science.gov (United States)

    Musungu, Bryan M; Bhatnagar, Deepak; Brown, Robert L; Payne, Gary A; OBrian, Greg; Fakhoury, Ahmad M; Geisler, Matt

    2016-01-01

    A gene co-expression network (GEN) was generated using a dual RNA-seq study with the fungal pathogen Aspergillus flavus and its plant host Zea mays during the initial 3 days of infection. The analysis deciphered novel pathways and mapped genes of interest in both organisms during the infection. This network revealed a high degree of connectivity in many of the previously recognized pathways in Z. mays such as jasmonic acid, ethylene, and reactive oxygen species (ROS). For the pathogen A. flavus , a link between aflatoxin production and vesicular transport was identified within the network. There was significant interspecies correlation of expression between Z. mays and A. flavus for a subset of 104 Z. mays , and 1942 A. flavus genes. This resulted in an interspecies subnetwork enriched in multiple Z. mays genes involved in the production of ROS. In addition to the ROS from Z. mays , there was enrichment in the vesicular transport pathways and the aflatoxin pathway for A. flavus . Included in these genes, a key aflatoxin cluster regulator, AflS, was found to be co-regulated with multiple Z. mays ROS producing genes within the network, suggesting AflS may be monitoring host ROS levels. The entire GEN for both host and pathogen, and the subset of interspecies correlations, is presented as a tool for hypothesis generation and discovery for events in the early stages of fungal infection of Z. mays by A. flavus .

  7. Evolutionary maintenance of selfish homing endonuclease genes in the absence of horizontal transfer.

    Science.gov (United States)

    Yahara, Koji; Fukuyo, Masaki; Sasaki, Akira; Kobayashi, Ichizo

    2009-11-03

    Homing endonuclease genes are "selfish" mobile genetic elements whose endonuclease promotes the spread of its own gene by creating a break at a specific target site and using the host machinery to repair the break by copying and inserting the gene at this site. Horizontal transfer across the boundary of a species or population within which mating takes place has been thought to be necessary for their evolutionary persistence. This is based on the assumption that they will become fixed in a host population, where opportunities of homing will disappear, and become susceptible to degeneration. To test this hypothesis, we modeled behavior of a homing endonuclease gene that moves during meiosis through double-strand break repair. We mathematically explored conditions for persistence of the homing endonuclease gene and elucidated their parameter dependence as phase diagrams. We found that, if the cost of the pseudogene is lower than that of the homing endonuclease gene, the 2 forms can persist in a population through autonomous periodic oscillation. If the cost of the pseudogene is higher, 2 types of dynamics appear that enable evolutionary persistence: bistability dependent on initial frequency or fixation irrespective of initial frequency. The prediction of long persistence in the absence of horizontal transfer was confirmed by stochastic simulations in finite populations. The average time to extinction of the endonuclease gene was found to be thousands of meiotic generations or more based on realistic parameter values. These results provide a solid theoretical basis for an understanding of these and other extremely selfish elements.

  8. Co-expression analysis and identification of fecundity-related long non-coding RNAs in sheep ovaries.

    Science.gov (United States)

    Miao, Xiangyang; Luo, Qingmiao; Zhao, Huijing; Qin, Xiaoyu

    2016-12-16

    Small Tail Han sheep, including the FecB B FecB B (Han BB) and FecB + FecB + (Han++) genotypes, and Dorset sheep exhibit different fecundities. To identify novel long non-coding RNAs (lncRNAs) associated with sheep fecundity to better understand their molecular mechanisms, a genome-wide analysis of mRNAs and lncRNAs from Han BB, Han++ and Dorset sheep was performed. After the identification of differentially expressed mRNAs and lncRNAs, 16 significant modules were explored by using weighted gene coexpression network analysis (WGCNA) followed by functional enrichment analysis of the genes and lncRNAs in significant modules. Among these selected modules, the yellow and brown modules were significantly related to sheep fecundity. lncRNAs (e.g., NR0B1, XLOC_041882, and MYH15) in the yellow module were mainly involved in the TGF-β signalling pathway, and NYAP1 and BCORL1 were significantly associated with the oxytocin signalling pathway, which regulates several genes in the coexpression network of the brown module. Overall, we identified several gene modules associated with sheep fecundity, as well as networks consisting of hub genes and lncRNAs that may contribute to sheep prolificacy by regulating the target mRNAs related to the TGF-β and oxytocin signalling pathways. This study provides an alternative strategy for the identification of potential candidate regulatory lncRNAs.

  9. Evolutionary genomics and adaptive evolution of the Hedgehog gene family (Shh, Ihh and Dhh in vertebrates.

    Directory of Open Access Journals (Sweden)

    Joana Pereira

    Full Text Available The Hedgehog (Hh gene family codes for a class of secreted proteins composed of two active domains that act as signalling molecules during embryo development, namely for the development of the nervous and skeletal systems and the formation of the testis cord. While only one Hh gene is found typically in invertebrate genomes, most vertebrates species have three (Sonic hedgehog--Shh; Indian hedgehog--Ihh; and Desert hedgehog--Dhh, each with different expression patterns and functions, which likely helped promote the increasing complexity of vertebrates and their successful diversification. In this study, we used comparative genomic and adaptive evolutionary analyses to characterize the evolution of the Hh genes in vertebrates following the two major whole genome duplication (WGD events. To overcome the lack of Hh-coding sequences on avian publicly available databases, we used an extensive dataset of 45 avian and three non-avian reptilian genomes to show that birds have all three Hh paralogs. We find suggestions that following the WGD events, vertebrate Hh paralogous genes evolved independently within similar linkage groups and under different evolutionary rates, especially within the catalytic domain. The structural regions around the ion-binding site were identified to be under positive selection in the signaling domain. These findings contrast with those observed in invertebrates, where different lineages that experienced gene duplication retained similar selective constraints in the Hh orthologs. Our results provide new insights on the evolutionary history of the Hh gene family, the functional roles of these paralogs in vertebrate species, and on the location of mutational hotspots.

  10. Evolutionary genomics and adaptive evolution of the Hedgehog gene family (Shh, Ihh and Dhh) in vertebrates.

    Science.gov (United States)

    Pereira, Joana; Johnson, Warren E; O'Brien, Stephen J; Jarvis, Erich D; Zhang, Guojie; Gilbert, M Thomas P; Vasconcelos, Vitor; Antunes, Agostinho

    2014-01-01

    The Hedgehog (Hh) gene family codes for a class of secreted proteins composed of two active domains that act as signalling molecules during embryo development, namely for the development of the nervous and skeletal systems and the formation of the testis cord. While only one Hh gene is found typically in invertebrate genomes, most vertebrates species have three (Sonic hedgehog--Shh; Indian hedgehog--Ihh; and Desert hedgehog--Dhh), each with different expression patterns and functions, which likely helped promote the increasing complexity of vertebrates and their successful diversification. In this study, we used comparative genomic and adaptive evolutionary analyses to characterize the evolution of the Hh genes in vertebrates following the two major whole genome duplication (WGD) events. To overcome the lack of Hh-coding sequences on avian publicly available databases, we used an extensive dataset of 45 avian and three non-avian reptilian genomes to show that birds have all three Hh paralogs. We find suggestions that following the WGD events, vertebrate Hh paralogous genes evolved independently within similar linkage groups and under different evolutionary rates, especially within the catalytic domain. The structural regions around the ion-binding site were identified to be under positive selection in the signaling domain. These findings contrast with those observed in invertebrates, where different lineages that experienced gene duplication retained similar selective constraints in the Hh orthologs. Our results provide new insights on the evolutionary history of the Hh gene family, the functional roles of these paralogs in vertebrate species, and on the location of mutational hotspots.

  11. Genome-wide expression of transcriptomes and their co-expression pattern in subtropical maize (Zea mays L. under waterlogging stress.

    Directory of Open Access Journals (Sweden)

    Nepolean Thirunavukkarasu

    Full Text Available Waterlogging causes extensive damage to maize crops in tropical and subtropical regions. The identification of tolerance genes and their interactions at the molecular level will be helpful to engineer tolerant genotypes. A whole-genome transcriptome assay revealed the specific role of genes in response to waterlogging stress in susceptible and tolerant genotypes. Genes involved in the synthesis of ethylene and auxin, cell wall metabolism, activation of G-proteins and formation of aerenchyma and adventitious roots, were upregulated in the tolerant genotype. Many transcription factors, particularly ERFs, MYB, HSPs, MAPK, and LOB-domain protein were involved in regulation of these traits. Genes responsible for scavenging of ROS generated under stress were expressed along with those involved in carbohydrate metabolism. The physical locations of 21 genes expressed in the tolerant genotype were found to correspond with the marker intervals of known QTLs responsible for development of adaptive traits. Among the candidate genes, most showed synteny with genes of sorghum and foxtail millet. Co-expression analysis of 528 microarray samples including 16 samples from the present study generated seven functional modules each in the two genotypes, with differing characteristics. In the tolerant genotype, stress genes were co-expressed along with peroxidase and fermentation pathway genes.

  12. A Genome-Wide Association Study for Culm Cellulose Content in Barley Reveals Candidate Genes Co-Expressed with Members of the CELLULOSE SYNTHASE A Gene Family

    Science.gov (United States)

    Houston, Kelly; Burton, Rachel A.; Sznajder, Beata; Rafalski, Antoni J.; Dhugga, Kanwarpal S.; Mather, Diane E.; Taylor, Jillian; Steffenson, Brian J.; Waugh, Robbie; Fincher, Geoffrey B.

    2015-01-01

    Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two – rowed and 288 six – rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with CELLULOSE SYNTHASE A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the

  13. Proteomics in evolutionary ecology.

    Science.gov (United States)

    Baer, B; Millar, A H

    2016-03-01

    Evolutionary ecologists are traditionally gene-focused, as genes propagate phenotypic traits across generations and mutations and recombination in the DNA generate genetic diversity required for evolutionary processes. As a consequence, the inheritance of changed DNA provides a molecular explanation for the functional changes associated with natural selection. A direct focus on proteins on the other hand, the actual molecular agents responsible for the expression of a phenotypic trait, receives far less interest from ecologists and evolutionary biologists. This is partially due to the central dogma of molecular biology that appears to define proteins as the 'dead-end of molecular information flow' as well as technical limitations in identifying and studying proteins and their diversity in the field and in many of the more exotic genera often favored in ecological studies. Here we provide an overview of a newly forming field of research that we refer to as 'Evolutionary Proteomics'. We point out that the origins of cellular function are related to the properties of polypeptide and RNA and their interactions with the environment, rather than DNA descent, and that the critical role of horizontal gene transfer in evolution is more about coopting new proteins to impact cellular processes than it is about modifying gene function. Furthermore, post-transcriptional and post-translational processes generate a remarkable diversity of mature proteins from a single gene, and the properties of these mature proteins can also influence inheritance through genetic and perhaps epigenetic mechanisms. The influence of post-transcriptional diversification on evolutionary processes could provide a novel mechanistic underpinning for elements of rapid, directed evolutionary changes and adaptations as observed for a variety of evolutionary processes. Modern state-of the art technologies based on mass spectrometry are now available to identify and quantify peptides, proteins, protein

  14. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

    Science.gov (United States)

    Catania, Francesco; Lynch, Michael

    2010-05-04

    In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  15. The functional landscape of mouse gene expression

    Directory of Open Access Journals (Sweden)

    Zhang Wen

    2004-12-01

    Full Text Available Abstract Background Large-scale quantitative analysis of transcriptional co-expression has been used to dissect regulatory networks and to predict the functions of new genes discovered by genome sequencing in model organisms such as yeast. Although the idea that tissue-specific expression is indicative of gene function in mammals is widely accepted, it has not been objectively tested nor compared with the related but distinct strategy of correlating gene co-expression as a means to predict gene function. Results We generated microarray expression data for nearly 40,000 known and predicted mRNAs in 55 mouse tissues, using custom-built oligonucleotide arrays. We show that quantitative transcriptional co-expression is a powerful predictor of gene function. Hundreds of functional categories, as defined by Gene Ontology 'Biological Processes', are associated with characteristic expression patterns across all tissues, including categories that bear no overt relationship to the tissue of origin. In contrast, simple tissue-specific restriction of expression is a poor predictor of which genes are in which functional categories. As an example, the highly conserved mouse gene PWP1 is widely expressed across different tissues but is co-expressed with many RNA-processing genes; we show that the uncharacterized yeast homolog of PWP1 is required for rRNA biogenesis. Conclusions We conclude that 'functional genomics' strategies based on quantitative transcriptional co-expression will be as fruitful in mammals as they have been in simpler organisms, and that transcriptional control of mammalian physiology is more modular than is generally appreciated. Our data and analyses provide a public resource for mammalian functional genomics.

  16. Co-expression of the Thermotoga neapolitana aglB gene with an upstream 3'-coding fragment of the malG gene improves enzymatic characteristics of recombinant AglB cyclomaltodextrinase.

    Science.gov (United States)

    Lunina, Natalia A; Agafonova, Elena V; Chekanovskaya, Lyudmila A; Dvortsov, Igor A; Berezina, Oksana V; Shedova, Ekaterina N; Kostrov, Sergey V; Velikodvorskaya, Galina A

    2007-07-01

    A cluster of Thermotoga neapolitana genes participating in starch degradation includes the malG gene of sugar transport protein and the aglB gene of cyclomaltodextrinase. The start and stop codons of these genes share a common overlapping sequence, aTGAtg. Here, we compared properties of expression products of three different constructs with aglB from T. neapolitana. The first expression vector contained the aglB gene linked to an upstream 90-bp 3'-terminal region of the malG gene with the stop codon overlapping with the start codon of aglB. The second construct included the isolated coding sequence of aglB with two tandem potential start codons. The expression product of this construct in Escherichia coli had two tandem Met residues at its N terminus and was characterized by low thermostability and high tendency to aggregate. In contrast, co-expression of aglB and the 3'-terminal region of malG (the first construct) resulted in AglB with only one N-terminal Met residue and a much higher specific activity of cyclomaltodextrinase. Moreover, the enzyme expressed by such a construct was more thermostable and less prone to aggregation. The third construct was the same as the second one except that it contained only one ATG start codon. The product of its expression had kinetic and other properties similar to those of the enzyme with only one N-terminal Met residue.

  17. Evolutionary inference across eukaryotes identifies specific pressures favoring mitochondrial gene retention

    OpenAIRE

    Williams, Ben; Johnston, Iain

    2016-01-01

    Since their endosymbiotic origin, mitochondria have lost most of their genes. Although many selective mechanisms underlying the evolution of mitochondrial genomes have been proposed, a data-driven exploration of these hypotheses is lacking, and a quantitatively supported consensus remains absent. We developed HyperTraPS, a methodology coupling stochastic modelling with Bayesian inference, to identify the ordering of evolutionary events and suggest their causes. Using 2015 complete mitochondri...

  18. Evolutionary genomics of plant genes encoding N-terminal-TM-C2 domain proteins and the similar FAM62 genes and synaptotagmin genes of metazoans

    Directory of Open Access Journals (Sweden)

    Craxton Molly

    2007-07-01

    Full Text Available Abstract Background Synaptotagmin genes are found in animal genomes and are known to function in the nervous system. Genes with a similar domain architecture as well as sequence similarity to synaptotagmin C2 domains have also been found in plant genomes. The plant genes share an additional region of sequence similarity with a group of animal genes named FAM62. FAM62 genes also have a similar domain architecture. Little is known about the functions of the plant genes and animal FAM62 genes. Indeed, many members of the large and diverse Syt gene family await functional characterization. Understanding the evolutionary relationships among these genes will help to realize the full implications of functional studies and lead to improved genome annotation. Results I collected and compared plant Syt-like sequences from the primary nucleotide sequence databases at NCBI. The collection comprises six groups of plant genes conserved in embryophytes: NTMC2Type1 to NTMC2Type6. I collected and compared metazoan FAM62 sequences and identified some similar sequences from other eukaryotic lineages. I found evidence of RNA editing and alternative splicing. I compared the intron patterns of Syt genes. I also compared Rabphilin and Doc2 genes. Conclusion Genes encoding proteins with N-terminal-transmembrane-C2 domain architectures resembling synaptotagmins, are widespread in eukaryotes. A collection of these genes is presented here. The collection provides a resource for studies of intron evolution. I have classified the collection into homologous gene families according to distinctive patterns of sequence conservation and intron position. The evolutionary histories of these gene families are traceable through the appearance of family members in different eukaryotic lineages. Assuming an intron-rich eukaryotic ancestor, the conserved intron patterns distinctive of individual gene families, indicate independent origins of Syt, FAM62 and NTMC2 genes. Resemblances

  19. Gene expression during Drosophila melanogaster egg development before and after reproductive diapause

    Directory of Open Access Journals (Sweden)

    Baker Dean A

    2009-05-01

    Full Text Available Abstract Background Despite the importance of egg development to the female life cycle in Drosophila, global patterns of gene expression have not been examined in detail, primarily due to the difficulty in isolating synchronised developmental stages in sufficient quantities for gene expression profiling. Entry into vitellogenesis is a key stage of oogenesis and by forcing females into reproductive diapause we are able to arrest oogenesis at the pre-vitellogenic stages. Releasing females from diapause allows collection of relatively synchronous developing egg populations and an investigation of some of the transcriptional dynamics apparent before and after reproductive diapause. Results Focusing on gender-biased transcription, we identified mechanisms of egg development suppressed during reproductive dormancy as well as other molecular changes unique to the diapausing female. A microarray based analysis generated a set of 3565 transcripts with at least 2-fold greater expression in females as compared to control males, 1392 such changes were biased during reproductive dormancy. In addition, we also detect 1922 up-regulated transcriptional changes after entry into vitellogenesis, which were classified into discrete blocks of co-expression. We discuss some of the regulatory aspects apparent after re-initiation of egg development, exploring the underlying functions, maternal contribution and evolutionary conservation of co-expression patterns involved in egg production. Conclusion Although much of the work we present is descriptive, fundamental aspects of egg development and gender-biased transcription can be derived from our time-series experiment. We believe that our dataset will facilitate further exploration of the developmental and evolutionary characteristics of oogenesis as well as the nature of reproductive arrest in Drosophila.

  20. Genetic architecture of gene expression in the chicken

    Directory of Open Access Journals (Sweden)

    Stanley Dragana

    2013-01-01

    Full Text Available Abstract Background The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. Results Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. Conclusions We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.

  1. Reconstruction of Oomycete Genome Evolution Identifies Differences in Evolutionary Trajectories Leading to Present-Day Large Gene Families

    NARCIS (Netherlands)

    Seidl, M.F.; Ackerveken, van den G.; Govers, F.; Snel, B.

    2012-01-01

    The taxonomic class of oomycetes contains numerous pathogens of plants and animals but is related to nonpathogenic diatoms and brown algae. Oomycetes have flexible genomes comprising large gene families that play roles in pathogenicity. The evolutionary processes that shaped the gene content have

  2. Differential co-expression and regulation analyses reveal different mechanisms underlying major depressive disorder and subsyndromal symptomatic depression.

    Science.gov (United States)

    Xu, Fan; Yang, Jing; Chen, Jin; Wu, Qingyuan; Gong, Wei; Zhang, Jianguo; Shao, Weihua; Mu, Jun; Yang, Deyu; Yang, Yongtao; Li, Zhiwei; Xie, Peng

    2015-04-03

    Recent depression research has revealed a growing awareness of how to best classify depression into depressive subtypes. Appropriately subtyping depression can lead to identification of subtypes that are more responsive to current pharmacological treatment and aid in separating out depressed patients in which current antidepressants are not particularly effective. Differential co-expression analysis (DCEA) and differential regulation analysis (DRA) were applied to compare the transcriptomic profiles of peripheral blood lymphocytes from patients with two depressive subtypes: major depressive disorder (MDD) and subsyndromal symptomatic depression (SSD). Six differentially regulated genes (DRGs) (FOSL1, SRF, JUN, TFAP4, SOX9, and HLF) and 16 transcription factor-to-target differentially co-expressed gene links or pairs (TF2target DCLs) appear to be the key differential factors in MDD; in contrast, one DRG (PATZ1) and eight TF2target DCLs appear to be the key differential factors in SSD. There was no overlap between the MDD target genes and SSD target genes. Venlafaxine (Efexor™, Effexor™) appears to have a significant effect on the gene expression profile of MDD patients but no significant effect on the gene expression profile of SSD patients. DCEA and DRA revealed no apparent similarities between the differential regulatory processes underlying MDD and SSD. This bioinformatic analysis may provide novel insights that can support future antidepressant R&D efforts.

  3. Evolutionary Pattern and Regulation Analysis to Support Why Diversity Functions Existed within PPAR Gene Family Members

    Directory of Open Access Journals (Sweden)

    Tianyu Zhou

    2015-01-01

    Full Text Available Peroxisome proliferators-activated receptor (PPAR gene family members exhibit distinct patterns of distribution in tissues and differ in functions. The purpose of this study is to investigate the evolutionary impacts on diversity functions of PPAR members and the regulatory differences on gene expression patterns. 63 homology sequences of PPAR genes from 31 species were collected and analyzed. The results showed that three isolated types of PPAR gene family may emerge from twice times of gene duplication events. The conserved domains of HOLI (ligand binding domain of hormone receptors domain and ZnF_C4 (C4 zinc finger in nuclear in hormone receptors are essential for keeping basic roles of PPAR gene family, and the variant domains of LCRs may be responsible for their divergence in functions. The positive selection sites in HOLI domain are benefit for PPARs to evolve towards diversity functions. The evolutionary variants in the promoter regions and 3′ UTR regions of PPARs result into differential transcription factors and miRNAs involved in regulating PPAR members, which may eventually affect their expressions and tissues distributions. These results indicate that gene duplication event, selection pressure on HOLI domain, and the variants on promoter and 3′ UTR are essential for PPARs evolution and diversity functions acquired.

  4. Evolutionary Pattern and Regulation Analysis to Support Why Diversity Functions Existed within PPAR Gene Family Members.

    Science.gov (United States)

    Zhou, Tianyu; Yan, Xiping; Wang, Guosong; Liu, Hehe; Gan, Xiang; Zhang, Tao; Wang, Jiwen; Li, Liang

    2015-01-01

    Peroxisome proliferators-activated receptor (PPAR) gene family members exhibit distinct patterns of distribution in tissues and differ in functions. The purpose of this study is to investigate the evolutionary impacts on diversity functions of PPAR members and the regulatory differences on gene expression patterns. 63 homology sequences of PPAR genes from 31 species were collected and analyzed. The results showed that three isolated types of PPAR gene family may emerge from twice times of gene duplication events. The conserved domains of HOLI (ligand binding domain of hormone receptors) domain and ZnF_C4 (C4 zinc finger in nuclear in hormone receptors) are essential for keeping basic roles of PPAR gene family, and the variant domains of LCRs may be responsible for their divergence in functions. The positive selection sites in HOLI domain are benefit for PPARs to evolve towards diversity functions. The evolutionary variants in the promoter regions and 3' UTR regions of PPARs result into differential transcription factors and miRNAs involved in regulating PPAR members, which may eventually affect their expressions and tissues distributions. These results indicate that gene duplication event, selection pressure on HOLI domain, and the variants on promoter and 3' UTR are essential for PPARs evolution and diversity functions acquired.

  5. Gene-Transformation-Induced Changes in Chemical Functional Group Features and Molecular Structure Conformation in Alfalfa Plants Co-Expressing Lc-bHLH and C1-MYB Transcriptive Flavanoid Regulatory Genes: Effects of Single-Gene and Two-Gene Insertion.

    Science.gov (United States)

    Heendeniya, Ravindra G; Yu, Peiqiang

    2017-03-20

    Alfalfa ( Medicago sativa L.) genotypes transformed with Lc-bHLH and Lc transcription genes were developed with the intention of stimulating proanthocyanidin synthesis in the aerial parts of the plant. To our knowledge, there are no studies on the effect of single-gene and two-gene transformation on chemical functional groups and molecular structure changes in these plants. The objective of this study was to use advanced molecular spectroscopy with multivariate chemometrics to determine chemical functional group intensity and molecular structure changes in alfalfa plants when co-expressing Lc-bHLH and C1-MYB transcriptive flavanoid regulatory genes in comparison with non-transgenic (NT) and AC Grazeland (ACGL) genotypes. The results showed that compared to NT genotype, the presence of double genes ( Lc and C1 ) increased ratios of both the area and peak height of protein structural Amide I/II and the height ratio of α-helix to β-sheet. In carbohydrate-related spectral analysis, the double gene-transformed alfalfa genotypes exhibited lower peak heights at 1370, 1240, 1153, and 1020 cm -1 compared to the NT genotype. Furthermore, the effect of double gene transformation on carbohydrate molecular structure was clearly revealed in the principal component analysis of the spectra. In conclusion, single or double transformation of Lc and C1 genes resulted in changing functional groups and molecular structure related to proteins and carbohydrates compared to the NT alfalfa genotype. The current study provided molecular structural information on the transgenic alfalfa plants and provided an insight into the impact of transgenes on protein and carbohydrate properties and their molecular structure's changes.

  6. Gene Expression Contributes to the Recent Evolution of Host Resistance in a Model Host Parasite System

    Directory of Open Access Journals (Sweden)

    Brian K. Lohman

    2017-09-01

    Full Text Available Heritable population differences in immune gene expression following infection can reveal mechanisms of host immune evolution. We compared gene expression in infected and uninfected threespine stickleback (Gasterosteus aculeatus from two natural populations that differ in resistance to a native cestode parasite, Schistocephalus solidus. Genes in both the innate and adaptive immune system were differentially expressed as a function of host population, infection status, and their interaction. These genes were enriched for loci controlling immune functions known to differ between host populations or in response to infection. Coexpression network analysis identified two distinct processes contributing to resistance: parasite survival and suppression of growth. Comparing networks between populations showed resistant fish have a dynamic expression profile while susceptible fish are static. In summary, recent evolutionary divergence between two vertebrate populations has generated population-specific gene expression responses to parasite infection, affecting parasite establishment and growth.

  7. Patterns of evolutionary conservation of essential genes correlate with their compensability.

    Directory of Open Access Journals (Sweden)

    Tobias Bergmiller

    2012-06-01

    Full Text Available Essential genes code for fundamental cellular functions required for the viability of an organism. For this reason, essential genes are often highly conserved across organisms. However, this is not always the case: orthologues of genes that are essential in one organism are sometimes not essential in other organisms or are absent from their genomes. This suggests that, in the course of evolution, essential genes can be rendered nonessential. How can a gene become non-essential? Here we used genetic manipulation to deplete the products of 26 different essential genes in Escherichia coli. This depletion results in a lethal phenotype, which could often be rescued by the overexpression of a non-homologous, non-essential gene, most likely through replacement of the essential function. We also show that, in a smaller number of cases, the essential genes can be fully deleted from the genome, suggesting that complete functional replacement is possible. Finally, we show that essential genes whose function can be replaced in the laboratory are more likely to be non-essential or not present in other taxa. These results are consistent with the notion that patterns of evolutionary conservation of essential genes are influenced by their compensability-that is, by how easily they can be functionally replaced, for example through increased expression of other genes.

  8. Signatures of selection acting on the innate immunity gene Toll-like receptor 2 (TLR2) during the evolutionary history of rodents.

    Science.gov (United States)

    Tschirren, B; Råberg, L; Westerdahl, H

    2011-06-01

    Patterns of selection acting on immune defence genes have recently been the focus of considerable interest. Yet, when it comes to vertebrates, studies have mainly focused on the acquired branch of the immune system. Consequently, the direction and strength of selection acting on genes of the vertebrate innate immune defence remain poorly understood. Here, we present a molecular analysis of selection on an important receptor of the innate immune system of vertebrates, the Toll-like receptor 2 (TLR2), across 17 rodent species. Although purifying selection was the prevalent evolutionary force acting on most parts of the rodent TLR2, we found that codons in close proximity to pathogen-binding and TLR2-TLR1 heterodimerization sites have been subject to positive selection. This indicates that parasite-mediated selection is not restricted to acquired immune system genes like the major histocompatibility complex, but also affects innate defence genes. To obtain a comprehensive understanding of evolutionary processes in host-parasite systems, both innate and acquired immunity thus need to be considered. © 2011 The Authors. Journal of Evolutionary Biology © 2011 European Society For Evolutionary Biology.

  9. Comparative genomic analysis of the Lipase3 gene family in five plant species reveals distinct evolutionary origins.

    Science.gov (United States)

    Wang, Dan; Zhang, Lin; Hu, JunFeng; Gao, Dianshuai; Liu, Xin; Sha, Yan

    2018-04-01

    Lipases are physiologically important and ubiquitous enzymes that share a conserved domain and are classified into eight different families based on their amino acid sequences and fundamental biological properties. The Lipase3 family of lipases was reported to possess a canonical fold typical of α/β hydrolases and a typical catalytic triad, suggesting a distinct evolutionary origin for this family. Genes in the Lipase3 family do not have the same functions, but maintain the conserved Lipase3 domain. There have been extensive studies of Lipase3 structures and functions, but little is known about their evolutionary histories. In this study, all lipases within five plant species were identified, and their phylogenetic relationships and genetic properties were analyzed and used to group them into distinct evolutionary families. Each identified lipase family contained at least one dicot and monocot Lipase3 protein, indicating that the gene family was established before the split of dicots and monocots. Similar intron/exon numbers and predicted protein sequence lengths were found within individual groups. Twenty-four tandem Lipase3 gene duplications were identified, implying that the distinctive function of Lipase3 genes appears to be a consequence of translocation and neofunctionalization after gene duplication. The functional genes EDS1, PAD4, and SAG101 that are reportedly involved in pathogen response were all located in the same group. The nucleotide diversity (Dxy) and the ratio of nonsynonymous to synonymous nucleotide substitutions rates (Ka/Ks) of the three genes were significantly greater than the average across the genomes. We further observed evidence for selection maintaining diversity on three genes in the Toll-Interleukin-1 receptor type of nucleotide binding/leucine-rich repeat immune receptor (TIR-NBS LRR) immunity-response signaling pathway, indicating that they could be vulnerable to pathogen effectors.

  10. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium

    Directory of Open Access Journals (Sweden)

    Lynch Michael

    2010-05-01

    Full Text Available Abstract Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1 shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2 are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3 reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  11. Evolutionary diversification of plant shikimate kinase gene duplicates.

    Directory of Open Access Journals (Sweden)

    Geoffrey Fucile

    2008-12-01

    Full Text Available Shikimate kinase (SK; EC 2.7.1.71 catalyzes the fifth reaction of the shikimate pathway, which directs carbon from the central metabolism pool to a broad range of secondary metabolites involved in plant development, growth, and stress responses. In this study, we demonstrate the role of plant SK gene duplicate evolution in the diversification of metabolic regulation and the acquisition of novel and physiologically essential function. Phylogenetic analysis of plant SK homologs resolves an orthologous cluster of plant SKs and two functionally distinct orthologous clusters. These previously undescribed genes, shikimate kinase-like 1 (SKL1 and -2 (SKL2, do not encode SK activity, are present in all major plant lineages, and apparently evolved under positive selection following SK gene duplication over 400 MYA. This is supported by functional assays using recombinant SK, SKL1, and SKL2 from Arabidopsis thaliana (At and evolutionary analyses of the diversification of SK-catalytic and -substrate binding sites based on theoretical structure models. AtSKL1 mutants yield albino and novel variegated phenotypes, which indicate SKL1 is required for chloroplast biogenesis. Extant SKL2 sequences show a strong genetic signature of positive selection, which is enriched in a protein-protein interaction module not found in other SK homologs. We also report the first kinetic characterization of plant SKs and show that gene expression diversification among the AtSK inparalogs is correlated with developmental processes and stress responses. This study examines the functional diversification of ancient and recent plant SK gene duplicates and highlights the utility of SKs as scaffolds for functional innovation.

  12. Population genetics of non-genetic traits: Evolutionary roles of stochasticity in gene expression

    KAUST Repository

    Mineta, Katsuhiko

    2015-05-01

    The role of stochasticity in evolutionary genetics has long been debated. To date, however, the potential roles of non-genetic traits in evolutionary processes have been largely neglected. In molecular biology, growing evidence suggests that stochasticity in gene expression (SGE) is common and that SGE has major impacts on phenotypes and fitness. Here, we provide a general overview of the potential effects of SGE on population genetic parameters, arguing that SGE can indeed have a profound effect on evolutionary processes. Our analyses suggest that SGE potentially alters the fate of mutations by influencing effective population size and fixation probability. In addition, a genetic control of SGE magnitude could evolve under certain conditions, if the fitness of the less-fit individual increases due to SGE and environmental fluctuation. Although empirical evidence for our arguments is yet to come, methodological developments for precisely measuring SGE in living organisms will further advance our understanding of SGE-driven evolution.

  13. Population genetics of non-genetic traits: Evolutionary roles of stochasticity in gene expression

    KAUST Repository

    Mineta, Katsuhiko; Matsumoto, Tomotaka; Osada, Naoki; Araki, Hitoshi

    2015-01-01

    The role of stochasticity in evolutionary genetics has long been debated. To date, however, the potential roles of non-genetic traits in evolutionary processes have been largely neglected. In molecular biology, growing evidence suggests that stochasticity in gene expression (SGE) is common and that SGE has major impacts on phenotypes and fitness. Here, we provide a general overview of the potential effects of SGE on population genetic parameters, arguing that SGE can indeed have a profound effect on evolutionary processes. Our analyses suggest that SGE potentially alters the fate of mutations by influencing effective population size and fixation probability. In addition, a genetic control of SGE magnitude could evolve under certain conditions, if the fitness of the less-fit individual increases due to SGE and environmental fluctuation. Although empirical evidence for our arguments is yet to come, methodological developments for precisely measuring SGE in living organisms will further advance our understanding of SGE-driven evolution.

  14. Domestication rewired gene expression and nucleotide diversity patterns in tomato.

    Science.gov (United States)

    Sauvage, Christopher; Rau, Andrea; Aichholz, Charlotte; Chadoeuf, Joël; Sarah, Gautier; Ruiz, Manuel; Santoni, Sylvain; Causse, Mathilde; David, Jacques; Glémin, Sylvain

    2017-08-01

    Plant domestication has led to considerable phenotypic modifications from wild species to modern varieties. However, although changes in key traits have been well documented, less is known about the underlying molecular mechanisms, such as the reduction of molecular diversity or global gene co-expression patterns. In this study, we used a combination of gene expression and population genetics in wild and crop tomato to decipher the footprints of domestication. We found a set of 1729 differentially expressed genes (DEG) between the two genetic groups, belonging to 17 clusters of co-expressed DEG, suggesting that domestication affected not only individual genes but also regulatory networks. Five co-expression clusters were enriched in functional terms involving carbohydrate metabolism or epigenetic regulation of gene expression. We detected differences in nucleotide diversity between the crop and wild groups specific to DEG. Our study provides an extensive profiling of the rewiring of gene co-expression induced by the domestication syndrome in one of the main crop species. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  15. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    Energy Technology Data Exchange (ETDEWEB)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  16. Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea

    Directory of Open Access Journals (Sweden)

    Wolf Yuri I

    2007-11-01

    Full Text Available Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs. Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. Results New Archaeal Clusters of Orthologous Genes (arCOGs were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover ~88% of the genes in a genome compared to a ~76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; ~40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile

  17. rbcL gene sequences provide evidence for the evolutionary lineages of leptosporangiate ferns.

    Science.gov (United States)

    Hasebe, M; Omori, T; Nakazawa, M; Sano, T; Kato, M; Iwatsuki, K

    1994-06-07

    Pteriodophytes have a longer evolutionary history than any other vascular land plant and, therefore, have endured greater loss of phylogenetically informative information. This factor has resulted in substantial disagreements in evaluating characters and, thus, controversy in establishing a stable classification. To compare competing classifications, we obtained DNA sequences of a chloroplast gene. The sequence of 1206 nt of the large subunit of the ribulose-bisphosphate carboxylase gene (rbcL) was determined from 58 species, representing almost all families of leptosporangiate ferns. Phlogenetic trees were inferred by the neighbor-joining and the parsimony methods. The two methods produced almost identical phylogenetic trees that provided insights concerning major general evolutionary trends in the leptosporangiate ferns. Interesting findings were as follows: (i) two morphologically distinct heterosporous water ferns, Marsilea and Salvinia, are sister genera; (ii) the tree ferns (Cyatheaceae, Dicksoniaceae, and Metaxyaceae) are monophyletic; and (iii) polypodioids are distantly related to the gleichenioids in spite of the similarity of their exindusiate soral morphology and are close to the higher indusiate ferns. In addition, the affinities of several "problematic genera" were assessed.

  18. Evolutionary Conservation in Genes Underlying Human Psychiatric Disorders

    Directory of Open Access Journals (Sweden)

    Lisa Michelle Ogawa

    2014-05-01

    Full Text Available Many psychiatric diseases observed in humans have tenuous or absent analogs in other species. Most notable among these are schizophrenia and autism. One hypothesis has posited that these diseases have arisen as a consequence of human brain evolution, for example, that the same processes that led to advances in cognition, language, and executive function also resulted in novel diseases in humans when dysfunctional. Here, the molecular evolution of genes associated with these and other psychiatric disorders are compared among species. Genes associated with psychiatric disorders are drawn from the literature and orthologous sequences are collected from eleven primate species (human, chimpanzee, bonobo, gorilla, orangutan, gibbon, macaque, baboon, marmoset, squirrel monkey, and galago and thirty one non-primate mammalian species. Evolutionary parameters, including dN/dS, are calculated for each gene and compared between disease classes and among species, focusing on humans and primates compared to other mammals and on large-brained taxa (cetaceans, rhinoceros, walrus, bear, and elephant compared to their small-brained sister species. Evidence of differential selection in primates supports the hypothesis that schizophrenia and autism are a cost of higher brain function. Through this work a better understanding of the molecular evolution of the human brain, the pathophysiology of disease, and the genetic basis of human psychiatric disease is gained.

  19. Evolutionary approaches for the reverse-engineering of gene regulatory networks: A study on a biologically realistic dataset

    Directory of Open Access Journals (Sweden)

    Gidrol Xavier

    2008-02-01

    Full Text Available Abstract Background Inferring gene regulatory networks from data requires the development of algorithms devoted to structure extraction. When only static data are available, gene interactions may be modelled by a Bayesian Network (BN that represents the presence of direct interactions from regulators to regulees by conditional probability distributions. We used enhanced evolutionary algorithms to stochastically evolve a set of candidate BN structures and found the model that best fits data without prior knowledge. Results We proposed various evolutionary strategies suitable for the task and tested our choices using simulated data drawn from a given bio-realistic network of 35 nodes, the so-called insulin network, which has been used in the literature for benchmarking. We assessed the inferred models against this reference to obtain statistical performance results. We then compared performances of evolutionary algorithms using two kinds of recombination operators that operate at different scales in the graphs. We introduced a niching strategy that reinforces diversity through the population and avoided trapping of the algorithm in one local minimum in the early steps of learning. We show the limited effect of the mutation operator when niching is applied. Finally, we compared our best evolutionary approach with various well known learning algorithms (MCMC, K2, greedy search, TPDA, MMHC devoted to BN structure learning. Conclusion We studied the behaviour of an evolutionary approach enhanced by niching for the learning of gene regulatory networks with BN. We show that this approach outperforms classical structure learning methods in elucidating the original model. These results were obtained for the learning of a bio-realistic network and, more importantly, on various small datasets. This is a suitable approach for learning transcriptional regulatory networks from real datasets without prior knowledge.

  20. Genomewide Expression and Functional Interactions of Genes under Drought Stress in Maize

    Directory of Open Access Journals (Sweden)

    Nepolean Thirunavukkarasu

    2017-01-01

    Full Text Available A genomewide transcriptome assay of two subtropical genotypes of maize was used to observe the expression of genes at seedling stage of drought stress. The number of genes expressed differentially was greater in HKI1532 (a drought tolerant genotype than in PC3 (a drought sensitive genotype, indicating primary differences at the transcriptional level in stress tolerance. The global coexpression networks of the two genotypes differed significantly with respect to the number of modules and the coexpression pattern within the modules. A total of 174 drought-responsive genes were selected from HKI1532, and their coexpression network revealed key correlations between different adaptive pathways, each cluster of the network representing a specific biological function. Transcription factors related to ABA-dependent stomatal closure, signalling, and phosphoprotein cascades work in concert to compensate for reduced photosynthesis. Under stress, water balance was maintained by coexpression of the genes involved in osmotic adjustments and transporter proteins. Metabolism was maintained by the coexpression of genes involved in cell wall modification and protein and lipid metabolism. The interaction of genes involved in crucial biological functions during stress was identified and the results will be useful in targeting important gene interactions to understand drought tolerance in greater detail.

  1. Classification and evolutionary analysis of the basic helix-loop-helix gene family in the green anole lizard, Anolis carolinensis.

    Science.gov (United States)

    Liu, Ake; Wang, Yong; Zhang, Debao; Wang, Xuhua; Song, Huifang; Dang, Chunwang; Yao, Qin; Chen, Keping

    2013-08-01

    Helix-loop-helix (bHLH) proteins play essential regulatory roles in a variety of biological processes. These highly conserved proteins form a large transcription factor superfamily, and are commonly identified in large numbers within animal, plant, and fungal genomes. The bHLH domain has been well studied in many animal species, but has not yet been characterized in non-avian reptiles. In this study, we identified 102 putative bHLH genes in the genome of the green anole lizard, Anolis carolinensis. Based on phylogenetic analysis, these genes were classified into 43 families, with 43, 24, 16, 3, 10, and 3 members assigned into groups A, B, C, D, E, and F, respectively, and 3 members categorized as "orphans". Within-group evolutionary relationships inferred from the phylogenetic analysis were consistent with highly conserved patterns observed for introns and additional domains. Results from phylogenetic analysis of the H/E(spl) family suggest that genome and tandem gene duplications have contributed to this family's expansion. Our classification and evolutionary analysis has provided insights into the evolutionary diversification of animal bHLH genes, and should aid future studies on bHLH protein regulation of key growth and developmental processes.

  2. Evolutionary history of the recruitment of conserved developmental genes in association to the formation and diversification of a novel trait

    Directory of Open Access Journals (Sweden)

    Shirai Leila T

    2012-02-01

    Full Text Available Abstract Background The origin and modification of novel traits are important aspects of biological diversification. Studies combining concepts and approaches of developmental genetics and evolutionary biology have uncovered many examples of the recruitment, or co-option, of genes conserved across lineages for the formation of novel, lineage-restricted traits. However, little is known about the evolutionary history of the recruitment of those genes, and of the relationship between them -for example, whether the co-option involves whole or parts of existing networks, or whether it occurs by redeployment of individual genes with de novo rewiring. We use a model novel trait, color pattern elements on butterfly wings called eyespots, to explore these questions. Eyespots have greatly diversified under natural and sexual selection, and their formation involves genetic circuitries shared across insects. Results We investigated the evolutionary history of the recruitment and co-recruitment of four conserved transcription regulators to the larval wing disc region where circular pattern elements develop. The co-localization of Antennapedia, Notch, Distal-less, and Spalt with presumptive (eyespot organizers was examined in 13 butterfly species, providing the largest comparative dataset available for the system. We found variation between families, between subfamilies, and between tribes. Phylogenetic reconstructions by parsimony and maximum likelihood methods revealed an unambiguous evolutionary history only for Antennapedia, with a resolved single origin of eyespot-associated expression, and many homoplastic events for Notch, Distal-less, and Spalt. The flexibility in the (co-recruitment of the targeted genes includes cases where different gene combinations are associated with morphologically similar eyespots, as well as cases where identical protein combinations are associated with very different phenotypes. Conclusions The evolutionary history of gene

  3. Successful recombinant production of Allochromatium vinosum cytochrome c' requires coexpression of cmm genes in heme-rich Escherichia coli JCB712

    International Nuclear Information System (INIS)

    Evers, Toon H.; Merkx, Maarten

    2005-01-01

    Cytochrome c' from the purple photosynthetic bacterium Allochromatium vinosum (CCP) displays a unique, reversible dimer-to-monomer transition upon binding of NO, CO, and CN - . This small, four helix bundle protein represents an attractive model for the study of other heme protein biosensors, provided a recombinant expression system is available. Here we report the development of an efficient expression system for CCP that makes use of a maltose binding protein fusion strategy to enhance periplasmic expression and allow easy purification by affinity chromatography. Coexpression of cytochrome c maturase genes and the use of a heme-rich Escherichia coli strain were found to be necessary to obtain reasonable yields of cytochrome c'. Characterization using circular dichroism, UV-vis spectroscopy, and size-exclusion chromatography confirms the native-like properties of the recombinant protein, including its ligand-induced monomerization

  4. Genetic Network Inference: From Co-Expression Clustering to Reverse Engineering

    Science.gov (United States)

    Dhaeseleer, Patrik; Liang, Shoudan; Somogyi, Roland

    2000-01-01

    Advances in molecular biological, analytical, and computational technologies are enabling us to systematically investigate the complex molecular processes underlying biological systems. In particular, using high-throughput gene expression assays, we are able to measure the output of the gene regulatory network. We aim here to review datamining and modeling approaches for conceptualizing and unraveling the functional relationships implicit in these datasets. Clustering of co-expression profiles allows us to infer shared regulatory inputs and functional pathways. We discuss various aspects of clustering, ranging from distance measures to clustering algorithms and multiple-duster memberships. More advanced analysis aims to infer causal connections between genes directly, i.e., who is regulating whom and how. We discuss several approaches to the problem of reverse engineering of genetic networks, from discrete Boolean networks, to continuous linear and non-linear models. We conclude that the combination of predictive modeling with systematic experimental verification will be required to gain a deeper insight into living organisms, therapeutic targeting, and bioengineering.

  5. Origin of a function by tandem gene duplication limits the evolutionary capability of its sister copy.

    Science.gov (United States)

    Hasselmann, Martin; Lechner, Sarah; Schulte, Christina; Beye, Martin

    2010-07-27

    The most remarkable outcome of a gene duplication event is the evolution of a novel function. Little information exists on how the rise of a novel function affects the evolution of its paralogous sister gene copy, however. We studied the evolution of the feminizer (fem) gene from which the gene complementary sex determiner (csd) recently derived by tandem duplication within the honey bee (Apis) lineage. Previous studies showed that fem retained its sex determination function, whereas the rise of csd established a new primary signal of sex determination. We observed a specific reduction of nonsynonymous to synonymous substitution ratios in Apis to non-Apis fem. We found a contrasting pattern at two other genetically linked genes, suggesting that hitchhiking effects to csd, the locus under balancing selection, is not the cause of this evolutionary pattern. We also excluded higher synonymous substitution rates by relative rate testing. These results imply that stronger purifying selection is operating at the fem gene in the presence of csd. We propose that csd's new function interferes with the function of Fem protein, resulting in molecular constraints and limited evolvability of fem in the Apis lineage. Elevated silent nucleotide polymorphism in fem relative to the genome-wide average suggests that genetic linkage to the csd gene maintained more nucleotide variation in today's population. Our findings provide evidence that csd functionally and genetically interferes with fem, suggesting that a newly evolved gene and its functions can limit the evolutionary capability of other genes in the genome.

  6. Bcl-2 and N-Myc Coexpression Increases IGF-IR and Features of Malignant Growth in Neuroblastoma Cell Lines

    Directory of Open Access Journals (Sweden)

    Rama Jasty

    2001-01-01

    Full Text Available The bcl-2 and c-myc oncogenes cooperate to transform multiple cell types. In the pediatric malignancy NB2, Bcl2 is highly expressed. In tumors with a poor prognosis, N-Myc, a protein homologous to c-Myc, is overexpressed as a result of gene amplification. The present study was designed to determine whether Bcl-2 cooperates with N-Myc to bestow a tumorigenic phenotype to neuroblastoma (NB cells. NB cell lines that at baseline express neither Bcl-2 nor N-Myc were stably transfected to express these gene products. In this model, we found Bcl-2 rescues N-Myc-expressing cells from apoptosis induced by serum withdrawal. Coexpression of Bcl-2 and N-Myc supports growth in low serum conditions and anchorage-independent growth in soft agar. Similarly, in vivo tumorigenic and angiogenic activity was dependent on coexpression. Our data further suggests that the mechanism underlying these changes involves the receptor for insulin growth factor type I (IGF-IR.

  7. Enforcing Co-expression Within a Brain-Imaging Genomics Regression Framework.

    Science.gov (United States)

    Zille, Pascal; Calhoun, Vince D; Wang, Yu-Ping

    2017-06-28

    Among the challenges arising in brain imaging genetic studies, estimating the potential links between neurological and genetic variability within a population is key. In this work, we propose a multivariate, multimodal formulation for variable selection that leverages co-expression patterns across various data modalities. Our approach is based on an intuitive combination of two widely used statistical models: sparse regression and canonical correlation analysis (CCA). While the former seeks multivariate linear relationships between a given phenotype and associated observations, the latter searches to extract co-expression patterns between sets of variables belonging to different modalities. In the following, we propose to rely on a 'CCA-type' formulation in order to regularize the classical multimodal sparse regression problem (essentially incorporating both CCA and regression models within a unified formulation). The underlying motivation is to extract discriminative variables that are also co-expressed across modalities. We first show that the simplest formulation of such model can be expressed as a special case of collaborative learning methods. After discussing its limitation, we propose an extended, more flexible formulation, and introduce a simple and efficient alternating minimization algorithm to solve the associated optimization problem.We explore the parameter space and provide some guidelines regarding parameter selection. Both the original and extended versions are then compared on a simple toy dataset and a more advanced simulated imaging genomics dataset in order to illustrate the benefits of the latter. Finally, we validate the proposed formulation using single nucleotide polymorphisms (SNP) data and functional magnetic resonance imaging (fMRI) data from a population of adolescents (n = 362 subjects, age 16.9 ± 1.9 years from the Philadelphia Neurodevelopmental Cohort) for the study of learning ability. Furthermore, we carry out a significance

  8. Identifying modules of coexpressed transcript units and their organization of Saccharopolyspora erythraea from time series gene expression profiles.

    Directory of Open Access Journals (Sweden)

    Xiao Chang

    Full Text Available BACKGROUND: The Saccharopolyspora erythraea genome sequence was released in 2007. In order to look at the gene regulations at whole transcriptome level, an expression microarray was specifically designed on the S. erythraea strain NRRL 2338 genome sequence. Based on these data, we set out to investigate the potential transcriptional regulatory networks and their organization. METHODOLOGY/PRINCIPAL FINDINGS: In view of the hierarchical structure of bacterial transcriptional regulation, we constructed a hierarchical coexpression network at whole transcriptome level. A total of 27 modules were identified from 1255 differentially expressed transcript units (TUs across time course, which were further classified in to four groups. Functional enrichment analysis indicated the biological significance of our hierarchical network. It was indicated that primary metabolism is activated in the first rapid growth phase (phase A, and secondary metabolism is induced when the growth is slowed down (phase B. Among the 27 modules, two are highly correlated to erythromycin production. One contains all genes in the erythromycin-biosynthetic (ery gene cluster and the other seems to be associated with erythromycin production by sharing common intermediate metabolites. Non-concomitant correlation between production and expression regulation was observed. Especially, by calculating the partial correlation coefficients and building the network based on Gaussian graphical model, intrinsic associations between modules were found, and the association between those two erythromycin production-correlated modules was included as expected. CONCLUSIONS: This work created a hierarchical model clustering transcriptome data into coordinated modules, and modules into groups across the time course, giving insight into the concerted transcriptional regulations especially the regulation corresponding to erythromycin production of S. erythraea. This strategy may be extendable to studies

  9. Identifying modules of coexpressed transcript units and their organization of Saccharopolyspora erythraea from time series gene expression profiles.

    Science.gov (United States)

    Chang, Xiao; Liu, Shuai; Yu, Yong-Tao; Li, Yi-Xue; Li, Yuan-Yuan

    2010-08-12

    The Saccharopolyspora erythraea genome sequence was released in 2007. In order to look at the gene regulations at whole transcriptome level, an expression microarray was specifically designed on the S. erythraea strain NRRL 2338 genome sequence. Based on these data, we set out to investigate the potential transcriptional regulatory networks and their organization. In view of the hierarchical structure of bacterial transcriptional regulation, we constructed a hierarchical coexpression network at whole transcriptome level. A total of 27 modules were identified from 1255 differentially expressed transcript units (TUs) across time course, which were further classified in to four groups. Functional enrichment analysis indicated the biological significance of our hierarchical network. It was indicated that primary metabolism is activated in the first rapid growth phase (phase A), and secondary metabolism is induced when the growth is slowed down (phase B). Among the 27 modules, two are highly correlated to erythromycin production. One contains all genes in the erythromycin-biosynthetic (ery) gene cluster and the other seems to be associated with erythromycin production by sharing common intermediate metabolites. Non-concomitant correlation between production and expression regulation was observed. Especially, by calculating the partial correlation coefficients and building the network based on Gaussian graphical model, intrinsic associations between modules were found, and the association between those two erythromycin production-correlated modules was included as expected. This work created a hierarchical model clustering transcriptome data into coordinated modules, and modules into groups across the time course, giving insight into the concerted transcriptional regulations especially the regulation corresponding to erythromycin production of S. erythraea. This strategy may be extendable to studies on other prokaryotic microorganisms.

  10. Assessing the evolutionary rate of positional orthologous genes in prokaryotes using synteny data

    Directory of Open Access Journals (Sweden)

    Lespinet Olivier

    2007-11-01

    Full Text Available Abstract Background Comparison of completely sequenced microbial genomes has revealed how fluid these genomes are. Detecting synteny blocks requires reliable methods to determining the orthologs among the whole set of homologs detected by exhaustive comparisons between each pair of completely sequenced genomes. This is a complex and difficult problem in the field of comparative genomics but will help to better understand the way prokaryotic genomes are evolving. Results We have developed a suite of programs that automate three essential steps to study conservation of gene order, and validated them with a set of 107 bacteria and archaea that cover the majority of the prokaryotic taxonomic space. We identified the whole set of shared homologs between two or more species and computed the evolutionary distance separating each pair of homologs. We applied two strategies to extract from the set of homologs a collection of valid orthologs shared by at least two genomes. The first computes the Reciprocal Smallest Distance (RSD using the PAM distances separating pairs of homologs. The second method groups homologs in families and reconstructs each family's evolutionary tree, distinguishing bona fide orthologs as well as paralogs created after the last speciation event. Although the phylogenetic tree method often succeeds where RSD fails, the reverse could occasionally be true. Accordingly, we used the data obtained with either methods or their intersection to number the orthologs that are adjacent in for each pair of genomes, the Positional Orthologous Genes (POGs, and to further study their properties. Once all these synteny blocks have been detected, we showed that POGs are subject to more evolutionary constraints than orthologs outside synteny groups, whichever the taxonomic distance separating the compared organisms. Conclusion The suite of programs described in this paper allows a reliable detection of orthologs and is useful for evaluating gene

  11. Iron homeostasis in Arabidopsis thaliana: transcriptomic analyses reveal novel FIT-regulated genes, iron deficiency marker genes and functional gene networks.

    Science.gov (United States)

    Mai, Hans-Jörg; Pateyron, Stéphanie; Bauer, Petra

    2016-10-03

    FIT (FER-LIKE IRON DEFICIENCY-INDUCED TRANSCRIPTION FACTOR) is the central regulator of iron uptake in Arabidopsis thaliana roots. We performed transcriptome analyses of six day-old seedlings and roots of six week-old plants using wild type, a fit knock-out mutant and a FIT over-expression line grown under iron-sufficient or iron-deficient conditions. We compared genes regulated in a FIT-dependent manner depending on the developmental stage of the plants. We assembled a high likelihood dataset which we used to perform co-expression and functional analysis of the most stably iron deficiency-induced genes. 448 genes were found FIT-regulated. Out of these, 34 genes were robustly FIT-regulated in root and seedling samples and included 13 novel FIT-dependent genes. Three hundred thirty-one genes showed differential regulation in response to the presence and absence of FIT only in the root samples, while this was the case for 83 genes in the seedling samples. We assembled a virtual dataset of iron-regulated genes based on a total of 14 transcriptomic analyses of iron-deficient and iron-sufficient wild-type plants to pinpoint the best marker genes for iron deficiency and analyzed this dataset in depth. Co-expression analysis of this dataset revealed 13 distinct regulons part of which predominantly contained functionally related genes. We could enlarge the list of FIT-dependent genes and discriminate between genes that are robustly FIT-regulated in roots and seedlings or only in one of those. FIT-regulated genes were mostly induced, few of them were repressed by FIT. With the analysis of a virtual dataset we could filter out and pinpoint new candidates among the most reliable marker genes for iron deficiency. Moreover, co-expression and functional analysis of this virtual dataset revealed iron deficiency-induced and functionally distinct regulons.

  12. Rapid Evolution of Ovarian-Biased Genes in the Yellow Fever Mosquito (Aedes aegypti).

    Science.gov (United States)

    Whittle, Carrie A; Extavour, Cassandra G

    2017-08-01

    Males and females exhibit highly dimorphic phenotypes, particularly in their gonads, which is believed to be driven largely by differential gene expression. Typically, the protein sequences of genes upregulated in males, or male-biased genes, evolve rapidly as compared to female-biased and unbiased genes. To date, the specific study of gonad-biased genes remains uncommon in metazoans. Here, we identified and studied a total of 2927, 2013, and 4449 coding sequences (CDS) with ovary-biased, testis-biased, and unbiased expression, respectively, in the yellow fever mosquito Aedes aegypti The results showed that ovary-biased and unbiased CDS had higher nonsynonymous to synonymous substitution rates (dN/dS) and lower optimal codon usage (those codons that promote efficient translation) than testis-biased genes. Further, we observed higher dN/dS in ovary-biased genes than in testis-biased genes, even for genes coexpressed in nonsexual (embryo) tissues. Ovary-specific genes evolved exceptionally fast, as compared to testis- or embryo-specific genes, and exhibited higher frequency of positive selection. Genes with ovary expression were preferentially involved in olfactory binding and reception. We hypothesize that at least two potential mechanisms could explain rapid evolution of ovary-biased genes in this mosquito: (1) the evolutionary rate of ovary-biased genes may be accelerated by sexual selection (including female-female competition or male-mate choice) affecting olfactory genes during female swarming by males, and/or by adaptive evolution of olfactory signaling within the female reproductive system ( e.g. , sperm-ovary signaling); and/or (2) testis-biased genes may exhibit decelerated evolutionary rates due to the formation of mating plugs in the female after copulation, which limits male-male sperm competition. Copyright © 2017 by the Genetics Society of America.

  13. Analysis of the dynamic co-expression network of heart regeneration in the zebrafish

    Science.gov (United States)

    Rodius, Sophie; Androsova, Ganna; Götz, Lou; Liechti, Robin; Crespo, Isaac; Merz, Susanne; Nazarov, Petr V.; de Klein, Niek; Jeanty, Céline; González-Rosa, Juan M.; Muller, Arnaud; Bernardin, Francois; Niclou, Simone P.; Vallar, Laurent; Mercader, Nadia; Ibberson, Mark; Xenarios, Ioannis; Azuaje, Francisco

    2016-05-01

    The zebrafish has the capacity to regenerate its heart after severe injury. While the function of a few genes during this process has been studied, we are far from fully understanding how genes interact to coordinate heart regeneration. To enable systematic insights into this phenomenon, we generated and integrated a dynamic co-expression network of heart regeneration in the zebrafish and linked systems-level properties to the underlying molecular events. Across multiple post-injury time points, the network displays topological attributes of biological relevance. We show that regeneration steps are mediated by modules of transcriptionally coordinated genes, and by genes acting as network hubs. We also established direct associations between hubs and validated drivers of heart regeneration with murine and human orthologs. The resulting models and interactive analysis tools are available at http://infused.vital-it.ch. Using a worked example, we demonstrate the usefulness of this unique open resource for hypothesis generation and in silico screening for genes involved in heart regeneration.

  14. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    Directory of Open Access Journals (Sweden)

    Surendra K Prajapati

    Full Text Available The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75 from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years and long-term population history (79,235 to 104,008 of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes to that inferred from mitochondrial genome diversity.

  15. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    Science.gov (United States)

    Prajapati, Surendra K; Joshi, Hema; Carlton, Jane M; Rizvi, M Alam

    2013-01-01

    The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75) from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years) and long-term population history (79,235 to 104,008) of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes) to that inferred from mitochondrial genome diversity.

  16. Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.

    Directory of Open Access Journals (Sweden)

    Xiaolin Xiao

    2014-01-01

    Full Text Available Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states. Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based and humans (mRNA-sequencing-based and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi

  17. Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.

    Science.gov (United States)

    Xiao, Xiaolin; Moreno-Moral, Aida; Rotival, Maxime; Bottolo, Leonardo; Petretto, Enrico

    2014-01-01

    Recent high-throughput efforts such as ENCODE have generated a large body of genome-scale transcriptional data in multiple conditions (e.g., cell-types and disease states). Leveraging these data is especially important for network-based approaches to human disease, for instance to identify coherent transcriptional modules (subnetworks) that can inform functional disease mechanisms and pathological pathways. Yet, genome-scale network analysis across conditions is significantly hampered by the paucity of robust and computationally-efficient methods. Building on the Higher-Order Generalized Singular Value Decomposition, we introduce a new algorithmic approach for efficient, parameter-free and reproducible identification of network-modules simultaneously across multiple conditions. Our method can accommodate weighted (and unweighted) networks of any size and can similarly use co-expression or raw gene expression input data, without hinging upon the definition and stability of the correlation used to assess gene co-expression. In simulation studies, we demonstrated distinctive advantages of our method over existing methods, which was able to recover accurately both common and condition-specific network-modules without entailing ad-hoc input parameters as required by other approaches. We applied our method to genome-scale and multi-tissue transcriptomic datasets from rats (microarray-based) and humans (mRNA-sequencing-based) and identified several common and tissue-specific subnetworks with functional significance, which were not detected by other methods. In humans we recapitulated the crosstalk between cell-cycle progression and cell-extracellular matrix interactions processes in ventricular zones during neocortex expansion and further, we uncovered pathways related to development of later cognitive functions in the cortical plate of the developing brain which were previously unappreciated. Analyses of seven rat tissues identified a multi-tissue subnetwork of co-expressed

  18. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene.

    Science.gov (United States)

    Casucci, Monica; Falcone, Laura; Camisa, Barbara; Norelli, Margherita; Porcellini, Simona; Stornaiuolo, Anna; Ciceri, Fabio; Traversari, Catia; Bordignon, Claudio; Bonini, Chiara; Bondanza, Attilio

    2018-01-01

    Chimeric antigen receptor (CAR)-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR). We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6). We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting.

  19. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene

    Science.gov (United States)

    Casucci, Monica; Falcone, Laura; Camisa, Barbara; Norelli, Margherita; Porcellini, Simona; Stornaiuolo, Anna; Ciceri, Fabio; Traversari, Catia; Bordignon, Claudio; Bonini, Chiara; Bondanza, Attilio

    2018-01-01

    Chimeric antigen receptor (CAR)-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR). We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6). We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting. PMID:29619024

  20. Extracellular NGFR Spacers Allow Efficient Tracking and Enrichment of Fully Functional CAR-T Cells Co-Expressing a Suicide Gene

    Directory of Open Access Journals (Sweden)

    Monica Casucci

    2018-03-01

    Full Text Available Chimeric antigen receptor (CAR-T cell immunotherapy is at the forefront of innovative cancer therapeutics. However, lack of standardization of cellular products within the same clinical trial and lack of harmonization between different trials have hindered the clear identification of efficacy and safety determinants that should be unveiled in order to advance the field. With the aim of facilitating the isolation and in vivo tracking of CAR-T cells, we here propose the inclusion within the CAR molecule of a novel extracellular spacer based on the low-affinity nerve-growth-factor receptor (NGFR. We screened four different spacer designs using as target antigen the CD44 isoform variant 6 (CD44v6. We successfully generated NGFR-spaced CD44v6 CAR-T cells that could be efficiently enriched with clinical-grade immuno-magnetic beads without negative consequences on subsequent expansion, immuno-phenotype, in vitro antitumor reactivity, and conditional ablation when co-expressing a suicide gene. Most importantly, these cells could be tracked with anti-NGFR monoclonal antibodies in NSG mice, where they expanded, persisted, and exerted potent antitumor effects against both high leukemia and myeloma burdens. Similar results were obtained with NGFR-enriched CAR-T cells specific for CD19 or CEA, suggesting the universality of this strategy. In conclusion, we have demonstrated that the incorporation of the NGFR marker gene within the CAR sequence allows for a single molecule to simultaneously work as a therapeutic and selection/tracking gene. Looking ahead, NGFR spacer enrichment might allow good manufacturing procedures-manufacturing of standardized CAR-T cell products with high therapeutic potential, which could be harmonized in different clinical trials and used in combination with a suicide gene for future application in the allogeneic setting.

  1. Genome-wide evolutionary characterization and expression analyses of major latex protein (MLP) family genes in Vitis vinifera.

    Science.gov (United States)

    Zhang, Ningbo; Li, Ruimin; Shen, Wei; Jiao, Shuzhen; Zhang, Junxiang; Xu, Weirong

    2018-04-27

    The major latex protein/ripening-related protein (MLP/RRP) subfamily is known to be involved in a wide range of biological processes of plant development and various stress responses. However, the biological function of MLP/RRP proteins is still far from being clear and identification of them may provide important clues for understanding their roles. Here, we report a genome-wide evolutionary characterization and gene expression analysis of the MLP family in European Vitis species. A total of 14 members, was found in the grape genome, all of which are located on chromosome 1, where are predominantly arranged in tandem clusters. We have noticed, most surprisingly, promoter-sharing by several non-identical but highly similar gene members to a greater extent than expected by chance. Synteny analysis between the grape and Arabidopsis thaliana genomes suggested that 3 grape MLP genes arose before the divergence of the two species. Phylogenetic analysis provided further insights into the evolutionary relationship between the genes, as well as their putative functions, and tissue-specific expression analysis suggested distinct biological roles for different members. Our expression data suggested a couple of candidate genes involved in abiotic stresses and phytohormone responses. The present work provides new insight into the evolution and regulation of Vitis MLP genes, which represent targets for future studies and inclusion in tolerance-related molecular breeding programs.

  2. Genome-Wide Identification, Evolutionary and Expression Analyses of the GALACTINOL SYNTHASE Gene Family in Rapeseed and Tobacco

    Directory of Open Access Journals (Sweden)

    Yonghai Fan

    2017-12-01

    Full Text Available Galactinol synthase (GolS is a key enzyme in raffinose family oligosaccharide (RFO biosynthesis. The finding that GolS accumulates in plants exposed to abiotic stresses indicates RFOs function in environmental adaptation. However, the evolutionary relationships and biological functions of GolS family in rapeseed (Brassica napus and tobacco (Nicotiana tabacum remain unclear. In this study, we identified 20 BnGolS and 9 NtGolS genes. Subcellular localization predictions showed that most of the proteins are localized to the cytoplasm. Phylogenetic analysis identified a lost event of an ancient GolS copy in the Solanaceae and an ancient duplication event leading to evolution of GolS4/7 in the Brassicaceae. The three-dimensional structures of two GolS proteins were conserved, with an important DxD motif for binding to UDP-galactose (uridine diphosphate-galactose and inositol. Expression profile analysis indicated that BnGolS and NtGolS genes were expressed in most tissues and highly expressed in one or two specific tissues. Hormone treatments strongly induced the expression of most BnGolS genes and homologous genes in the same subfamilies exhibited divergent-induced expression. Our study provides a comprehensive evolutionary analysis of GolS genes among the Brassicaceae and Solanaceae as well as an insight into the biological function of GolS genes in hormone response in plants.

  3. Preservation Analysis of Macrophage Gene Coexpression Between Human and Mouse Identifies PARK2 as a Genetically Controlled Master Regulator of Oxidative Phosphorylation in Humans

    Directory of Open Access Journals (Sweden)

    Veronica Codoni

    2016-10-01

    Full Text Available Macrophages are key players involved in numerous pathophysiological pathways and an in-depth characterization of their gene regulatory networks can help in better understanding how their dysfunction may impact on human diseases. We here conducted a cross-species network analysis of macrophage gene expression data between human and mouse to identify conserved networks across both species, and assessed whether such networks could reveal new disease-associated regulatory mechanisms. From a sample of 684 individuals processed for genome-wide macrophage gene expression profiling, we identified 27 groups of coexpressed genes (modules. Six modules were found preserved (P < 10−4 in macrophages from 86 mice of the Hybrid Mouse Diversity Panel. One of these modules was significantly [false discovery rate (FDR = 8.9 × 10−11] enriched for genes belonging to the oxidative phosphorylation (OXPHOS pathway. This pathway was also found significantly (FDR < 10−4 enriched in susceptibility genes for Alzheimer, Parkinson, and Huntington diseases. We further conducted an expression quantitative trait loci analysis to identify SNP that could regulate macrophage OXPHOS gene expression in humans. This analysis identified the PARK2 rs192804963 as a trans-acting variant influencing (minimal P-value = 4.3 × 10−8 the expression of most OXPHOS genes in humans. Further experimental work demonstrated that PARK2 knockdown expression was associated with increased OXPHOS gene expression in THP1 human macrophages. This work provided strong new evidence that PARK2 participates to the regulatory networks associated with oxidative phosphorylation and suggested that PARK2 genetic variations could act as a trans regulator of OXPHOS gene macrophage expression in humans.

  4. Evolutionary changes of multiple visual pigment genes in the complete genome of Pacific bluefin tuna.

    Science.gov (United States)

    Nakamura, Yoji; Mori, Kazuki; Saitoh, Kenji; Oshima, Kenshiro; Mekuchi, Miyuki; Sugaya, Takuma; Shigenobu, Yuya; Ojima, Nobuhiko; Muta, Shigeru; Fujiwara, Atushi; Yasuike, Motoshige; Oohara, Ichiro; Hirakawa, Hideki; Chowdhury, Vishwajit Sur; Kobayashi, Takanori; Nakajima, Kazuhiro; Sano, Motohiko; Wada, Tokio; Tashiro, Kosuke; Ikeo, Kazuho; Hattori, Masahira; Kuhara, Satoru; Gojobori, Takashi; Inouye, Kiyoshi

    2013-07-02

    Tunas are migratory fishes in offshore habitats and top predators with unique features. Despite their ecological importance and high market values, the open-ocean lifestyle of tuna, in which effective sensing systems such as color vision are required for capture of prey, has been poorly understood. To elucidate the genetic and evolutionary basis of optic adaptation of tuna, we determined the genome sequence of the Pacific bluefin tuna (Thunnus orientalis), using next-generation sequencing technology. A total of 26,433 protein-coding genes were predicted from 16,802 assembled scaffolds. From these, we identified five common fish visual pigment genes: red-sensitive (middle/long-wavelength sensitive; M/LWS), UV-sensitive (short-wavelength sensitive 1; SWS1), blue-sensitive (SWS2), rhodopsin (RH1), and green-sensitive (RH2) opsin genes. Sequence comparison revealed that tuna's RH1 gene has an amino acid substitution that causes a short-wave shift in the absorption spectrum (i.e., blue shift). Pacific bluefin tuna has at least five RH2 paralogs, the most among studied fishes; four of the proteins encoded may be tuned to blue light at the amino acid level. Moreover, phylogenetic analysis suggested that gene conversions have occurred in each of the SWS2 and RH2 loci in a short period. Thus, Pacific bluefin tuna has undergone evolutionary changes in three genes (RH1, RH2, and SWS2), which may have contributed to detecting blue-green contrast and measuring the distance to prey in the blue-pelagic ocean. These findings provide basic information on behavioral traits of predatory fish and, thereby, could help to improve the technology to culture such fish in captivity for resource management.

  5. Utility and Limitations of Using Gene Expression Data to Identify Functional Associations.

    Directory of Open Access Journals (Sweden)

    Sahra Uygun

    2016-12-01

    Full Text Available Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets.

  6. Molecular phylogeny of the higher and lower taxonomy of the Fusarium genus and differences in the evolutionary histories of multiple genes

    Science.gov (United States)

    2011-01-01

    Background Species of the Fusarium genus are important fungi which is associated with health hazards in human and animals. The taxonomy of this genus has been a subject of controversy for many years. Although many researchers have applied molecular phylogenetic analysis to examine the taxonomy of Fusarium species, their phylogenetic relationships remain unclear only few comprehensive phylogenetic analyses of the Fusarium genus and a lack of suitable nucleotides and amino acid substitution rates. A previous stugy with whole genome comparison among Fusairum species revealed the possibility that each gene in Fusarium genomes has a unique evolutionary history, and such gene may bring difficulty to the reconstruction of phylogenetic tree of Fusarium. There is a need not only to check substitution rates of genes but also to perform the exact evaluation of each gene-evolution. Results We performed phylogenetic analyses based on the nucleotide sequences of the rDNA cluster region (rDNA cluster), and the β-tubulin gene (β-tub), the elongation factor 1α gene (EF-1α), and the aminoadipate reductase gene (lys2). Although incongruence of the tree topologies between lys2 and the other genes was detected, all genes supported the classification of Fusarium species into 7 major clades, I to VII. To obtain a reliable phylogeny for Fusarium species, we excluded the lys2 sequences from our dataset, and re-constructed a maximum likelihood (ML) tree based on the combined data of the rDNA cluster, β-tub, and EF-1α. Our ML tree indicated some interesting relationships in the higher and lower taxa of Fusarium species and related genera. Moreover, we observed a novel evolutionary history of lys2. We suggest that the unique tree topologies of lys2 are not due to an analytical artefact, but due to differences in the evolutionary history of genomes caused by positive selection of particular lineages. Conclusion This study showed the reliable species tree of the higher and lower taxonomy

  7. Evolutionary novelty in gravity sensing through horizontal gene transfer and high-order protein assembly.

    Directory of Open Access Journals (Sweden)

    Tu Anh Nguyen

    2018-04-01

    Full Text Available Horizontal gene transfer (HGT can promote evolutionary adaptation by transforming a species' relationship to the environment. In most well-understood cases of HGT, acquired and donor functions appear to remain closely related. Thus, the degree to which HGT can lead to evolutionary novelties remains unclear. Mucorales fungi sense gravity through the sedimentation of vacuolar protein crystals. Here, we identify the octahedral crystal matrix protein (OCTIN. Phylogenetic analysis strongly supports acquisition of octin by HGT from bacteria. A bacterial OCTIN forms high-order periplasmic oligomers, and inter-molecular disulphide bonds are formed by both fungal and bacterial OCTINs, suggesting that they share elements of a conserved assembly mechanism. However, estimated sedimentation velocities preclude a gravity-sensing function for the bacterial structures. Together, our data suggest that HGT from bacteria into the Mucorales allowed a dramatic increase in assembly scale and emergence of the gravity-sensing function. We conclude that HGT can lead to evolutionary novelties that emerge depending on the physiological and cellular context of protein assembly.

  8. Disease gene characterization through large-scale co-expression analysis.

    Directory of Open Access Journals (Sweden)

    Allen Day

    2009-12-01

    Full Text Available In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET.Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2 and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders.UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis.

  9. Evolutionary history of the third chromosome gene arrangements of Drosophila pseudoobscura inferred from inversion breakpoints.

    Science.gov (United States)

    Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W

    2011-08-01

    The third chromosome of Drosophila pseudoobscura is polymorphic for numerous gene arrangements that form classical clines in North America. The polytene salivary chromosomes isolated from natural populations revealed changes in gene order that allowed the different gene arrangements to be linked together by paracentric inversions representing one of the first cases where genetic data were used to construct a phylogeny. Although the inversion phylogeny can be used to determine the relationships among the gene arrangements, the cytogenetic data are unable to infer the ancestral arrangement or the age of the different chromosome types. These are both important properties if one is to infer the evolutionary forces responsible for the spread and maintenance of the chromosomes. Here, we employ the nucleotide sequences of 18 regions distributed across the third chromosome in 80-100 D. pseudoobscura strains to test whether five gene arrangements are of unique or multiple origin, what the ancestral arrangement was, and what are the ages of the different arrangements. Each strain carried one of six commonly found gene arrangements and the sequences were used to infer their evolutionary relationships. Breakpoint regions in the center of the chromosome supported monophyly of the gene arrangements, whereas regions at the ends of the chromosome gave phylogenies that provided less support for monophyly of the chromosomes either because the individual markers did not have enough phylogenetically informative sites or genetic exchange scrambled information among the gene arrangements. A data set where the genetic markers were concatenated strongly supported a unique origin of the different gene arrangements. The inversion polymorphism of D. pseudoobscura is estimated to be about a million years old. We have also shown that the generated phylogeny is consistent with the cytological phylogeny of this species. In addition, the data presented here support hypothetical as the ancestral

  10. A moth pheromone brewery: production of (Z)-11-hexadecenol by heterologous co-expression of two biosynthetic genes from a noctuid moth in a yeast cell factory.

    Science.gov (United States)

    Hagström, Åsa K; Wang, Hong-Lei; Liénard, Marjorie A; Lassance, Jean-Marc; Johansson, Tomas; Löfstedt, Christer

    2013-12-13

    Moths (Lepidoptera) are highly dependent on chemical communication to find a mate. Compared to conventional unselective insecticides, synthetic pheromones have successfully served to lure male moths as a specific and environmentally friendly way to control important pest species. However, the chemical synthesis and purification of the sex pheromone components in large amounts is a difficult and costly task. The repertoire of enzymes involved in moth pheromone biosynthesis in insecta can be seen as a library of specific catalysts that can be used to facilitate the synthesis of a particular chemical component. In this study, we present a novel approach to effectively aid in the preparation of semi-synthetic pheromone components using an engineered vector co-expressing two key biosynthetic enzymes in a simple yeast cell factory. We first identified and functionally characterized a ∆11 Fatty-Acyl Desaturase and a Fatty-Acyl Reductase from the Turnip moth, Agrotis segetum. The ∆11-desaturase produced predominantly Z11-16:acyl, a common pheromone component precursor, from the abundant yeast palmitic acid and the FAR transformed a series of saturated and unsaturated fatty acids into their corresponding alcohols which may serve as pheromone components in many moth species. Secondly, when we co-expressed the genes in the Brewer's yeast Saccharomyces cerevisiae, a set of long-chain fatty acids and alcohols that are not naturally occurring in yeast were produced from inherent yeast fatty acids, and the presence of (Z)-11-hexadecenol (Z11-16:OH), demonstrated that both heterologous enzymes were active in concert. A 100 ml batch yeast culture produced on average 19.5 μg Z11-16:OH. Finally, we demonstrated that oxidized extracts from the yeast cells containing (Z)-11-hexadecenal and other aldehyde pheromone compounds elicited specific electrophysiological activity from male antennae of the Tobacco budworm, Heliothis virescens, supporting the idea that genes from different

  11. The gsdf gene locus harbors evolutionary conserved and clustered genes preferentially expressed in fish previtellogenic oocytes.

    Science.gov (United States)

    Gautier, Aude; Le Gac, Florence; Lareyre, Jean-Jacques

    2011-02-01

    The gonadal soma-derived factor (GSDF) belongs to the transforming growth factor-β superfamily and is conserved in teleostean fish species. Gsdf is specifically expressed in the gonads, and gene expression is restricted to the granulosa and Sertoli cells in trout and medaka. The gsdf gene expression is correlated to early testis differentiation in medaka and was shown to stimulate primordial germ cell and spermatogonia proliferation in trout. In the present study, we show that the gsdf gene localizes to a syntenic chromosomal fragment conserved among vertebrates although no gsdf-related gene is detected on the corresponding genomic region in tetrapods. We demonstrate using quantitative RT-PCR that most of the genes localized in the synteny are specifically expressed in medaka gonads. Gsdf is the only gene of the synteny with a much higher expression in the testis compared to the ovary. In contrast, gene expression pattern analysis of the gsdf surrounding genes (nup54, aff1, klhl8, sdad1, and ptpn13) indicates that these genes are preferentially expressed in the female gonads. The tissue distribution of these genes is highly similar in medaka and zebrafish, two teleostean species that have diverged more than 110 million years ago. The cellular localization of these genes was determined in medaka gonads using the whole-mount in situ hybridization technique. We confirm that gsdf gene expression is restricted to Sertoli and granulosa cells in contact with the premeiotic and meiotic cells. The nup54 gene is expressed in spermatocytes and previtellogenic oocytes. Transcripts corresponding to the ovary-specific genes (aff1, klhl8, and sdad1) are detected only in previtellogenic oocytes. No expression was detected in the gonocytes in 10 dpf embryos. In conclusion, we show that the gsdf gene localizes to a syntenic chromosomal fragment harboring evolutionary conserved genes in vertebrates. These genes are preferentially expressed in previtelloogenic oocytes, and thus, they

  12. The importance of immune gene variability (MHC in evolutionary ecology and conservation

    Directory of Open Access Journals (Sweden)

    Sommer Simone

    2005-10-01

    Full Text Available Abstract Genetic studies have typically inferred the effects of human impact by documenting patterns of genetic differentiation and levels of genetic diversity among potentially isolated populations using selective neutral markers such as mitochondrial control region sequences, microsatellites or single nucleotide polymorphism (SNPs. However, evolutionary relevant and adaptive processes within and between populations can only be reflected by coding genes. In vertebrates, growing evidence suggests that genetic diversity is particularly important at the level of the major histocompatibility complex (MHC. MHC variants influence many important biological traits, including immune recognition, susceptibility to infectious and autoimmune diseases, individual odours, mating preferences, kin recognition, cooperation and pregnancy outcome. These diverse functions and characteristics place genes of the MHC among the best candidates for studies of mechanisms and significance of molecular adaptation in vertebrates. MHC variability is believed to be maintained by pathogen-driven selection, mediated either through heterozygote advantage or frequency-dependent selection. Up to now, most of our knowledge has derived from studies in humans or from model organisms under experimental, laboratory conditions. Empirical support for selective mechanisms in free-ranging animal populations in their natural environment is rare. In this review, I first introduce general information about the structure and function of MHC genes, as well as current hypotheses and concepts concerning the role of selection in the maintenance of MHC polymorphism. The evolutionary forces acting on the genetic diversity in coding and non-coding markers are compared. Then, I summarise empirical support for the functional importance of MHC variability in parasite resistance with emphasis on the evidence derived from free-ranging animal populations investigated in their natural habitat. Finally, I

  13. Profiling and Co-expression Network Analysis of Learned Helplessness Regulated mRNAs and lncRNAs in the Mouse Hippocampus

    Directory of Open Access Journals (Sweden)

    Chaoqun Li

    2018-01-01

    Full Text Available Although studies provide insights into the neurobiology of stress and depression, the exact molecular mechanisms underlying their pathologies remain largely unknown. Long non-coding RNA (lncRNA has been implicated in brain functions and behavior. A potential link between lncRNA and psychiatric disorders has been proposed. However, it remains undetermined whether IncRNA regulation, in the brain, contributes to stress or depression pathologies. In this study, we used a valid animal model of depression-like symptoms; namely learned helplessness, RNA-seq, Gene Ontology and co-expression network analyses to profile the expression pattern of lncRNA and mRNA in the hippocampus of mice. We identified 6346 differentially expressed transcripts. Among them, 340 lncRNAs and 3559 protein coding mRNAs were differentially expressed in helpless mice in comparison with control and/or non-helpless mice (inescapable stress resilient mice. Gene Ontology and pathway enrichment analyses indicated that induction of helplessness altered expression of mRNAs enriched in fundamental biological functions implicated in stress/depression neurobiology such as synaptic, metabolic, cell survival and proliferation, developmental and chromatin modification functions. To explore the possible regulatory roles of the altered lncRNAs, we constructed co-expression networks composed of the lncRNAs and mRNAs. Among our differentially expressed lncRNAs, 17% showed significant correlation with genes. Functional co-expression analysis linked the identified lncRNAs to several cellular mechanisms implicated in stress/depression neurobiology. Importantly, 57% of the identified regulatory lncRNAs significantly correlated with 18 different synapse-related functions. Thus, the current study identifies for the first time distinct groups of lncRNAs regulated by induction of learned helplessness in the mouse brain. Our results suggest that lncRNA-directed regulatory mechanisms might contribute to

  14. Profiling and Co-expression Network Analysis of Learned Helplessness Regulated mRNAs and lncRNAs in the Mouse Hippocampus.

    Science.gov (United States)

    Li, Chaoqun; Cao, Feifei; Li, Shengli; Huang, Shenglin; Li, Wei; Abumaria, Nashat

    2017-01-01

    Although studies provide insights into the neurobiology of stress and depression, the exact molecular mechanisms underlying their pathologies remain largely unknown. Long non-coding RNA (lncRNA) has been implicated in brain functions and behavior. A potential link between lncRNA and psychiatric disorders has been proposed. However, it remains undetermined whether IncRNA regulation, in the brain, contributes to stress or depression pathologies. In this study, we used a valid animal model of depression-like symptoms; namely learned helplessness, RNA-seq, Gene Ontology and co-expression network analyses to profile the expression pattern of lncRNA and mRNA in the hippocampus of mice. We identified 6346 differentially expressed transcripts. Among them, 340 lncRNAs and 3559 protein coding mRNAs were differentially expressed in helpless mice in comparison with control and/or non-helpless mice (inescapable stress resilient mice). Gene Ontology and pathway enrichment analyses indicated that induction of helplessness altered expression of mRNAs enriched in fundamental biological functions implicated in stress/depression neurobiology such as synaptic, metabolic, cell survival and proliferation, developmental and chromatin modification functions. To explore the possible regulatory roles of the altered lncRNAs, we constructed co-expression networks composed of the lncRNAs and mRNAs. Among our differentially expressed lncRNAs, 17% showed significant correlation with genes. Functional co-expression analysis linked the identified lncRNAs to several cellular mechanisms implicated in stress/depression neurobiology. Importantly, 57% of the identified regulatory lncRNAs significantly correlated with 18 different synapse-related functions. Thus, the current study identifies for the first time distinct groups of lncRNAs regulated by induction of learned helplessness in the mouse brain. Our results suggest that lncRNA-directed regulatory mechanisms might contribute to stress

  15. Horizontal gene transfer: essentiality and evolvability in prokaryotes, and roles in evolutionary transitions [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Eugene V. Koonin

    2016-07-01

    Full Text Available The wide spread of gene exchange and loss in the prokaryotic world has prompted the concept of ‘lateral genomics’ to the point of an outright denial of the relevance of phylogenetic trees for evolution. However, the pronounced coherence congruence of the topologies of numerous gene trees, particularly those for (nearly universal genes, translates into the notion of a statistical tree of life (STOL, which reflects a central trend of vertical evolution. The STOL can be employed as a framework for reconstruction of the evolutionary processes in the prokaryotic world. Quantitatively, however, horizontal gene transfer (HGT dominates microbial evolution, with the rate of gene gain and loss being comparable to the rate of point mutations and much greater than the duplication rate. Theoretical models of evolution suggest that HGT is essential for the survival of microbial populations that otherwise deteriorate due to the Muller’s ratchet effect. Apparently, at least some bacteria and archaea evolved dedicated vehicles for gene transfer that evolved from selfish elements such as plasmids and viruses. Recent phylogenomic analyses suggest that episodes of massive HGT were pivotal for the emergence of major groups of organisms such as multiple archaeal phyla as well as eukaryotes. Similar analyses appear to indicate that, in addition to donating hundreds of genes to the emerging eukaryotic lineage, mitochondrial endosymbiosis severely curtailed HGT. These results shed new light on the routes of evolutionary transitions, but caution is due given the inherent uncertainty of deep phylogenies.

  16. Identifying human disease genes through cross-species gene mapping of evolutionary conserved processes.

    Directory of Open Access Journals (Sweden)

    Martin Poot

    2011-05-01

    Full Text Available Understanding complex networks that modulate development in humans is hampered by genetic and phenotypic heterogeneity within and between populations. Here we present a method that exploits natural variation in highly diverse mouse genetic reference panels in which genetic and environmental factors can be tightly controlled. The aim of our study is to test a cross-species genetic mapping strategy, which compares data of gene mapping in human patients with functional data obtained by QTL mapping in recombinant inbred mouse strains in order to prioritize human disease candidate genes.We exploit evolutionary conservation of developmental phenotypes to discover gene variants that influence brain development in humans. We studied corpus callosum volume in a recombinant inbred mouse panel (C57BL/6J×DBA/2J, BXD strains using high-field strength MRI technology. We aligned mouse mapping results for this neuro-anatomical phenotype with genetic data from patients with abnormal corpus callosum (ACC development.From the 61 syndromes which involve an ACC, 51 human candidate genes have been identified. Through interval mapping, we identified a single significant QTL on mouse chromosome 7 for corpus callosum volume with a QTL peak located between 25.5 and 26.7 Mb. Comparing the genes in this mouse QTL region with those associated with human syndromes (involving ACC and those covered by copy number variations (CNV yielded a single overlap, namely HNRPU in humans and Hnrpul1 in mice. Further analysis of corpus callosum volume in BXD strains revealed that the corpus callosum was significantly larger in BXD mice with a B genotype at the Hnrpul1 locus than in BXD mice with a D genotype at Hnrpul1 (F = 22.48, p<9.87*10(-5.This approach that exploits highly diverse mouse strains provides an efficient and effective translational bridge to study the etiology of human developmental disorders, such as autism and schizophrenia.

  17. Correlation-maximizing surrogate gene space for visual mining of gene expression patterns in developing barley endosperm tissue

    Directory of Open Access Journals (Sweden)

    Usadel Björn

    2007-05-01

    Full Text Available Abstract Background Micro- and macroarray technologies help acquire thousands of gene expression patterns covering important biological processes during plant ontogeny. Particularly, faithful visualization methods are beneficial for revealing interesting gene expression patterns and functional relationships of coexpressed genes. Such screening helps to gain deeper insights into regulatory behavior and cellular responses, as will be discussed for expression data of developing barley endosperm tissue. For that purpose, high-throughput multidimensional scaling (HiT-MDS, a recent method for similarity-preserving data embedding, is substantially refined and used for (a assessing the quality and reliability of centroid gene expression patterns, and for (b derivation of functional relationships of coexpressed genes of endosperm tissue during barley grain development (0–26 days after flowering. Results Temporal expression profiles of 4824 genes at 14 time points are faithfully embedded into two-dimensional displays. Thereby, similar shapes of coexpressed genes get closely grouped by a correlation-based similarity measure. As a main result, by using power transformation of correlation terms, a characteristic cloud of points with bipolar sandglass shape is obtained that is inherently connected to expression patterns of pre-storage, intermediate and storage phase of endosperm development. Conclusion The new HiT-MDS-2 method helps to create global views of expression patterns and to validate centroids obtained from clustering programs. Furthermore, functional gene annotation for developing endosperm barley tissue is successfully mapped to the visualization, making easy localization of major centroids of enriched functional categories possible.

  18. Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

    Science.gov (United States)

    Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

    2015-06-01

    To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Heterologous co-expression of accA, fabD, and thioesterase genes for improving long-chain fatty acid production in Pseudomonas aeruginosa and Escherichia coli.

    Science.gov (United States)

    Lee, Sunhee; Jeon, Eunyoung; Jung, Yeontae; Lee, Jinwon

    2012-05-01

    The goal of the present study was to increase the content of intracellular long-chain fatty acids in two bacterial strains, Pseudomonas aeruginosa PA14 and Escherichia coli K-12 MG1655, by co-overexpressing essential enzymes that are involved in the fatty acid synthesis metabolic pathway. Recently, microbial fatty acids and their derivatives have been receiving increasing attention as an alternative source of fuel. By introducing two genes (accA and fabD) of P. aeruginosa into the two bacterial strains and by co-expressing with them the fatty acyl-acyl carrier protein thioesterase gene of Streptococcus pyogenes (strain MGAS10270), we have engineered recombinant strains that are efficient producers of long-chain fatty acids (C16 and C18). The recombinant strains exhibit a 1.3-1.7-fold increase in the production of long-chain fatty acids over the wild-type strains. To enhance the production of total long-chain fatty acids, we researched the carbon sources for optimized culture conditions and results were used for post-culture incubation period. E. coli SGJS17 (containing the accA, fabD, and thioesterase genes) produced the highest content of intracellular total fatty acids; in particular, the unsaturated fatty acid content was about 20-fold higher than that in the wild-type E. coli.

  20. Inequalities and Duality in Gene Coexpression Networks of HIV-1 Infection Revealed by the Combination of the Double-Connectivity Approach and the Gini's Method

    Directory of Open Access Journals (Sweden)

    Chuang Ma

    2011-01-01

    Full Text Available The symbiosis (Sym and pathogenesis (Pat is a duality problem of microbial infection, including HIV/AIDS. Statistical analysis of inequalities and duality in gene coexpression networks (GCNs of HIV-1 infection may gain novel insights into AIDS. In this study, we focused on analysis of GCNs of uninfected subjects and HIV-1-infected patients at three different stages of viral infection based on data deposited in the GEO database of NCBI. The inequalities and duality in these GCNs were analyzed by the combination of the double-connectivity (DC approach and the Gini's method. DC analysis reveals that there are significant differences between positive and negative connectivity in HIV-1 stage-specific GCNs. The inequality measures of negative connectivity and edge weight are changed more significantly than those of positive connectivity and edge weight in GCNs from the HIV-1 uninfected to the AIDS stages. With the permutation test method, we identified a set of genes with significant changes in the inequality and duality measure of edge weight. Functional analysis shows that these genes are highly enriched for the immune system, which plays an essential role in the Sym-Pat duality (SPD of microbial infections. Understanding of the SPD problems of HIV-1 infection may provide novel intervention strategies for AIDS.

  1. Comprehensive analysis of differential co-expression patterns reveal transcriptional dysregulation mechanism and identify novel prognostic lncRNAs in esophageal squamous cell carcinoma

    Directory of Open Access Journals (Sweden)

    Li Z

    2017-06-01

    Full Text Available Zhen Li,1 Qianlan Yao,1 Songjian Zhao,1 Yin Wang,2,3 Yixue Li,1,4 Zhen Wang4 1School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 2Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, 3Collaborative Innovation Center for Genetics and Development, Fudan University, 4Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, People’s Republic of China Abstract: Esophageal squamous cell carcinoma (ESCC is one of the most common malignancies worldwide and occurs at a relatively high frequency in People’s Republic of China. However, the molecular mechanism underlying ESCC is still unclear. In this study, the mRNA and long non-coding RNA (lncRNA expression profiles of ESCC were downloaded from the Gene Expression Omnibus database, and then differential co-expression analysis was used to reveal the altered co-expression relationship of gene pairs in ESCC tumors. A total of 3,709 mRNAs and 923 lncRNAs were differentially co-expressed between normal and tumor tissues, and we found that most of the gene pairs lost associations in the tumor tissues. The differential regulatory networking approach deciphered that transcriptional dysregulation was ubiquitous in ESCC, and most of the differentially regulated links were modulated by 37 TFs. Our study also found that two novel lncRNAs (ADAMTS9-AS1 and AP000696.2 might be essential in the development of ectoderm and epithelial cells, which could significantly stratify ESCC patients into high-risk and low-risk groups, and were much better than traditional clinical tumor markers. Further inspection of two risk groups showed that the changes in TF-target regulation in the high-risk patients were significantly higher than those in the low-risk patients. In addition, four signal transduction-related DCmRNAs (ERBB3, ENSA, KCNK7, MFSD5

  2. Functional alterations due to amino acid changes and evolutionary comparative analysis of ARPKD and ADPKD genes

    Directory of Open Access Journals (Sweden)

    Burhan M. Edrees

    2016-12-01

    Full Text Available A targeted customized sequencing of genes implicated in autosomal recessive polycystic kidney disease (ARPKD phenotype was performed to identify candidate variants using the Ion torrent PGM next-generation sequencing. The results identified four potential pathogenic variants in PKHD1 gene [c.4870C>T, p.(Arg1624Trp, c.5725C>T, p.(Arg1909Trp, c.1736C>T, p.(Thr579Met and c.10628T>G, p.(Leu3543Trp] among 12 out of 18 samples. However, one variant c.4870C>T, p.(Arg1624Trp was common among eight patients. Some patient samples also showed few variants in autosomal dominant polycystic kidney disease (ADPKD disease causing genes PKD1 and PKD2 such as c.12433G>A, p.(Val4145Ile and c.1445T>G, p.(Phe482Cys, respectively. All causative variants were validated by capillary sequencing and confirmed the presence of a novel homozygous variant c.10628T>G, p.(Leu3543Trp in a male proband. We have recently published the results of these studies (Edrees et al., 2016. Here we report for the first time the effect of the common mutation p.(Arg1624Trp found in eight samples on the protein structure and function due to the specific amino acid changes of PKHD1 protein using molecular dynamics simulations. The computational approaches provide tool predict the phenotypic effect of variant on the structure and function of the altered protein. The structural analysis with the common mutation p.(Arg1624Trp in the native and mutant modeled protein were also studied for solvent accessibility, secondary structure and stabilizing residues to find out the stability of the protein between wild type and mutant forms. Furthermore, comparative genomics and evolutionary analyses of variants observed in PKHD1, PKD1, and PKD2 genes were also performed in some mammalian species including human to understand the complexity of genomes among closely related mammalian species. Taken together, the results revealed that the evolutionary comparative analyses and characterization of PKHD1, PKD1

  3. Identification and Evolutionary Analysis of Potential Candidate Genes in a Human Eating Disorder

    Directory of Open Access Journals (Sweden)

    Ubadah Sabbagh

    2016-01-01

    Full Text Available The purpose of this study was to find genes linked with eating disorders and associated with both metabolic and neural systems. Our operating hypothesis was that there are genetic factors underlying some eating disorders resting in both those pathways. Specifically, we are interested in disorders that may rest in both sleep and metabolic function, generally called Night Eating Syndrome (NES. A meta-analysis of the Gene Expression Omnibus targeting the mammalian nervous system, sleep, and obesity studies was performed, yielding numerous genes of interest. Through a text-based analysis of the results, a number of potential candidate genes were identified. VGF, in particular, appeared to be relevant both to obesity and, broadly, to brain or neural development. VGF is a highly connected protein that interacts with numerous targets via proteolytically digested peptides. We examined VGF from an evolutionary perspective to determine whether other available evidence supported a role for the gene in human disease. We conclude that some of the already identified variants in VGF from human polymorphism studies may contribute to eating disorders and obesity. Our data suggest that there is enough evidence to warrant eGWAS and GWAS analysis of these genes in NES patients in a case-control study.

  4. Identification and Evolutionary Analysis of Potential Candidate Genes in a Human Eating Disorder.

    Science.gov (United States)

    Sabbagh, Ubadah; Mullegama, Saman; Wyckoff, Gerald J

    2016-01-01

    The purpose of this study was to find genes linked with eating disorders and associated with both metabolic and neural systems. Our operating hypothesis was that there are genetic factors underlying some eating disorders resting in both those pathways. Specifically, we are interested in disorders that may rest in both sleep and metabolic function, generally called Night Eating Syndrome (NES). A meta-analysis of the Gene Expression Omnibus targeting the mammalian nervous system, sleep, and obesity studies was performed, yielding numerous genes of interest. Through a text-based analysis of the results, a number of potential candidate genes were identified. VGF, in particular, appeared to be relevant both to obesity and, broadly, to brain or neural development. VGF is a highly connected protein that interacts with numerous targets via proteolytically digested peptides. We examined VGF from an evolutionary perspective to determine whether other available evidence supported a role for the gene in human disease. We conclude that some of the already identified variants in VGF from human polymorphism studies may contribute to eating disorders and obesity. Our data suggest that there is enough evidence to warrant eGWAS and GWAS analysis of these genes in NES patients in a case-control study.

  5. LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes.

    Science.gov (United States)

    Wang, Dapeng; Zhang, Yubin; Fan, Zhonghua; Liu, Guiming; Yu, Jun

    2012-01-01

    ontology (GO) annotation, promoter identification, gene expression (co-expression), and evolutionary analysis. This database not only provides a way to define lineage-specific and species-specific gene clusters but also facilitates future studies on gene co-regulation, epigenetic control of gene expression (DNA methylation and histone marks), and chromosomal structures in a context of gene clusters and species evolution. LCGbase is freely available at http://lcgbase.big.ac.cn/LCGbase.

  6. Peeling back the evolutionary layers of molecular mechanisms responsive to exercise-stress in the skeletal muscle of the racing horse.

    Science.gov (United States)

    Kim, Hyeongmin; Lee, Taeheon; Park, Woncheoul; Lee, Jin Woo; Kim, Jaemin; Lee, Bo-Young; Ahn, Hyeonju; Moon, Sunjin; Cho, Seoae; Do, Kyoung-Tag; Kim, Heui-Soo; Lee, Hak-Kyo; Lee, Chang-Kyu; Kong, Hong-Sik; Yang, Young-Mok; Park, Jongsun; Kim, Hak-Min; Kim, Byung Chul; Hwang, Seungwoo; Bhak, Jong; Burt, Dave; Park, Kyoung-Do; Cho, Byung-Wook; Kim, Heebal

    2013-06-01

    The modern horse (Equus caballus) is the product of over 50 million yrs of evolution. The athletic abilities of the horse have been enhanced during the past 6000 yrs under domestication. Therefore, the horse serves as a valuable model to understand the physiology and molecular mechanisms of adaptive responses to exercise. The structure and function of skeletal muscle show remarkable plasticity to the physical and metabolic challenges following exercise. Here, we reveal an evolutionary layer of responsiveness to exercise-stress in the skeletal muscle of the racing horse. We analysed differentially expressed genes and their co-expression networks in a large-scale RNA-sequence dataset comparing expression before and after exercise. By estimating genome-wide dN/dS ratios using six mammalian genomes, and FST and iHS using re-sequencing data derived from 20 horses, we were able to peel back the evolutionary layers of adaptations to exercise-stress in the horse. We found that the oldest and thickest layer (dN/dS) consists of system-wide tissue and organ adaptations. We further find that, during the period of horse domestication, the older layer (FST) is mainly responsible for adaptations to inflammation and energy metabolism, and the most recent layer (iHS) for neurological system process, cell adhesion, and proteolysis.

  7. Evolutionary conservation of essential and highly expressed genes in Pseudomonas aeruginosa

    Directory of Open Access Journals (Sweden)

    Scharfe Maren

    2010-04-01

    Full Text Available Abstract Background The constant increase in development and spread of bacterial resistance to antibiotics poses a serious threat to human health. New sequencing technologies are now on the horizon that will yield massive increases in our capacity for DNA sequencing and will revolutionize the drug discovery process. Since essential genes are promising novel antibiotic targets, the prediction of gene essentiality based on genomic information has become a major focus. Results In this study we demonstrate that pooled sequencing is applicable for the analysis of sequence variations of strain collections with more than 10 individual isolates. Pooled sequencing of 36 clinical Pseudomonas aeruginosa isolates revealed that essential and highly expressed proteins evolve at lower rates, whereas extracellular proteins evolve at higher rates. We furthermore refined the list of experimentally essential P. aeruginosa genes, and identified 980 genes that show no sequence variation at all. Among the conserved nonessential genes we found several that are involved in regulation, motility and virulence, indicating that they represent factors of evolutionary importance for the lifestyle of a successful environmental bacterium and opportunistic pathogen. Conclusion The detailed analysis of a comprehensive set of P. aeruginosa genomes in this study clearly disclosed detailed information of the genomic makeup and revealed a large set of highly conserved genes that play an important role for the lifestyle of this microorganism. Sequencing strain collections enables for a detailed and extensive identification of sequence variations as potential bacterial adaptation processes, e.g., during the development of antibiotic resistance in the clinical setting and thus may be the basis to uncover putative targets for novel treatment strategies.

  8. Evolutionary dynamics of the Pgk1 gene in the polyploid genus Kengyilia (Triticeae: Poaceae and its diploid relatives.

    Directory of Open Access Journals (Sweden)

    Xing Fan

    Full Text Available The level and pattern of nucleotide variation in duplicate gene provide important information on the evolutionary history of polyploids and divergent process between homoeologous loci within lineages. Kengyilia is a group of allohexaploid species with the StYP genomic constitutions in the wheat tribe. To investigate the evolutionary dynamics of the Pgk1 gene in Kengyilia and its diploid relatives, three copies of Pgk1 homoeologues were isolated from all sampled hexaploid Kengyilia species and analyzed with the Pgk1 sequences from 47 diploid taxa representing 18 basic genomes in Triticeae. Sequence diversity patterns and genealogical analysis suggested that (1 Kengyilia species from the Central Asia and the Qinghai-Tibetan plateau have independent origins with geographically differentiated P genome donors and diverged levels of nucleotide diversity at Pgk1 locus; (2 a relatively long-time sweep event has allowed the Pgk1 gene within Agropyron to adapt to cold climate triggered by the recent uplifts of the Qinghai-Tibetan Plateau; (3 sweep event and population expansion might result in the difference in the d(N/d(S value of the Pgk1 gene in allopatric Agropyron populations, and this difference may be genetically transmitted to Kengyilia lineages via independent polyploidization events; (4 an 83 bp MITE element insertion has shaped the Pgk1 loci in the P genome lineage with different geographical regions; (5 the St and P genomes in Kengyilia were donated by Pseudoroegneria and Agropyron, respectively, and the Y genome is closely related to the Xp genome of Peridictyon sanctum. The interplay of evolutionary forces involving diverged natural selection, population expansion, and transposable events in geographically differentiated P genome donors could attribute to geographical differentiation of Kengyilia species via independent origins.

  9. Evolutionary trajectories of snake genes and genomes revealed by comparative analyses of five-pacer viper

    Science.gov (United States)

    Yin, Wei; Wang, Zong-ji; Li, Qi-ye; Lian, Jin-ming; Zhou, Yang; Lu, Bing-zheng; Jin, Li-jun; Qiu, Peng-xin; Zhang, Pei; Zhu, Wen-bo; Wen, Bo; Huang, Yi-jun; Lin, Zhi-long; Qiu, Bi-tao; Su, Xing-wen; Yang, Huan-ming; Zhang, Guo-jie; Yan, Guang-mei; Zhou, Qi

    2016-01-01

    Snakes have numerous features distinctive from other tetrapods and a rich history of genome evolution that is still obscure. Here, we report the high-quality genome of the five-pacer viper, Deinagkistrodon acutus, and comparative analyses with other representative snake and lizard genomes. We map the evolutionary trajectories of transposable elements (TEs), developmental genes and sex chromosomes onto the snake phylogeny. TEs exhibit dynamic lineage-specific expansion, and many viper TEs show brain-specific gene expression along with their nearby genes. We detect signatures of adaptive evolution in olfactory, venom and thermal-sensing genes and also functional degeneration of genes associated with vision and hearing. Lineage-specific relaxation of functional constraints on respective Hox and Tbx limb-patterning genes supports fossil evidence for a successive loss of forelimbs then hindlimbs during snake evolution. Finally, we infer that the ZW sex chromosome pair had undergone at least three recombination suppression events in the ancestor of advanced snakes. These results altogether forge a framework for our deep understanding into snakes' history of molecular evolution. PMID:27708285

  10. Gene Network Construction from Microarray Data Identifies a Key Network Module and Several Candidate Hub Genes in Age-Associated Spatial Learning Impairment.

    Science.gov (United States)

    Uddin, Raihan; Singh, Shiva M

    2017-01-01

    As humans age many suffer from a decrease in normal brain functions including spatial learning impairments. This study aimed to better understand the molecular mechanisms in age-associated spatial learning impairment (ASLI). We used a mathematical modeling approach implemented in Weighted Gene Co-expression Network Analysis (WGCNA) to create and compare gene network models of young (learning unimpaired) and aged (predominantly learning impaired) brains from a set of exploratory datasets in rats in the context of ASLI. The major goal was to overcome some of the limitations previously observed in the traditional meta- and pathway analysis using these data, and identify novel ASLI related genes and their networks based on co-expression relationship of genes. This analysis identified a set of network modules in the young, each of which is highly enriched with genes functioning in broad but distinct GO functional categories or biological pathways. Interestingly, the analysis pointed to a single module that was highly enriched with genes functioning in "learning and memory" related functions and pathways. Subsequent differential network analysis of this "learning and memory" module in the aged (predominantly learning impaired) rats compared to the young learning unimpaired rats allowed us to identify a set of novel ASLI candidate hub genes. Some of these genes show significant repeatability in networks generated from independent young and aged validation datasets. These hub genes are highly co-expressed with other genes in the network, which not only show differential expression but also differential co-expression and differential connectivity across age and learning impairment. The known function of these hub genes indicate that they play key roles in critical pathways, including kinase and phosphatase signaling, in functions related to various ion channels, and in maintaining neuronal integrity relating to synaptic plasticity and memory formation. Taken together, they

  11. Genome-Wide Identification, Evolutionary Analysis and Expression Profiles of LATERAL ORGAN BOUNDARIES DOMAIN Gene Family in Lotus japonicus and Medicago truncatula.

    Directory of Open Access Journals (Sweden)

    Tianquan Yang

    Full Text Available The LATERAL ORGAN BOUNDARIES DOMAIN (LBD gene family has been well-studied in Arabidopsis and play crucial roles in the diverse growth and development processes including establishment and maintenance of boundary of developmental lateral organs. In this study we identified and characterized 38 LBD genes in Lotus japonicus (LjLBD and 57 LBD genes in Medicago truncatula (MtLBD, both of which are model legume plants that have some specific development features absent in Arabidopsis. The phylogenetic relationships, their locations in the genome, genes structure and conserved motifs were examined. The results revealed that all LjLBD and MtLBD genes could be distinctly divided into two classes: Class I and II. The evolutionary analysis showed that Type I functional divergence with some significantly site-specific shifts may be the main force for the divergence between Class I and Class II. In addition, the expression patterns of LjLBD genes uncovered the diverse functions in plant development. Interestingly, we found that two LjLBD proteins that were highly expressed during compound leaf and pulvinus development, can interact via yeast two-hybrid assays. Taken together, our findings provide an evolutionary and genetic foundation in further understanding the molecular basis of LBD gene family in general, specifically in L. japonicus and M. truncatula.

  12. Deep evolutionary comparison of gene expression identifies parallel recruitment of trans-factors in two independent origins of C4 photosynthesis.

    Science.gov (United States)

    Aubry, Sylvain; Kelly, Steven; Kümpers, Britta M C; Smith-Unna, Richard D; Hibberd, Julian M

    2014-06-01

    With at least 60 independent origins spanning monocotyledons and dicotyledons, the C4 photosynthetic pathway represents one of the most remarkable examples of convergent evolution. The recurrent evolution of this highly complex trait involving alterations to leaf anatomy, cell biology and biochemistry allows an increase in productivity by ∼ 50% in tropical and subtropical areas. The extent to which separate lineages of C4 plants use the same genetic networks to maintain C4 photosynthesis is unknown. We developed a new informatics framework to enable deep evolutionary comparison of gene expression in species lacking reference genomes. We exploited this to compare gene expression in species representing two independent C4 lineages (Cleome gynandra and Zea mays) whose last common ancestor diverged ∼ 140 million years ago. We define a cohort of 3,335 genes that represent conserved components of leaf and photosynthetic development in these species. Furthermore, we show that genes encoding proteins of the C4 cycle are recruited into networks defined by photosynthesis-related genes. Despite the wide evolutionary separation and independent origins of the C4 phenotype, we report that these species use homologous transcription factors to both induce C4 photosynthesis and to maintain the cell specific gene expression required for the pathway to operate. We define a core molecular signature associated with leaf and photosynthetic maturation that is likely shared by angiosperm species derived from the last common ancestor of the monocotyledons and dicotyledons. We show that deep evolutionary comparisons of gene expression can reveal novel insight into the molecular convergence of highly complex phenotypes and that parallel evolution of trans-factors underpins the repeated appearance of C4 photosynthesis. Thus, exploitation of extant natural variation associated with complex traits can be used to identify regulators. Moreover, the transcription factors that are shared by

  13. Deep evolutionary comparison of gene expression identifies parallel recruitment of trans-factors in two independent origins of C4 photosynthesis.

    Directory of Open Access Journals (Sweden)

    Sylvain Aubry

    2014-06-01

    Full Text Available With at least 60 independent origins spanning monocotyledons and dicotyledons, the C4 photosynthetic pathway represents one of the most remarkable examples of convergent evolution. The recurrent evolution of this highly complex trait involving alterations to leaf anatomy, cell biology and biochemistry allows an increase in productivity by ∼ 50% in tropical and subtropical areas. The extent to which separate lineages of C4 plants use the same genetic networks to maintain C4 photosynthesis is unknown. We developed a new informatics framework to enable deep evolutionary comparison of gene expression in species lacking reference genomes. We exploited this to compare gene expression in species representing two independent C4 lineages (Cleome gynandra and Zea mays whose last common ancestor diverged ∼ 140 million years ago. We define a cohort of 3,335 genes that represent conserved components of leaf and photosynthetic development in these species. Furthermore, we show that genes encoding proteins of the C4 cycle are recruited into networks defined by photosynthesis-related genes. Despite the wide evolutionary separation and independent origins of the C4 phenotype, we report that these species use homologous transcription factors to both induce C4 photosynthesis and to maintain the cell specific gene expression required for the pathway to operate. We define a core molecular signature associated with leaf and photosynthetic maturation that is likely shared by angiosperm species derived from the last common ancestor of the monocotyledons and dicotyledons. We show that deep evolutionary comparisons of gene expression can reveal novel insight into the molecular convergence of highly complex phenotypes and that parallel evolution of trans-factors underpins the repeated appearance of C4 photosynthesis. Thus, exploitation of extant natural variation associated with complex traits can be used to identify regulators. Moreover, the transcription factors

  14. Clustering approaches to identifying gene expression patterns from DNA microarray data.

    Science.gov (United States)

    Do, Jin Hwan; Choi, Dong-Kug

    2008-04-30

    The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.

  15. Genome-wide investigation reveals high evolutionary rates in annual model plants.

    Science.gov (United States)

    Yue, Jia-Xing; Li, Jinpeng; Wang, Dan; Araki, Hitoshi; Tian, Dacheng; Yang, Sihai

    2010-11-09

    Rates of molecular evolution vary widely among species. While significant deviations from molecular clock have been found in many taxa, effects of life histories on molecular evolution are not fully understood. In plants, annual/perennial life history traits have long been suspected to influence the evolutionary rates at the molecular level. To date, however, the number of genes investigated on this subject is limited and the conclusions are mixed. To evaluate the possible heterogeneity in evolutionary rates between annual and perennial plants at the genomic level, we investigated 85 nuclear housekeeping genes, 10 non-housekeeping families, and 34 chloroplast genes using the genomic data from model plants including Arabidopsis thaliana and Medicago truncatula for annuals and grape (Vitis vinifera) and popular (Populus trichocarpa) for perennials. According to the cross-comparisons among the four species, 74-82% of the nuclear genes and 71-97% of the chloroplast genes suggested higher rates of molecular evolution in the two annuals than those in the two perennials. The significant heterogeneity in evolutionary rate between annuals and perennials was consistently found both in nonsynonymous sites and synonymous sites. While a linear correlation of evolutionary rates in orthologous genes between species was observed in nonsynonymous sites, the correlation was weak or invisible in synonymous sites. This tendency was clearer in nuclear genes than in chloroplast genes, in which the overall evolutionary rate was small. The slope of the regression line was consistently lower than unity, further confirming the higher evolutionary rate in annuals at the genomic level. The higher evolutionary rate in annuals than in perennials appears to be a universal phenomenon both in nuclear and chloroplast genomes in the four dicot model plants we investigated. Therefore, such heterogeneity in evolutionary rate should result from factors that have genome-wide influence, most likely those

  16. Comparison of gene co-networks reveals the molecular mechanisms of the rice (Oryza sativa L.) response to Rhizoctonia solani AG1 IA infection.

    Science.gov (United States)

    Zhang, Jinfeng; Zhao, Wenjuan; Fu, Rong; Fu, Chenglin; Wang, Lingxia; Liu, Huainian; Li, Shuangcheng; Deng, Qiming; Wang, Shiquan; Zhu, Jun; Liang, Yueyang; Li, Ping; Zheng, Aiping

    2018-05-05

    Rhizoctonia solani causes rice sheath blight, an important disease affecting the growth of rice (Oryza sativa L.). Attempts to control the disease have met with little success. Based on transcriptional profiling, we previously identified more than 11,947 common differentially expressed genes (TPM > 10) between the rice genotypes TeQing and Lemont. In the current study, we extended these findings by focusing on an analysis of gene co-expression in response to R. solani AG1 IA and identified gene modules within the networks through weighted gene co-expression network analysis (WGCNA). We compared the different genes assigned to each module and the biological interpretations of gene co-expression networks at early and later modules in the two rice genotypes to reveal differential responses to AG1 IA. Our results show that different changes occurred in the two rice genotypes and that the modules in the two groups contain a number of candidate genes possibly involved in pathogenesis, such as the VQ protein. Furthermore, these gene co-expression networks provide comprehensive transcriptional information regarding gene expression in rice in response to AG1 IA. The co-expression networks derived from our data offer ideas for follow-up experimentation that will help advance our understanding of the translational regulation of rice gene expression changes in response to AG1 IA.

  17. Horizontal gene transfer in silkworm, Bombyx mori

    Science.gov (United States)

    2011-01-01

    Background The domesticated silkworm, Bombyx mori, is the model insect for the order Lepidoptera, has economically important values, and has gained some representative behavioral characteristics compared to its wild ancestor. The genome of B. mori has been fully sequenced while function analysis of BmChi-h and BmSuc1 genes revealed that horizontal gene transfer (HGT) maybe bestow a clear selective advantage to B. mori. However, the role of HGT in the evolutionary history of B. mori is largely unexplored. In this study, we compare the whole genome of B. mori with those of 382 prokaryotic and eukaryotic species to investigate the potential HGTs. Results Ten candidate HGT events were defined in B. mori by comprehensive sequence analysis using Maximum Likelihood and Bayesian method combining with EST checking. Phylogenetic analysis of the candidate HGT genes suggested that one HGT was plant-to- B. mori transfer while nine were bacteria-to- B. mori transfer. Furthermore, functional analysis based on expression, coexpression and related literature searching revealed that several HGT candidate genes have added important characters, such as resistance to pathogen, to B. mori. Conclusions Results from this study clearly demonstrated that HGTs play an important role in the evolution of B. mori although the number of HGT events in B. mori is in general smaller than those of microbes and other insects. In particular, interdomain HGTs in B. mori may give rise to functional, persistent, and possibly evolutionarily significant new genes. PMID:21595916

  18. Evolutionary responses to a constructed niche: ancient Mesoamericans as a model of gene-culture coevolution.

    Directory of Open Access Journals (Sweden)

    Tábita Hünemeier

    Full Text Available Culture and genetics rely on two distinct but not isolated transmission systems. Cultural processes may change the human selective environment and thereby affect which individuals survive and reproduce. Here, we evaluated whether the modes of subsistence in Native American populations and the frequencies of the ABCA1*Arg230Cys polymorphism were correlated. Further, we examined whether the evolutionary consequences of the agriculturally constructed niche in Mesoamerica could be considered as a gene-culture coevolution model. For this purpose, we genotyped 229 individuals affiliated with 19 Native American populations and added data for 41 other Native American groups (n = 1905 to the analysis. In combination with the SNP cluster of a neutral region, this dataset was then used to unravel the scenario involved in 230Cys evolutionary history. The estimated age of 230Cys is compatible with its origin occurring in the American continent. The correlation of its frequencies with the archeological data on Zea pollen in Mesoamerica/Central America, the neutral coalescent simulations, and the F(ST-based natural selection analysis suggest that maize domestication was the driving force in the increase in the frequencies of 230Cys in this region. These results may represent the first example of a gene-culture coevolution involving an autochthonous American allele.

  19. Bioinformatics analysis of RNA-seq data revealed critical genes in colon adenocarcinoma.

    Science.gov (United States)

    Xi, W-D; Liu, Y-J; Sun, X-B; Shan, J; Yi, L; Zhang, T-T

    2017-07-01

    RNA-seq data of colon adenocarcinoma (COAD) were analyzed with bioinformatics tools to discover critical genes in the disease. Relevant small molecule drugs, transcription factors (TFs) and microRNAs (miRNAs) were also investigated. RNA-seq data of COAD were downloaded from The Cancer Genome Atlas (TCGA). Differential analysis was performed with package edgeR. False positive discovery (FDR) 1 were set as the cut-offs to screen out differentially expressed genes (DEGs). Gene coexpression network was constructed with package Ebcoexpress. GO enrichment analysis was performed for the DEGs in the gene coexpression network with DAVID. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was also performed for the genes with KOBASS 2.0. Modules were identified with MCODE of Cytoscape. Relevant small molecules drugs were predicted by Connectivity map. Relevant miRNAs and TFs were searched by WebGestalt. A total of 457 DEGs, including 255 up-regulated and 202 down-regulated genes, were identified from 437 COAD and 39 control samples. A gene coexpression network was constructed containing 40 DEGs and 101 edges. The genes were mainly associated with collagen fibril organization, extracellular matrix organization and translation. Two modules were identified from the gene coexpression network, which were implicated in muscle contraction and extracellular matrix organization, respectively. Several critical genes were disclosed, such as MYH11, COL5A2 and ribosomal proteins. Nine relevant small molecule drugs were identified, such as scriptaid and STOCK1N-35874. Accordingly, a total of 17 TFs and 10 miRNAs related to COAD were acquired, such as ETS2, NFAT, AP4, miR-124A, MiR-9, miR-96 and let-7. Several critical genes and relevant drugs, TFs and miRNAs were revealed in COAD. These findings could advance the understanding of the disease and benefit therapy development.

  20. Analyses of Evolutionary Characteristics of the Hemagglutinin-Esterase Gene of Influenza C Virus during a Period of 68 Years Reveals Evolutionary Patterns Different from Influenza A and B Viruses

    Directory of Open Access Journals (Sweden)

    Yuki Furuse

    2016-11-01

    Full Text Available Infections with the influenza C virus causing respiratory symptoms are common, particularly among children. Since isolation and detection of the virus are rarely performed, compared with influenza A and B viruses, the small number of available sequences of the virus makes it difficult to analyze its evolutionary dynamics. Recently, we reported the full genome sequence of 102 strains of the virus. Here, we exploited the data to elucidate the evolutionary characteristics and phylodynamics of the virus compared with influenza A and B viruses. Along with our data, we obtained public sequence data of the hemagglutinin-esterase gene of the virus; the dataset consists of 218 unique sequences of the virus collected from 14 countries between 1947 and 2014. Informatics analyses revealed that (1 multiple lineages have been circulating globally; (2 there have been weak and infrequent selective bottlenecks; (3 the evolutionary rate is low because of weak positive selection and a low capability to induce mutations; and (4 there is no significant positive selection although a few mutations affecting its antigenicity have been induced. The unique evolutionary dynamics of the influenza C virus must be shaped by multiple factors, including virological, immunological, and epidemiological characteristics.

  1. Recovery and evolutionary analysis of complete integron gene cassette arrays from Vibrio

    Directory of Open Access Journals (Sweden)

    Gillings Michael R

    2006-01-01

    Full Text Available Abstract Background Integrons are genetic elements capable of the acquisition, rearrangement and expression of genes contained in gene cassettes. Gene cassettes generally consist of a promoterless gene associated with a recombination site known as a 59-base element (59-be. Multiple insertion events can lead to the assembly of large integron-associated cassette arrays. The most striking examples are found in Vibrio, where such cassette arrays are widespread and can range from 30 kb to 150 kb. Besides those found in completely sequenced genomes, no such array has yet been recovered in its entirety. We describe an approach to systematically isolate, sequence and annotate large integron gene cassette arrays from bacterial strains. Results The complete Vibrio sp. DAT722 integron cassette array was determined through the streamlined approach described here. To place it in an evolutionary context, we compare the DAT722 array to known vibrio arrays and performed phylogenetic analyses for all of its components (integrase, 59-be sites, gene cassette encoded genes. It differs extensively in terms of genomic context as well as gene cassette content and organization. The phylogenetic tree of the 59-be sites collectively found in the Vibrio gene cassette pool suggests frequent transfer of cassettes within and between Vibrio species, with slower transfer rates between more phylogenetically distant relatives. We also identify multiple cases where non-integron chromosomal genes seem to have been assembled into gene cassettes and others where cassettes have been inserted into chromosomal locations outside integrons. Conclusion Our systematic approach greatly facilitates the isolation and annotation of large integrons gene cassette arrays. Comparative analysis of the Vibrio sp. DAT722 integron obtained through this approach to those found in other vibrios confirms the role of this genetic element in promoting lateral gene transfer and suggests a high rate of gene

  2. Evolutionary genetic analyses of MEF2C gene: implications for learning and memory in Homo sapiens.

    Science.gov (United States)

    Kalmady, Sunil V; Venkatasubramanian, Ganesan; Arasappa, Rashmi; Rao, Naren P

    2013-02-01

    MEF2C facilitates context-dependent fear conditioning (CFC) which is a salient aspect of hippocampus-dependent learning and memory. CFC might have played a crucial role in human evolution because of its advantageous influence on survival of species. In this study, we analyzed 23 orthologous mammalian gene sequences of MEF2C gene to examine the evidence for positive selection on this gene in Homo sapiens using Phylogenetic Analysis by Maximum Likelihood (PAML) and HyPhy software. Both PAML Bayes Empirical Bayes (BEB) and HyPhy Fixed Effects Likelihood (FEL) analyses supported significant positive selection on 4 codon sites in H. sapiens. Also, haplotter analysis revealed significant ongoing positive selection on this gene in Central European population. The study findings suggest that adaptive selective pressure on this gene might have influenced human evolution. Further research on this gene might unravel the potential role of this gene in learning and memory as well as its pathogenetic effect in certain hippocampal disorders with evolutionary basis like schizophrenia. Copyright © 2012 Elsevier B.V. All rights reserved.

  3. Co-evolutionary interactions between host resistance and pathogen avirulence genes in rice-Magnaporthe oryzae pathosystem.

    Science.gov (United States)

    Singh, Pankaj Kumar; Ray, Soham; Thakur, Shallu; Rathour, Rajeev; Sharma, Vinay; Sharma, Tilak Raj

    2018-06-01

    Rice and Magnaporthe oryzae constitutes an ideal pathosystem for studying host-pathogen interaction in cereals crops. There are two alternative hypotheses, viz. Arms race and Trench warfare, which explain the co-evolutionary dynamics of hosts and pathogens which are under continuous confrontation. Arms race proposes that both R- and Avr- genes of host and pathogen, respectively, undergo positive selection. Alternatively, trench warfare suggests that either R- or Avr- gene in the pathosystem is under balanced selection intending to stabilize the genetic advantage gained over the opposition. Here, we made an attempt to test the above-stated hypotheses in rice-M. oryzae pathosystem at loci of three R-Avr gene pairs, Piz-t-AvrPiz-t, Pi54-AvrPi54 and Pita-AvrPita using allele mining approach. Allele mining is an efficient way to capture allelic variants existing in the population and to study the selective forces imposed on the variants during evolution. Results of nucleotide diversity, neutrality statistics and phylogenetic analyses reveal that Piz-t, Pi54 and AvrPita are diversified and under positive selection at their corresponding loci, while their counterparts, AvrPiz-t, AvrPi54 and Pita are conserved and under balancing selection, in nature. These results imply that rice-M. oryzae populations are engaged in a trench warfare at least at the three R/Avr loci studied. It is a maiden attempt to study the co-evolution of three R-Avr gene pairs in this pathosystem. Knowledge gained from this study will help in understanding the evolutionary dynamics of host-pathogen interaction in a better way and will also aid in developing new durable blast resistant rice varieties in future. Copyright © 2018 Elsevier Inc. All rights reserved.

  4. Network-based identification of biomarkers coexpressed with multiple pathways.

    Science.gov (United States)

    Guo, Nancy Lan; Wan, Ying-Wooi

    2014-01-01

    Unraveling complex molecular interactions and networks and incorporating clinical information in modeling will present a paradigm shift in molecular medicine. Embedding biological relevance via modeling molecular networks and pathways has become increasingly important for biomarker identification in cancer susceptibility and metastasis studies. Here, we give a comprehensive overview of computational methods used for biomarker identification, and provide a performance comparison of several network models used in studies of cancer susceptibility, disease progression, and prognostication. Specifically, we evaluated implication networks, Boolean networks, Bayesian networks, and Pearson's correlation networks in constructing gene coexpression networks for identifying lung cancer diagnostic and prognostic biomarkers. The results show that implication networks, implemented in Genet package, identified sets of biomarkers that generated an accurate prediction of lung cancer risk and metastases; meanwhile, implication networks revealed more biologically relevant molecular interactions than Boolean networks, Bayesian networks, and Pearson's correlation networks when evaluated with MSigDB database.

  5. Genome Wide Identification, Evolutionary, and Expression Analysis of VQ Genes from Two Pyrus Species.

    Science.gov (United States)

    Cao, Yunpeng; Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping

    2018-04-23

    The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice ( Oryza sativa ), maize ( Zea mays ), and Arabidopsis ( Arabidopsis thaliana ). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis , respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis , respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis . A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis , respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus , and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis .

  6. Dynamic functional modules in co-expressed protein interaction networks of dilated cardiomyopathy

    Directory of Open Access Journals (Sweden)

    Oyang Yen-Jen

    2010-10-01

    Full Text Available Abstract Background Molecular networks represent the backbone of molecular activity within cells and provide opportunities for understanding the mechanism of diseases. While protein-protein interaction data constitute static network maps, integration of condition-specific co-expression information provides clues to the dynamic features of these networks. Dilated cardiomyopathy is a leading cause of heart failure. Although previous studies have identified putative biomarkers or therapeutic targets for heart failure, the underlying molecular mechanism of dilated cardiomyopathy remains unclear. Results We developed a network-based comparative analysis approach that integrates protein-protein interactions with gene expression profiles and biological function annotations to reveal dynamic functional modules under different biological states. We found that hub proteins in condition-specific co-expressed protein interaction networks tended to be differentially expressed between biological states. Applying this method to a cohort of heart failure patients, we identified two functional modules that significantly emerged from the interaction networks. The dynamics of these modules between normal and disease states further suggest a potential molecular model of dilated cardiomyopathy. Conclusions We propose a novel framework to analyze the interaction networks in different biological states. It successfully reveals network modules closely related to heart failure; more importantly, these network dynamics provide new insights into the cause of dilated cardiomyopathy. The revealed molecular modules might be used as potential drug targets and provide new directions for heart failure therapy.

  7. Understanding the mind from an evolutionary perspective: an overview of evolutionary psychology.

    Science.gov (United States)

    Shackelford, Todd K; Liddle, James R

    2014-05-01

    The theory of evolution by natural selection provides the only scientific explanation for the existence of complex adaptations. The design features of the brain, like any organ, are the result of selection pressures operating over deep time. Evolutionary psychology posits that the human brain comprises a multitude of evolved psychological mechanisms, adaptations to specific and recurrent problems of survival and reproduction faced over human evolutionary history. Although some mistakenly view evolutionary psychology as promoting genetic determinism, evolutionary psychologists appreciate and emphasize the interactions between genes and environments. This approach to psychology has led to a richer understanding of a variety of psychological phenomena, and has provided a powerful foundation for generating novel hypotheses. Critics argue that evolutionary psychologists resort to storytelling, but as with any branch of science, empirical testing is a vital component of the field, with hypotheses standing or falling with the weight of the evidence. Evolutionary psychology is uniquely suited to provide a unifying theoretical framework for the disparate subdisciplines of psychology. An evolutionary perspective has provided insights into several subdisciplines of psychology, while simultaneously demonstrating the arbitrary nature of dividing psychological science into such subdisciplines. Evolutionary psychologists have amassed a substantial empirical and theoretical literature, but as a relatively new approach to psychology, many questions remain, with several promising directions for future research. For further resources related to this article, please visit the WIREs website. The authors have declared no conflicts of interest for this article. © 2014 John Wiley & Sons, Ltd.

  8. The order of expression is a key factor in the production of active transglutaminase in Escherichia coli by co-expression with its pro-peptide

    Directory of Open Access Journals (Sweden)

    Liu Song

    2011-12-01

    Full Text Available Abstract Background Streptomyces transglutaminase (TGase is naturally synthesized as zymogen (pro-TGase, which is then processed to produce active enzyme by the removal of its N-terminal pro-peptide. This pro-peptide is found to be essential for overexpression of soluble TGase in E. coli. However, expression of pro-TGase by E. coli requires protease-mediated activation in vitro. In this study, we developed a novel co- expression method for the direct production of active TGase in E. coli. Results A TGase from S. hygroscopicus was expressed in E. coli only after fusing with the pelB signal peptide, but fusion with the signal peptide induced insoluble enzyme. Therefore, alternative protocol was designed by co-expressing the TGase and its pro-peptide as independent polypeptides under a single T7 promoter using vector pET-22b(+. Although the pro-peptide was co-expressed, the TGase fused without the signal peptide was undetectable in both soluble and insoluble fractions of the recombinant cells. Similarly, when both genes were expressed in the order of the TGase and the pro-peptide, the solubility of TGase fused with the signal peptide was not improved by the co-expression with its pro-peptide. Interestingly, active TGase was only produced by the cells in which the pro-peptide and the TGase were fused with the signal peptide and sequentially expressed. The purified recombinant and native TGase shared the similar catalytic properties. Conclusions Our results indicated that the pro-peptide can assist correct folding of the TGase inter-molecularly in E. coli, and expression of pro-peptide prior to that of TGase was essential for the production of active TGase. The co-expression strategy based on optimizing the order of gene expression could be useful for the expression of other functional proteins that are synthesized as a precursor.

  9. GEM2Net: from gene expression modeling to -omics networks, a new CATdb module to investigate Arabidopsis thaliana genes involved in stress response.

    Science.gov (United States)

    Zaag, Rim; Tamby, Jean Philippe; Guichard, Cécile; Tariq, Zakia; Rigaill, Guillem; Delannoy, Etienne; Renou, Jean-Pierre; Balzergue, Sandrine; Mary-Huard, Tristan; Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Brunaud, Véronique

    2015-01-01

    CATdb (http://urgv.evry.inra.fr/CATdb) is a database providing a public access to a large collection of transcriptomic data, mainly for Arabidopsis but also for other plants. This resource has the rare advantage to contain several thousands of microarray experiments obtained with the same technical protocol and analyzed by the same statistical pipelines. In this paper, we present GEM2Net, a new module of CATdb that takes advantage of this homogeneous dataset to mine co-expression units and decipher Arabidopsis gene functions. GEM2Net explores 387 stress conditions organized into 18 biotic and abiotic stress categories. For each one, a model-based clustering is applied on expression differences to identify clusters of co-expressed genes. To characterize functions associated with these clusters, various resources are analyzed and integrated: Gene Ontology, subcellular localization of proteins, Hormone Families, Transcription Factor Families and a refined stress-related gene list associated to publications. Exploiting protein-protein interactions and transcription factors-targets interactions enables to display gene networks. GEM2Net presents the analysis of the 18 stress categories, in which 17,264 genes are involved and organized within 681 co-expression clusters. The meta-data analyses were stored and organized to compose a dynamic Web resource. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Genomic regression of claw keratin, taste receptor and light-associated genes provides insights into biology and evolutionary origins of snakes.

    Science.gov (United States)

    Emerling, Christopher A

    2017-10-01

    Regressive evolution of anatomical traits often corresponds with the regression of genomic loci underlying such characters. As such, studying patterns of gene loss can be instrumental in addressing questions of gene function, resolving conflicting results from anatomical studies, and understanding the evolutionary history of clades. The evolutionary origins of snakes involved the regression of a number of anatomical traits, including limbs, taste buds and the visual system, and by analyzing serpent genomes, I was able to test three hypotheses associated with the regression of these features. The first concerns two keratins that are putatively specific to claws. Both genes that encode these keratins are pseudogenized/deleted in snake genomes, providing additional evidence of claw-specificity. The second hypothesis is that snakes lack taste buds, an issue complicated by conflicting results in the literature. I found evidence that different snakes have lost one or more taste receptors, but all snakes examined retained at least one gustatory channel. The final hypothesis addressed is that the earliest snakes were adapted to a dim light niche. I found evidence of deleted and pseudogenized genes with light-associated functions in snakes, demonstrating a pattern of gene loss similar to other dim light-adapted clades. Molecular dating estimates suggest that dim light adaptation preceded the loss of limbs, providing some bearing on interpretations of the ecological origins of snakes. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. Calculation of evolutionary correlation between individual genes and full-length genome: a method useful for choosing phylogenetic markers for molecular epidemiology.

    Directory of Open Access Journals (Sweden)

    Shuai Wang

    Full Text Available Individual genes or regions are still commonly used to estimate the phylogenetic relationships among viral isolates. The genomic regions that can faithfully provide assessments consistent with those predicted with full-length genome sequences would be preferable to serve as good candidates of the phylogenetic markers for molecular epidemiological studies of many viruses. Here we employed a statistical method to evaluate the evolutionary relationships between individual viral genes and full-length genomes without tree construction as a way to determine which gene can match the genome well in phylogenetic analyses. This method was performed by calculation of linear correlations between the genetic distance matrices of aligned individual gene sequences and aligned genome sequences. We applied this method to the phylogenetic analyses of porcine circovirus 2 (PCV2, measles virus (MV, hepatitis E virus (HEV and Japanese encephalitis virus (JEV. Phylogenetic trees were constructed for comparisons and the possible factors affecting the method accuracy were also discussed in the calculations. The results revealed that this method could produce results consistent with those of previous studies about the proper consensus sequences that could be successfully used as phylogenetic markers. And our results also suggested that these evolutionary correlations could provide useful information for identifying genes that could be used effectively to infer the genetic relationships.

  12. Diversity and evolutionary patterns of immune genes in free-ranging Namibian leopards (Panthera pardus pardus).

    Science.gov (United States)

    Castro-Prieto, Aines; Wachter, Bettina; Melzheimer, Joerg; Thalwitzer, Susanne; Sommer, Simone

    2011-01-01

    The genes of the major histocompatibility complex (MHC) are a key component of the mammalian immune system and have become important molecular markers for fitness-related genetic variation in wildlife populations. Currently, no information about the MHC sequence variation and constitution in African leopards exists. In this study, we isolated and characterized genetic variation at the adaptively most important region of MHC class I and MHC class II-DRB genes in 25 free-ranging African leopards from Namibia and investigated the mechanisms that generate and maintain MHC polymorphism in the species. Using single-stranded conformation polymorphism analysis and direct sequencing, we detected 6 MHC class I and 6 MHC class II-DRB sequences, which likely correspond to at least 3 MHC class I and 3 MHC class II-DRB loci. Amino acid sequence variation in both MHC classes was higher or similar in comparison to other reported felids. We found signatures of positive selection shaping the diversity of MHC class I and MHC class II-DRB loci during the evolutionary history of the species. A comparison of MHC class I and MHC class II-DRB sequences of the leopard to those of other felids revealed a trans-species mode of evolution. In addition, the evolutionary relationships of MHC class II-DRB sequences between African and Asian leopard subspecies are discussed.

  13. Revealing less derived nature of cartilaginous fish genomes with their evolutionary time scale inferred with nuclear genes.

    Directory of Open Access Journals (Sweden)

    Adina J Renz

    Full Text Available Cartilaginous fishes, divided into Holocephali (chimaeras and Elasmoblanchii (sharks, rays and skates, occupy a key phylogenetic position among extant vertebrates in reconstructing their evolutionary processes. Their accurate evolutionary time scale is indispensable for better understanding of the relationship between phenotypic and molecular evolution of cartilaginous fishes. However, our current knowledge on the time scale of cartilaginous fish evolution largely relies on estimates using mitochondrial DNA sequences. In this study, making the best use of the still partial, but large-scale sequencing data of cartilaginous fish species, we estimate the divergence times between the major cartilaginous fish lineages employing nuclear genes. By rigorous orthology assessment based on available genomic and transcriptomic sequence resources for cartilaginous fishes, we selected 20 protein-coding genes in the nuclear genome, spanning 2973 amino acid residues. Our analysis based on the Bayesian inference resulted in the mean divergence time of 421 Ma, the late Silurian, for the Holocephali-Elasmobranchii split, and 306 Ma, the late Carboniferous, for the split between sharks and rays/skates. By applying these results and other documented divergence times, we measured the relative evolutionary rate of the Hox A cluster sequences in the cartilaginous fish lineages, which resulted in a lower substitution rate with a factor of at least 2.4 in comparison to tetrapod lineages. The obtained time scale enables mapping phenotypic and molecular changes in a quantitative framework. It is of great interest to corroborate the less derived nature of cartilaginous fish at the molecular level as a genome-wide phenomenon.

  14. Algorithms for computing parsimonious evolutionary scenarios for genome evolution, the last universal common ancestor and dominance of horizontal gene transfer in the evolution of prokaryotes

    Directory of Open Access Journals (Sweden)

    Galperin Michael Y

    2003-01-01

    Full Text Available Abstract Background Comparative analysis of sequenced genomes reveals numerous instances of apparent horizontal gene transfer (HGT, at least in prokaryotes, and indicates that lineage-specific gene loss might have been even more common in evolution. This complicates the notion of a species tree, which needs to be re-interpreted as a prevailing evolutionary trend, rather than the full depiction of evolution, and makes reconstruction of ancestral genomes a non-trivial task. Results We addressed the problem of constructing parsimonious scenarios for individual sets of orthologous genes given a species tree. The orthologous sets were taken from the database of Clusters of Orthologous Groups of proteins (COGs. We show that the phyletic patterns (patterns of presence-absence in completely sequenced genomes of almost 90% of the COGs are inconsistent with the hypothetical species tree. Algorithms were developed to reconcile the phyletic patterns with the species tree by postulating gene loss, COG emergence and HGT (the latter two classes of events were collectively treated as gene gains. We prove that each of these algorithms produces a parsimonious evolutionary scenario, which can be represented as mapping of loss and gain events on the species tree. The distribution of the evolutionary events among the tree nodes substantially depends on the underlying assumptions of the reconciliation algorithm, e.g. whether or not independent gene gains (gain after loss after gain are permitted. Biological considerations suggest that, on average, gene loss might be a more likely event than gene gain. Therefore different gain penalties were used and the resulting series of reconstructed gene sets for the last universal common ancestor (LUCA of the extant life forms were analysed. The number of genes in the reconstructed LUCA gene sets grows as the gain penalty increases. However, qualitative examination of the LUCA versions reconstructed with different gain penalties

  15. Co-expression of the transcription factors CEH-14 and TTX-1 regulates AFD neuron-specific genes gcy-8 and gcy-18 in C. elegans.

    Science.gov (United States)

    Kagoshima, Hiroshi; Kohara, Yuji

    2015-03-15

    A wide variety of cells are generated by the expression of characteristic sets of genes, primarily those regulated by cell-specific transcription. To elucidate the mechanism regulating cell-specific gene expression in a highly specialized cell, AFD thermosensory neuron in Caenorhabditis elegans, we analyzed the promoter sequences of guanylyl cyclase genes, gcy-8 and gcy-18, exclusively expressed in AFD. In this study, we showed that AFD-specific expression of gcy-8 and gcy-18 requires the co-expression of homeodomain proteins, CEH-14/LHX3 and TTX-1/OTX1. We observed that mutation of ttx-1 or ceh-14 caused a reduction in the expression of gcy-8 and gcy-18 and that the expression was completely lost in double mutants. This synergy effect was also observed with other AFD marker genes, such as ntc-1, nlp-21and cng-3. Electrophoretic mobility shift assays revealed direct interaction of CEH-14 and TTX-1 proteins with gcy-8 and gcy-18 promoters in vitro. The binding sites of CEH-14 and TTX-1 proteins were confirmed to be essential for AFD-specific expression of gcy-8 and gcy-18 in vivo. We also demonstrated that forced expression of CEH-14 and TTX-1 in AWB chemosensory neurons induced ectopic expression of gcy-8 and gcy-18 reporters in this neuron. Finally, we showed that the regulation of gcy-8 and gcy-18 expression by ceh-14 and ttx-1 is evolutionally conserved in five Caenorhabditis species. Taken together, ceh-14 and ttx-1 expression determines the fate of AFD as terminal selector genes at the final step of cell specification. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Update of the human secretoglobin (SCGB gene superfamily and an example of 'evolutionary bloom' of androgen-binding protein genes within the mouse Scgb gene superfamily

    Directory of Open Access Journals (Sweden)

    Jackson Brian C

    2011-10-01

    Full Text Available Abstract The secretoglobins (SCGBs comprise a family of small, secreted proteins found in animals exclusively of mammalian lineage. There are 11 human SCGB genes and five pseudogenes. Interestingly, mice have 68 Scgb genes, four of which are highly orthologous to human SCGB genes; the remainder represent an 'evolutionary bloom' and make up a large gene family represented by only six counterparts in humans. SCGBs are found in high concentrations in many mammalian secretions, including fluids of the lung, lacrimal gland, salivary gland, prostate and uterus. Whereas the biological activities of most individual SCGBs have not been fully characterised, what already has been discovered suggests that this family has an important role in the modulation of inflammation, tissue repair and tumorigenesis. In mice, the large Scgb1b and Scgb2b gene families encode the androgen-binding proteins, which have been shown to play a role in mate selection. Although much has been learned about SCGBs in recent years, clearly more research remains to be done to allow a better understanding of the roles of these proteins in human health and disease. Such information is predicted to reveal valuable novel drug targets for the treatment of inflammation, as well as designing biomarkers that might identify tissue damage or cancer.

  17. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm

    Energy Technology Data Exchange (ETDEWEB)

    SacconePhD, Scott F [Washington University, St. Louis; Chesler, Elissa J [ORNL; Bierut, Laura J [Washington University, St. Louis; Kalivas, Peter J [Medical College of South Carolina, Charleston; Lerman, Caryn [University of Pennsylvania; Saccone, Nancy L [Washington University, St. Louis; Uhl, George R [Johns Hopkins University; Li, Chuan-Yun [Peking University; Philip, Vivek M [ORNL; Edenberg, Howard [Indiana University; Sherry, Steven [National Center for Biotechnology Information; Feolo, Michael [National Center for Biotechnology Information; Moyzis, Robert K [Johns Hopkins University; Rutter, Joni L [National Institute of Drug Abuse

    2009-01-01

    Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.

  18. Arabidopsis CPR5 is a senescence-regulatory gene with pleiotropic functions as predicted by the evolutionary theory of senescence

    NARCIS (Netherlands)

    Jing, Hai-Chun; Anderson, Lisa; Sturre, Marcel J. G.; Hille, Jacques; Dijkwel, Paul P.

    2007-01-01

    Arabidopsis CPR5 is a senescence-regulatory gene with pleiotropic functions as predicted by the evolutionary theory of senescence Hai-Chun Jing1,2, Lisa Anderson3, Marcel J.G. Sturre1, Jacques Hille1 and Paul P. Dijkwel1,* 1Molecular Biology of Plants, Groningen Biomolecular Sciences and

  19. TAS3 miR390-dependent loci in non-vascular land plants: towards a comprehensive reconstruction of the gene evolutionary history

    Directory of Open Access Journals (Sweden)

    Sergey Y. Morozov

    2018-04-01

    Full Text Available Trans-acting small interfering RNAs (ta-siRNAs are transcribed from protein non-coding genomic TAS loci and belong to a plant-specific class of endogenous small RNAs. These siRNAs have been found to regulate gene expression in most taxa including seed plants, gymnosperms, ferns and mosses. In this study, bioinformatic and experimental PCR-based approaches were used as tools to analyze TAS3 and TAS6 loci in transcriptomes and genomic DNAs from representatives of evolutionary distant non-vascular plant taxa such as Bryophyta, Marchantiophyta and Anthocerotophyta. We revealed previously undiscovered TAS3 loci in plant classes Sphagnopsida and Anthocerotopsida, as well as TAS6 loci in Bryophyta classes Tetraphidiopsida, Polytrichopsida, Andreaeopsida and Takakiopsida. These data further unveil the evolutionary pathway of the miR390-dependent TAS3 loci in land plants. We also identified charophyte alga sequences coding for SUPPRESSOR OF GENE SILENCING 3 (SGS3, which is required for generation of ta-siRNAs in plants, and hypothesized that the appearance of TAS3-related sequences could take place at a very early step in evolutionary transition from charophyte algae to an earliest common ancestor of land plants.

  20. TAS3 miR390-dependent loci in non-vascular land plants: towards a comprehensive reconstruction of the gene evolutionary history.

    Science.gov (United States)

    Morozov, Sergey Y; Milyutina, Irina A; Erokhina, Tatiana N; Ozerova, Liudmila V; Troitsky, Alexey V; Solovyev, Andrey G

    2018-01-01

    Trans-acting small interfering RNAs (ta-siRNAs) are transcribed from protein non-coding genomic TAS loci and belong to a plant-specific class of endogenous small RNAs. These siRNAs have been found to regulate gene expression in most taxa including seed plants, gymnosperms, ferns and mosses. In this study, bioinformatic and experimental PCR-based approaches were used as tools to analyze TAS3 and TAS6 loci in transcriptomes and genomic DNAs from representatives of evolutionary distant non-vascular plant taxa such as Bryophyta, Marchantiophyta and Anthocerotophyta. We revealed previously undiscovered TAS3 loci in plant classes Sphagnopsida and Anthocerotopsida, as well as TAS6 loci in Bryophyta classes Tetraphidiopsida, Polytrichopsida, Andreaeopsida and Takakiopsida. These data further unveil the evolutionary pathway of the miR390-dependent TAS3 loci in land plants. We also identified charophyte alga sequences coding for SUPPRESSOR OF GENE SILENCING 3 (SGS3), which is required for generation of ta-siRNAs in plants, and hypothesized that the appearance of TAS3-related sequences could take place at a very early step in evolutionary transition from charophyte algae to an earliest common ancestor of land plants.

  1. phyloXML: XML for evolutionary biology and comparative genomics.

    Science.gov (United States)

    Han, Mira V; Zmasek, Christian M

    2009-10-27

    Evolutionary trees are central to a wide range of biological studies. In many of these studies, tree nodes and branches need to be associated (or annotated) with various attributes. For example, in studies concerned with organismal relationships, tree nodes are associated with taxonomic names, whereas tree branches have lengths and oftentimes support values. Gene trees used in comparative genomics or phylogenomics are usually annotated with taxonomic information, genome-related data, such as gene names and functional annotations, as well as events such as gene duplications, speciations, or exon shufflings, combined with information related to the evolutionary tree itself. The data standards currently used for evolutionary trees have limited capacities to incorporate such annotations of different data types. We developed a XML language, named phyloXML, for describing evolutionary trees, as well as various associated data items. PhyloXML provides elements for commonly used items, such as branch lengths, support values, taxonomic names, and gene names and identifiers. By using "property" elements, phyloXML can be adapted to novel and unforeseen use cases. We also developed various software tools for reading, writing, conversion, and visualization of phyloXML formatted data. PhyloXML is an XML language defined by a complete schema in XSD that allows storing and exchanging the structures of evolutionary trees as well as associated data. More information about phyloXML itself, the XSD schema, as well as tools implementing and supporting phyloXML, is available at http://www.phyloxml.org.

  2. Eco-evolutionary spatial dynamics in the Glanville fritillary butterfly.

    Science.gov (United States)

    Hanski, Ilkka A

    2011-08-30

    Demographic population dynamics, gene flow, and local adaptation may influence each other and lead to coupling of ecological and evolutionary dynamics, especially in species inhabiting fragmented heterogeneous environments. Here, I review long-term research on eco-evolutionary spatial dynamics in the Glanville fritillary butterfly inhabiting a large network of approximately 4,000 meadows in Finland. The metapopulation persists in a balance between frequent local extinctions and recolonizations. The genetic spatial structure as defined by neutral markers is much more coarse-grained than the demographic spatial structure determined by the fragmented habitat, yet small-scale spatial structure has important consequences for the dynamics. I discuss three examples of eco-evolutionary spatial dynamics. (i) Extinction-colonization metapopulation dynamics influence allele frequency changes in the phosphoglucose isomerase (Pgi) gene, which leads to strong associations between genetic variation in Pgi and dispersal, recolonization, and local population dynamics. (ii) Inbreeding in local populations increases their risk for extinction, whereas reciprocal effects between inbreeding, population size, and emigration represent likely eco-evolutionary feedbacks. (iii) Genetically determined female oviposition preference for two host plant species exhibits a cline paralleling a gradient in host plant relative abundances, and host plant preference of dispersing females in relation to the host plant composition of habitat patches influences immigration (gene flow) and recolonization (founder events). Eco-evolutionary spatial dynamics in heterogeneous environments may not lead to directional evolutionary changes unless the environment itself changes, but eco-evolutionary dynamics may contribute to the maintenance of genetic variation attributable to fluctuating selection in space and time.

  3. Coexpression of multidrug resistance involve proteins: a flow cytometric analysis.

    Science.gov (United States)

    Boutonnat, J; Bonnefoix, T; Mousseau, M; Seigneurin, D; Ronot, X

    1998-01-01

    Cross resistance to multiple natural cytotoxic products represents a major obstacle in myeloblastic acute leukaemia (AML). Multidrug resistance (MDR) often involves overexpression of plasma membrane drug transporter P-glycoprotein (PGP) or the resistance associated protein (MRP). Recently, a protein overexpressed in a non-PGP MDR lung cancer cell line and termed lung resistance related protein (LRP) was identified. These proteins are known to be associated with a bad prognosis in AML. We have developed a triple indirect labelling analysed by flow cytometry to detect the coexpression of these proteins. Since no cell line expressing all three antigens is known, we mixed K562 cells (resistant to Adriblastine, PGP+, MRP-, LRP-) with GLC4 cells (resistant to Adriblastine, PGP-, MRP+, LRP+) to create a model system to test the method. The antibodies used were UIC2 for PGP, MRPm6 for MRP and LRP56 for LRP. They were revealed by Fab'2 coupled with Fluoresceine-isothiocyanate, Phycoerythrin or Tricolor with isotype specificity. Cells were fixed and permeabilized after PGP labelling because MRPm6 and LRP56 recognize intracellular epitopes. PGP and LRP were easily detected. MRP is expressed at relatively low levels and was more difficult to detect because in the triple labelling the non specific staining was higher than in a single labelling. Despite the increased background in the triple labelling we were able to detect coexpression of PGP, MRP, LRP by flow cytometry. This method appears to be very useful to detect coexpression of markers in AML. Such coexpression could modify the therapeutic approach with revertants.

  4. Ancient Origin of the Tryptophan Operon and the Dynamics of Evolutionary Change†

    Science.gov (United States)

    Xie, Gary; Keyhani, Nemat O.; Bonner; Jensen, Roy A.

    2003-01-01

    The seven conserved enzymatic domains required for tryptophan (Trp) biosynthesis are encoded in seven genetic regions that are organized differently (whole-pathway operons, multiple partial-pathway operons, and dispersed genes) in prokaryotes. A comparative bioinformatics evaluation of the conservation and organization of the genes of Trp biosynthesis in prokaryotic operons should serve as an excellent model for assessing the feasibility of predicting the evolutionary histories of genes and operons associated with other biochemical pathways. These comparisons should provide a better understanding of possible explanations for differences in operon organization in different organisms at a genomics level. These analyses may also permit identification of some of the prevailing forces that dictated specific gene rearrangements during the course of evolution. Operons concerned with Trp biosynthesis in prokaryotes have been in a dynamic state of flux. Analysis of closely related organisms among the Bacteria at various phylogenetic nodes reveals many examples of operon scission, gene dispersal, gene fusion, gene scrambling, and gene loss from which the direction of evolutionary events can be deduced. Two milestone evolutionary events have been mapped to the 16S rRNA tree of Bacteria, one splitting the operon in two, and the other rejoining it by gene fusion. The Archaea, though less resolved due to a lesser genome representation, appear to exhibit more gene scrambling than the Bacteria. The trp operon appears to have been an ancient innovation; it was already present in the common ancestor of Bacteria and Archaea. Although the operon has been subjected, even in recent times, to dynamic changes in gene rearrangement, the ancestral gene order can be deduced with confidence. The evolutionary history of the genes of the pathway is discernible in rough outline as a vertical line of descent, with events of lateral gene transfer or paralogy enriching the analysis as interesting

  5. Ancient origin of the tryptophan operon and the dynamics of evolutionary change.

    Science.gov (United States)

    Xie, Gary; Keyhani, Nemat O; Bonner, Carol A; Jensen, Roy A

    2003-09-01

    The seven conserved enzymatic domains required for tryptophan (Trp) biosynthesis are encoded in seven genetic regions that are organized differently (whole-pathway operons, multiple partial-pathway operons, and dispersed genes) in prokaryotes. A comparative bioinformatics evaluation of the conservation and organization of the genes of Trp biosynthesis in prokaryotic operons should serve as an excellent model for assessing the feasibility of predicting the evolutionary histories of genes and operons associated with other biochemical pathways. These comparisons should provide a better understanding of possible explanations for differences in operon organization in different organisms at a genomics level. These analyses may also permit identification of some of the prevailing forces that dictated specific gene rearrangements during the course of evolution. Operons concerned with Trp biosynthesis in prokaryotes have been in a dynamic state of flux. Analysis of closely related organisms among the Bacteria at various phylogenetic nodes reveals many examples of operon scission, gene dispersal, gene fusion, gene scrambling, and gene loss from which the direction of evolutionary events can be deduced. Two milestone evolutionary events have been mapped to the 16S rRNA tree of Bacteria, one splitting the operon in two, and the other rejoining it by gene fusion. The Archaea, though less resolved due to a lesser genome representation, appear to exhibit more gene scrambling than the Bacteria. The trp operon appears to have been an ancient innovation; it was already present in the common ancestor of Bacteria and Archaea. Although the operon has been subjected, even in recent times, to dynamic changes in gene rearrangement, the ancestral gene order can be deduced with confidence. The evolutionary history of the genes of the pathway is discernible in rough outline as a vertical line of descent, with events of lateral gene transfer or paralogy enriching the analysis as interesting

  6. S187. SEARCHING FOR BRAIN CO-EXPRESSION MODULES THAT CONTRIBUTE DISPROPORTIONATELY TO THE COMMON POLYGENIC RISK FOR SCHIZOPHRENIA

    Science.gov (United States)

    Costas, Javier; Paramo, Mario; Arrojo, Manuel

    2018-01-01

    Abstract Background Genomic research has revealed that schizophrenia is a highly polygenic disease. Recent estimates indicate that at least 71% of genomic segments of 1 Mb include one or more risk loci for schizophrenia (Loh et al., Nature Genet 2015). This extremely high polygenicity represents a challenge to decipher the biological basis of schizophrenia, as it is expected that any set of SNPs with enough size will be associated with the disorder. Among the different gene sets available for study (such as those from Gene Ontology, KEGG pathway, Reactome pathways or protein protein interaction datasets), those based on brain co-expression networks represent putative functional relationships in the relevant tissue. The aim of this work was to identify brain co-expression networks that contribute disproportionately to the common polygenic risk for schizophrenia to get more insight on schizophrenia etiopathology. Methods We analyzed a case -control dataset consisting of 582 schizophrenia patients from Galicia, NW Spain, and 591 ancestrally matched controls, genotyped with the Illumina PsychArray. Using as discovery sample the summary results from the largest GWAS of schizophrenia to date (Psychiatric Genomics Consortium, SCZ2), we generated polygenic risk scores (PRS) in our sample based on SNPs located at genes belonging to brain co-expression modules determined by the CommonMind Consortium (Fromer et al., Nature Neurosci 2016). PRS were generated using the clumping procedure of PLINK, considering several different thresholds to select SNPs from the discovery sample. In order to test if any specific module increased risk to schizophrenia more than expected by their size, we generated up to 10,000 random permutations of the same number of SNPs, matched by frequency, distance to nearest gene, number of SNPs in LD and gene density, using SNPsnap. Results As expected, most modules with enough number of independent SNPs belonging to them showed a significant increase in

  7. Ethylene-Related Gene Expression Networks in Wood Formation

    Directory of Open Access Journals (Sweden)

    Carolin Seyfferth

    2018-03-01

    Full Text Available Thickening of tree stems is the result of secondary growth, accomplished by the meristematic activity of the vascular cambium. Secondary growth of the stem entails developmental cascades resulting in the formation of secondary phloem outwards and secondary xylem (i.e., wood inwards of the stem. Signaling and transcriptional reprogramming by the phytohormone ethylene modifies cambial growth and cell differentiation, but the molecular link between ethylene and secondary growth remains unknown. We addressed this shortcoming by analyzing expression profiles and co-expression networks of ethylene pathway genes using the AspWood transcriptome database which covers all stages of secondary growth in aspen (Populus tremula stems. ACC synthase expression suggests that the ethylene precursor 1-aminocyclopropane-1-carboxylic acid (ACC is synthesized during xylem expansion and xylem cell maturation. Ethylene-mediated transcriptional reprogramming occurs during all stages of secondary growth, as deduced from AspWood expression profiles of ethylene-responsive genes. A network centrality analysis of the AspWood dataset identified EIN3D and 11 ERFs as hubs. No overlap was found between the co-expressed genes of the EIN3 and ERF hubs, suggesting target diversification and hence independent roles for these transcription factor families during normal wood formation. The EIN3D hub was part of a large co-expression gene module, which contained 16 transcription factors, among them several new candidates that have not been earlier connected to wood formation and a VND-INTERACTING 2 (VNI2 homolog. We experimentally demonstrated Populus EIN3D function in ethylene signaling in Arabidopsis thaliana. The ERF hubs ERF118 and ERF119 were connected on the basis of their expression pattern and gene co-expression module composition to xylem cell expansion and secondary cell wall formation, respectively. We hereby establish data resources for ethylene-responsive genes and

  8. New insights into the evolutionary origins of the recombination-activating gene proteins and V(D)J recombination.

    Science.gov (United States)

    Carmona, Lina Marcela; Schatz, David G

    2017-06-01

    The adaptive immune system of jawed vertebrates relies on V(D)J recombination as one of the main processes to generate the diverse array of receptors necessary for the recognition of a wide range of pathogens. The DNA cleavage reaction necessary for the assembly of the antigen receptor genes from an array of potential gene segments is mediated by the recombination-activating gene proteins RAG1 and RAG2. The RAG proteins have been proposed to originate from a transposable element (TE) as they share mechanistic and structural similarities with several families of transposases and are themselves capable of mediating transposition. A number of RAG-like proteins and TEs with sequence similarity to RAG1 and RAG2 have been identified, but only recently has their function begun to be characterized, revealing mechanistic links to the vertebrate RAGs. Of particular significance is the discovery of ProtoRAG, a transposon superfamily found in the genome of the basal chordate amphioxus. ProtoRAG has many of the sequence and mechanistic features predicted for the ancestral RAG transposon and is likely to be an evolutionary relative of RAG1 and RAG2. In addition, early observations suggesting that RAG1 is able to mediate V(D)J recombination in the absence of RAG2 have been confirmed, implying independent evolutionary origins for the two RAG genes. Here, recent progress in identifying and characterizing RAG-like proteins and the TEs that encode them is summarized and a refined model for the evolution of V(D)J recombination and the RAG proteins is presented. © 2016 Federation of European Biochemical Societies.

  9. Incorporating evolutionary principles into environmental management and policy

    DEFF Research Database (Denmark)

    Lankau, Richard; Jørgensen, Peter Søgaard; Harris, David J.

    2011-01-01

    As policymakers and managers work to mitigate the effects of rapid anthropogenic environmental changes, they need to consider organisms’ responses. In light of recent evidence that evolution can be quite rapid, this now includes evolutionary responses. Evolutionary principles have a long history...... in conservation biology, and the necessary next step for the field is to consider ways in which conservation policy makers and managers can proactively manipulate evolutionary processes to achieve their goals. In this review, we aim to illustrate the potential conservation benefits of an increased understanding...... of evolutionary history and prescriptive manipulation of three basic evolutionary factors: selection, variation, and gene flow. For each, we review and propose ways that policy makers and managers can use evolutionary thinking to preserve threatened species, combat pest species, or reduce undesirable evolutionary...

  10. Selfish genetic elements, genetic conflict, and evolutionary innovation.

    Science.gov (United States)

    Werren, John H

    2011-06-28

    Genomes are vulnerable to selfish genetic elements (SGEs), which enhance their own transmission relative to the rest of an individual's genome but are neutral or harmful to the individual as a whole. As a result, genetic conflict occurs between SGEs and other genetic elements in the genome. There is growing evidence that SGEs, and the resulting genetic conflict, are an important motor for evolutionary change and innovation. In this review, the kinds of SGEs and their evolutionary consequences are described, including how these elements shape basic biological features, such as genome structure and gene regulation, evolution of new genes, origin of new species, and mechanisms of sex determination and development. The dynamics of SGEs are also considered, including possible "evolutionary functions" of SGEs.

  11. Phylodynamics and evolutionary epidemiology of African swine fever p72-CVR genes in Eurasia and Africa.

    Science.gov (United States)

    Alkhamis, Moh A; Gallardo, Carmina; Jurado, Cristina; Soler, Alejandro; Arias, Marisa; Sánchez-Vizcaíno, José M

    2018-01-01

    African swine fever (ASF) is a complex infectious disease of swine that constitutes devastating impacts on animal health and the world economy. Here, we investigated the evolutionary epidemiology of ASF virus (ASFV) in Eurasia and Africa using the concatenated gene sequences of the viral protein 72 and the central variable region of isolates collected between 1960 and 2015. We used Bayesian phylodynamic models to reconstruct the evolutionary history of the virus, to identify virus population demographics and to quantify dispersal patterns between host species. Results suggest that ASFV exhibited a significantly high evolutionary rate and population growth through time since its divergence in the 18th century from East Africa, with no signs of decline till recent years. This increase corresponds to the growing pig trade activities between continents during the 19th century, and may be attributed to an evolutionary drift that resulted from either continuous circulation or maintenance of the virus within Africa and Eurasia. Furthermore, results implicate wild suids as the ancestral host species (root state posterior probability = 0.87) for ASFV in the early 1700s in Africa. Moreover, results indicate the transmission cycle between wild suids and pigs is an important cycle for ASFV spread and maintenance in pig populations, while ticks are an important natural reservoir that can facilitate ASFV spread and maintenance in wild swine populations. We illustrated the prospects of phylodynamic methods in improving risk-based surveillance, support of effective animal health policies, and epidemic preparedness in countries at high risk of ASFV incursion.

  12. Differential Network Analysis Reveals Evolutionary Complexity in Secondary Metabolism of Rauvolfia serpentina over Catharanthus roseus.

    Science.gov (United States)

    Pathania, Shivalika; Bagler, Ganesh; Ahuja, Paramvir S

    2016-01-01

    Comparative co-expression analysis of multiple species using high-throughput data is an integrative approach to determine the uniformity as well as diversification in biological processes. Rauvolfia serpentina and Catharanthus roseus, both members of Apocyanacae family, are reported to have remedial properties against multiple diseases. Despite of sharing upstream of terpenoid indole alkaloid pathway, there is significant diversity in tissue-specific synthesis and accumulation of specialized metabolites in these plants. This led us to implement comparative co-expression network analysis to investigate the modules and genes responsible for differential tissue-specific expression as well as species-specific synthesis of metabolites. Toward these goals differential network analysis was implemented to identify candidate genes responsible for diversification of metabolites profile. Three genes were identified with significant difference in connectivity leading to differential regulatory behavior between these plants. These genes may be responsible for diversification of secondary metabolism, and thereby for species-specific metabolite synthesis. The network robustness of R. serpentina, determined based on topological properties, was also complemented by comparison of gene-metabolite networks of both plants, and may have evolved to have complex metabolic mechanisms as compared to C. roseus under the influence of various stimuli. This study reveals evolution of complexity in secondary metabolism of R. serpentina, and key genes that contribute toward diversification of specific metabolites.

  13. Coexpression of EGFR and CXCR4 predicts poor prognosis in resected pancreatic ductal adenocarcinoma.

    Directory of Open Access Journals (Sweden)

    Huanwen Wu

    Full Text Available Epidermal growth factor receptor (EGFR is highly expressed in pancreatic ductal adenocarcinoma (PDAC and is involved in tumorigenesis and development. However, EGFR expression alone has limited clinical and prognostic significance. Recently, the cross-talk between EGFR and G-protein-coupled chemokine receptor CXCR4 has become increasingly recognized.In the present study, immunohistochemical staining of EGFR and CXCR4 was performed on paraffin-embedded specimens from 131 patients with surgically resected PDAC. Subsequently, the associations between EGFR expression, CXCR4 expression, EGFR/CXCR4 coexpression and clinicopathologic factors were assessed, and survival analyses were performed.In total, 64 (48.9% patients expressed EGFR, 68 (51.9% expressed CXCR4, and 33 (25.2% coexpressed EGFR and CXCR4. No significant association between EGFR and CXCR4 expression was observed (P = 0.938. EGFR expression significantly correlated with tumor differentiation (P = 0.031, whereas CXCR4 expression significantly correlated with lymph node metastasis (P = 0.001. EGFR/CXCR4 coexpression was significantly associated with lymph node metastasis (P = 0.026, TNM stage (P = 0.048, and poor tumor differentiation (P = 0.004. By univariate survival analysis, both CXCR4 expression and EGFR/CXCR4 coexpression were significant prognostic factors for poor disease-free survival (DFS and overall survival (OS. Moreover, EGFR/CXCR4 coexpression significantly increased the hazard ratio for both recurrence and death compared with EGFR or CXCR4 protein expression alone. Multivariate survival analysis demonstrated that EGFR/CXCR4 coexpression was an independent prognostic factor for DFS (HR = 2.33, P<0.001 and OS (HR = 2.48, P = 0.001.In conclusion, our data indicate that although EGFR expression alone has limited clinical and prognostic significance, EGFR/CXCR4 coexpression identified a subset of PDAC patients with more aggressive tumor characteristics and a significantly worse

  14. Cooperation and conflict in cancer: An evolutionary perspective

    Directory of Open Access Journals (Sweden)

    Jonathan Featherston

    2012-09-01

    Full Text Available Evolutionary approaches to carcinogenesis have gained prominence in the literature and enhanced our understanding of cancer. However, an appreciation of neoplasia in the context of evolutionary transitions, particularly the transition from independent genes to a fullyintegrated genome, is largely absent. In the gene–genome evolutionary transition, mobile genetic elements (MGEs can be studied as the extant exemplars of selfish autonomous lowerlevel units that cooperated to form a higher-level, functionally integrated genome. Here,we discuss levels of selection in cancer cells. In particular, we examine the tension between gene and genome units of selection by examining the expression profiles of MGE domains in an array of human cancers. Overall, across diverse cancers, there is an aberrant expression of several families of mobile elements, including the most common MGE in the human genome, retrotransposon LINE 1. These results indicate an alternative life-history strategy for MGEs in the cancers studied. Whether the aberrant expression is the cause or effect oftumourigenesis is unknown, although some evidence suggests that dysregulation of MGEs can play a role in cancer origin and progression. These data are interpreted in combination with phylostratigraphic reports correlating the origin of cancer genes with multicellularity and other potential increases in complexity in cancer cell populations. Cooperation and conflict between individuals at the gene, genome and cell level provide an evolutionary medicineperspective of cancer that enhances our understanding of disease pathogenesis and treatment.

  15. Highly restricted gene flow and deep evolutionary lineages in the giant clam Tridacna maxima

    Science.gov (United States)

    Nuryanto, A.; Kochzius, M.

    2009-09-01

    The tropical Indo-West Pacific is the biogeographic region with the highest diversity of marine shallow water species, with its centre in the Indo-Malay Archipelago. However, due to its high endemism, the Red Sea is also considered as an important centre of evolution. Currently, not much is known about exchange among the Red Sea, Indian Ocean and West Pacific, as well as connectivity within the Indo-Malay Archipelago, even though such information is important to illuminate ecological and evolutionary processes that shape marine biodiversity in these regions. In addition, the inference of connectivity among populations is important for conservation. This study aims to test the hypothesis that the Indo-Malay Archipelago and the Red Sea are important centres of evolution by studying the genetic population structure of the giant clam Tridacna maxima. This study is based on a 484-bp fragment of the cytochrome c oxidase I gene from 211 individuals collected at 14 localities in the Indo-West Pacific to infer lineage diversification and gene flow as a measure for connectivity. The analysis showed a significant genetic differentiation among sample sites in the Indo-West Pacific (Φst = 0.74, P < 0.001) and across the Indo-Malay Archipelago (Φst = 0.72, P < 0.001), indicating restricted gene flow. Hierarchical AMOVA revealed the highest fixation index (Φct = 0.8, P < 0.001) when sample sites were assigned to the following regions: (1) Red Sea, (2) Indian Ocean and Java Sea, (3) Indonesian throughflow and seas in the East of Sulawesi, and (4) Western Pacific. Geological history as well as oceanography are important factors that shape the genetic structure of T. maxima in the Indo-Malay Archipelago and Red Sea. The observed deep evolutionary lineages might include cryptic species and this result supports the notion that the Indo-Malay Archipelago and the Red Sea are important centres of evolution.

  16. The sociobiology of genes: the gene's eye view as a unifying behavioural-ecological framework for biological evolution.

    Science.gov (United States)

    De Tiège, Alexis; Van de Peer, Yves; Braeckman, Johan; Tanghe, Koen B

    2017-11-22

    Although classical evolutionary theory, i.e., population genetics and the Modern Synthesis, was already implicitly 'gene-centred', the organism was, in practice, still generally regarded as the individual unit of which a population is composed. The gene-centred approach to evolution only reached a logical conclusion with the advent of the gene-selectionist or gene's eye view in the 1960s and 1970s. Whereas classical evolutionary theory can only work with (genotypically represented) fitness differences between individual organisms, gene-selectionism is capable of working with fitness differences among genes within the same organism and genome. Here, we explore the explanatory potential of 'intra-organismic' and 'intra-genomic' gene-selectionism, i.e., of a behavioural-ecological 'gene's eye view' on genetic, genomic and organismal evolution. First, we give a general outline of the framework and how it complements the-to some extent-still 'organism-centred' approach of classical evolutionary theory. Secondly, we give a more in-depth assessment of its explanatory potential for biological evolution, i.e., for Darwin's 'common descent with modification' or, more specifically, for 'historical continuity or homology with modular evolutionary change' as it has been studied by evolutionary developmental biology (evo-devo) during the last few decades. In contrast with classical evolutionary theory, evo-devo focuses on 'within-organism' developmental processes. Given the capacity of gene-selectionism to adopt an intra-organismal gene's eye view, we outline the relevance of the latter model for evo-devo. Overall, we aim for the conceptual integration between the gene's eye view on the one hand, and more organism-centred evolutionary models (both classical evolutionary theory and evo-devo) on the other.

  17. Identification of conserved drought stress responsive gene-network across tissues and developmental stages in rice.

    Science.gov (United States)

    Smita, Shuchi; Katiyar, Amit; Pandey, Dev Mani; Chinnusamy, Viswanathan; Archak, Sunil; Bansal, Kailash Chander

    2013-01-01

    Identification of genes that are coexpressed across various tissues and environmental stresses is biologically interesting, since they may play coordinated role in similar biological processes. Genes with correlated expression patterns can be best identified by using coexpression network analysis of transcriptome data. In the present study, we analyzed the temporal-spatial coordination of gene expression in root, leaf and panicle of rice under drought stress and constructed network using WGCNA and Cytoscape. Total of 2199 differentially expressed genes (DEGs) were identified in at least three or more tissues, wherein 88 genes have coordinated expression profile among all the six tissues under drought stress. These 88 highly coordinated genes were further subjected to module identification in the coexpression network. Based on chief topological properties we identified 18 hub genes such as ABC transporter, ATP-binding protein, dehydrin, protein phosphatase 2C, LTPL153 - Protease inhibitor, phosphatidylethanolaminebinding protein, lactose permease-related, NADP-dependent malic enzyme, etc. Motif enrichment analysis showed the presence of ABRE cis-elements in the promoters of > 62% of the coordinately expressed genes. Our results suggest that drought stress mediated upregulated gene expression was coordinated through an ABA-dependent signaling pathway across tissues, at least for the subset of genes identified in this study, while down regulation appears to be regulated by tissue specific pathways in rice.

  18. Identification of Gene Biomarkers for Distinguishing Small-Cell Lung Cancer from Non-Small-Cell Lung Cancer Using a Network-Based Approach

    Directory of Open Access Journals (Sweden)

    Fei Long

    2015-01-01

    Full Text Available Lung cancer consists of two main subtypes: small-cell lung cancer (SCLC and non-small-cell lung cancer (NSCLC that are classified according to their physiological phenotypes. In this study, we have developed a network-based approach to identify molecular biomarkers that can distinguish SCLC from NSCLC. By identifying positive and negative coexpression gene pairs in normal lung tissues, SCLC, or NSCLC samples and using functional association information from the STRING network, we first construct a lung cancer-specific gene association network. From the network, we obtain gene modules in which genes are highly functionally associated with each other and are either positively or negatively coexpressed in the three conditions. Then, we identify gene modules that not only are differentially expressed between cancer and normal samples, but also show distinctive expression patterns between SCLC and NSCLC. Finally, we select genes inside those modules with discriminating coexpression patterns between the two lung cancer subtypes and predict them as candidate biomarkers that are of diagnostic use.

  19. Phylogenetic distribution and evolutionary dynamics of the sex determination genes doublesex and transformer in insects.

    Science.gov (United States)

    Geuverink, E; Beukeboom, L W

    2014-01-01

    Sex determination in insects is characterized by a gene cascade that is conserved at the bottom but contains diverse primary signals at the top. The bottom master switch gene doublesex is found in all insects. Its upstream regulator transformer is present in the orders Hymenoptera, Coleoptera and Diptera, but has thus far not been found in Lepidoptera and in the basal lineages of Diptera. transformer is presumed to be ancestral to the holometabolous insects based on its shared domains and conserved features of autoregulation and sex-specific splicing. We interpret that its absence in basal lineages of Diptera and its order-specific conserved domains indicate multiple independent losses or recruitments into the sex determination cascade. Duplications of transformer are found in derived families within the Hymenoptera, characterized by their complementary sex determination mechanism. As duplications are not found in any other insect order, they appear linked to the haplodiploid reproduction of the Hymenoptera. Further phylogenetic analyses combined with functional studies are needed to understand the evolutionary history of the transformer gene among insects. © 2013 S. Karger AG, Basel.

  20. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature

    Directory of Open Access Journals (Sweden)

    Ning Ye

    2015-01-01

    Full Text Available The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  1. Uncovering robust patterns of microRNA co-expression across cancers using Bayesian Relevance Networks.

    Directory of Open Access Journals (Sweden)

    Parameswaran Ramachandran

    Full Text Available Co-expression networks have long been used as a tool for investigating the molecular circuitry governing biological systems. However, most algorithms for constructing co-expression networks were developed in the microarray era, before high-throughput sequencing-with its unique statistical properties-became the norm for expression measurement. Here we develop Bayesian Relevance Networks, an algorithm that uses Bayesian reasoning about expression levels to account for the differing levels of uncertainty in expression measurements between highly- and lowly-expressed entities, and between samples with different sequencing depths. It combines data from groups of samples (e.g., replicates to estimate group expression levels and confidence ranges. It then computes uncertainty-moderated estimates of cross-group correlations between entities, and uses permutation testing to assess their statistical significance. Using large scale miRNA data from The Cancer Genome Atlas, we show that our Bayesian update of the classical Relevance Networks algorithm provides improved reproducibility in co-expression estimates and lower false discovery rates in the resulting co-expression networks. Software is available at www.perkinslab.ca.

  2. Differential network analysis reveals evolutionary complexity in secondary metabolism of Rauvolfia serpentina over Catharanthus roseus

    Directory of Open Access Journals (Sweden)

    Shivalika Pathania

    2016-08-01

    Full Text Available Comparative co-expression analysis of multiple species using high-throughput data is an integrative approach to determine the uniformity as well as diversification in biological processes. Rauvolfia serpentina and Catharanthus roseus, both members of Apocyanacae family, are reported to have remedial properties against multiple diseases. Despite of sharing upstream of terpenoid indole alkaloid pathway, there is significant diversity in tissue-specific synthesis and accumulation of specialized metabolites in these plants. This led us to implement comparative co-expression network analysis to investigate the modules and genes responsible for differential tissue-specific expression as well as species-specific synthesis of metabolites. Towards these goals differential network analysis was implemented to identify candidate genes responsible for diversification of metabolites profile. Three genes were identified with significant difference in connectivity leading to differential regulatory behavior between these plants. These mechanisms may be responsible for diversification of secondary metabolism, and thereby for species-specific metabolite synthesis. The network robustness of R. serpentina, determined based on topological properties, was also complemented by comparison of gene-metabolite networks of both plants, and may have evolved to have complex metabolic mechanisms as compared to C. roseus under the influence of various stimuli. This study reveals evolution of complexity in secondary metabolism of Rauvolfia serpentina, and key genes that contribute towards diversification of specific metabolites.

  3. Co-expression networks reveal the tissue-specific regulation of transcription and splicing.

    Science.gov (United States)

    Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D H; Jo, Brian; Gao, Chuan; McDowell, Ian C; Engelhardt, Barbara E; Battle, Alexis

    2017-11-01

    Gene co-expression networks capture biologically important patterns in gene expression data, enabling functional analyses of genes, discovery of biomarkers, and interpretation of genetic variants. Most network analyses to date have been limited to assessing correlation between total gene expression levels in a single tissue or small sets of tissues. Here, we built networks that additionally capture the regulation of relative isoform abundance and splicing, along with tissue-specific connections unique to each of a diverse set of tissues. We used the Genotype-Tissue Expression (GTEx) project v6 RNA sequencing data across 50 tissues and 449 individuals. First, we developed a framework called Transcriptome-Wide Networks (TWNs) for combining total expression and relative isoform levels into a single sparse network, capturing the interplay between the regulation of splicing and transcription. We built TWNs for 16 tissues and found that hubs in these networks were strongly enriched for splicing and RNA binding genes, demonstrating their utility in unraveling regulation of splicing in the human transcriptome. Next, we used a Bayesian biclustering model that identifies network edges unique to a single tissue to reconstruct Tissue-Specific Networks (TSNs) for 26 distinct tissues and 10 groups of related tissues. Finally, we found genetic variants associated with pairs of adjacent nodes in our networks, supporting the estimated network structures and identifying 20 genetic variants with distant regulatory impact on transcription and splicing. Our networks provide an improved understanding of the complex relationships of the human transcriptome across tissues. © 2017 Saha et al.; Published by Cold Spring Harbor Laboratory Press.

  4. Assessing fluctuating evolutionary pressure in yeast and mammal evolutionary rate covariation using bioinformatics of meiotic protein genetic sequences

    Science.gov (United States)

    Dehipawala, Sunil; Nguyen, A.; Tremberger, G.; Cheung, E.; Holden, T.; Lieberman, D.; Cheung, T.

    2013-09-01

    The evolutionary rate co-variation in meiotic proteins has been reported for yeast and mammal using phylogenic branch lengths which assess retention, duplication and mutation. The bioinformatics of the corresponding DNA sequences could be classified as a diagram of fractal dimension and Shannon entropy. Results from biomedical gene research provide examples on the diagram methodology. The identification of adaptive selection using entropy marker and functional-structural diversity using fractal dimension would support a regression analysis where the coefficient of determination would serve as evolutionary pathway marker for DNA sequences and be an important component in the astrobiology community. Comparisons between biomedical genes such as EEF2 (elongation factor 2 human, mouse, etc), WDR85 in epigenetics, HAR1 in human specificity, clinical trial targeted cancer gene CD47, SIRT6 in spermatogenesis, and HLA-C in mosquito bite immunology demonstrate the diagram classification methodology. Comparisons to the SEPT4-XIAP pair in stem cell apoptosis, testesexpressed taste genes TAS1R3-GNAT3 pair, and amyloid beta APLP1-APLP2 pair with the yeast-mammal DNA sequences for meiotic proteins RAD50-MRE11 pair and NCAPD2-ICK pair have accounted for the observed fluctuating evolutionary pressure systematically. Regression with high R-sq values or a triangular-like cluster pattern for concordant pairs in co-variation among the studied species could serve as evidences for the possible location of common ancestors in the entropy-fractal dimension diagram, consistent with an example of the human-chimp common ancestor study using the FOXP2 regulated genes reported in human fetal brain study. The Deinococcus radiodurans R1 Rad-A could be viewed as an outlier in the RAD50 diagram and also in the free energy versus fractal dimension regression Cook's distance, consistent with a non-Earth source for this radiation resistant bacterium. Convergent and divergent fluctuating evolutionary

  5. Do we need an extended evolutionary synthesis?

    Science.gov (United States)

    Pigliucci, Massimo

    2007-12-01

    The Modern Synthesis (MS) is the current paradigm in evolutionary biology. It was actually built by expanding on the conceptual foundations laid out by its predecessors, Darwinism and neo-Darwinism. For sometime now there has been talk of a new Extended Evolutionary Synthesis (EES), and this article begins to outline why we may need such an extension, and how it may come about. As philosopher Karl Popper has noticed, the current evolutionary theory is a theory of genes, and we still lack a theory of forms. The field began, in fact, as a theory of forms in Darwin's days, and the major goal that an EES will aim for is a unification of our theories of genes and of forms. This may be achieved through an organic grafting of novel concepts onto the foundational structure of the MS, particularly evolvability, phenotypic plasticity, epigenetic inheritance, complexity theory, and the theory of evolution in highly dimensional adaptive landscapes.

  6. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    Science.gov (United States)

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  7. [Evolutionary medicine: the future looking at the past].

    Science.gov (United States)

    Carvalho, Serafim; Rosado, Margarida

    2008-01-01

    Evolutionary medicine is an emergent basic science that offers new and varied perspectives to the comprehension of the human health and disease, considering them as a result of a gap between our modern lives and the environment where human beings evolve. This work's goals are to understand the importance of the evolutionary theories on concepts of health and disease, providing a new insight on medicine investigation. This bibliography review is based on Medline and PsycINFO articles research between 1996 and 2007 about review and experimental studies published in English, using the key words evolutionary and medicine, psychiatry, psychology, behaviour, health, disease, gene. There were selected forty-five articles based on and with special interest on the authors' practice. There were also consulted some allusive books. The present human genome and phenotypes are essentially Palaeolithic ones: they are not adapted to the modern life style, thus favouring the so called diseases of civilization. Fitting evolutionary strategies, apparently protective ones, when excessive, are the core syndromes of many emotional disruptive behaviours and diseases. Having the stone age's genes, we are obliged to live in the space age. With the evolutionary approach, postmodern medicine is detecting better the vulnerabilities, restrictions, biases, adaptations and maladaptations of human body, its actual diseases and its preventions and treatment.

  8. Evolutionary analysis of hepatitis C virus gene sequences from 1953

    Science.gov (United States)

    Gray, Rebecca R.; Tanaka, Yasuhito; Takebe, Yutaka; Magiorkinis, Gkikas; Buskell, Zelma; Seeff, Leonard; Alter, Harvey J.; Pybus, Oliver G.

    2013-01-01

    Reconstructing the transmission history of infectious diseases in the absence of medical or epidemiological records often relies on the evolutionary analysis of pathogen genetic sequences. The precision of evolutionary estimates of epidemic history can be increased by the inclusion of sequences derived from ‘archived’ samples that are genetically distinct from contemporary strains. Historical sequences are especially valuable for viral pathogens that circulated for many years before being formally identified, including HIV and the hepatitis C virus (HCV). However, surprisingly few HCV isolates sampled before discovery of the virus in 1989 are currently available. Here, we report and analyse two HCV subgenomic sequences obtained from infected individuals in 1953, which represent the oldest genetic evidence of HCV infection. The pairwise genetic diversity between the two sequences indicates a substantial period of HCV transmission prior to the 1950s, and their inclusion in evolutionary analyses provides new estimates of the common ancestor of HCV in the USA. To explore and validate the evolutionary information provided by these sequences, we used a new phylogenetic molecular clock method to estimate the date of sampling of the archived strains, plus the dates of four more contemporary reference genomes. Despite the short fragments available, we conclude that the archived sequences are consistent with a proposed sampling date of 1953, although statistical uncertainty is large. Our cross-validation analyses suggest that the bias and low statistical power observed here likely arise from a combination of high evolutionary rate heterogeneity and an unstructured, star-like phylogeny. We expect that attempts to date other historical viruses under similar circumstances will meet similar problems. PMID:23938759

  9. The evolutionary fate of the genes encoding the purine catabolic enzymes in hominoids, birds, and reptiles.

    Science.gov (United States)

    Keebaugh, Alaine C; Thomas, James W

    2010-06-01

    Gene loss has been proposed to play a major role in adaptive evolution, and recent studies are beginning to reveal its importance in human evolution. However, the potential consequence of a single gene-loss event upon the fates of functionally interrelated genes is poorly understood. Here, we use the purine metabolic pathway as a model system in which to explore this important question. The loss of urate oxidase (UOX) activity, a necessary step in this pathway, has occurred independently in the hominoid and bird/reptile lineages. Because the loss of UOX would have removed the functional constraint upon downstream genes in this pathway, these downstream genes are generally assumed to have subsequently deteriorated. In this study, we used a comparative genomics approach to empirically determine the fate of UOX itself and the downstream genes in five hominoids, two birds, and a reptile. Although we found that the loss of UOX likely triggered the genetic deterioration of the immediate downstream genes in the hominoids, surprisingly in the birds and reptiles, the UOX locus itself and some of the downstream genes were present in the genome and predicted to encode proteins. To account for the variable pattern of gene retention and loss after the inactivation of UOX, we hypothesize that although gene loss is a common fate for genes that have been rendered obsolete due to the upstream loss of an enzyme a metabolic pathway, it is also possible that same lack of constraint will foster the evolution of new functions or allow the optimization of preexisting alternative functions in the downstream genes, thereby resulting in gene retention. Thus, adaptive single-gene losses have the potential to influence the long-term evolutionary fate of functionally interrelated genes.

  10. Hydroxylation of recombinant human collagen type I alpha 1 in transgenic maize co-expressed with a recombinant human prolyl 4-hydroxylase

    Directory of Open Access Journals (Sweden)

    Pappu Kameshwari M

    2011-06-01

    Full Text Available Abstract Background Collagens require the hydroxylation of proline (Pro residues in their triple-helical domain repeating sequence Xaa-Pro-Gly to function properly as a main structural component of the extracellular matrix in animals at physiologically relevant conditions. The regioselective proline hydroxylation is catalyzed by a specific prolyl 4-hydroxylase (P4H as a posttranslational processing step. Results A recombinant human collagen type I α-1 (rCIα1 with high percentage of hydroxylated prolines (Hyp was produced in transgenic maize seeds when co-expressed with both the α- and β- subunits of a recombinant human P4H (rP4H. Germ-specific expression of rCIα1 using maize globulin-1 gene promoter resulted in an average yield of 12 mg/kg seed for the full-length rCIα1 in seeds without co-expression of rP4H and 4 mg/kg seed for the rCIα1 (rCIα1-OH in seeds with co-expression of rP4H. High-resolution mass spectrometry (HRMS analysis revealed that nearly half of the collagenous repeating triplets in rCIα1 isolated from rP4H co-expressing maize line had the Pro residues changed to Hyp residues. The HRMS analysis determined the Hyp content of maize-derived rCIα1-OH as 18.11%, which is comparable to the Hyp level of yeast-derived rCIα1-OH (17.47% and the native human CIa1 (14.59%, respectively. The increased Hyp percentage was correlated with a markedly enhanced thermal stability of maize-derived rCIα1-OH when compared to the non-hydroxylated rCIα1. Conclusions This work shows that maize has potential to produce adequately modified exogenous proteins with mammalian-like post-translational modifications that may be require for their use as pharmaceutical and industrial products.

  11. Evolutionary paths of streptococcal and staphylococcal superantigens

    Directory of Open Access Journals (Sweden)

    Okumura Kayo

    2012-08-01

    Full Text Available Abstract Background Streptococcus pyogenes (GAS harbors several superantigens (SAgs in the prophage region of its genome, although speG and smez are not located in this region. The diversity of SAgs is thought to arise during horizontal transfer, but their evolutionary pathways have not yet been determined. We recently completed sequencing the entire genome of S. dysgalactiae subsp. equisimilis (SDSE, the closest relative of GAS. Although speG is the only SAg gene of SDSE, speG was present in only 50% of clinical SDSE strains and smez in none. In this study, we analyzed the evolutionary paths of streptococcal and staphylococcal SAgs. Results We compared the sequences of the 12–60 kb speG regions of nine SDSE strains, five speG+ and four speG–. We found that the synteny of this region was highly conserved, whether or not the speG gene was present. Synteny analyses based on genome-wide comparisons of GAS and SDSE indicated that speG is the direct descendant of a common ancestor of streptococcal SAgs, whereas smez was deleted from SDSE after SDSE and GAS split from a common ancestor. Cumulative nucleotide skew analysis of SDSE genomes suggested that speG was located outside segments of steeper slopes than the stable region in the genome, whereas the region flanking smez was unstable, as expected from the results of GAS. We also detected a previously undescribed staphylococcal SAg gene, selW, and a staphylococcal SAg -like gene, ssl, in the core genomes of all Staphylococcus aureus strains sequenced. Amino acid substitution analyses, based on dN/dS window analysis of the products encoded by speG, selW and ssl suggested that all three genes have been subjected to strong positive selection. Evolutionary analysis based on the Bayesian Markov chain Monte Carlo method showed that each clade included at least one direct descendant. Conclusions Our findings reveal a plausible model for the comprehensive evolutionary pathway of streptococcal and

  12. Early Developmental and Evolutionary Origins of Gene Body DNA Methylation Patterns in Mammalian Placentas.

    Directory of Open Access Journals (Sweden)

    Diane I Schroeder

    2015-08-01

    Full Text Available Over the last 20-80 million years the mammalian placenta has taken on a variety of morphologies through both divergent and convergent evolution. Recently we have shown that the human placenta genome has a unique epigenetic pattern of large partially methylated domains (PMDs and highly methylated domains (HMDs with gene body DNA methylation positively correlating with level of gene expression. In order to determine the evolutionary conservation of DNA methylation patterns and transcriptional regulatory programs in the placenta, we performed a genome-wide methylome (MethylC-seq analysis of human, rhesus macaque, squirrel monkey, mouse, dog, horse, and cow placentas as well as opossum extraembryonic membrane. We found that, similar to human placenta, mammalian placentas and opossum extraembryonic membrane have globally lower levels of methylation compared to somatic tissues. Higher relative gene body methylation was the conserved feature across all mammalian placentas, despite differences in PMD/HMDs and absolute methylation levels. Specifically, higher methylation over the bodies of genes involved in mitosis, vesicle-mediated transport, protein phosphorylation, and chromatin modification was observed compared with the rest of the genome. As in human placenta, higher methylation is associated with higher gene expression and is predictive of genic location across species. Analysis of DNA methylation in oocytes and preimplantation embryos shows a conserved pattern of gene body methylation similar to the placenta. Intriguingly, mouse and cow oocytes and mouse early embryos have PMD/HMDs but their placentas do not, suggesting that PMD/HMDs are a feature of early preimplantation methylation patterns that become lost during placental development in some species and following implantation of the embryo.

  13. Evolution by Pervasive Gene Fusion in Antibiotic Resistance and Antibiotic Synthesizing Genes

    Directory of Open Access Journals (Sweden)

    Orla Coleman

    2015-03-01

    Full Text Available Phylogenetic (tree-based approaches to understanding evolutionary history are unable to incorporate convergent evolutionary events where two genes merge into one. In this study, as exemplars of what can be achieved when a tree is not assumed a priori, we have analysed the evolutionary histories of polyketide synthase genes and antibiotic resistance genes and have shown that their history is replete with convergent events as well as divergent events. We demonstrate that the overall histories of these genes more closely resembles the remodelling that might be seen with the children’s toy Lego, than the standard model of the phylogenetic tree. This work demonstrates further that genes can act as public goods, available for re-use and incorporation into other genetic goods.

  14. Genome-wide identification, evolutionary and expression analysis of the aspartic protease gene superfamily in grape

    Science.gov (United States)

    2013-01-01

    Background Aspartic proteases (APs) are a large family of proteolytic enzymes found in almost all organisms. In plants, they are involved in many biological processes, such as senescence, stress responses, programmed cell death, and reproduction. Prior to the present study, no grape AP gene(s) had been reported, and their research on woody species was very limited. Results In this study, a total of 50 AP genes (VvAP) were identified in the grape genome, among which 30 contained the complete ASP domain. Synteny analysis within grape indicated that segmental and tandem duplication events contributed to the expansion of the grape AP family. Additional analysis between grape and Arabidopsis demonstrated that several grape AP genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grape and Arabidopsis. Phylogenetic relationships of the 30 VvAPs with the complete ASP domain and their Arabidopsis orthologs, as well as their gene and protein features were analyzed and their cellular localization was predicted. Moreover, expression profiles of VvAP genes in six different tissues were determined, and their transcript abundance under various stresses and hormone treatments were measured. Twenty-seven VvAP genes were expressed in at least one of the six tissues examined; nineteen VvAPs responded to at least one abiotic stress, 12 VvAPs responded to powdery mildew infection, and most of the VvAPs responded to SA and ABA treatments. Furthermore, integrated synteny and phylogenetic analysis identified orthologous AP genes between grape and Arabidopsis, providing a unique starting point for investigating the function of grape AP genes. Conclusions The genome-wide identification, evolutionary and expression analyses of grape AP genes provide a framework for future analysis of AP genes in defining their roles during stress response. Integrated synteny and phylogenetic analyses provide novel insight into the

  15. Oxytocin Pathway Genes: Evolutionary Ancient System Impacting on Human Affiliation, Sociality, and Psychopathology.

    Science.gov (United States)

    Feldman, Ruth; Monakhov, Mikhail; Pratt, Maayan; Ebstein, Richard P

    2016-02-01

    Oxytocin (OT), a nonapeptide signaling molecule originating from an ancestral peptide, appears in different variants across all vertebrate and several invertebrate species. Throughout animal evolution, neuropeptidergic signaling has been adapted by organisms for regulating response to rapidly changing environments. The family of OT-like molecules affects both peripheral tissues implicated in reproduction, homeostasis, and energy balance, as well as neuromodulation of social behavior, stress regulation, and associative learning in species ranging from nematodes to humans. After describing the OT-signaling pathway, we review research on the three genes most extensively studied in humans: the OT receptor (OXTR), the structural gene for OT (OXT/neurophysin-I), and CD38. Consistent with the notion that sociality should be studied from the perspective of social life at the species level, we address human social functions in relation to OT-pathway genes, including parenting, empathy, and using social relationships to manage stress. We then describe associations between OT-pathway genes with psychopathologies involving social dysfunctions such as autism, depression, or schizophrenia. Human research particularly underscored the involvement of two OXTR single nucleotide polymorphisms (rs53576, rs2254298) with fewer studies focusing on other OXTR (rs7632287, rs1042778, rs2268494, rs2268490), OXT (rs2740210, rs4813627, rs4813625), and CD38 (rs3796863, rs6449197) single nucleotide polymorphisms. Overall, studies provide evidence for the involvement of OT-pathway genes in human social functions but also suggest that factors such as gender, culture, and early environment often confound attempts to replicate first findings. We conclude by discussing epigenetics, conceptual implications within an evolutionary perspective, and future directions, especially the need to refine phenotypes, carefully characterize early environments, and integrate observations of social behavior across

  16. Significant Microsynteny with New Evolutionary Highlights Is Detected through Comparative Genomic Sequence Analysis of Maize CCCH IX Gene Subfamily

    Directory of Open Access Journals (Sweden)

    Wei-Jun Chen

    2015-01-01

    Full Text Available CCCH zinc finger proteins, which are characterized by the presence of three cysteine residues and one histidine residue, play important roles in RNA processing in plants. Subfamily IX CCCH proteins were recently shown to function in stress tolerances. In this study, we analyzed CCCH IX genes in Zea mays, Oryza sativa, and Sorghum bicolor. These genes, which are almost intronless, were divided into four groups based on phylogenetic analysis. Microsynteny analysis revealed microsynteny in regions of some gene pairs, indicating that segmental duplication has played an important role in the expansion of this gene family. In addition, we calculated the dates of duplication by Ks analysis, finding that all microsynteny blocks were formed after the monocot-eudicot divergence. We found that deletions, multiplications, and inversions were shown to have occurred over the course of evolution. Moreover, the Ka/Ks ratios indicated that the genes in these three grass species are under strong purifying selection. Finally, we investigated the evolutionary patterns of some gene pairs conferring tolerance to abiotic stress, laying the foundation for future functional studies of these transcription factors.

  17. THD-Module Extractor: An Application for CEN Module Extraction and Interesting Gene Identification for Alzheimer's Disease.

    Science.gov (United States)

    Kakati, Tulika; Kashyap, Hirak; Bhattacharyya, Dhruba K

    2016-11-30

    There exist many tools and methods for construction of co-expression network from gene expression data and for extraction of densely connected gene modules. In this paper, a method is introduced to construct co-expression network and to extract co-expressed modules having high biological significance. The proposed method has been validated on several well known microarray datasets extracted from a diverse set of species, using statistical measures, such as p and q values. The modules obtained in these studies are found to be biologically significant based on Gene Ontology enrichment analysis, pathway analysis, and KEGG enrichment analysis. Further, the method was applied on an Alzheimer's disease dataset and some interesting genes are found, which have high semantic similarity among them, but are not significantly correlated in terms of expression similarity. Some of these interesting genes, such as MAPT, CASP2, and PSEN2, are linked with important aspects of Alzheimer's disease, such as dementia, increase cell death, and deposition of amyloid-beta proteins in Alzheimer's disease brains. The biological pathways associated with Alzheimer's disease, such as, Wnt signaling, Apoptosis, p53 signaling, and Notch signaling, incorporate these interesting genes. The proposed method is evaluated in regard to existing literature.

  18. Estimating true evolutionary distances under the DCJ model.

    Science.gov (United States)

    Lin, Yu; Moret, Bernard M E

    2008-07-01

    Modern techniques can yield the ordering and strandedness of genes on each chromosome of a genome; such data already exists for hundreds of organisms. The evolutionary mechanisms through which the set of the genes of an organism is altered and reordered are of great interest to systematists, evolutionary biologists, comparative genomicists and biomedical researchers. Perhaps the most basic concept in this area is that of evolutionary distance between two genomes: under a given model of genomic evolution, how many events most likely took place to account for the difference between the two genomes? We present a method to estimate the true evolutionary distance between two genomes under the 'double-cut-and-join' (DCJ) model of genome rearrangement, a model under which a single multichromosomal operation accounts for all genomic rearrangement events: inversion, transposition, translocation, block interchange and chromosomal fusion and fission. Our method relies on a simple structural characterization of a genome pair and is both analytically and computationally tractable. We provide analytical results to describe the asymptotic behavior of genomes under the DCJ model, as well as experimental results on a wide variety of genome structures to exemplify the very high accuracy (and low variance) of our estimator. Our results provide a tool for accurate phylogenetic reconstruction from multichromosomal gene rearrangement data as well as a theoretical basis for refinements of the DCJ model to account for biological constraints. All of our software is available in source form under GPL at http://lcbb.epfl.ch.

  19. Evolutionary engineering of industrial microorganisms-strategies and applications.

    Science.gov (United States)

    Zhu, Zhengming; Zhang, Juan; Ji, Xiaomei; Fang, Zhen; Wu, Zhimeng; Chen, Jian; Du, Guocheng

    2018-06-01

    Microbial cells have been widely used in the industry to obtain various biochemical products, and evolutionary engineering is a common method in biological research to improve their traits, such as high environmental tolerance and improvement of product yield. To obtain better integrate functions of microbial cells, evolutionary engineering combined with other biotechnologies have attracted more attention in recent years. Classical laboratory evolution has been proven effective to letting more beneficial mutations occur in different genes but also has some inherent limitations such as a long evolutionary period and uncontrolled mutation frequencies. However, recent studies showed that some new strategies may gradually overcome these limitations. In this review, we summarize the evolutionary strategies commonly used in industrial microorganisms and discuss the combination of evolutionary engineering with other biotechnologies such as systems biology and inverse metabolic engineering. Finally, we prospect the importance and application prospect of evolutionary engineering as a powerful tool especially in optimization of industrial microbial cell factories.

  20. Similar but not the same: insights into the evolutionary history of paralogous sex-determining genes of the dwarf honey bee Apis florea.

    Science.gov (United States)

    Biewer, M; Lechner, S; Hasselmann, M

    2016-01-01

    Studying the fate of duplicated genes provides informative insight into the evolutionary plasticity of biological pathways to which they belong. In the paralogous sex-determining genes complementary sex determiner (csd) and feminizer (fem) of honey bee species (genus Apis), only heterozygous csd initiates female development. Here, the full-length coding sequences of the genes csd and fem of the phylogenetically basal dwarf honey bee Apis florea are characterized. Compared with other Apis species, remarkable evolutionary changes in the formation and localization of a protein-interacting (coiled-coil) motif and in the amino acids coding for the csd characteristic hypervariable region (HVR) are observed. Furthermore, functionally different csd alleles were isolated as genomic fragments from a random population sample. In the predicted potential specifying domain (PSD), a high ratio of πN/πS=1.6 indicated positive selection, whereas signs of balancing selection, commonly found in other Apis species, are missing. Low nucleotide diversity on synonymous and genome-wide, non-coding sites as well as site frequency analyses indicated a strong impact of genetic drift in A. florea, likely linked to its biology. Along the evolutionary trajectory of ~30 million years of csd evolution, episodic diversifying selection seems to have acted differently among distinct Apis branches. Consistently low amino-acid differences within the PSD among pairs of functional heterozygous csd alleles indicate that the HVR is the most important region for determining allele specificity. We propose that in the early history of the lineage-specific fem duplication giving rise to csd in Apis, A. florea csd stands as a remarkable example for the plasticity of initial sex-determining signals.

  1. Recombinant amyloid beta-peptide production by coexpression with an affibody ligand

    Science.gov (United States)

    Macao, Bertil; Hoyer, Wolfgang; Sandberg, Anders; Brorsson, Ann-Christin; Dobson, Christopher M; Härd, Torleif

    2008-01-01

    Background Oligomeric and fibrillar aggregates of the amyloid β-peptide (Aβ) have been implicated in the pathogenesis of Alzheimer's disease (AD). The characterization of Aβ assemblies is essential for the elucidation of the mechanisms of Aβ neurotoxicity, but requires large quantities of pure peptide. Here we describe a novel approach to the recombinant production of Aβ. The method is based on the coexpression of the affibody protein ZAβ3, a selected affinity ligand derived from the Z domain three-helix bundle scaffold. ZAβ3 binds to the amyloidogenic central and C-terminal part of Aβ with nanomolar affinity and consequently inhibits aggregation. Results Coexpression of ZAβ3 affords the overexpression of both major Aβ isoforms, Aβ(1–40) and Aβ(1–42), yielding 4 or 3 mg, respectively, of pure 15N-labeled peptide per liter of culture. The method does not rely on a protein-fusion or -tag and thus does not require a cleavage reaction. The purified peptides were characterized by NMR, circular dichroism, SDS-PAGE and size exclusion chromatography, and their aggregation propensities were assessed by thioflavin T fluorescence and electron microscopy. The data coincide with those reported previously for monomeric, largely unstructured Aβ. ZAβ3 coexpression moreover permits the recombinant production of Aβ(1–42) carrying the Arctic (E22G) mutation, which causes early onset familial AD. Aβ(1–42)E22G is obtained in predominantly monomeric form and suitable, e.g., for NMR studies. Conclusion The coexpression of an engineered aggregation-inhibiting binding protein offers a novel route to the recombinant production of amyloidogenic Aβ peptides that can be advantageously employed to study the molecular basis of AD. The presented expression system is the first for which expression and purification of the aggregation-prone Arctic variant (E22G) of Aβ(1–42) is reported. PMID:18973685

  2. Recombinant amyloid beta-peptide production by coexpression with an affibody ligand

    Directory of Open Access Journals (Sweden)

    Dobson Christopher M

    2008-10-01

    Full Text Available Abstract Background Oligomeric and fibrillar aggregates of the amyloid β-peptide (Aβ have been implicated in the pathogenesis of Alzheimer's disease (AD. The characterization of Aβ assemblies is essential for the elucidation of the mechanisms of Aβ neurotoxicity, but requires large quantities of pure peptide. Here we describe a novel approach to the recombinant production of Aβ. The method is based on the coexpression of the affibody protein ZAβ3, a selected affinity ligand derived from the Z domain three-helix bundle scaffold. ZAβ3 binds to the amyloidogenic central and C-terminal part of Aβ with nanomolar affinity and consequently inhibits aggregation. Results Coexpression of ZAβ3 affords the overexpression of both major Aβ isoforms, Aβ(1–40 and Aβ(1–42, yielding 4 or 3 mg, respectively, of pure 15N-labeled peptide per liter of culture. The method does not rely on a protein-fusion or -tag and thus does not require a cleavage reaction. The purified peptides were characterized by NMR, circular dichroism, SDS-PAGE and size exclusion chromatography, and their aggregation propensities were assessed by thioflavin T fluorescence and electron microscopy. The data coincide with those reported previously for monomeric, largely unstructured Aβ. ZAβ3 coexpression moreover permits the recombinant production of Aβ(1–42 carrying the Arctic (E22G mutation, which causes early onset familial AD. Aβ(1–42E22G is obtained in predominantly monomeric form and suitable, e.g., for NMR studies. Conclusion The coexpression of an engineered aggregation-inhibiting binding protein offers a novel route to the recombinant production of amyloidogenic Aβ peptides that can be advantageously employed to study the molecular basis of AD. The presented expression system is the first for which expression and purification of the aggregation-prone Arctic variant (E22G of Aβ(1–42 is reported.

  3. Evolutionary history and functional divergence of the cytochrome P450 gene superfamily between Arabidopsis thaliana and Brassica species uncover effects of whole genome and tandem duplications.

    Science.gov (United States)

    Yu, Jingyin; Tehrim, Sadia; Wang, Linhai; Dossa, Komivi; Zhang, Xiurong; Ke, Tao; Liao, Boshou

    2017-09-18

    The cytochrome P450 monooxygenase (P450) superfamily is involved in the biosynthesis of various primary and secondary metabolites. However, little is known about the effects of whole genome duplication (WGD) and tandem duplication (TD) events on the evolutionary history and functional divergence of P450s in Brassica after splitting from a common ancestor with Arabidopsis thaliana. Using Hidden Markov Model search and manual curation, we detected that Brassica species have nearly 1.4-fold as many P450 members as A. thaliana. Most P450s in A. thaliana and Brassica species were located on pseudo-chromosomes. The inferred phylogeny indicated that all P450s were clustered into two different subgroups. Analysis of WGD event revealed that different P450 gene families had appeared after evolutionary events of species. For the TD event analyses, the P450s from TD events in Brassica species can be divided into ancient and recent parts. Our comparison of influence of WGD and TD events on the P450 gene superfamily between A. thaliana and Brassica species indicated that the family-specific evolution in the Brassica lineage can be attributed to both WGD and TD, whereas WGD was recognized as the major mechanism for the recent evolution of the P450 super gene family. Expression analysis of P450s from A. thaliana and Brassica species indicated that WGD-type P450s showed the same expression pattern but completely different expression with TD-type P450s across different tissues in Brassica species. Selection force analysis suggested that P450 orthologous gene pairs between A. thaliana and Brassica species underwent negative selection, but no significant differences were found between P450 orthologous gene pairs in A. thaliana-B. rapa and A. thaliana-B. oleracea lineages, as well as in different subgenomes in B. rapa or B. oleracea compared with A. thaliana. This study is the first to investigate the effects of WGD and TD on the evolutionary history and functional divergence of P450

  4. Ageing genes

    DEFF Research Database (Denmark)

    Rattan, Suresh

    2018-01-01

    The idea of gerontogenes is in line with the evolutionary explanation of ageing as being an emergent phenomenon as a result of the imperfect maintenance and repair systems. Although evolutionary processes did not select for any specific ageing genes that restrict and determine the lifespan...... of an individual, the term ‘gerontogenes’ primarily refers to any genes that may seem to influence ageing and longevity, without being specifically selected for that role. Such genes can also be called ‘virtual gerontogenes’ by virtue of their indirect influence on the rate and process of ageing. More than 1000...... virtual gerontogenes have been associated with ageing and longevity in model organisms and humans. The ‘real’ genes, which do influence the essential lifespan of a species, and have been selected for in accordance with the evolutionary life history of the species, are known as the longevity assurance...

  5. Evolutionary Dynamics and Diversity in Microbial Populations

    Science.gov (United States)

    Thompson, Joel; Fisher, Daniel

    2013-03-01

    Diseases such as flu and cancer adapt at an astonishing rate. In large part, viruses and cancers are so difficult to prevent because they are continually evolving. Controlling such ``evolutionary diseases'' requires a better understanding of the underlying evolutionary dynamics. It is conventionally assumed that adaptive mutations are rare and therefore will occur and sweep through the population in succession. Recent experiments using modern sequencing technologies have illuminated the many ways in which real population sequence data does not conform to the predictions of conventional theory. We consider a very simple model of asexual evolution and perform simulations in a range of parameters thought to be relevant for microbes and cancer. Simulation results reveal complex evolutionary dynamics typified by competition between lineages with different sets of adaptive mutations. This dynamical process leads to a distribution of mutant gene frequencies different than expected under the conventional assumption that adaptive mutations are rare. Simulated gene frequencies share several conspicuous features with data collected from laboratory-evolved yeast and the worldwide population of influenza.

  6. Expression of lycopene biosynthesis genes fused in line with Shine-Dalgarno sequences improves the stress-tolerance of Lactococcus lactis.

    Science.gov (United States)

    Dong, Xiangrong; Wang, Yanping; Yang, Fengyuan; Zhao, Shanshan; Tian, Bing; Li, Tao

    2017-01-01

    Lycopene biosynthetic genes from Deinococcus radiodurans were co-expressed in Lactococcus lactis to produce lycopene and improve its tolerance to stress. Lycopene-related genes from D. radiodurans, DR1395 (crtE), DR0862 (crtB), and DR0861 (crtI), were fused in line with S hine-Dalgarno (SD) sequences and co-expressed in L. lactis. The recombinant strain produced 0.36 mg lycopene g -1  dry cell wt after 48 h fermentation. The survival rate to UV irradiation of the recombinant strain was higher than that of the non-transformed strain. The L. lactis with co-expressed genes responsible for lycopene biosynthesis from D. radiodurans produced lycopene and exhibited increased resistance to UV stress, suggesting that the recombinant strain has important application potential in food industry.

  7. Functional comparison of the nematode Hox gene lin-39 in C. elegans and P. pacificus reveals evolutionary conservation of protein function despite divergence of primary sequences

    OpenAIRE

    Grandien, Kaj; Sommer, Ralf J.

    2001-01-01

    Hox transcription factors have been implicated in playing a central role in the evolution of animal morphology. Many studies indicate the evolutionary importance of regulatory changes in Hox genes, but little is known about the role of functional changes in Hox proteins. In the nematodes Pristionchus pacificus and Caenorhabditis elegans, developmental processes can be compared at the cellular, genetic, and molecular levels and differences in gene function can be identified. The Hox gene lin-3...

  8. Keratitis-Ichthyosis-Deafness syndrome-associated Cx26 mutants produce nonfunctional gap junctions but hyperactive hemichannels when co-expressed with wild type Cx43

    Science.gov (United States)

    García, Isaac E.; Maripillán, Jaime; Jara, Oscar; Ceriani, Ricardo; Palacios-Muñoz, Angelina; Ramachandran, Jayalakshimi; Olivero, Pablo; Pérez-Acle, Tomás; González, Carlos; Sáez, Juan C.; Contreras, Jorge E.; Martínez, Agustín D.

    2015-01-01

    Mutations in Cx26 gene are found in most cases of human genetic deafness. Some mutations produce syndromic deafness associated with skin disorders, like Keratitis Ichthyosis Deafness syndrome (KID). Because in the human skin Cx26 is co-expressed with other connexins, like Cx43 and Cx30, and since KID syndrome is inherited as autosomal dominant condition, it is possible that KID mutations change the way Cx26 interacts with other co-expressed connexins. Indeed, some Cx26 syndromic mutations showed gap junction dominant negative effect when co-expressed with wild type connexins, including Cx26 and Cx43. The nature of these interactions and the consequences on hemichannels and gap junction channels functions remain unknown. In this study we demonstrate that syndromic mutations at the N-terminus segment of Cx26, change connexin oligomerization compatibility, allowing aberrant interactions with Cx43. Strikingly, heteromeric oligomer formed by Cx43/Cx26 (syndromic mutants) show exacerbated hemichannel activity, but nonfunctional gap junction channels; this also occurs for those Cx26 KID mutants that do not show functional homomeric hemichannels. Heterologous expression of these hyperactive heteromeric hemichannels increases cell membrane permeability, favoring ATP release and Ca2+ overload. The functional paradox produced by oligomerization of Cx43 and Cx26 KID mutants could underlie the severe syndromic phenotype in human skin. PMID:25625422

  9. Identification of a gene module associated with BMD through the integration of network analysis and genome-wide association data.

    Science.gov (United States)

    Farber, Charles R

    2010-11-01

    Bone mineral density (BMD) is influenced by a complex network of gene interactions; therefore, elucidating the relationships between genes and how those genes, in turn, influence BMD is critical for developing a comprehensive understanding of osteoporosis. To investigate the role of transcriptional networks in the regulation of BMD, we performed a weighted gene coexpression network analysis (WGCNA) using microarray expression data on monocytes from young individuals with low or high BMD. WGCNA groups genes into modules based on patterns of gene coexpression. and our analysis identified 11 gene modules. We observed that the overall expression of one module (referred to as module 9) was significantly higher in the low-BMD group (p = .03). Module 9 was highly enriched for genes belonging to the immune system-related gene ontology (GO) category "response to virus" (p = 7.6 × 10(-11)). Using publically available genome-wide association study data, we independently validated the importance of module 9 by demonstrating that highly connected module 9 hubs were more likely, relative to less highly connected genes, to be genetically associated with BMD. This study highlights the advantages of systems-level analyses to uncover coexpression modules associated with bone mass and suggests that particular monocyte expression patterns may mediate differences in BMD. © 2010 American Society for Bone and Mineral Research.

  10. Evolutionary institutionalism.

    Science.gov (United States)

    Fürstenberg, Dr Kai

    Institutions are hard to define and hard to study. Long prominent in political science have been two theories: Rational Choice Institutionalism (RCI) and Historical Institutionalism (HI). Arising from the life sciences is now a third: Evolutionary Institutionalism (EI). Comparative strengths and weaknesses of these three theories warrant review, and the value-to-be-added by expanding the third beyond Darwinian evolutionary theory deserves consideration. Should evolutionary institutionalism expand to accommodate new understanding in ecology, such as might apply to the emergence of stability, and in genetics, such as might apply to political behavior? Core arguments are reviewed for each theory with more detailed exposition of the third, EI. Particular attention is paid to EI's gene-institution analogy; to variation, selection, and retention of institutional traits; to endogeneity and exogeneity; to agency and structure; and to ecosystem effects, institutional stability, and empirical limitations in behavioral genetics. RCI, HI, and EI are distinct but complementary. Institutional change, while amenable to rational-choice analysis and, retrospectively, to criticaljuncture and path-dependency analysis, is also, and importantly, ecological. Stability, like change, is an emergent property of institutions, which tend to stabilize after change in a manner analogous to allopatric speciation. EI is more than metaphorically biological in that institutional behaviors are driven by human behaviors whose evolution long preceded the appearance of institutions themselves.

  11. EMMPRIN co-expressed with matrix metalloproteinases predicts poor prognosis in patients with osteosarcoma.

    Science.gov (United States)

    Futamura, Naohisa; Nishida, Yoshihiro; Urakawa, Hiroshi; Kozawa, Eiji; Ikuta, Kunihiro; Hamada, Shunsuke; Ishiguro, Naoki

    2014-06-01

    Several studies have focused on the relationships between the expression of extracellular matrix metalloproteinase inducer (EMMPRIN) and the prognosis of patients with malignant tumors. However, few of these have investigated the expression of EMMPRIN in osteosarcoma. We examined expression levels of EMMPRIN immunohistochemically in 53 cases of high-grade osteosarcoma of the extremities and analyzed the correlation of its expression with patient prognosis. The correlation between matrix metalloproteinases (MMPs) and EMMPRIN expression and the prognostic value of co-expression were also analyzed. Staining positivity for EMMPRIN was negative in 7 cases, low in 17, moderate in 19, and strong in 10. The overall and disease-free survivals (OS and DFS) in patients with higher EMMPRIN expression (strong-moderate) were significantly lower than those in the lower (weak-negative) group (0.037 and 0.024, respectively). In multivariate analysis, age (P=0.004), location (P=0.046), and EMMPRIN expression (P=0.038) were significant prognostic factors for overall survival. EMMPRIN expression (P=0.024) was also a significant prognostic factor for disease-free survival. Co-expression analyses of EMMPRIN and MMPs revealed that strong co-expression of EMMPRIN and membrane-type 1 (MT1)-MMP had a poor prognostic value (P=0.056 for DFS, P=0.006 for OS). EMMPRIN expression and co-expression with MMPs well predict the prognosis of patients with extremity osteosarcoma, making EMMPRIN a possible therapeutic target in these patients.

  12. Genome-wide identification of nuclear receptor (NR) genes and the evolutionary significance of the NR1O subfamily in the monogonont rotifer Brachionus spp.

    Science.gov (United States)

    Kim, Duck-Hyun; Kim, Hui-Su; Hwang, Dae-Sik; Kim, Hee-Jin; Hagiwara, Atsushi; Lee, Jae-Seong; Jeong, Chang-Bum

    2017-10-01

    Nuclear receptors (NRs) are a large family of transcription factors that are involved in many fundamental biological processes. NRs are considered to have originated from a common ancestor, and are highly conserved throughout the whole animal taxa. Therefore, the genome-wide identification of NR genes in an animal taxon can provide insight into the evolutionary tendencies of NRs. Here, we identified all the NR genes in the monogonont rotifer Brachionus spp., which are considered an ecologically key species due to their abundance and world-wide distribution. The NR family was composed of 40, 32, 29, and 32 genes in the genomes of the rotifers B. calyciflorus, B. koreanus, B. plicatilis, and B. rotundiformis, respectively, which were classified into seven distinct subfamilies. The composition of each subfamily was highly conserved between species, except for NR1O genes, suggesting that they have undergone sporadic evolutionary processes for adaptation to their different environmental pressures. In addition, despite the dynamics of NR evolution, the significance of the conserved endocrine system, particularly for estrogen receptor (ER)-signaling, in rotifers was discussed on the basis of phylogenetic analyses. The results of this study may help provide a better understanding the evolution of NRs, and expand our knowledge of rotifer endocrine systems. Copyright © 2017 Elsevier Inc. All rights reserved.

  13. Vampire bats exhibit evolutionary reduction of bitter taste receptor genes common to other bats

    Science.gov (United States)

    Hong, Wei; Zhao, Huabin

    2014-01-01

    The bitter taste serves as an important natural defence against the ingestion of poisonous foods and is thus believed to be indispensable in animals. However, vampire bats are obligate blood feeders that show a reduced behavioural response towards bitter-tasting compounds. To test whether bitter taste receptor genes (T2Rs) have been relaxed from selective constraint in vampire bats, we sampled all three vampire bat species and 11 non-vampire bats, and sequenced nine one-to-one orthologous T2Rs that are assumed to be functionally conserved in all bats. We generated 85 T2R sequences and found that vampire bats have a significantly greater percentage of pseudogenes than other bats. These results strongly suggest a relaxation of selective constraint and a reduction of bitter taste function in vampire bats. We also found that vampire bats retain many intact T2Rs, and that the taste signalling pathway gene Calhm1 remains complete and intact with strong functional constraint. These results suggest the presence of some bitter taste function in vampire bats, although it is not likely to play a major role in food selection. Together, our study suggests that the evolutionary reduction of bitter taste function in animals is more pervasive than previously believed, and highlights the importance of extra-oral functions of taste receptor genes. PMID:24966321

  14. Evolutionary Co-Option of Floral Meristem Identity Genes for Patterning of the Flower-Like Asteraceae Inflorescence1

    Science.gov (United States)

    Broholm, Suvi K.; Tähtiharju, Sari

    2016-01-01

    The evolutionary success of Asteraceae, the largest family of flowering plants, has been attributed to the unique inflorescence architecture of the family, which superficially resembles an individual flower. Here, we show that Asteraceae inflorescences (flower heads, or capitula) resemble solitary flowers not only morphologically but also at the molecular level. By conducting functional analyses for orthologs of the flower meristem identity genes LEAFY (LFY) and UNUSUAL FLORAL ORGANS (UFO) in Gerbera hybrida, we show that GhUFO is the master regulator of flower meristem identity, while GhLFY has evolved a novel, homeotic function during the evolution of head-like inflorescences. Resembling LFY expression in a single flower meristem, uniform expression of GhLFY in the inflorescence meristem defines the capitulum as a determinate structure that can assume floral fate upon ectopic GhUFO expression. We also show that GhLFY uniquely regulates the ontogeny of outer, expanded ray flowers but not inner, compact disc flowers, indicating that the distinction of different flower types in Asteraceae is connected with their independent evolutionary origins from separate branching systems. PMID:27382139

  15. Transcriptional dynamics of a conserved gene expression network associated with craniofacial divergence in Arctic charr.

    Science.gov (United States)

    Ahi, Ehsan Pashay; Kapralova, Kalina Hristova; Pálsson, Arnar; Maier, Valerie Helene; Gudbrandsson, Jóhannes; Snorrason, Sigurdur S; Jónsson, Zophonías O; Franzdóttir, Sigrídur Rut

    2014-01-01

    Understanding the molecular basis of craniofacial variation can provide insights into key developmental mechanisms of adaptive changes and their role in trophic divergence and speciation. Arctic charr (Salvelinus alpinus) is a polymorphic fish species, and, in Lake Thingvallavatn in Iceland, four sympatric morphs have evolved distinct craniofacial structures. We conducted a gene expression study on candidates from a conserved gene coexpression network, focusing on the development of craniofacial elements in embryos of two contrasting Arctic charr morphotypes (benthic and limnetic). Four Arctic charr morphs were studied: one limnetic and two benthic morphs from Lake Thingvallavatn and a limnetic reference aquaculture morph. The presence of morphological differences at developmental stages before the onset of feeding was verified by morphometric analysis. Following up on our previous findings that Mmp2 and Sparc were differentially expressed between morphotypes, we identified a network of genes with conserved coexpression across diverse vertebrate species. A comparative expression study of candidates from this network in developing heads of the four Arctic charr morphs verified the coexpression relationship of these genes and revealed distinct transcriptional dynamics strongly correlated with contrasting craniofacial morphologies (benthic versus limnetic). A literature review and Gene Ontology analysis indicated that a significant proportion of the network genes play a role in extracellular matrix organization and skeletogenesis, and motif enrichment analysis of conserved noncoding regions of network candidates predicted a handful of transcription factors, including Ap1 and Ets2, as potential regulators of the gene network. The expression of Ets2 itself was also found to associate with network gene expression. Genes linked to glucocorticoid signalling were also studied, as both Mmp2 and Sparc are responsive to this pathway. Among those, several transcriptional

  16. Evolutionary Co-Option of Floral Meristem Identity Genes for Patterning of the Flower-Like Asteraceae Inflorescence.

    Science.gov (United States)

    Zhao, Yafei; Zhang, Teng; Broholm, Suvi K; Tähtiharju, Sari; Mouhu, Katriina; Albert, Victor A; Teeri, Teemu H; Elomaa, Paula

    2016-09-01

    The evolutionary success of Asteraceae, the largest family of flowering plants, has been attributed to the unique inflorescence architecture of the family, which superficially resembles an individual flower. Here, we show that Asteraceae inflorescences (flower heads, or capitula) resemble solitary flowers not only morphologically but also at the molecular level. By conducting functional analyses for orthologs of the flower meristem identity genes LEAFY (LFY) and UNUSUAL FLORAL ORGANS (UFO) in Gerbera hybrida, we show that GhUFO is the master regulator of flower meristem identity, while GhLFY has evolved a novel, homeotic function during the evolution of head-like inflorescences. Resembling LFY expression in a single flower meristem, uniform expression of GhLFY in the inflorescence meristem defines the capitulum as a determinate structure that can assume floral fate upon ectopic GhUFO expression. We also show that GhLFY uniquely regulates the ontogeny of outer, expanded ray flowers but not inner, compact disc flowers, indicating that the distinction of different flower types in Asteraceae is connected with their independent evolutionary origins from separate branching systems. © 2016 American Society of Plant Biologists. All rights reserved.

  17. Heat Shock Protein 70 Enhances Mucosal Immunity against Human Norovirus When Coexpressed from a Vesicular Stomatitis Virus Vector

    Science.gov (United States)

    Ma, Yuanmei; Duan, Yue; Wei, Yongwei; Liang, Xueya; Niewiesk, Stefan; Oglesbee, Michael

    2014-01-01

    uncultivable. Thus, a live vector-based vaccine may provide an alternative vaccine strategy. In this study, we developed a vesicular stomatitis virus (VSV)-based human NoV vaccine candidate. We constructed rVSV-HSP70-VP1, coexpressing heat shock protein (HSP70) and capsid (VP1) genes of human NoV, and rVSV-Luc-VP1, coexpressing firefly luciferase (Luc) and VP1 genes. We found that VSVs with a double gene insertion were significantly more attenuated than VSV with a single VP1 insertion (rVSV-VP1). Furthermore, we found that coexpression or coadministration of HSP70 from VSV vector significantly enhanced human NoV-specific mucosal immunity. Collectively, we developed an improved live vectored vaccine candidate for human NoV which will be useful for future clinical studies. PMID:24574391

  18. Once for All: A Novel Robust System for Co-expression of Multiple Chimeric Fluorescent Fusion Proteins in Plants

    Directory of Open Access Journals (Sweden)

    Guitao Zhong

    2017-06-01

    Full Text Available Chimeric fluorescent fusion proteins have been employed as a powerful tool to reveal the subcellular localizations and dynamics of proteins in living cells. Co-expression of a fluorescent fusion protein with well-known organelle markers in the same cell is especially useful in revealing its spatial and temporal functions of the protein in question. However, the conventional methods for co-expressing multiple fluorescent tagged proteins in plants have the drawbacks of low expression efficiency, variations in the expression level and time-consuming genetic crossing. Here, we have developed a novel robust system that allows for high-efficient co-expression of multiple chimeric fluorescent fusion proteins in plants in a time-saving fashion. This system takes advantage of employing a single expression vector which consists of multiple semi-independent expressing cassettes for the protein co-expression thereby overcoming the limitations of using multiple independent expressing plasmids. In addition, it is a highly manipulable DNA assembly system, in which modification and recombination of DNA molecules are easily achieved through an optimized one-step assembly reaction. By employing this effective system, we demonstrated that co-expression of two chimeric fluorescent fusion reporter proteins of vacuolar sorting receptor and secretory carrier membrane protein gave rise to their perspective subcellular localizations in plants via both transient expression and stable transformation. Thus, we believed that this technical advance represents a promising approach for multi-color-protein co-expression in plant cells.

  19. Characterization of differentially expressed genes involved in pathways associated with gastric cancer.

    Directory of Open Access Journals (Sweden)

    Hao Li

    Full Text Available To explore the patterns of gene expression in gastric cancer, a total of 26 paired gastric cancer and noncancerous tissues from patients were enrolled for gene expression microarray analyses. Limma methods were applied to analyze the data, and genes were considered to be significantly differentially expressed if the False Discovery Rate (FDR value was 2. Subsequently, Gene Ontology (GO categories were used to analyze the main functions of the differentially expressed genes. According to the Kyoto Encyclopedia of Genes and Genomes (KEGG database, we found pathways significantly associated with the differential genes. Gene-Act network and co-expression network were built respectively based on the relationships among the genes, proteins and compounds in the database. 2371 mRNAs and 350 lncRNAs considered as significantly differentially expressed genes were selected for the further analysis. The GO categories, pathway analyses and the Gene-Act network showed a consistent result that up-regulated genes were responsible for tumorigenesis, migration, angiogenesis and microenvironment formation, while down-regulated genes were involved in metabolism. These results of this study provide some novel findings on coding RNAs, lncRNAs, pathways and the co-expression network in gastric cancer which will be useful to guide further investigation and target therapy for this disease.

  20. Mouse Models as Predictors of Human Responses: Evolutionary Medicine.

    Science.gov (United States)

    Uhl, Elizabeth W; Warner, Natalie J

    Mice offer a number of advantages and are extensively used to model human diseases and drug responses. Selective breeding and genetic manipulation of mice have made many different genotypes and phenotypes available for research. However, in many cases, mouse models have failed to be predictive. Important sources of the prediction problem have been the failure to consider the evolutionary basis for species differences, especially in drug metabolism, and disease definitions that do not reflect the complexity of gene expression underlying disease phenotypes. Incorporating evolutionary insights into mouse models allow for unique opportunities to characterize the effects of diet, different gene expression profiles, and microbiomics underlying human drug responses and disease phenotypes.

  1. Evolutionary Dynamics of the Gametologous CTNNB1 Gene on the Z and W Chromosomes of Snakes.

    Science.gov (United States)

    Laopichienpong, Nararat; Muangmai, Narongrit; Chanhome, Lawan; Suntrarachun, Sunutcha; Twilprawat, Panupon; Peyachoknagul, Surin; Srikulnath, Kornsorn

    2017-03-01

    Snakes exhibit genotypic sex determination with female heterogamety (ZZ males and ZW females), and the state of sex chromosome differentiation also varies among lineages. To investigate the evolutionary history of homologous genes located in the nonrecombining region of differentiated sex chromosomes in snakes, partial sequences of the gametologous CTNNB1 gene were analyzed for 12 species belonging to henophid (Cylindrophiidae, Xenopeltidae, and Pythonidae) and caenophid snakes (Viperidae, Elapidae, and Colubridae). Nonsynonymous/synonymous substitution ratios (Ka/Ks) in coding sequences were low (Ka/Ks < 1) between CTNNB1Z and CTNNB1W, suggesting that these 2 genes may have similar functional properties. However, frequencies of intron sequence substitutions and insertion–deletions were higher in CTNNB1Z than CTNNB1W, suggesting that Z-linked sequences evolved faster than W-linked sequences. Molecular phylogeny based on both intron and exon sequences showed the presence of 2 major clades: 1) Z-linked sequences of Caenophidia and 2) W-linked sequences of Caenophidia clustered with Z-linked sequences of Henophidia, which suggests that the sequence divergence between CTNNB1Z and CTNNB1W in Caenophidia may have occurred by the cessation of recombination after the split from Henophidia.

  2. Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population

    Science.gov (United States)

    Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

    2016-01-01

    Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. PMID:27189991

  3. Evolutionary analysis of the kinesin light chain genes in the yellow fever mosquito Aedes aegypti: gene duplication as a source for novel early zygotic genes.

    Science.gov (United States)

    Biedler, James K; Tu, Zhijian

    2010-07-08

    codon shows promoter activity at least as early as 3 hours in the developing Ae. aegypti embryo. The AaKLC2.1 promoter activity reached ~1600 fold over the negative control at 5 hr after egg deposition. Transcriptome profiling by use of high throughput sequencing technologies has proven to be a valuable method for the identification and discovery of early and transient zygotic genes. The evolutionary investigation of the KLC gene family reveals that duplication is a source for the evolution of new genes that play a role in the dynamic process of early embryonic development. AaKLC2.1 may provide a promoter for early zygotic-specific transgene expression, which is a key component of the Medea gene drive system.

  4. Evolutionary analysis of the kinesin light chain genes in the yellow fever mosquito Aedes aegypti: gene duplication as a source for novel early zygotic genes

    Directory of Open Access Journals (Sweden)

    Tu Zhijian

    2010-07-01

    1 kb fragment upstream of the AaKLC2.1 start codon shows promoter activity at least as early as 3 hours in the developing Ae. aegypti embryo. The AaKLC2.1 promoter activity reached ~1600 fold over the negative control at 5 hr after egg deposition. Conclusions Transcriptome profiling by use of high throughput sequencing technologies has proven to be a valuable method for the identification and discovery of early and transient zygotic genes. The evolutionary investigation of the KLC gene family reveals that duplication is a source for the evolution of new genes that play a role in the dynamic process of early embryonic development. AaKLC2.1 may provide a promoter for early zygotic-specific transgene expression, which is a key component of the Medea gene drive system.

  5. [Evolutionary perspective in precocious puberty].

    Science.gov (United States)

    Hochberg, Ze'ev

    2014-10-01

    Pubertal development is subject to substantial heritability, but much variation remains to be explained, including fast changes over the last 150 years, that cannot be explained by changes of gene frequency in the population. This article discusses the influence of environmental factors to adjust maturational tempo in the service of fitness goals. Utilizing evolutionary development thinking (evo-devo), the author examines adolescence as an evolutionary life-history stage in its developmental context. The transition from the preceding stage of juvenility entails adaptive plasticity in response to energy resources, social needs of adolescence and maturation toward youth and adulthood. Using Belsky's evolutionary theory of socialization, I show that familial psychosocial environment during the infancy-childhood and childhood-juvenility transitions foster a fast life-history and reproductive strategy rather than early maturation being just a risk factor for aggression and delinquency. The implications of the evo-devo framework for theory building, illuminates new directions in the understanding of precocious puberty other than a diagnosis of a disease.

  6. Application of network methods for understanding evolutionary dynamics in discrete habitats.

    Science.gov (United States)

    Greenbaum, Gili; Fefferman, Nina H

    2017-06-01

    In populations occupying discrete habitat patches, gene flow between habitat patches may form an intricate population structure. In such structures, the evolutionary dynamics resulting from interaction of gene-flow patterns with other evolutionary forces may be exceedingly complex. Several models describing gene flow between discrete habitat patches have been presented in the population-genetics literature; however, these models have usually addressed relatively simple settings of habitable patches and have stopped short of providing general methodologies for addressing nontrivial gene-flow patterns. In the last decades, network theory - a branch of discrete mathematics concerned with complex interactions between discrete elements - has been applied to address several problems in population genetics by modelling gene flow between habitat patches using networks. Here, we present the idea and concepts of modelling complex gene flows in discrete habitats using networks. Our goal is to raise awareness to existing network theory applications in molecular ecology studies, as well as to outline the current and potential contribution of network methods to the understanding of evolutionary dynamics in discrete habitats. We review the main branches of network theory that have been, or that we believe potentially could be, applied to population genetics and molecular ecology research. We address applications to theoretical modelling and to empirical population-genetic studies, and we highlight future directions for extending the integration of network science with molecular ecology. © 2017 John Wiley & Sons Ltd.

  7. MEGA5: Molecular Evolutionary Genetics Analysis Using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods

    Science.gov (United States)

    Tamura, Koichiro; Peterson, Daniel; Peterson, Nicholas; Stecher, Glen; Nei, Masatoshi; Kumar, Sudhir

    2011-01-01

    Comparative analysis of molecular sequence data is essential for reconstructing the evolutionary histories of species and inferring the nature and extent of selective forces shaping the evolution of genes and species. Here, we announce the release of Molecular Evolutionary Genetics Analysis version 5 (MEGA5), which is a user-friendly software for mining online databases, building sequence alignments and phylogenetic trees, and using methods of evolutionary bioinformatics in basic biology, biomedicine, and evolution. The newest addition in MEGA5 is a collection of maximum likelihood (ML) analyses for inferring evolutionary trees, selecting best-fit substitution models (nucleotide or amino acid), inferring ancestral states and sequences (along with probabilities), and estimating evolutionary rates site-by-site. In computer simulation analyses, ML tree inference algorithms in MEGA5 compared favorably with other software packages in terms of computational efficiency and the accuracy of the estimates of phylogenetic trees, substitution parameters, and rate variation among sites. The MEGA user interface has now been enhanced to be activity driven to make it easier for the use of both beginners and experienced scientists. This version of MEGA is intended for the Windows platform, and it has been configured for effective use on Mac OS X and Linux desktops. It is available free of charge from http://www.megasoftware.net. PMID:21546353

  8. Evolutionary hierarchy of vertebrate-like heterotrimeric G protein families.

    Science.gov (United States)

    Krishnan, Arunkumar; Mustafa, Arshi; Almén, Markus Sällman; Fredriksson, Robert; Williams, Michael J; Schiöth, Helgi B

    2015-10-01

    Heterotrimeric G proteins perform a crucial role as molecular switches controlling various cellular responses mediated by G protein-coupled receptor (GPCR) signaling pathway. Recent data have shown that the vertebrate-like G protein families are found across metazoans and their closest unicellular relatives. However, an overall evolutionary hierarchy of vertebrate-like G proteins, including gene family annotations and in particular mapping individual gene gain/loss events across diverse holozoan lineages is still incomplete. Here, with more expanded invertebrate taxon sampling, we have reconstructed phylogenetic trees for each of the G protein classes/families and provide a robust classification and hierarchy of vertebrate-like heterotrimeric G proteins. Our results further extend the evidence that the common ancestor (CA) of holozoans had at least five ancestral Gα genes corresponding to all major vertebrate Gα classes and contain a total of eight genes including two Gβ and one Gγ. Our results also indicate that the GNAI/O-like gene likely duplicated in the last CA of metazoans to give rise to GNAI- and GNAO-like genes, which are conserved across invertebrates. Moreover, homologs of GNB1-4 paralogon- and GNB5 family-like genes are found in most metazoans and that the unicellular holozoans encode two ancestral Gβ genes. Similarly, most bilaterian invertebrates encode two Gγ genes which include a representative of the GNG gene cluster and a putative homolog of GNG13. Interestingly, our results also revealed key evolutionary events such as the Drosophila melanogaster eye specific Gβ subunit that is found conserved in most arthropods and several previously unidentified species specific expansions within Gαi/o, Gαs, Gαq, Gα12/13 classes and the GNB1-4 paralogon. Also, we provide an overall proposed evolutionary scenario on the expansions of all G protein families in vertebrate tetraploidizations. Our robust classification/hierarchy is essential to further

  9. Intra- and interspecies gene expression models for predicting drug response in canine osteosarcoma.

    Science.gov (United States)

    Fowles, Jared S; Brown, Kristen C; Hess, Ann M; Duval, Dawn L; Gustafson, Daniel L

    2016-02-19

    Genomics-based predictors of drug response have the potential to improve outcomes associated with cancer therapy. Osteosarcoma (OS), the most common primary bone cancer in dogs, is commonly treated with adjuvant doxorubicin or carboplatin following amputation of the affected limb. We evaluated the use of gene-expression based models built in an intra- or interspecies manner to predict chemosensitivity and treatment outcome in canine OS. Models were built and evaluated using microarray gene expression and drug sensitivity data from human and canine cancer cell lines, and canine OS tumor datasets. The "COXEN" method was utilized to filter gene signatures between human and dog datasets based on strong co-expression patterns. Models were built using linear discriminant analysis via the misclassification penalized posterior algorithm. The best doxorubicin model involved genes identified in human lines that were co-expressed and trained on canine OS tumor data, which accurately predicted clinical outcome in 73 % of dogs (p = 0.0262, binomial). The best carboplatin model utilized canine lines for gene identification and model training, with canine OS tumor data for co-expression. Dogs whose treatment matched our predictions had significantly better clinical outcomes than those that didn't (p = 0.0006, Log Rank), and this predictor significantly associated with longer disease free intervals in a Cox multivariate analysis (hazard ratio = 0.3102, p = 0.0124). Our data show that intra- and interspecies gene expression models can successfully predict response in canine OS, which may improve outcome in dogs and serve as pre-clinical validation for similar methods in human cancer research.

  10. Reconciliation of Gene and Species Trees

    Directory of Open Access Journals (Sweden)

    L. Y. Rusin

    2014-01-01

    Full Text Available The first part of the paper briefly overviews the problem of gene and species trees reconciliation with the focus on defining and algorithmic construction of the evolutionary scenario. Basic ideas are discussed for the aspects of mapping definitions, costs of the mapping and evolutionary scenario, imposing time scales on a scenario, incorporating horizontal gene transfers, binarization and reconciliation of polytomous trees, and construction of species trees and scenarios. The review does not intend to cover the vast diversity of literature published on these subjects. Instead, the authors strived to overview the problem of the evolutionary scenario as a central concept in many areas of evolutionary research. The second part provides detailed mathematical proofs for the solutions of two problems: (i inferring a gene evolution along a species tree accounting for various types of evolutionary events and (ii trees reconciliation into a single species tree when only gene duplications and losses are allowed. All proposed algorithms have a cubic time complexity and are mathematically proved to find exact solutions. Solving algorithms for problem (ii can be naturally extended to incorporate horizontal transfers, other evolutionary events, and time scales on the species tree.

  11. Novel algorithms reveal streptococcal transcriptomes and clues about undefined genes.

    Science.gov (United States)

    Ryan, Patricia A; Kirk, Brian W; Euler, Chad W; Schuch, Raymond; Fischetti, Vincent A

    2007-07-01

    Bacteria-host interactions are dynamic processes, and understanding transcriptional responses that directly or indirectly regulate the expression of genes involved in initial infection stages would illuminate the molecular events that result in host colonization. We used oligonucleotide microarrays to monitor (in vitro) differential gene expression in group A streptococci during pharyngeal cell adherence, the first overt infection stage. We present neighbor clustering, a new computational method for further analyzing bacterial microarray data that combines two informative characteristics of bacterial genes that share common function or regulation: (1) similar gene expression profiles (i.e., co-expression); and (2) physical proximity of genes on the chromosome. This method identifies statistically significant clusters of co-expressed gene neighbors that potentially share common function or regulation by coupling statistically analyzed gene expression profiles with the chromosomal position of genes. We applied this method to our own data and to those of others, and we show that it identified a greater number of differentially expressed genes, facilitating the reconstruction of more multimeric proteins and complete metabolic pathways than would have been possible without its application. We assessed the biological significance of two identified genes by assaying deletion mutants for adherence in vitro and show that neighbor clustering indeed provides biologically relevant data. Neighbor clustering provides a more comprehensive view of the molecular responses of streptococci during pharyngeal cell adherence.

  12. Phylogeny and evolutionary history of Leymus (Triticeae; Poaceae based on a single-copy nuclear gene encoding plastid acetyl-CoA carboxylase

    Directory of Open Access Journals (Sweden)

    Ding Cun-Bang

    2009-10-01

    Full Text Available Abstract Background Single- and low- copy genes are less likely subject to concerted evolution, thus making themselves ideal tools for studying the origin and evolution of polyploid taxa. Leymus is a polyploid genus with a diverse array of morphology, ecology and distribution in Triticeae. The genomic constitution of Leymus was assigned as NsXm, where Ns was presumed to be originated from Psathyrostachys, while Xm represented a genome of unknown origin. In addition, little is known about the evolutionary history of Leymus. Here, we investigate the phylogenetic relationship, genome donor, and evolutionary history of Leymus based on a single-copy nuclear Acc1 gene. Results Two homoeologues of the Acc1 gene were isolated from nearly all the sampled Leymus species using allele-specific primer and were analyzed with those from 35 diploid taxa representing 18 basic genomes in Triticeae. Sequence diversity patterns and genealogical analysis suggested that (1 Leymus is closely related to Psathyrostachys, Agropyron, and Eremopyrum; (2 Psathyrostachys juncea is an ancestral Ns-genome donor of Leymus species; (3 the Xm genome in Leymus may be originated from an ancestral lineage of Agropyron and Eremopyrum triticeum; (4 the Acc1 sequences of Leymus species from the Qinghai-Tibetan plateau are evolutionarily distinct; (5 North America Leymus species might originate from colonization via the Bering land bridge; (6 Leymus originated about 11-12MYA in Eurasia, and adaptive radiation might have occurred in Leymus during the period of 3.7-4.3 MYA and 1.7-2.1 MYA. Conclusion Leymus species have allopolyploid origin. It is hypothesized that the adaptive radiation of Leymus species might have been triggered by the recent upliftings of the Qinghai-Tibetan plateau and subsequent climatic oscillations. Adaptive radiation may have promoted the rapid speciation, as well as the fixation of unique morphological characters in Leymus. Our results shed new light on our

  13. The evolutionary ecology of molecular replicators.

    Science.gov (United States)

    Nee, Sean

    2016-08-01

    By reasonable criteria, life on the Earth consists mainly of molecular replicators. These include viruses, transposons, transpovirons, coviruses and many more, with continuous new discoveries like Sputnik Virophage. Their study is inherently multidisciplinary, spanning microbiology, genetics, immunology and evolutionary theory, and the current view is that taking a unified approach has great power and promise. We support this with a new, unified, model of their evolutionary ecology, using contemporary evolutionary theory coupling the Price equation with game theory, studying the consequences of the molecular replicators' promiscuous use of each others' gene products for their natural history and evolutionary ecology. Even at this simple expository level, we can make a firm prediction of a new class of replicators exploiting viruses such as lentiviruses like SIVs, a family which includes HIV: these have been explicitly stated in the primary literature to be non-existent. Closely connected to this departure is the view that multicellular organism immunology is more about the management of chronic infections rather than the elimination of acute ones and new understandings emerging are changing our view of the kind of theatre we ourselves provide for the evolutionary play of molecular replicators. This study adds molecular replicators to bacteria in the emerging field of sociomicrobiology.

  14. Coexpression of β-D-galactosidase and L-arabinose isomerase in the production of D-tagatose: a functional sweetener.

    Science.gov (United States)

    Zhan, Yijing; Xu, Zheng; Li, Sha; Liu, Xiaoliu; Xu, Lu; Feng, Xiaohai; Xu, Hong

    2014-03-19

    The functional sweetener, d-tagatose, is commonly transformed from galactose by l-arabinose isomerase. To make use of a much cheaper starting material, lactose, hydrolization, and isomerization are required to take place collaboratively. Therefore, a single-step method involving β-d-galactosidase was explored for d-tagatose production. The two vital genes, β-d-galactosidase gene (lacZ) and l-arabinose isomerase mutant gene (araA') were extracted separately from Escherichia coli strains and incorporated into E. coli simultaneously. This gave us E. coli-ZY, a recombinant producing strain capable of coexpressing the two key enzymes. The resulted cells exhibited maximum d-tagatose producing activity at 34 °C and pH 6.5 and in the presence of borate, 10 mM Fe(2+), and 1 mM Mn(2+). Further monitoring showed that the recombinant cells could hydrolyze more than 95% lactose and convert 43% d-galactose into d-tagatose. This research has verified the feasibility of single-step d-tagatose fermentation, thereby laying down the foundation for industrial usage of lactose.

  15. The Epstein-Barr virus BFRF1 and BFLF2 proteins interact and coexpression alters their cellular localization

    International Nuclear Information System (INIS)

    Lake, Cathleen M.; Hutt-Fletcher, Lindsey M.

    2004-01-01

    The BFRF1 protein of Epstein-Barr virus (EBV) is a recently identified membrane protein that is the homolog of the alphaherpesvirus UL34 gene product. We report here that a yeast two-hybrid screen identified the BFLF2 gene product, a homolog of alphaherpesvirus UL31, as a protein that interacts with BFRF1. Expression of BFLF2 in mammalian cells revealed a protein of approximately 28 kDa that associated with BFRF1 in a noncovalently linked complex. When expressed alone, the BFRF1 protein was found in the cytoplasm and perinuclear region. BFLF2 was found diffusely in the nucleus in the absence of BFRF1, but coexpression of BFRF1 and BFLF2 resulted in colocalization of the two proteins at the nuclear rim. These data recapitulate the behavior of the alphaherpesvirus homologs of BFRF1 and BFLF2 and suggest that functional as well as structural and positional homology may be conserved

  16. From genes to milk: genomic organization and epigenetic regulation of the mammary transcriptome.

    Science.gov (United States)

    Lemay, Danielle G; Pollard, Katherine S; Martin, William F; Freeman Zadrowski, Courtneay; Hernandez, Joseph; Korf, Ian; German, J Bruce; Rijnkels, Monique

    2013-01-01

    Even in genomes lacking operons, a gene's position in the genome influences its potential for expression. The mechanisms by which adjacent genes are co-expressed are still not completely understood. Using lactation and the mammary gland as a model system, we explore the hypothesis that chromatin state contributes to the co-regulation of gene neighborhoods. The mammary gland represents a unique evolutionary model, due to its recent appearance, in the context of vertebrate genomes. An understanding of how the mammary gland is regulated to produce milk is also of biomedical and agricultural importance for human lactation and dairying. Here, we integrate epigenomic and transcriptomic data to develop a comprehensive regulatory model. Neighborhoods of mammary-expressed genes were determined using expression data derived from pregnant and lactating mice and a neighborhood scoring tool, G-NEST. Regions of open and closed chromatin were identified by ChIP-Seq of histone modifications H3K36me3, H3K4me2, and H3K27me3 in the mouse mammary gland and liver tissue during lactation. We found that neighborhoods of genes in regions of uniquely active chromatin in the lactating mammary gland, compared with liver tissue, were extremely rare. Rather, genes in most neighborhoods were suppressed during lactation as reflected in their expression levels and their location in regions of silenced chromatin. Chromatin silencing was largely shared between the liver and mammary gland during lactation, and what distinguished the mammary gland was mainly a small tissue-specific repertoire of isolated, expressed genes. These findings suggest that an advantage of the neighborhood organization is in the collective repression of groups of genes via a shared mechanism of chromatin repression. Genes essential to the mammary gland's uniqueness are isolated from neighbors, and likely have less tolerance for variation in expression, properties they share with genes responsible for an organism's survival.

  17. Comparative genomics explains the evolutionary success of reef-forming corals

    KAUST Repository

    Bhattacharya, Debashish

    2016-05-24

    Transcriptome and genome data from twenty stony coral species and a selection of reference bilaterians were studied to elucidate coral evolutionary history. We identified genes that encode the proteins responsible for the precipitation and aggregation of the aragonite skeleton on which the organisms live, and revealed a network of environmental sensors that coordinate responses of the host animals to temperature, light, and pH. Furthermore, we describe a variety of stress-related pathways, including apoptotic pathways that allow the host animals to detoxify reactive oxygen and nitrogen species that are generated by their intracellular photosynthetic symbionts, and determine the fate of corals under environmental stress. Some of these genes arose through horizontal gene transfer and comprise at least 0.2% of the animal gene inventory. Our analysis elucidates the evolutionary strategies that have allowed symbiotic corals to adapt and thrive for hundreds of millions of years.

  18. Comparative genomics explains the evolutionary success of reef-forming corals

    KAUST Repository

    Bhattacharya, Debashish; Agrawal, Shobhit; Aranda, Manuel; Baumgarten, Sebastian; Belcaid, Mahdi; Drake, Jeana L; Erwin, Douglas; Foret, Sylvian; Gates, Ruth D; Gruber, David F; Kamel, Bishoy; Lesser, Michael P; Levy, Oren; Liew, Yi Jin; MacManes, Matthew; Mass, Tali; Medina, Monica; Mehr, Shaadi; Meyer, Eli; Price, Dana C; Putnam, Hollie M; Qiu, Huan; Shinzato, Chuya; Shoguchi, Eiichi; Stokes, Alexander J; Tambutté , Sylvie; Tchernov, Dan; Voolstra, Christian R.; Wagner, Nicole; Walker, Charles W; Weber, Andreas PM; Weis, Virginia; Zelzion, Ehud; Zoccola, Didier; Falkowski, Paul G

    2016-01-01

    Transcriptome and genome data from twenty stony coral species and a selection of reference bilaterians were studied to elucidate coral evolutionary history. We identified genes that encode the proteins responsible for the precipitation and aggregation of the aragonite skeleton on which the organisms live, and revealed a network of environmental sensors that coordinate responses of the host animals to temperature, light, and pH. Furthermore, we describe a variety of stress-related pathways, including apoptotic pathways that allow the host animals to detoxify reactive oxygen and nitrogen species that are generated by their intracellular photosynthetic symbionts, and determine the fate of corals under environmental stress. Some of these genes arose through horizontal gene transfer and comprise at least 0.2% of the animal gene inventory. Our analysis elucidates the evolutionary strategies that have allowed symbiotic corals to adapt and thrive for hundreds of millions of years.

  19. Pooled Enrichment Sequencing Identifies Diversity and Evolutionary Pressures at NLR Resistance Genes within a Wild Tomato Population.

    Science.gov (United States)

    Stam, Remco; Scheikl, Daniela; Tellier, Aurélien

    2016-06-02

    Nod-like receptors (NLRs) are nucleotide-binding domain and leucine-rich repeats containing proteins that are important in plant resistance signaling. Many of the known pathogen resistance (R) genes in plants are NLRs and they can recognize pathogen molecules directly or indirectly. As such, divergence and copy number variants at these genes are found to be high between species. Within populations, positive and balancing selection are to be expected if plants coevolve with their pathogens. In order to understand the complexity of R-gene coevolution in wild nonmodel species, it is necessary to identify the full range of NLRs and infer their evolutionary history. Here we investigate and reveal polymorphism occurring at 220 NLR genes within one population of the partially selfing wild tomato species Solanum pennellii. We use a combination of enrichment sequencing and pooling ten individuals, to specifically sequence NLR genes in a resource and cost-effective manner. We focus on the effects which different mapping and single nucleotide polymorphism calling software and settings have on calling polymorphisms in customized pooled samples. Our results are accurately verified using Sanger sequencing of polymorphic gene fragments. Our results indicate that some NLRs, namely 13 out of 220, have maintained polymorphism within our S. pennellii population. These genes show a wide range of πN/πS ratios and differing site frequency spectra. We compare our observed rate of heterozygosity with expectations for this selfing and bottlenecked population. We conclude that our method enables us to pinpoint NLR genes which have experienced natural selection in their habitat. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. Conservation of gene linkage in dispersed vertebrate NK homeobox clusters.

    Science.gov (United States)

    Wotton, Karl R; Weierud, Frida K; Juárez-Morales, José L; Alvares, Lúcia E; Dietrich, Susanne; Lewis, Katharine E

    2009-10-01

    Nk homeobox genes are important regulators of many different developmental processes including muscle, heart, central nervous system and sensory organ development. They are thought to have arisen as part of the ANTP megacluster, which also gave rise to Hox and ParaHox genes, and at least some NK genes remain tightly linked in all animals examined so far. The protostome-deuterostome ancestor probably contained a cluster of nine Nk genes: (Msx)-(Nk4/tinman)-(Nk3/bagpipe)-(Lbx/ladybird)-(Tlx/c15)-(Nk7)-(Nk6/hgtx)-(Nk1/slouch)-(Nk5/Hmx). Of these genes, only NKX2.6-NKX3.1, LBX1-TLX1 and LBX2-TLX2 remain tightly linked in humans. However, it is currently unclear whether this is unique to the human genome as we do not know which of these Nk genes are clustered in other vertebrates. This makes it difficult to assess whether the remaining linkages are due to selective pressures or because chance rearrangements have "missed" certain genes. In this paper, we identify all of the paralogs of these ancestrally clustered NK genes in several distinct vertebrates. We demonstrate that tight linkages of Lbx1-Tlx1, Lbx2-Tlx2 and Nkx3.1-Nkx2.6 have been widely maintained in both the ray-finned and lobe-finned fish lineages. Moreover, the recently duplicated Hmx2-Hmx3 genes are also tightly linked. Finally, we show that Lbx1-Tlx1 and Hmx2-Hmx3 are flanked by highly conserved noncoding elements, suggesting that shared regulatory regions may have resulted in evolutionary pressure to maintain these linkages. Consistent with this, these pairs of genes have overlapping expression domains. In contrast, Lbx2-Tlx2 and Nkx3.1-Nkx2.6, which do not seem to be coexpressed, are also not associated with conserved noncoding sequences, suggesting that an alternative mechanism may be responsible for the continued clustering of these genes.

  1. In vivo modification of tyrosine residues in recombinant mussel adhesive protein by tyrosinase co-expression in Escherichia coli

    Directory of Open Access Journals (Sweden)

    Choi Yoo

    2012-10-01

    Full Text Available Abstract Background In nature, mussel adhesive proteins (MAPs show remarkable adhesive properties, biocompatibility, and biodegradability. Thus, they have been considered promising adhesive biomaterials for various biomedical and industrial applications. However, limited production of natural MAPs has hampered their practical applications. Recombinant production in bacterial cells could be one alternative to obtain useable amounts of MAPs, although additional post-translational modification of tyrosine residues into 3,4-dihydroxyphenyl-alanine (Dopa and Dopaquinone is required. The superior properties of MAPs are mainly attributed to the introduction of quinone-derived intermolecular cross-links. To solve this problem, we utilized a co-expression strategy of recombinant MAP and tyrosinase in Escherichia coli to successfully modify tyrosine residues in vivo. Results A recombinant hybrid MAP, fp-151, was used as a target for in vivo modification, and a dual vector system of pET and pACYC-Duet provided co-expression of fp-151 and tyrosinase. As a result, fp-151 was over-expressed and mainly obtained from the soluble fraction in the co-expression system. Without tyrosinase co-expression, fp-151 was over-expressed in an insoluble form in inclusion bodies. The modification of tyrosine residues in the soluble-expressed fp-151 was clearly observed from nitroblue tetrazolium staining and liquid-chromatography-mass/mass spectrometry analyses. The purified, in vivo modified, fp-151 from the co-expression system showed approximately 4-fold higher bulk-scale adhesive strength compared to in vitro tyrosinase-treated fp-151. Conclusion Here, we reported a co-expression system to obtain in vivo modified MAP; additional in vitro tyrosinase modification was not needed to obtain adhesive properties and the in vivo modified MAP showed superior adhesive strength compared to in vitro modified protein. It is expected that this co-expression strategy will accelerate

  2. Evolutionary genomics and HIV restriction factors.

    Science.gov (United States)

    Pyndiah, Nitisha; Telenti, Amalio; Rausell, Antonio

    2015-03-01

    To provide updated insights into innate antiviral immunity and highlight prototypical evolutionary features of well characterized HIV restriction factors. Recently, a new HIV restriction factor, Myxovirus resistance 2, has been discovered and the region/residue responsible for its activity identified using an evolutionary approach. Furthermore, IFI16, an innate immunity protein known to sense several viruses, has been shown to contribute to the defense to HIV-1 by causing cell death upon sensing HIV-1 DNA. Restriction factors against HIV show characteristic signatures of positive selection. Different patterns of accelerated sequence evolution can distinguish antiviral strategies--offense or defence--as well as the level of specificity of the antiviral properties. Sequence analysis of primate orthologs of restriction factors serves to localize functional domains and sites responsible for antiviral action. We use recent discoveries to illustrate how evolutionary genomic analyses help identify new antiviral genes and their mechanisms of action.

  3. Protein Coexpression Using FMDV 2A: Effect of “Linker” Residues

    Directory of Open Access Journals (Sweden)

    Ekaterina Minskaia

    2013-01-01

    Full Text Available Many biomedical applications absolutely require, or are substantially enhanced by, coexpression of multiple proteins from a single vector. Foot-and-mouth disease virus 2A (F2A and “2A-like” sequences (e.g., Thosea asigna virus 2A; T2A are used widely for this purpose since multiple proteins can be coexpressed by linking open reading frames (ORFs to form a single cistron. The activity of F2A “cleavage” may, however, be compromised by both the use of shorter versions of F2A and the sequences (derived from multiple-purpose cloning sites used to link F2A to the upstream protein. To characterise these effects, different lengths of F2A and T2A were inserted between green and cherry fluorescent proteins. Mutations were introduced in the linker region immediately upstream of both F2A- and T2A-based constructs and activities determined using both cell-free translation systems and transfected cells. In shorter versions of F2A, activity may be affected by both the C-terminal sequence of the protein upstream and, equally strikingly, the residues immediately upstream introduced during cloning. Mutations significantly improved activity for shorter versions of F2A but could decrease activity in the case of T2A. These data will aid the design of cloning strategies for the co-expression of multiple proteins in biomedical/biotechnological applications.

  4. Evolution of microbes and viruses: A paradigm shift in evolutionary biology?

    Directory of Open Access Journals (Sweden)

    Eugene V. Koonin

    2012-09-01

    Full Text Available When Charles Darwin formulated the central principles of evolutionary biology in the Origin of Species in 1859 and the architects of the Modern Synthesis integrated these principles with population genetics almost a century later, the principal if not the sole objects of evolutionary biology were multicellular eukaryotes, primarily animals and plants. Before the advent of efficient gene sequencing, all attempts to extend evolutionary studies to bacteria have been futile. Sequencing of the rRNA genes in thousands of microbes allowed the construction of the three- domain ‘ribosomal Tree of Life’ that was widely thought to have resolved the evolutionary relationships between the cellular life forms. However, subsequent massive sequencing of numerous, complete microbial genomes revealed novel evolutionary phenomena, the most fundamental of these being: i pervasive horizontal gene transfer (HGT, in large part mediated by viruses and plasmids, that shapes the genomes of archaea and bacteria and call for a radical revision (if not abandonment of the Tree of Life concept, ii Lamarckian-type inheritance that appears to be critical for antivirus defense and other forms of adaptation in prokaryotes, and iii evolution of evolvability, i.e. dedicated mechanisms for evolution such as vehicles for HGT and stress-induced mutagenesis systems. In the non-cellular part of the microbial world, phylogenomics and metagenomics of viruses and related selfish genetic elements revealed enormous genetic and molecular diversity and extremely high abundance of viruses that come across as the dominant biological entities on earth. Furthermore, the perennial arms race between viruses and their hosts is one of the defining factors of evolution. Thus, microbial phylogenomics adds new dimensions to the fundamental picture of evolution even as the principle of descent with modification discovered by Darwin and the laws of population genetics remain at the core of evolutionary

  5. Microarray profiling and co-expression network analysis of circulating lncRNAs and mRNAs associated with major depressive disorder.

    Directory of Open Access Journals (Sweden)

    Zhifen Liu

    Full Text Available LncRNAs, which represent one of the most highly expressed classes of ncRNAs in the brain, are becoming increasingly interesting with regard to brain functions and disorders. However, changes in the expression of regulatory lncRNAs in Major Depressive Disorder (MDD have not yet been reported. Using microarrays, we profiled the expression of 34834 lncRNAs and 39224 mRNAs in peripheral blood sampled from MDD patients as well as demographically-matched controls. Among these, we found that 2007 lncRNAs and 1667 mRNAs were differentially expressed, 17 of which were documented as depression-related gene in previous studies. Gene Ontology (GO and pathway analyses indicated that the biological functions of differentially expressed mRNAs were related to fundamental metabolic processes and neurodevelopment diseases. To investigate the potential regulatory roles of the differentially expressed lncRNAs on the mRNAs, we also constructed co-expression networks composed of the lncRNAs and mRNAs, which shows significant correlated patterns of expression. In the MDD-derived network, there were a greater number of nodes and connections than that in the control-derived network. The lncRNAs located at chr10:874695-874794, chr10:75873456-75873642, and chr3:47048304-47048512 may be important factors regulating the expression of mRNAs as they have previously been reported associations with MDD. This study is the first to explore genome-wide lncRNA expression and co-expression with mRNA patterns in MDD using microarray technology. We identified circulating lncRNAs that are aberrantly expressed in MDD and the results suggest that lncRNAs may contribute to the molecular pathogenesis of MDD.

  6. TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES.

    Science.gov (United States)

    Zhu, Lingxue; Lei, Jing; Devlin, Bernie; Roeder, Kathryn

    2017-09-01

    Scientists routinely compare gene expression levels in cases versus controls in part to determine genes associated with a disease. Similarly, detecting case-control differences in co-expression among genes can be critical to understanding complex human diseases; however statistical methods have been limited by the high dimensional nature of this problem. In this paper, we construct a sparse-Leading-Eigenvalue-Driven (sLED) test for comparing two high-dimensional covariance matrices. By focusing on the spectrum of the differential matrix, sLED provides a novel perspective that accommodates what we assume to be common, namely sparse and weak signals in gene expression data, and it is closely related with Sparse Principal Component Analysis. We prove that sLED achieves full power asymptotically under mild assumptions, and simulation studies verify that it outperforms other existing procedures under many biologically plausible scenarios. Applying sLED to the largest gene-expression dataset obtained from post-mortem brain tissue from Schizophrenia patients and controls, we provide a novel list of genes implicated in Schizophrenia and reveal intriguing patterns in gene co-expression change for Schizophrenia subjects. We also illustrate that sLED can be generalized to compare other gene-gene "relationship" matrices that are of practical interest, such as the weighted adjacency matrices.

  7. Modalities of gene action predicted by the classical evolutionary biological theory of aging.

    Science.gov (United States)

    Martin, George M

    2007-04-01

    What might now be referred to as the "classical" evolutionary biological theory of why we age has had a number of serious challenges in recent years. While the theory might therefore have to be modified under certain circumstances, in the author's opinion, it still provides the soundest theoretical basis for thinking about how we age. Nine modalities of gene action that have the potential to modulate processes of aging are reviewed, including the two most widely reviewed and accepted concepts ("antagonistic pleiotropy" and "mutation accumulation"). While several of these nine mechanisms can be regarded as derivatives of the antagonistic pleiotropic concept, they frame more specific questions for future research. Such research should pursue what appears to be the dominant factor in the determination of intraspecific variations in longevity-stochastic mechanisms, most likely based upon epigenetics. This contrasts with the dominant factor in the determination of interspecific variations in longevity-the constitutional genome, most likely based upon variations in regulatory loci.

  8. VSNL1 Co-expression networks in aging include calcium signaling, synaptic plasticity, and Alzheimer’s disease pathways

    Directory of Open Access Journals (Sweden)

    C W Lin

    2015-03-01

    Full Text Available The Visinin-like 1 (VSNL1 gene encodes Visinin-like protein 1, a peripheral biomarker for Alzheimer disease (AD. Little is known, however, about normal VSNL1 expression in brain and the biologic networks in which it participates. Frontal cortex gray matter from 209 subjects without neurodegenerative or psychiatric illness, ranging in age from 16–91, were processed on Affymetrix GeneChip 1.1 ST and Human SNP Array 6.0. VSNL1 expression was unaffected by age and sex, and not significantly associated with SNPs in cis or trans. VSNL1 was significantly co-expressed with genes in pathways for Calcium Signaling, AD, Long Term Potentiation, Long Term Depression, and Trafficking of AMPA Receptors. The association with AD was driven, in part, by correlation with amyloid precursor protein (APP expression. These findings provide an unbiased link between VSNL1 and molecular mechanisms of AD, including pathways implicated in synaptic pathology in AD. Whether APP may drive increased VSNL1 expression, VSNL1 drives increased APP expression, or both are downstream of common pathogenic regulators will need to be evaluated in model systems.

  9. Functional comparison of the nematode Hox gene lin-39 in C. elegans and P. pacificus reveals evolutionary conservation of protein function despite divergence of primary sequences.

    Science.gov (United States)

    Grandien, K; Sommer, R J

    2001-08-15

    Hox transcription factors have been implicated in playing a central role in the evolution of animal morphology. Many studies indicate the evolutionary importance of regulatory changes in Hox genes, but little is known about the role of functional changes in Hox proteins. In the nematodes Pristionchus pacificus and Caenorhabditis elegans, developmental processes can be compared at the cellular, genetic, and molecular levels and differences in gene function can be identified. The Hox gene lin-39 is involved in the regulation of nematode vulva development. Comparison of known lin-39 mutations in P. pacificus and C. elegans revealed both conservation and changes of gene function. Here, we study evolutionary changes of lin-39 function using hybrid transgenes and site-directed mutagenesis in an in vivo assay using C. elegans lin-39 mutants. Our data show that despite the functional differences of LIN-39 between the two species, Ppa-LIN-39, when driven by Cel-lin-39 regulatory elements, can functionally replace Cel-lin-39. Furthermore, we show that the MAPK docking and phosphorylation motifs unique for Cel-LIN-39 are dispensable for Cel-lin-39 function. Therefore, the evolution of lin-39 function is driven by changes in regulatory elements rather than changes in the protein itself.

  10. Interactions between co-expressed Arabidopsis sucrose transporters in the split-ubiquitin system

    Directory of Open Access Journals (Sweden)

    Lalonde Sylvie

    2003-03-01

    Full Text Available Abstract Background The Arabidopsis genome contains nine sucrose transporter paralogs falling into three clades: SUT1-like, SUT2 and SUT4. The carriers differ in their kinetic properties. Many transport proteins are known to exist as oligomers. The yeast-based split ubiquitin system can be used to analyze the ability of membrane proteins to interact. Results Promoter-GUS fusions were used to analyze the cellular expression of the three transporter genes in transgenic Arabidopsis plants. All three fusion genes are co-expressed in companion cells. Protein-protein interactions between Arabidopsis sucrose transporters were tested using the split ubiquitin system. Three paralogous sucrose transporters are capable of interacting as either homo- or heteromers. The interactions are specific, since a potassium channel and a glucose transporter did not show interaction with sucrose transporters. Also the biosynthetic and metabolizing enzymes, sucrose phosphate phosphatase and sucrose synthase, which were found to be at least in part bound to the plasma membrane, did not specifically interact with sucrose transporters. Conclusions The split-ubiquitin system provides a powerful tool to detect potential interactions between plant membrane proteins by heterologous expression in yeast, and can be used to screen for interactions with membrane proteins as baits. Like other membrane proteins, the Arabidopsis sucrose transporters are able to form oligomers. The biochemical approaches are required to confirm the in planta interaction.

  11. Application of DETECTER, an evolutionary genomic tool to analyze genetic variation, to the cystic fibrosis gene family

    Directory of Open Access Journals (Sweden)

    De Kee Danny W

    2006-03-01

    Full Text Available Abstract Background The medical community requires computational tools that distinguish missense genetic differences having phenotypic impact within the vast number of sense mutations that do not. Tools that do this will become increasingly important for those seeking to use human genome sequence data to predict disease, make prognoses, and customize therapy to individual patients. Results An approach, termed DETECTER, is proposed to identify sites in a protein sequence where amino acid replacements are likely to have a significant effect on phenotype, including causing genetic disease. This approach uses a model-dependent tool to estimate the normalized replacement rate at individual sites in a protein sequence, based on a history of those sites extracted from an evolutionary analysis of the corresponding protein family. This tool identifies sites that have higher-than-average, average, or lower-than-average rates of change in the lineage leading to the sequence in the population of interest. The rates are then combined with sequence data to determine the likelihoods that particular amino acids were present at individual sites in the evolutionary history of the gene family. These likelihoods are used to predict whether any specific amino acid replacements, if introduced at the site in a modern human population, would have a significant impact on fitness. The DETECTER tool is used to analyze the cystic fibrosis transmembrane conductance regulator (CFTR gene family. Conclusion In this system, DETECTER retrodicts amino acid replacements associated with the cystic fibrosis disease with greater accuracy than alternative approaches. While this result validates this approach for this particular family of proteins only, the approach may be applicable to the analysis of polymorphisms generally, including SNPs in a human population.

  12. Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice

    Directory of Open Access Journals (Sweden)

    Shuchi eSmita

    2015-12-01

    Full Text Available MYB transcription factor (TF is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by top down and guide gene approaches. More than 50% of OsMYBs were strongly correlated under fifty experimental conditions with 51 hub genes via top down approach. Further, clusters were identified using Markov Clustering (MCL. To maximize the clustering performance, parameter evaluation of the MCL inflation score (I was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by guide gene approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought

  13. Late replication domains are evolutionary conserved in the Drosophila genome.

    Science.gov (United States)

    Andreyenkova, Natalya G; Kolesnikova, Tatyana D; Makunin, Igor V; Pokholkova, Galina V; Boldyreva, Lidiya V; Zykova, Tatyana Yu; Zhimulev, Igor F; Belyaeva, Elena S

    2013-01-01

    Drosophila chromosomes are organized into distinct domains differing in their predominant chromatin composition, replication timing and evolutionary conservation. We show on a genome-wide level that genes whose order has remained unaltered across 9 Drosophila species display late replication timing and frequently map to the regions of repressive chromatin. This observation is consistent with the existence of extensive domains of repressive chromatin that replicate extremely late and have conserved gene order in the Drosophila genome. We suggest that such repressive chromatin domains correspond to a handful of regions that complete replication at the very end of S phase. We further demonstrate that the order of genes in these regions is rarely altered in evolution. Substantial proportion of such regions significantly coincide with large synteny blocks. This indicates that there are evolutionary mechanisms maintaining the integrity of these late-replicating chromatin domains. The synteny blocks corresponding to the extremely late-replicating regions in the D. melanogaster genome consistently display two-fold lower gene density across different Drosophila species.

  14. Co-expression of the C-terminal domain of Yersinia enterocolitica ...

    Indian Academy of Sciences (India)

    Home; Journals; Journal of Biosciences; Volume 40; Issue 1. Co-expression of the C-terminal domain of Yersinia enterocolitica invasin enhances the efficacy of classical swine-fever-vectored vaccine based on human adenovirus. Helin Li Pengbo Ning Zhi Lin Wulong Liang Kai Kang Lei He Yanming Zhang. Articles Volume ...

  15. Identification and network-enabled characterization of auxin response factor genes in Medicago truncatula

    Directory of Open Access Journals (Sweden)

    David J. Burks

    2016-12-01

    Full Text Available The Auxin Response Factor (ARF family of transcription factors is an important regulator of environmental response and symbiotic nodulation in the legume Medicago truncatula. While previous studies have identified members of this family, a recent spurt in gene expression data coupled with genome update and reannotation calls for a reassessment of the prevalence of ARF genes and their interaction networks in M. truncatula. We performed a comprehensive analysis of the M. truncatula genome and transcriptome that entailed search for novel ARF genes and the co-expression networks. Our investigation revealed 8 novel M. truncatula ARF (MtARF genes, of the total 22 identified, and uncovered novel gene co-expression networks as well. Furthermore, the topological clustering and single enrichment analysis of several network models revealed the roles of individual members of the MtARF family in nitrogen regulation, nodule initiation, and post-embryonic development through a specialized protein packaging and secretory pathway. In summary, this study not just shines new light on an important gene family, but also provides a guideline for identification of new members of gene families and their functional characterization through network analyses.

  16. Sieve element occlusion (SEO) genes encode structural phloem proteins involved in wound sealing of the phloem.

    Science.gov (United States)

    Ernst, Antonia M; Jekat, Stephan B; Zielonka, Sascia; Müller, Boje; Neumann, Ulla; Rüping, Boris; Twyman, Richard M; Krzyzanek, Vladislav; Prüfer, Dirk; Noll, Gundula A

    2012-07-10

    The sieve element occlusion (SEO) gene family originally was delimited to genes encoding structural components of forisomes, which are specialized crystalloid phloem proteins found solely in the Fabaceae. More recently, SEO genes discovered in various non-Fabaceae plants were proposed to encode the common phloem proteins (P-proteins) that plug sieve plates after wounding. We carried out a comprehensive characterization of two tobacco (Nicotiana tabacum) SEO genes (NtSEO). Reporter genes controlled by the NtSEO promoters were expressed specifically in immature sieve elements, and GFP-SEO fusion proteins formed parietal agglomerates in intact sieve elements as well as sieve plate plugs after wounding. NtSEO proteins with and without fluorescent protein tags formed agglomerates similar in structure to native P-protein bodies when transiently coexpressed in Nicotiana benthamiana, and the analysis of these protein complexes by electron microscopy revealed ultrastructural features resembling those of native P-proteins. NtSEO-RNA interference lines were essentially devoid of P-protein structures and lost photoassimilates more rapidly after injury than control plants, thus confirming the role of P-proteins in sieve tube sealing. We therefore provide direct evidence that SEO genes in tobacco encode P-protein subunits that affect translocation. We also found that peptides recently identified in fascicular phloem P-protein plugs from squash (Cucurbita maxima) represent cucurbit members of the SEO family. Our results therefore suggest a common evolutionary origin for P-proteins found in the sieve elements of all dicotyledonous plants and demonstrate the exceptional status of extrafascicular P-proteins in cucurbits.

  17. Comparative ecological transcriptomics and the contribution of gene expression to the evolutionary potential of a threatened fish.

    Science.gov (United States)

    Brauer, Chris J; Unmack, Peter J; Beheregaray, Luciano B

    2017-12-01

    Understanding whether small populations with low genetic diversity can respond to rapid environmental change via phenotypic plasticity is an outstanding research question in biology. RNA sequencing (RNA-seq) has recently provided the opportunity to examine variation in gene expression, a surrogate for phenotypic variation, in nonmodel species. We used a comparative RNA-seq approach to assess expression variation within and among adaptively divergent populations of a threatened freshwater fish, Nannoperca australis, found across a steep hydroclimatic gradient in the Murray-Darling Basin, Australia. These populations evolved under contrasting selective environments (e.g., dry/hot lowland; wet/cold upland) and represent opposite ends of the species' spectrum of genetic diversity and population size. We tested the hypothesis that environmental variation among isolated populations has driven the evolution of divergent expression at ecologically important genes using differential expression (DE) analysis and an anova-based comparative phylogenetic expression variance and evolution model framework based on 27,425 de novo assembled transcripts. Additionally, we tested whether gene expression variance within populations was correlated with levels of standing genetic diversity. We identified 290 DE candidate transcripts, 33 transcripts with evidence for high expression plasticity, and 50 candidates for divergent selection on gene expression after accounting for phylogenetic structure. Variance in gene expression appeared unrelated to levels of genetic diversity. Functional annotation of the candidate transcripts revealed that variation in water quality is an important factor influencing expression variation for N. australis. Our findings suggest that gene expression variation can contribute to the evolutionary potential of small populations. © 2017 John Wiley & Sons Ltd.

  18. On the evolution of misunderstandings about evolutionary psychology.

    Science.gov (United States)

    Young, J; Persell, R

    2000-04-01

    Some of the controversy surrounding evolutionary explanations of human behavior may be due to cognitive information-processing patterns that are themselves the result of evolutionary processes. Two such patterns are (1) the tendency to oversimplify information so as to reduce demand on cognitive resources and (2) our strong desire to generate predictability and stability from perceptions of the external world. For example, research on social stereotyping has found that people tend to focus automatically on simplified social-categorical information, to use such information when deciding how to behave, and to rely on such information even in the face of contradictory evidence. Similarly, an undying debate over nature vs. nurture is shaped by various data-reduction strategies that frequently oversimplify, and thus distort, the intent of the supporting arguments. This debate is also often marked by an assumption that either the nature or the nurture domain may be justifiably excluded at an explanatory level because one domain appears to operate in a sufficiently stable and predictable way for a particular argument. As a result, critiques in-veighed against evolutionary explanations of behavior often incorporate simplified--and erroneous--assumptions about either the mechanics of how evolution operates or the inevitable implications of evolution for understanding human behavior. The influences of these tendencies are applied to a discussion of the heritability of behavioral characteristics. It is suggested that the common view that Mendelian genetics can explain the heritability of complex behaviors, with a one-gene-one-trait process, is misguided. Complex behaviors are undoubtedly a product of a more complex interaction between genes and environment, ensuring that both nature and nurture must be accommodated in a yet-to-be-developed post-Mendelian model of genetic influence. As a result, current public perceptions of evolutionary explanations of behavior are

  19. Phylogenetic footprint of the plant clock system in angiosperms: evolutionary processes of Pseudo-Response Regulators

    Directory of Open Access Journals (Sweden)

    Saito Shigeru

    2010-05-01

    Full Text Available Abstract Background Plant circadian clocks regulate many photoperiodic and diurnal responses that are conserved among plant species. The plant circadian clock system has been uncovered in the model plant, Arabidopsis thaliana, using genetics and systems biology approaches. However, it is still not clear how the clock system had been organized in the evolutionary history of plants. We recently revealed the molecular phylogeny of LHY/CCA1 genes, one of the essential components of the clock system. The aims of this study are to reconstruct the phylogenetic relationships of angiosperm clock-associated PRR genes, the partner of the LHY/CCA1 genes, and to clarify the evolutionary history of the plant clock system in angiosperm lineages. Results In the present study, to investigate the molecular phylogeny of PRR genes, we performed two approaches: reconstruction of phylogenetic trees and examination of syntenic relationships. Phylogenetic analyses revealed that PRR genes had diverged into three clades prior to the speciation of monocots and eudicots. Furthermore, copy numbers of PRR genes have been independently increased in monocots and eudicots as a result of ancient chromosomal duplication events. Conclusions Based on the molecular phylogenies of both PRR genes and LHY/CCA1 genes, we inferred the evolutionary process of the plant clock system in angiosperms. This scenario provides evolutionary information that a common ancestor of monocots and eudicots had retained the basic components required for reconstructing a clock system and that the plant circadian clock may have become a more elaborate mechanism after the speciation of monocots and eudicots because of the gene expansion that resulted from polyploidy events.

  20. Evolutionary Significance of Wolbachia-to-Animal Horizontal Gene Transfer: Female Sex Determination and the f Element in the Isopod Armadillidium vulgare.

    Science.gov (United States)

    Cordaux, Richard; Gilbert, Clément

    2017-07-21

    An increasing number of horizontal gene transfer (HGT) events from bacteria to animals have been reported in the past years, many of which involve Wolbachia bacterial endosymbionts and their invertebrate hosts. Most transferred Wolbachia genes are neutrally-evolving fossils embedded in host genomes. A remarkable case of Wolbachia HGT for which a clear evolutionary significance has been demonstrated is the " f element", a nuclear Wolbachia insert involved in female sex determination in the terrestrial isopod Armadillidium vulgare . The f element represents an instance of bacteria-to-animal HGT that has occurred so recently that it was possible to infer the donor (feminizing Wolbachia closely related to the w VulC Wolbachia strain of A. vulgare ) and the mechanism of integration (a nearly complete genome inserted by micro-homology-mediated recombination). In this review, we summarize our current knowledge of the f element and discuss arising perspectives regarding female sex determination, unstable inheritance, population dynamics and the molecular evolution of the f element. Overall, the f element unifies three major areas in evolutionary biology: symbiosis, HGT and sex determination. Its characterization highlights the tremendous impact sex ratio distorters can have on the evolution of sex determination mechanisms and sex chromosomes in animals and plants.

  1. Evolutionary Significance of Wolbachia-to-Animal Horizontal Gene Transfer: Female Sex Determination and the f Element in the Isopod Armadillidium vulgare

    Directory of Open Access Journals (Sweden)

    Richard Cordaux

    2017-07-01

    Full Text Available An increasing number of horizontal gene transfer (HGT events from bacteria to animals have been reported in the past years, many of which involve Wolbachia bacterial endosymbionts and their invertebrate hosts. Most transferred Wolbachia genes are neutrally-evolving fossils embedded in host genomes. A remarkable case of Wolbachia HGT for which a clear evolutionary significance has been demonstrated is the “f element”, a nuclear Wolbachia insert involved in female sex determination in the terrestrial isopod Armadillidium vulgare. The f element represents an instance of bacteria-to-animal HGT that has occurred so recently that it was possible to infer the donor (feminizing Wolbachia closely related to the wVulC Wolbachia strain of A. vulgare and the mechanism of integration (a nearly complete genome inserted by micro-homology-mediated recombination. In this review, we summarize our current knowledge of the f element and discuss arising perspectives regarding female sex determination, unstable inheritance, population dynamics and the molecular evolution of the f element. Overall, the f element unifies three major areas in evolutionary biology: symbiosis, HGT and sex determination. Its characterization highlights the tremendous impact sex ratio distorters can have on the evolution of sex determination mechanisms and sex chromosomes in animals and plants.

  2. Lengths of Orthologous Prokaryotic Proteins Are Affected by Evolutionary Factors

    Directory of Open Access Journals (Sweden)

    Tatiana Tatarinova

    2015-01-01

    Full Text Available Proteins of the same functional family (for example, kinases may have significantly different lengths. It is an open question whether such variation in length is random or it appears as a response to some unknown evolutionary driving factors. The main purpose of this paper is to demonstrate existence of factors affecting prokaryotic gene lengths. We believe that the ranking of genomes according to lengths of their genes, followed by the calculation of coefficients of association between genome rank and genome property, is a reasonable approach in revealing such evolutionary driving factors. As we demonstrated earlier, our chosen approach, Bubble-sort, combines stability, accuracy, and computational efficiency as compared to other ranking methods. Application of Bubble Sort to the set of 1390 prokaryotic genomes confirmed that genes of Archaeal species are generally shorter than Bacterial ones. We observed that gene lengths are affected by various factors: within each domain, different phyla have preferences for short or long genes; thermophiles tend to have shorter genes than the soil-dwellers; halophiles tend to have longer genes. We also found that species with overrepresentation of cytosines and guanines in the third position of the codon (GC3 content tend to have longer genes than species with low GC3 content.

  3. Lengths of Orthologous Prokaryotic Proteins Are Affected by Evolutionary Factors.

    Science.gov (United States)

    Tatarinova, Tatiana; Salih, Bilal; Dien Bard, Jennifer; Cohen, Irit; Bolshoy, Alexander

    2015-01-01

    Proteins of the same functional family (for example, kinases) may have significantly different lengths. It is an open question whether such variation in length is random or it appears as a response to some unknown evolutionary driving factors. The main purpose of this paper is to demonstrate existence of factors affecting prokaryotic gene lengths. We believe that the ranking of genomes according to lengths of their genes, followed by the calculation of coefficients of association between genome rank and genome property, is a reasonable approach in revealing such evolutionary driving factors. As we demonstrated earlier, our chosen approach, Bubble-sort, combines stability, accuracy, and computational efficiency as compared to other ranking methods. Application of Bubble Sort to the set of 1390 prokaryotic genomes confirmed that genes of Archaeal species are generally shorter than Bacterial ones. We observed that gene lengths are affected by various factors: within each domain, different phyla have preferences for short or long genes; thermophiles tend to have shorter genes than the soil-dwellers; halophiles tend to have longer genes. We also found that species with overrepresentation of cytosines and guanines in the third position of the codon (GC3 content) tend to have longer genes than species with low GC3 content.

  4. The odyssey of a young gene: structure-function studies in human glutamate dehydrogenases reveal evolutionary-acquired complex allosteric regulation mechanisms.

    Science.gov (United States)

    Zaganas, Ioannis V; Kanavouras, Konstantinos; Borompokas, Nikolas; Arianoglou, Giovanna; Dimovasili, Christina; Latsoudis, Helen; Vlassi, Metaxia; Mastorodemos, Vasileios

    2014-01-01

    Mammalian glutamate dehydrogenase (GDH) catalyzes the reversible inter-conversion of glutamate to α-ketoglutarate and ammonia, interconnecting carbon skeleton and nitrogen metabolism. In addition, it functions as an energy switch by its ability to fuel the Krebs cycle depending on the energy status of the cell. As GDH lies at the intersection of several metabolic pathways, its activity is tightly regulated by several allosteric compounds that are metabolic intermediates. In contrast to other mammals that have a single GDH-encoding gene, humans and great apes possess two isoforms of GDH (hGDH1 and hGDH2, encoded by the GLUD1 and GLUD2 genes, respectively) with distinct regulation pattern, but remarkable sequence similarity (they differ, in their mature form, in only 15 of their 505 amino-acids). The GLUD2 gene is considered a very young gene, emerging from the GLUD1 gene through retro-position only recently (<23 million years ago). The new hGDH2 iso-enzyme, through random mutations and natural selection, is thought to have conferred an evolutionary advantage that helped its persistence through primate evolution. The properties of the two highly homologous human GDHs have been studied using purified recombinant hGDH1 and hGDH2 proteins obtained by expression of the corresponding cDNAs in Sf21 cells. According to these studies, in contrast to hGDH1 that maintains basal activity at 35-40 % of its maximal, hGDH2 displays low basal activity that is highly responsive to activation by rising levels of ADP and/or L-leucine which can also act synergistically. While hGDH1 is inhibited potently by GTP, hGDH2 shows remarkable GTP resistance. Furthermore, the two iso-enzymes are differentially inhibited by estrogens, polyamines and neuroleptics, and also differ in heat-lability. To elucidate the molecular mechanisms that underlie these different regulation patterns of the two iso-enzymes (and consequently the evolutionary adaptation of hGDH2 to a new functional role), we have

  5. Phylogenetic and CRISPR/Cas9 Studies in Deciphering the Evolutionary Trajectory and Phenotypic Impacts of Rice ERECTA Genes

    Directory of Open Access Journals (Sweden)

    Yanchun Zhang

    2018-04-01

    Full Text Available The ERECTA family genes (ERfs have been found to play diverse functions in Arabidopsis, including controlling cell proliferation and cell growth, regulating stomata patterning, and responding to various stresses. This wide range of functions has rendered them as a potential candidate for crop improvement. However, information on their functional roles, particularly their morphological impact, in crop genomes, such as rice, is limited. Here, through evolutionary prediction, we first depict the evolutionary trajectory of the ER family, and show that the ER family is actually highly conserved across different species, suggesting that most of their functions may also be observed in other plant species. We then take advantage of the CRISPR/Cas9 (clustered regularly interspaced short palindromic repeats–associated nuclease 9 system to assess their morphological impact on one of the most important crops, rice. Loss-of-function mutants of OsER1 and OsER2 display shortened plant stature and reduced panicle size, suggesting they possibly also functioned in regulating cell proliferation and cell growth in rice. In addition to functions similar to that in Arabidopsis, we also find clues that rice ERfs may play unique functional roles. The OsER2 displayed more severe phenotypic changes than OsER1, indicating putative differentiation in their functions. The OsERL might be of essential in its function, and the proper function of all three rice ER genes might be dependent of their genetic background. Future investigations relating to these functions are key to exploiting ERfs in crop development.

  6. Usage of U7 snRNA in gene therapy of hemoglobin C disorder ...

    African Journals Online (AJOL)

    Here, a bioinformatic analysis was performed to study the effect of co-expression between human Hb C b-globin chain gene and U7.623. The gene ontological results show that full recovery of hemoglobin function and biological process can be derived. This confirms that U7 snRNA can be a good tool for gene therapy in Hb ...

  7. An efficient co-expression and purification system for the complex of Stx4 and C-terminal domain of Synip

    International Nuclear Information System (INIS)

    Tian Wei; Ma Cong; Liu Yingfang; Xu Tao

    2008-01-01

    Synip and Stx4 complex plays a key role in GLUT4 vesicle trafficking and fusion with plasma membrane. The interaction of Synip with Stx4 prevents interaction of VAMP2 located in GLUT4 vesicle with Stx4 in basal state. Insulin induces the dissociation of the Synip and Stx4 complex, and then triggers VAMP2 to interact with Stx4 to form the SNARE complex, thus promoting the vesicle fusion. In this report, we adopt a novel system for co-expression of the Synip and Stx4 by using two common vectors pGEX6p-1 and pET28a(+) to investigate their expression, purification, and interaction. Through this co-expression system, we successfully co-expressed the Synip and Stx4 complex with high yield, and co-purified at an approximate 1:1 molar ratio with high purity (95%). We also demonstrate that the 1-28 residues of Stx4 are dispensable for interaction with Synip using this co-expression system

  8. An Evolutionary Psychology Approach to Consumer Choice

    Directory of Open Access Journals (Sweden)

    ZURINA BT MOHAIDIN

    2013-07-01

    Full Text Available Human behaviour can be explained not only through experience and environments but also by incorporating evolutionary explanation. Consumer behaviour could not be understood accurately without infusing Darwinian evolutionary theory which has contributed in the knowledge of human nature. Evolutionary psychology revolves around the human’s evolved mental and the impact on human’s traits and behaviour where the influence of the environment to our genes would determine our individual behaviour and traits, resulting in variation among us. Foraging which is a part of behavioural ecology involves many sequences or repetitions of animals’ activities and decision making which is useful to relate these patterns of activities to the decisions made in human consumption. The aim of this research is to investigate the similarities of human consumption and ecological behaviour by employing interpretative and comparative approach. It is hoped that by applying the evolutionary theory in explaining consumer choice, this study is able to contribute to the development of behavioural ecology in human consumption. The analysis of the data is done aggregately for 200 consumers and individually for 20 consumers, who have purchased four product categories over a year. This study concludes that the theories of evolutionary psychology can fit to the consumers’ buying behaviour implicating its usefulness in explaining the consumers’ choice.

  9. Hardy–Weinberg Equilibrium and the Foundations of Evolutionary ...

    Indian Academy of Sciences (India)

    Home; Journals; Resonance – Journal of Science Education; Volume 13; Issue 9. Hardy–Weinberg Equilibrium and the Foundations of Evolutionary Genetics - The Dance of the Genes. Amitabh Joshi. Series Article Volume 13 Issue 9 September 2008 pp 812-835 ...

  10. Efficient androst-1,4-diene-3,17-dione production by co-expressing 3-ketosteroid-Δ1 -dehydrogenase and catalase in Bacillus subtilis.

    Science.gov (United States)

    Shao, M; Sha, Z; Zhang, X; Rao, Z; Xu, M; Yang, T; Xu, Z; Yang, S

    2017-01-01

    3-ketosteroid-Δ 1 -dehydrogenase (KSDD), a flavin adenine dinucleotide (FAD)-dependent enzyme involved in sterol metabolism, specifically catalyses the conversion of androst-4-ene-3,17-dione (AD) to androst-1,4-diene-3,17-dione (ADD). However, the low KSDD activity and the toxic effects of hydrogen peroxide (H 2 O 2 ) generated during the biotransformation of AD to ADD with FAD regeneration hinder its application on AD conversion. The aim of this work was to improve KSDD activity and eliminate the toxic effects of the generated H 2 O 2 to enhance ADD production. The ksdd gene obtained from Mycobacterium neoaurum JC-12 was codon-optimized to increase its expression level in Bacillus subtilis, and the KSDD activity reached 12·3 U mg -1 , which was sevenfold of that of codon-unoptimized gene. To improve AD conversion, catalase was co-expressed with KSDD in B. subtilis 168/pMA5-ksdd opt -katA to eliminate the toxic effects of H 2 O 2 generated during AD conversion. Finally, under optimized bioconversion conditions, fed-batch strategy was carried out and the ADD yield improved to 8·76 g l -1 . This work demonstrates the potential to improve enzyme activity by codon-optimization and eliminate the toxic effects of H 2 O 2 by co-expressing catalase. This study showed the highest ADD productivity ever reported and provides a promising strain for efficient ADD production in the pharmaceutical industry. © 2016 The Society for Applied Microbiology.

  11. Evolutionary Genomics of Life in (and from) the Sea

    Energy Technology Data Exchange (ETDEWEB)

    Boore, Jeffrey L.; Dehal, Paramvir; Fuerstenberg, Susan I.

    2006-01-09

    High throughput genome sequencing centers that were originally built for the Human Genome Project (Lander et al., 2001; Venter et al., 2001) have now become an engine for comparative genomics. The six largest centers alone are now producing over 150 billion nucleotides per year, more than 50 times the amount of DNA in the human genome, and nearly all of this is directed at projects that promise great insights into the pattern and processes of evolution. Unfortunately, this data is being produced at a pace far exceeding the capacity of the scientific community to provide insightful analysis, and few scientists with training and experience in evolutionary biology have played prominent roles to date. One of the consequences is that poor quality analyses are typical; for example, orthology among genes is generally determined by simple measures of sequence similarity, when this has been discredited by molecular evolutionary biologists decades ago. Here we discuss the how genomes are chosen for sequencing and how the scientific community can have input. We describe the PhIGs database and web tools (Dehal and Boore 2005a; http://PhIGs.org), which provide phylogenetic analysis of all gene families for all completely sequenced genomes and the associated 'Synteny Viewer', which allows comparisons of the relative positions of orthologous genes. This is the best tool available for inferring gene function across multiple genomes. We also describe how we have used the PhIGs methods with the whole genome sequences of a tunicate, fish, mouse, and human to conclusively demonstrate that two rounds of whole genome duplication occurred at the base of vertebrates (Dehal and Boore 2005b). This evidence is found in the large scale structure of the positions of paralogous genes that arose from duplications inferred by evolutionary analysis to have occurred at the base of vertebrates.

  12. Phenotypic characterization of neurotensin messenger RNA-expressing cells in the neuroleptic-treated rat striatum: a detailed cellular co-expression study

    Energy Technology Data Exchange (ETDEWEB)

    Emson, P C; Westmore, K; Augood, S J [MRC Molecular Neuroscience Group, The Department of Neurobiology, The Babraham Institute, Babraham, Cambridge (United Kingdom)

    1996-12-11

    The chemical phenotype of proneurotensin messenger RNA-expressing cells was determined in the acute haloperidol-treated rat striatum using a combination of [{sup 35}S]-labelled and alkaline phosphatase-labelled oligonucleotides. Cellular sites of proneurotensin messenger RNA expression were visualized simultaneously on tissue sections processed to reveal cellular sites of preproenkephalin A messenger RNA or the dopamine and adenylate cyclase phosphoprotein-32, messenger RNA. The cellular co-expression of preproenkepahlin A and preprotachykinin messenger RNA was also examined within forebrain structures. Cellular sites of preproenkephalin A and dopamine and adenylate cyclase phosphoprotein-32 messenger RNAs were visualized using alkaline phosphatase-labelled oligonucleotides whilst sites of preprotachykinin and proneurotensin messenger RNA expression were detected using [{sup 35}S]-labelled oligos. Cellular sites of enkephalin and dopamine and adenylate cyclase phosphoprotein-32 gene expression were identified microscopically by the concentration of purple alkaline phosphatase reaction product within the cell cytoplasm, whereas sites of substance P and proneurotensin gene expression were identified by the dense clustering of silver grains overlying cells.An intense hybridization signal was detected for all three neuropeptide messenger RNAs in the striatum, the nucleus accumbens and septum. Dopamine and adenylate cyclase phosphoprotein-32 messenger RNA was detected within the neostriatum but not within the septum. In all forebrain regions examined, with the exception of the islands of Cajella, the cellular expression of enkephalin messenger RNA and substance P messenger RNA was discordant; the two neuropeptide messenger RNAs were detected essentially in different cells, although in the striatum and nucleus accumbens occasional isolated cells were detected which contained both hybridization signals; dense clusters of silver grains overlay alkaline phosphatase

  13. Phenotypic characterization of neurotensin messenger RNA-expressing cells in the neuroleptic-treated rat striatum: a detailed cellular co-expression study

    International Nuclear Information System (INIS)

    Emson, P.C.; Westmore, K.; Augood, S.J.

    1996-01-01

    The chemical phenotype of proneurotensin messenger RNA-expressing cells was determined in the acute haloperidol-treated rat striatum using a combination of [ 35 S]-labelled and alkaline phosphatase-labelled oligonucleotides. Cellular sites of proneurotensin messenger RNA expression were visualized simultaneously on tissue sections processed to reveal cellular sites of preproenkephalin A messenger RNA or the dopamine and adenylate cyclase phosphoprotein-32, messenger RNA. The cellular co-expression of preproenkepahlin A and preprotachykinin messenger RNA was also examined within forebrain structures. Cellular sites of preproenkephalin A and dopamine and adenylate cyclase phosphoprotein-32 messenger RNAs were visualized using alkaline phosphatase-labelled oligonucleotides whilst sites of preprotachykinin and proneurotensin messenger RNA expression were detected using [ 35 S]-labelled oligos. Cellular sites of enkephalin and dopamine and adenylate cyclase phosphoprotein-32 gene expression were identified microscopically by the concentration of purple alkaline phosphatase reaction product within the cell cytoplasm, whereas sites of substance P and proneurotensin gene expression were identified by the dense clustering of silver grains overlying cells.An intense hybridization signal was detected for all three neuropeptide messenger RNAs in the striatum, the nucleus accumbens and septum. Dopamine and adenylate cyclase phosphoprotein-32 messenger RNA was detected within the neostriatum but not within the septum. In all forebrain regions examined, with the exception of the islands of Cajella, the cellular expression of enkephalin messenger RNA and substance P messenger RNA was discordant; the two neuropeptide messenger RNAs were detected essentially in different cells, although in the striatum and nucleus accumbens occasional isolated cells were detected which contained both hybridization signals; dense clusters of silver grains overlay alkaline phosphatase-positive cells

  14. Radiochemotherapy of hepatocarcinoma via lentivirus-mediated transfer of human sodium iodide symporter gene and herpes simplex virus thymidine kinase gene

    Energy Technology Data Exchange (ETDEWEB)

    Chen Libo, E-mail: libochen888@hotmail.com [Department of Nuclear Medicine, Shanghai Sixth People' s Hospital, Shanghai Jiao Tong University, Shanghai 200233 (China); Guo Guoying [Xinyuan Institute of Medicine and Biotechnology, School of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018 (China); Liu Tianjing; Guo Lihe [Division of Biochemistry and Cell Biology, Shanghai Institute for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China); Zhu Ruisen [Department of Nuclear Medicine, Shanghai Sixth People' s Hospital, Shanghai Jiao Tong University, Shanghai 200233 (China)

    2011-07-15

    Herpes simplex virus thymidine kinase (HSV-TK) gene/ganciclovir (GCV) system has been widely used as a traditional gene therapy modality, and the sodium/iodide symporter gene (NIS) has been found to be a novel therapeutic gene. Since the therapeutic effects of radioiodine therapy or prodrug chemotherapy on cancers following NIS or HSV-TK gene transfer need to be enhanced, this study was designed to investigate the feasibility of radiochemotherapy for hepatocarcinoma via coexpression of NIS gene and HSV-TK gene. Methods: HepG2 cells were stably transfected with NIS, TK and GFP gene via recombinant lentiviral vector and named HepG2/NTG. Gene expression was examined by reverse transcriptase polymerase chain reaction, fluorescence imaging and iodide uptake. The therapeutic effects were assessed by MTT assay and clonogenic assay. Results: HepG2/NTG cells concentrated {sup 125}I{sup -} up to 76-fold higher than the wild-type cells within 20 min, and the efflux happened with a T{sub 1/2eff} of less than 10 min. The iodide uptake in HepG2/NTG cells was specifically inhibited by sodium perchlorate. Dose-dependent toxicity to HepG2/NTG cells by either GCV or {sup 131}I was revealed by clonogenic assay and MTT assay, respectively. The survival rate of HepG2/NTG cells decreased to 49.7%{+-}2.5%, 43.4%{+-}2.8% and 8.6%{+-}1.2% after exposure to {sup 131}I, GCV and combined therapy, respectively. Conclusion: We demonstrate that radiochemotherapy of hepatocarcinoma via lentiviral-mediated coexpression of NIS gene and HSV-TK gene leads to stronger killing effect than single treatment, and in vivo studies are needed to verify these findings.

  15. Radiochemotherapy of hepatocarcinoma via lentivirus-mediated transfer of human sodium iodide symporter gene and herpes simplex virus thymidine kinase gene

    International Nuclear Information System (INIS)

    Chen Libo; Guo Guoying; Liu Tianjing; Guo Lihe; Zhu Ruisen

    2011-01-01

    Herpes simplex virus thymidine kinase (HSV-TK) gene/ganciclovir (GCV) system has been widely used as a traditional gene therapy modality, and the sodium/iodide symporter gene (NIS) has been found to be a novel therapeutic gene. Since the therapeutic effects of radioiodine therapy or prodrug chemotherapy on cancers following NIS or HSV-TK gene transfer need to be enhanced, this study was designed to investigate the feasibility of radiochemotherapy for hepatocarcinoma via coexpression of NIS gene and HSV-TK gene. Methods: HepG2 cells were stably transfected with NIS, TK and GFP gene via recombinant lentiviral vector and named HepG2/NTG. Gene expression was examined by reverse transcriptase polymerase chain reaction, fluorescence imaging and iodide uptake. The therapeutic effects were assessed by MTT assay and clonogenic assay. Results: HepG2/NTG cells concentrated 125 I - up to 76-fold higher than the wild-type cells within 20 min, and the efflux happened with a T 1/2eff of less than 10 min. The iodide uptake in HepG2/NTG cells was specifically inhibited by sodium perchlorate. Dose-dependent toxicity to HepG2/NTG cells by either GCV or 131 I was revealed by clonogenic assay and MTT assay, respectively. The survival rate of HepG2/NTG cells decreased to 49.7%±2.5%, 43.4%±2.8% and 8.6%±1.2% after exposure to 131 I, GCV and combined therapy, respectively. Conclusion: We demonstrate that radiochemotherapy of hepatocarcinoma via lentiviral-mediated coexpression of NIS gene and HSV-TK gene leads to stronger killing effect than single treatment, and in vivo studies are needed to verify these findings.

  16. Enhanced phytoremediation of mixed heavy metal (mercury)-organic pollutants (trichloroethylene) with transgenic alfalfa co-expressing glutathione S-transferase and human P450 2E1.

    Science.gov (United States)

    Zhang, Yuanyuan; Liu, Junhong; Zhou, Yuanming; Gong, Tingyun; Wang, Jing; Ge, Yinlin

    2013-09-15

    Soil contamination is a global environmental problem and many efforts have been made to find efficient remediation methods over the last decade. Moreover, remediation of mixed contaminated soils are more difficult. In the present study, transgenic alfalfa plants pKHCG co-expressing glutathione S-transferase (GST) and human P450 2E1 (CYP2E1) genes were used for phytoremediation of mixed mercury (Hg)-trichloroethylene (TCE) contaminants. Simultaneous expression of GST and CYP2E1 may produce a significant synergistic effect, and leads to improved resistance and accumulation to heavy metal-organic complex contaminants. Based on the tolerance and accumulation assays, pKHCG transgenic plants were more resistant to Hg/TCE complex pollutants and many folds higher in Hg/TCE-accumulation than the non-transgenic control plants in mixed contaminated soil. It is confirmed that GST and CYP2E1 co-expression may be a useful strategy to help achieve mixed heavy metal-organic pollutants phytoremediation. Copyright © 2013 Elsevier B.V. All rights reserved.

  17. Functional and evolutionary analysis of alternatively spliced genes is consistent with an early eukaryotic origin of alternative splicing

    Directory of Open Access Journals (Sweden)

    Penny David

    2007-10-01

    Full Text Available Abstract Background Alternative splicing has been reported in various eukaryotic groups including plants, apicomplexans, diatoms, amoebae, animals and fungi. However, whether widespread alternative splicing has evolved independently in the different eukaryotic groups or was inherited from their last common ancestor, and may therefore predate multicellularity, is still unknown. To better understand the origin and evolution of alternative splicing and its usage in diverse organisms, we studied alternative splicing in 12 eukaryotic species, comparing rates of alternative splicing across genes of different functional classes, cellular locations, intron/exon structures and evolutionary origins. Results For each species, we find that genes from most functional categories are alternatively spliced. Ancient genes (shared between animals, fungi and plants show high levels of alternative splicing. Genes with products expressed in the nucleus or plasma membrane are generally more alternatively spliced while those expressed in extracellular location show less alternative splicing. We find a clear correspondence between incidence of alternative splicing and intron number per gene both within and between genomes. In general, we find several similarities in patterns of alternative splicing across these diverse eukaryotes. Conclusion Along with previous studies indicating intron-rich genes with weak intron boundary consensus and complex spliceosomes in ancestral organisms, our results suggest that at least a simple form of alternative splicing may already have been present in the unicellular ancestor of plants, fungi and animals. A role for alternative splicing in the evolution of multicellularity then would largely have arisen by co-opting the preexisting process.

  18. Phylogenetic comparison of F-Box (FBX gene superfamily within the plant kingdom reveals divergent evolutionary histories indicative of genomic drift.

    Directory of Open Access Journals (Sweden)

    Zhihua Hua

    Full Text Available The emergence of multigene families has been hypothesized as a major contributor to the evolution of complex traits and speciation. To help understand how such multigene families arose and diverged during plant evolution, we examined the phylogenetic relationships of F-Box (FBX genes, one of the largest and most polymorphic superfamilies known in the plant kingdom. FBX proteins comprise the target recognition subunit of SCF-type ubiquitin-protein ligases, where they individually recruit specific substrates for ubiquitylation. Through the extensive analysis of 10,811 FBX loci from 18 plant species, ranging from the alga Chlamydomonas reinhardtii to numerous monocots and eudicots, we discovered strikingly diverse evolutionary histories. The number of FBX loci varies widely and appears independent of the growth habit and life cycle of land plants, with a little as 198 predicted for Carica papaya to as many as 1350 predicted for Arabidopsis lyrata. This number differs substantially even among closely related species, with evidence for extensive gains/losses. Despite this extraordinary inter-species variation, one subset of FBX genes was conserved among most species examined. Together with evidence of strong purifying selection and expression, the ligases synthesized from these conserved loci likely direct essential ubiquitylation events. Another subset was much more lineage specific, showed more relaxed purifying selection, and was enriched in loci with little or no evidence of expression, suggesting that they either control more limited, species-specific processes or arose from genomic drift and thus may provide reservoirs for evolutionary innovation. Numerous FBX loci were also predicted to be pseudogenes with their numbers tightly correlated with the total number of FBX genes in each species. Taken together, it appears that the FBX superfamily has independently undergone substantial birth/death in many plant lineages, with its size and rapid

  19. Co-Expression and Co-Localization of Cartilage Glycoproteins CHI3L1 and Lubricin in Osteoarthritic Cartilage: Morphological, Immunohistochemical and Gene Expression Profiles

    Directory of Open Access Journals (Sweden)

    Marta Anna Szychlinska

    2016-03-01

    Full Text Available Osteoarthritis is the most common human arthritis characterized by degeneration of articular cartilage. Several studies reported that levels of human cartilage glycoprotein chitinase 3-like-1 (CHI3L1 are known as a potential marker for the activation of chondrocytes and the progression of Osteoarthritis (OA, whereas lubricin appears to be chondroprotective. The aim of this study was to investigate the co-expression and co-localization of CHI3L1 and lubricin in normal and osteoarthritic rat articular cartilage to correlate their modified expression to a specific grade of OA. Samples of normal and osteoarthritic rat articular cartilage were analyzed by the Kellgren–Lawrence OA severity scores, the Kraus’ modified Mankin score and the Histopathology Osteoarthritis Research Society International (OARSI system for histomorphometric evaluations, and through CHI3L1 and lubricin gene expression, immunohistochemistry and double immuno-staining analysis. The immunoexpression and the mRNA levels of lubricin increased in normal cartilage and decreased in OA cartilage (normal vs. OA, p < 0.01. By contrast, the immunoexpression and the mRNA levels of CHI3L1 increased in OA cartilage and decreased in normal cartilage (normal vs. OA, p < 0.01. Our findings are consistent with reports suggesting that these two glycoproteins are functionally associated with the development of OA and in particular with grade 2/3 of OA, suggesting that in the future they could be helpful to stage the severity and progression of the disease.

  20. Gene Expression Analysis in Tubule Interstitial Compartments Reveals Candidate Agents for IgA Nephropathy

    Directory of Open Access Journals (Sweden)

    Jinling Wang

    2014-09-01

    Full Text Available Background/Aims: Our aim was to explore the molecular mechanism underlying development of IgA nephropathy and discover candidate agents for IgA nephropathy. Methods: The differentially expressed genes (DEGs between patients with IgA nephropathy and normal controls were identified by the data of GSE35488 downloaded from GEO (Gene Expression Omnibus database. The co-expressed gene pairs among DEGs were screened to construct the gene-gene interaction network. Gene Ontology (GO enrichment analysis was performed to analyze the functions of DEGs. The biologically active small molecules capable of targeting IgA nephropathy were identified using the Connectivity Map (cMap database. Results: A total of 55 genes involved in response to organic substance, transcription factor activity and response to steroid hormone stimulus were identified to be differentially expressed in IgA nephropathy patients compared to healthy individuals. A network with 45 co-expressed gene pairs was constructed. DEGs in the network were significantly enriched in response to organic substance. Additionally, a group of small molecules were identified, such as doxorubicin and thapsigargin. Conclusion: Our work provided a systematic insight in understanding the mechanism of IgA nephropathy. Small molecules such as thapsigargin might be potential candidate agents for the treatment of IgA nephropathy.

  1. [Evolutionary medicine: A new look on health and disease].

    Science.gov (United States)

    Bauduer, F

    2017-03-01

    Evolutionary medicine represents an innovative approach deriving from evolutionary biology. It includes the initial Darwin's view, its actualization in the light of progresses in genetics and also dissident theories (i.e. non gene-based) particularly epigenetics. This approach enables us to reconsider the pathophysiology of numerous diseases, as for instance, infection, and our so-called diseases of civilization especially obesity, type 2 diabetes, allergy or cancer. Evolutionary medicine may also improve our knowledge regarding inter-individual variation in susceptibility to disease or drugs. Furthermore, it points out the impact of our behaviors and environment on the genesis of a series of diseases. Copyright © 2016 Société Nationale Française de Médecine Interne (SNFMI). Published by Elsevier SAS. All rights reserved.

  2. Comparison of loline alkaloid gene clusters across fungal endophytes: predicting the co-regulatory sequence motifs and the evolutionary history.

    Science.gov (United States)

    Kutil, Brandi L; Greenwald, Charles; Liu, Gang; Spiering, Martin J; Schardl, Christopher L; Wilkinson, Heather H

    2007-10-01

    LOL, a fungal secondary metabolite gene cluster found in Epichloë and Neotyphodium species, is responsible for production of insecticidal loline alkaloids. To analyze the genetic architecture and to predict the evolutionary history of LOL, we compared five clusters from four fungal species (single clusters from Epichloë festucae, Neotyphodium sp. PauTG-1, Neotyphodium coenophialum, and two clusters we previously characterized in Neotyphodium uncinatum). Using PhyloCon to compare putative lol gene promoter regions, we have identified four motifs conserved across the lol genes in all five clusters. Each motif has significant similarity to known fungal transcription factor binding sites in the TRANSFAC database. Conservation of these motifs is further support for the hypothesis that the lol genes are co-regulated. Interestingly, the history of asexual Neotyphodium spp. includes multiple interspecific hybridization events. Comparing clusters from three Neotyphodium species and E. festucae allowed us to determine which Epichloë ancestors are the most likely contributors of LOL in these asexual species. For example, while no present day Epichloë typhina isolates are known to produce lolines, our data support the hypothesis that the E. typhina ancestor(s) of three asexual endophyte species contained a LOL gene cluster. Thus, these data support a model of evolution in which the polymorphism in loline alkaloid production phenotypes among endophyte species is likely due to the loss of the trait over time.

  3. Phylogeny and phylogeography of functional genes shared among seven terrestrial subsurface metagenomes reveal N-cycling and microbial evolutionary relationships

    Directory of Open Access Journals (Sweden)

    Maggie CY Lau

    2014-10-01

    Full Text Available Comparative studies on community phylogenetics and phylogeography of microorganisms living in extreme environments are rare. Terrestrial subsurface habitats are valuable for studying microbial biogeographical patterns due to their isolation and the restricted dispersal mechanisms. Since the taxonomic identity of a microorganism does not always correspond well with its functional role in a particular community, the use of taxonomic assignments or patterns may give limited inference on how microbial functions are affected by historical, geographical and environmental factors. With seven metagenomic libraries generated from fracture water samples collected from five South African mines, this study was carried out to (1 screen for ubiquitous functions or pathways of biogeochemical cycling of CH4, S and N; (2 to characterize the biodiversity represented by the common functional genes; (3 to investigate the subsurface biogeography as revealed by this subset of genes; and (4 to explore the possibility of using metagenomic data for evolutionary study. The ubiquitous functional genes are NarV, NPD, PAP reductase, NifH, NifD, NifK, NifE and NifN genes. Although these 8 common functional genes were taxonomically and phylogenetically diverse and distinct from each other, the dissimilarity between samples did not correlate strongly with either geographical, environmental or residence time of the water. Por genes homologous to those of Thermodesulfovibrio yellowstonii detected in all metagenomes were deep lineages of Nitrospirae, suggesting that subsurface habitats have preserved ancestral genetic signatures that inform the study of the origin and evolution of prokaryotes.

  4. NorWood: a gene expression resource for evo-devo studies of conifer wood development.

    Science.gov (United States)

    Jokipii-Lukkari, Soile; Sundell, David; Nilsson, Ove; Hvidsten, Torgeir R; Street, Nathaniel R; Tuominen, Hannele

    2017-10-01

    The secondary xylem of conifers is composed mainly of tracheids that differ anatomically and chemically from angiosperm xylem cells. There is currently no high-spatial-resolution data available profiling gene expression during wood formation for any coniferous species, which limits insight into tracheid development. RNA-sequencing data from replicated, high-spatial-resolution section series throughout the cambial and woody tissues of Picea abies were used to generate the NorWood.conGenIE.org web resource, which facilitates exploration of the associated gene expression profiles and co-expression networks. Integration within PlantGenIE.org enabled a comparative regulomics analysis, revealing divergent co-expression networks between P. abies and the two angiosperm species Arabidopsis thaliana and Populus tremula for the secondary cell wall (SCW) master regulator NAC Class IIB transcription factors. The SCW cellulose synthase genes (CesAs) were located in the neighbourhoods of the NAC factors in A. thaliana and P. tremula, but not in P. abies. The NorWood co-expression network enabled identification of potential SCW CesA regulators in P. abies. The NorWood web resource represents a powerful community tool for generating evo-devo insights into the divergence of wood formation between angiosperms and gymnosperms and for advancing understanding of the regulation of wood development in P. abies. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  5. Darwin in Mind: New Opportunities for Evolutionary Psychology

    Science.gov (United States)

    Bolhuis, Johan J.; Brown, Gillian R.; Richardson, Robert C.; Laland, Kevin N.

    2011-01-01

    Evolutionary Psychology (EP) views the human mind as organized into many modules, each underpinned by psychological adaptations designed to solve problems faced by our Pleistocene ancestors. We argue that the key tenets of the established EP paradigm require modification in the light of recent findings from a number of disciplines, including human genetics, evolutionary biology, cognitive neuroscience, developmental psychology, and paleoecology. For instance, many human genes have been subject to recent selective sweeps; humans play an active, constructive role in co-directing their own development and evolution; and experimental evidence often favours a general process, rather than a modular account, of cognition. A redefined EP could use the theoretical insights of modern evolutionary biology as a rich source of hypotheses concerning the human mind, and could exploit novel methods from a variety of adjacent research fields. PMID:21811401

  6. Home and away- the evolutionary dynamics of homing endonucleases

    Directory of Open Access Journals (Sweden)

    Barzel Adi

    2011-11-01

    Full Text Available Abstract Background Homing endonucleases (HEases are a large and diverse group of site-specific DNAases. They reside within self-splicing introns and inteins, and promote their horizontal dissemination. In recent years, HEases have been the focus of extensive research due to their promising potential use in gene targeting procedures for the treatment of genetic diseases and for the genetic engineering of crop, animal models and cell lines. Results Using mathematical analysis and computational modeling, we present here a novel account for the evolution and population dynamics of HEase genes (HEGs. We describe HEGs as paradoxical selfish elements whose long-term persistence in a single population relies on low transmission rates and a positive correlation between transmission efficiency and toxicity. Conclusion Plausible conditions allow HEGs to sustain at high frequency through long evolutionary periods, with the endonuclease frequency being either at equilibrium or periodically oscillating. The predictions of our model may prove important not only for evolutionary theory but also for gene therapy and bio-engineering applications of HEases.

  7. Tracing the evolutionary origin of vertebrate skeletal tissues: insights from cephalochordate amphioxus.

    Science.gov (United States)

    Yong, Luok Wen; Yu, Jr-Kai

    2016-08-01

    Vertebrate mineralized skeletal tissues are widely considered as an evolutionary novelty. Despite the importance of these tissues to the adaptation and radiation of vertebrate animals, the evolutionary origin of vertebrate skeletal tissues remains largely unclear. Cephalochordates (Amphioxus) occupy a key phylogenetic position and can serve as a valuable model for studying the evolution of vertebrate skeletal tissues. Here we summarize recent advances in amphioxus developmental biology and comparative genomics that can help to elucidate the evolutionary origins of the vertebrate skeletal tissues and their underlying developmental gene regulatory networks (GRN). By making comparisons to the developmental studies in vertebrate models and recent discoveries in paleontology and genomics, it becomes evident that the collagen matrix-based connective tissues secreted by the somite-derived cells in amphioxus likely represent the rudimentary skeletal tissues in chordates. We propose that upon the foundation of this collagenous precursor, novel tissue mineralization genes that arose from gene duplications were incorporated into an ancestral mesodermal GRN that makes connective and supporting tissues, leading to the emergence of highly-mineralized skeletal tissues in early vertebrates. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. Coexpression of the high molecular weight glutenin subunit 1Ax1 and puroindoline improves dough mixing properties in durum wheat (Triticum turgidum L. ssp. durum).

    Science.gov (United States)

    Li, Yin; Wang, Qiong; Li, Xiaoyan; Xiao, Xin; Sun, Fusheng; Wang, Cheng; Hu, Wei; Feng, Zhijuan; Chang, Junli; Chen, Mingjie; Wang, Yuesheng; Li, Kexiu; Yang, Guangxiao; He, Guangyuan

    2012-01-01

    Wheat end-use quality mainly derives from two interrelated characteristics: the compositions of gluten proteins and grain hardness. The composition of gluten proteins determines dough rheological properties and thus confers the unique viscoelastic property on dough. One group of gluten proteins, high molecular weight glutenin subunits (HMW-GS), plays an important role in dough functional properties. On the other hand, grain hardness, which influences the milling process of flour, is controlled by Puroindoline a (Pina) and Puroindoline b (Pinb) genes. However, little is known about the combined effects of HMW-GS and PINs on dough functional properties. In this study, we crossed a Pina-expressing transgenic line with a 1Ax1-expressing line of durum wheat and screened out lines coexpressing 1Ax1 and Pina or lines expressing either 1Ax1 or Pina. Dough mixing analysis of these lines demonstrated that expression of 1Ax1 improved both dough strength and over-mixing tolerance, while expression of PINA detrimentally affected the dough resistance to extension. In lines coexpressing 1Ax1 and Pina, faster hydration of flour during mixing was observed possibly due to the lower water absorption and damaged starch caused by PINA expression. In addition, expression of 1Ax1 appeared to compensate the detrimental effect of PINA on dough resistance to extension. Consequently, coexpression of 1Ax1 and PINA in durum wheat had combined effects on dough mixing behaviors with a better dough strength and resistance to extension than those from lines expressing either 1Ax1 or Pina. The results in our study suggest that simultaneous modulation of dough strength and grain hardness in durum wheat could significantly improve its breadmaking quality and may not even impair its pastamaking potential. Therefore, coexpression of 1Ax1 and PINA in durum wheat has useful implications for breeding durum wheat with dual functionality (for pasta and bread) and may improve the economic values of durum

  9. Coexpression of the high molecular weight glutenin subunit 1Ax1 and puroindoline improves dough mixing properties in durum wheat (Triticum turgidum L. ssp. durum.

    Directory of Open Access Journals (Sweden)

    Yin Li

    Full Text Available Wheat end-use quality mainly derives from two interrelated characteristics: the compositions of gluten proteins and grain hardness. The composition of gluten proteins determines dough rheological properties and thus confers the unique viscoelastic property on dough. One group of gluten proteins, high molecular weight glutenin subunits (HMW-GS, plays an important role in dough functional properties. On the other hand, grain hardness, which influences the milling process of flour, is controlled by Puroindoline a (Pina and Puroindoline b (Pinb genes. However, little is known about the combined effects of HMW-GS and PINs on dough functional properties. In this study, we crossed a Pina-expressing transgenic line with a 1Ax1-expressing line of durum wheat and screened out lines coexpressing 1Ax1 and Pina or lines expressing either 1Ax1 or Pina. Dough mixing analysis of these lines demonstrated that expression of 1Ax1 improved both dough strength and over-mixing tolerance, while expression of PINA detrimentally affected the dough resistance to extension. In lines coexpressing 1Ax1 and Pina, faster hydration of flour during mixing was observed possibly due to the lower water absorption and damaged starch caused by PINA expression. In addition, expression of 1Ax1 appeared to compensate the detrimental effect of PINA on dough resistance to extension. Consequently, coexpression of 1Ax1 and PINA in durum wheat had combined effects on dough mixing behaviors with a better dough strength and resistance to extension than those from lines expressing either 1Ax1 or Pina. The results in our study suggest that simultaneous modulation of dough strength and grain hardness in durum wheat could significantly improve its breadmaking quality and may not even impair its pastamaking potential. Therefore, coexpression of 1Ax1 and PINA in durum wheat has useful implications for breeding durum wheat with dual functionality (for pasta and bread and may improve the economic

  10. Chromosome-Centric Human Proteome Project Allies with Developmental Biology: A Case Study of the Role of Y Chromosome Genes in Organ Development.

    Science.gov (United States)

    Meyfour, Anna; Pooyan, Paria; Pahlavan, Sara; Rezaei-Tavirani, Mostafa; Gourabi, Hamid; Baharvand, Hossein; Salekdeh, Ghasem Hosseini

    2017-12-01

    One of the main goals of Chromosome-Centric Human Proteome Project is to identify protein evidence for missing proteins (MPs). Here, we present a case study of the role of Y chromosome genes in organ development and how to overcome the challenges facing MPs identification by employing human pluripotent stem cell differentiation into cells of different organs yielding unprecedented biological insight into adult silenced proteins. Y chromosome is a male-specific sex chromosome which escapes meiotic recombination. From an evolutionary perspective, Y chromosome has preserved 3% of ancestral genes compared to 98% preservation of the X chromosome based on Ohno's law. Male specific region of Y chromosome (MSY) contains genes that contribute to central dogma and govern the expression of various targets throughout the genome. One of the most well-known functions of MSY genes is to decide the male-specific characteristics including sex, testis formation, and spermatogenesis, which are majorly formed by ampliconic gene families. Beyond its role in sex-specific gonad development, MSY genes in coexpression with their X counterparts, as single copy and broadly expressed genes, inhibit haplolethality and play a key role in embryogenesis. The role of X-Y related gene mutations in the development of hereditary syndromes suggests an essential contribution of sex chromosome genes to development. MSY genes, solely and independent of their X counterparts and/or in association with sex hormones, have a considerable impact on organ development. In this Review, we present major recent findings on the contribution of MSY genes to gonad formation, spermatogenesis, and the brain, heart, and kidney development and discuss how Y chromosome proteome project may exploit developmental biology to find missing proteins.

  11. Polycistronic gene expression in Aspergillus niger.

    Science.gov (United States)

    Schuetze, Tabea; Meyer, Vera

    2017-09-25

    Genome mining approaches predict dozens of biosynthetic gene clusters in each of the filamentous fungal genomes sequenced so far. However, the majority of these gene clusters still remain cryptic because they are not expressed in their natural host. Simultaneous expression of all genes belonging to a biosynthetic pathway in a heterologous host is one approach to activate biosynthetic gene clusters and to screen the metabolites produced for bioactivities. Polycistronic expression of all pathway genes under control of a single and tunable promoter would be the method of choice, as this does not only simplify cloning procedures, but also offers control on timing and strength of expression. However, polycistronic gene expression is a feature not commonly found in eukaryotic host systems, such as Aspergillus niger. In this study, we tested the suitability of the viral P2A peptide for co-expression of three genes in A. niger. Two genes descend from Fusarium oxysporum and are essential to produce the secondary metabolite enniatin (esyn1, ekivR). The third gene (luc) encodes the reporter luciferase which was included to study position effects. Expression of the polycistronic gene cassette was put under control of the Tet-On system to ensure tunable gene expression in A. niger. In total, three polycistronic expression cassettes which differed in the position of luc were constructed and targeted to the pyrG locus in A. niger. This allowed direct comparison of the luciferase activity based on the position of the luciferase gene. Doxycycline-mediated induction of the Tet-On expression cassettes resulted in the production of one long polycistronic mRNA as proven by Northern analyses, and ensured comparable production of enniatin in all three strains. Notably, gene position within the polycistronic expression cassette matters, as, luciferase activity was lowest at position one and had a comparable activity at positions two and three. The P2A peptide can be used to express at

  12. The four cornerstones of Evolutionary Toxicology.

    Science.gov (United States)

    Bickham, John W

    2011-05-01

    Evolutionary Toxicology is the study of the effects of chemical pollutants on the genetics of natural populations. Research in Evolutionary Toxicology uses experimental designs familiar to the ecotoxicologist with matched reference and contaminated sites and the selection of sentinel species. It uses the methods of molecular genetics and population genetics, and is based on the theories and concepts of evolutionary biology and conservation genetics. Although it is a relatively young field, interest is rapidly growing among ecotoxicologists and more and more field studies and even controlled laboratory experiments are appearing in the literature. A number of population genetic impacts have been observed in organisms exposed to pollutants which I refer to here as the four cornerstones of Evolutionary Toxicology. These include (1) genome-wide changes in genetic diversity, (2) changes in allelic or genotypic frequencies caused by contaminant-induced selection acting at survivorship loci, (3) changes in dispersal patterns or gene flow which alter the genetic relationships among populations, and (4) changes in allelic or genotypic frequencies caused by increased mutation rates. It is concluded that population genetic impacts of pollution exposure are emergent effects that are not necessarily predictable from the mode of toxicity of the pollutant. Thus, to attribute an effect to a particular contaminant requires a careful experimental design which includes selection of appropriate reference sites, detailed chemistry analyses of environmental samples and tissues, and the use of appropriate biomarkers to establish exposure and effect. This paper describes the field of Evolutionary Toxicology and discusses relevant field studies and their findings. © Springer Science+Business Media, LLC 2011

  13. Human Management of a Wild Plant Modulates the Evolutionary Dynamics of a Gene Determining Recessive Resistance to Virus Infection.

    Science.gov (United States)

    Poulicard, Nils; Pacios, Luis Fernández; Gallois, Jean-Luc; Piñero, Daniel; García-Arenal, Fernando

    2016-08-01

    This work analyses the genetic variation and evolutionary patterns of recessive resistance loci involved in matching-allele (MA) host-pathogen interactions, focusing on the pvr2 resistance gene to potyviruses of the wild pepper Capsicum annuum glabriusculum (chiltepin). Chiltepin grows in a variety of wild habitats in Mexico, and its cultivation in home gardens started about 25 years ago. Potyvirus infection of Capsicum plants requires the physical interaction of the viral VPg with the pvr2 product, the translation initiation factor eIF4E1. Mutations impairing this interaction result in resistance, according to the MA model. The diversity of pvr2/eIF4E1 in wild and cultivated chiltepin populations from six biogeographical provinces in Mexico was analysed in 109 full-length coding sequences from 97 plants. Eleven alleles were found, and their interaction with potyvirus VPg in yeast-two-hybrid assays, plus infection assays of plants, identified six resistance alleles. Mapping resistance mutations on a pvr2/eIF4E1 model structure showed that most were around the cap-binding pocket and strongly altered its surface electrostatic potential, suggesting resistance-associated costs due to functional constraints. The pvr2/eIF4E1 phylogeny established that susceptibility was ancestral and resistance was derived. The spatial structure of pvr2/eIF4E1 diversity differed from that of neutral markers, but no evidence of selection for resistance was found in wild populations. In contrast, the resistance alleles were much more frequent, and positive selection stronger, in cultivated chiltepin populations, where diversification of pvr2/eIF4E1 was higher. This analysis of the genetic variation of a recessive resistance gene involved in MA host-pathogen interactions in populations of a wild plant show that evolutionary patterns differ according to the plant habitat, wild or cultivated. It also demonstrates that human management of the plant population has profound effects on the

  14. Human Management of a Wild Plant Modulates the Evolutionary Dynamics of a Gene Determining Recessive Resistance to Virus Infection.

    Directory of Open Access Journals (Sweden)

    Nils Poulicard

    2016-08-01

    Full Text Available This work analyses the genetic variation and evolutionary patterns of recessive resistance loci involved in matching-allele (MA host-pathogen interactions, focusing on the pvr2 resistance gene to potyviruses of the wild pepper Capsicum annuum glabriusculum (chiltepin. Chiltepin grows in a variety of wild habitats in Mexico, and its cultivation in home gardens started about 25 years ago. Potyvirus infection of Capsicum plants requires the physical interaction of the viral VPg with the pvr2 product, the translation initiation factor eIF4E1. Mutations impairing this interaction result in resistance, according to the MA model. The diversity of pvr2/eIF4E1 in wild and cultivated chiltepin populations from six biogeographical provinces in Mexico was analysed in 109 full-length coding sequences from 97 plants. Eleven alleles were found, and their interaction with potyvirus VPg in yeast-two-hybrid assays, plus infection assays of plants, identified six resistance alleles. Mapping resistance mutations on a pvr2/eIF4E1 model structure showed that most were around the cap-binding pocket and strongly altered its surface electrostatic potential, suggesting resistance-associated costs due to functional constraints. The pvr2/eIF4E1 phylogeny established that susceptibility was ancestral and resistance was derived. The spatial structure of pvr2/eIF4E1 diversity differed from that of neutral markers, but no evidence of selection for resistance was found in wild populations. In contrast, the resistance alleles were much more frequent, and positive selection stronger, in cultivated chiltepin populations, where diversification of pvr2/eIF4E1 was higher. This analysis of the genetic variation of a recessive resistance gene involved in MA host-pathogen interactions in populations of a wild plant show that evolutionary patterns differ according to the plant habitat, wild or cultivated. It also demonstrates that human management of the plant population has profound

  15. [Evolutionary medicine: an introduction. Evolutionary biology, a missing element in medical teaching].

    Science.gov (United States)

    Swynghedauw, Bernard

    2009-05-01

    The aim of this brief review article is to help to reconcile medicine with evolutionary biology, a subject that should be taught in medical school. Evolutionary medicine takes the view that contemporary ills are related to an incompatibility between the environment in which humans currently live and their genomes, which have been shaped by diferent environmental conditions during biological evolution. Human activity has recently induced acute environmental modifications that have profoundly changed the medical landscape. Evolutionary biology is an irreversible, ongoing and discontinuous process characterized by periods of stasis followed by accelerations. Evolutionary biology is determined by genetic mutations, which are selected either by Darwinian selective pressure or randomly by genetic drift. Most medical events result from a genome/environment conflict. Some may be purely genetic, as in monogenic diseases, and others purely environmental, such as traffic accidents. Nevertheless, in most common diseases the clinical landscape is determined by the conflict between these two factors, the genetic elements of which are gradually being unraveled Three examples are examined in depth:--The medical consequences of the greenhouse effect. The absence of excess mortality during recent heat waves suggests that the main determinant of mortality in the 2003 heatwave was heatstroke and old age. The projected long-term effects of global warming call for research on thermolysis, a forgotten branch of physiology.--The hygiene hypothesis postulates that the exponential rise in autoimmune and allergic diseases is linked to lesser exposure to infectious agents, possibly involving counter-regulatory factors such as IL-10.--The recent rise in the incidence of obesity and type 2 diabetes in rich countries can be considered to result from a conflict between a calorie-rich environment and gene variants that control appetite. These variants are currently being identified by genome

  16. Comparative gene expression between two yeast species

    Directory of Open Access Journals (Sweden)

    Guan Yuanfang

    2013-01-01

    Full Text Available Abstract Background Comparative genomics brings insight into sequence evolution, but even more may be learned by coupling sequence analyses with experimental tests of gene function and regulation. However, the reliability of such comparisons is often limited by biased sampling of expression conditions and incomplete knowledge of gene functions across species. To address these challenges, we previously systematically generated expression profiles in Saccharomyces bayanus to maximize functional coverage as compared to an existing Saccharomyces cerevisiae data repository. Results In this paper, we take advantage of these two data repositories to compare patterns of ortholog expression in a wide variety of conditions. First, we developed a scalable metric for expression divergence that enabled us to detect a significant correlation between sequence and expression conservation on the global level, which previous smaller-scale expression studies failed to detect. Despite this global conservation trend, between-species gene expression neighborhoods were less well-conserved than within-species comparisons across different environmental perturbations, and approximately 4% of orthologs exhibited a significant change in co-expression partners. Furthermore, our analysis of matched perturbations collected in both species (such as diauxic shift and cell cycle synchrony demonstrated that approximately a quarter of orthologs exhibit condition-specific expression pattern differences. Conclusions Taken together, these analyses provide a global view of gene expression patterns between two species, both in terms of the conditions and timing of a gene's expression as well as co-expression partners. Our results provide testable hypotheses that will direct future experiments to determine how these changes may be specified in the genome.

  17. Extracting Gene Networks for Low-Dose Radiation Using Graph Theoretical Algorithms

    Energy Technology Data Exchange (ETDEWEB)

    Voy, Brynn H [ORNL; Scharff, Jon [University of Tennessee, Knoxville (UTK); Perkins, Andy [University of Tennessee, Knoxville (UTK); Saxton, Arnold [University of Tennessee, Knoxville (UTK); Borate, Bhavesh [University of Tennessee, Knoxville (UTK); Chesler, Elissa J [ORNL; Branstetter, Lisa R [ORNL; Langston, Michael A [University of Tennessee, Knoxville (UTK)

    2006-01-01

    Genes with common functions often exhibit correlated expression levels, which can be used to identify sets of interacting genes from microarray data. Microarrays typically measure expression across genomic space, creating a massive matrix of co-expression that must be mined to extract only the most relevant gene interactions. We describe a graph theoretical approach to extracting co-expressed sets of genes, based on the computation of cliques. Unlike the results of traditional clustering algorithms, cliques are not disjoint and allow genes to be assigned to multiple sets of interacting partners, consistent with biological reality. A graph is created by thresholding the correlation matrix to include only the correlations most likely to signify functional relationships. Cliques computed from the graph correspond to sets of genes for which significant edges are present between all members of the set, representing potential members of common or interacting pathways. Clique membership can be used to infer function about poorly annotated genes, based on the known functions of better-annotated genes with which they share clique membership (i.e., ''guilt-by-association''). We illustrate our method by applying it to microarray data collected from the spleens of mice exposed to low-dose ionizing radiation. Differential analysis is used to identify sets of genes whose interactions are impacted by radiation exposure. The correlation graph is also queried independently of clique to extract edges that are impacted by radiation. We present several examples of multiple gene interactions that are altered by radiation exposure and thus represent potential molecular pathways that mediate the radiation response.

  18. Extracting gene networks for low-dose radiation using graph theoretical algorithms.

    Directory of Open Access Journals (Sweden)

    Brynn H Voy

    2006-07-01

    Full Text Available Genes with common functions often exhibit correlated expression levels, which can be used to identify sets of interacting genes from microarray data. Microarrays typically measure expression across genomic space, creating a massive matrix of co-expression that must be mined to extract only the most relevant gene interactions. We describe a graph theoretical approach to extracting co-expressed sets of genes, based on the computation of cliques. Unlike the results of traditional clustering algorithms, cliques are not disjoint and allow genes to be assigned to multiple sets of interacting partners, consistent with biological reality. A graph is created by thresholding the correlation matrix to include only the correlations most likely to signify functional relationships. Cliques computed from the graph correspond to sets of genes for which significant edges are present between all members of the set, representing potential members of common or interacting pathways. Clique membership can be used to infer function about poorly annotated genes, based on the known functions of better-annotated genes with which they share clique membership (i.e., "guilt-by-association". We illustrate our method by applying it to microarray data collected from the spleens of mice exposed to low-dose ionizing radiation. Differential analysis is used to identify sets of genes whose interactions are impacted by radiation exposure. The correlation graph is also queried independently of clique to extract edges that are impacted by radiation. We present several examples of multiple gene interactions that are altered by radiation exposure and thus represent potential molecular pathways that mediate the radiation response.

  19. Exploring the Genomic Roadmap and Molecular Phylogenetics Associated with MODY Cascades Using Computational Biology.

    Science.gov (United States)

    Chakraborty, Chiranjib; Bandyopadhyay, Sanghamitra; Doss, C George Priya; Agoramoorthy, Govindasamy

    2015-04-01

    Maturity onset diabetes of the young (MODY) is a metabolic and genetic disorder. It is different from type 1 and type 2 diabetes with low occurrence level (1-2%) among all diabetes. This disorder is a consequence of β-cell dysfunction. Till date, 11 subtypes of MODY have been identified, and all of them can cause gene mutations. However, very little is known about the gene mapping, molecular phylogenetics, and co-expression among MODY genes and networking between cascades. This study has used latest servers and software such as VarioWatch, ClustalW, MUSCLE, G Blocks, Phylogeny.fr, iTOL, WebLogo, STRING, and KEGG PATHWAY to perform comprehensive analyses of gene mapping, multiple sequences alignment, molecular phylogenetics, protein-protein network design, co-expression analysis of MODY genes, and pathway development. The MODY genes are located in chromosomes-2, 7, 8, 9, 11, 12, 13, 17, and 20. Highly aligned block shows Pro, Gly, Leu, Arg, and Pro residues are highly aligned in the positions of 296, 386, 437, 455, 456 and 598, respectively. Alignment scores inform us that HNF1A and HNF1B proteins have shown high sequence similarity among MODY proteins. Protein-protein network design shows that HNF1A, HNF1B, HNF4A, NEUROD1, PDX1, PAX4, INS, and GCK are strongly connected, and the co-expression analyses between MODY genes also show distinct association between HNF1A and HNF4A genes. This study has used latest tools of bioinformatics to develop a rapid method to assess the evolutionary relationship, the network development, and the associations among eleven MODY genes and cascades. The prediction of sequence conservation, molecular phylogenetics, protein-protein network and the association between the MODY cascades enhances opportunities to get more insights into the less-known MODY disease.

  20. Characterization of claustral neurons by comparative gene expression profiling and dye-injection analyses

    Directory of Open Access Journals (Sweden)

    Akiya eWatakabe

    2014-05-01

    Full Text Available The identity of the claustrum as a part of cerebral cortex, and in particular of the adjacent insular cortex, has been investigated by connectivity features and patterns of gene expression. In the present paper, we mapped the cortical and claustral expression of several cortical genes in rodent and macaque monkey brains (nurr1, latexin, cux2, and netrinG2 to further assess shared features between cortex and claustrum. In mice, these genes were densely expressed in the claustrum, but very sparsely in the cortex and not present in the striatum. To test whether the cortical vs. claustral cell types can be distinguished by co-expression of these genes, we performed a panel of double ISH in mouse and macaque brain. NetrinG2 and nurr1 genes were co-expressed across entire cortex and claustrum, but cux2 and nurr1 were co-expressed only in the insular cortex and claustrum. Latexin was expressed, in the macaque, only in the claustrum. The nurr1+ claustral neurons expressed VGluT1, a marker for cortical glutamatergic cells and send cortical projections. Taken together, our data suggest a partial commonality between claustral neurons and a subtype of cortical neurons in the monkey brain. Moreover, in the embryonic (E110 macaque brain, many nurr1+ neurons were scattered in the white matter between the claustrum and the insular cortex, possibly representing their migratory history. In a second set of experiments, we injected Lucifer Yellow intracellularly in mouse and rat slices to investigate whether dendrites of insular and claustral neurons can cross the border of the two brain regions. Dendrites of claustral neurons did not invade the overlying insular territory. In summary, gene expression profile of the claustrum is similar to that of the neocortex, in both rodent and macaque brains, but with modifications in density of expression and cellular co-localization of specific genes.

  1. Identification of estrogen receptor dimer selective ligands reveals growth-inhibitory effects on cells that co-express ERα and ERβ.

    Directory of Open Access Journals (Sweden)

    Emily Powell

    Full Text Available Estrogens play essential roles in the progression of mammary and prostatic diseases. The transcriptional effects of estrogens are transduced by two estrogen receptors, ERα and ERβ, which elicit opposing roles in regulating proliferation: ERα is proliferative while ERβ is anti-proliferative. Exogenous expression of ERβ in ERα-positive cancer cell lines inhibits cell proliferation in response to estrogen and reduces xenografted tumor growth in vivo, suggesting that ERβ might oppose ERα's proliferative effects via formation of ERα/β heterodimers. Despite biochemical and cellular evidence of ERα/β heterodimer formation in cells co-expressing both receptors, the biological roles of the ERα/β heterodimer remain to be elucidated. Here we report the identification of two phytoestrogens that selectively activate ERα/β heterodimers at specific concentrations using a cell-based, two-step high throughput small molecule screen for ER transcriptional activity and ER dimer selectivity. Using ERα/β heterodimer-selective ligands at defined concentrations, we demonstrate that ERα/β heterodimers are growth inhibitory in breast and prostate cells which co-express the two ER isoforms. Furthermore, using Automated Quantitative Analysis (AQUA to examine nuclear expression of ERα and ERβ in human breast tissue microarrays, we demonstrate that ERα and ERβ are co-expressed in the same cells in breast tumors. The co-expression of ERα and ERβ in the same cells supports the possibility of ERα/β heterodimer formation at physio- and pathological conditions, further suggesting that targeting ERα/β heterodimers might be a novel therapeutic approach to the treatment of cancers which co-express ERα and ERβ.

  2. Holistic Darwinism: the new evolutionary paradigm and some implications for political science.

    Science.gov (United States)

    Corning, Peter A

    2008-03-01

    Holistic Darwinism is a candidate name for a major paradigm shift that is currently underway in evolutionary biology and related disciplines. Important developments include (1) a growing appreciation for the fact that evolution is a multilevel process, from genes to ecosystems, and that interdependent coevolution is a ubiquitous phenomenon in nature; (2) a revitalization of group selection theory, which was banned (prematurely) from evolutionary biology over 30 years ago (groups may in fact be important evolutionary units); (3) a growing respect for the fact that the genome is not a "bean bag" (in biologist Ernst Mayr's caricature), much less a gladiatorial arena for competing selfish genes, but a complex, interdependent, cooperating system; (4) an increased recognition that symbiosis is an important phenomenon in nature and that symbiogenesis is a major source of innovation in evolution; (5) an array of new, more advanced game theory models, which support the growing evidence that cooperation is commonplace in nature and not a rare exception; (6) new research and theoretical work that stresses the role of nurture in evolution, including developmental processes, phenotypic plasticity, social information transfer (culture), and especially the role of behavioral innovations as pacemakers of evolutionary change (e.g., niche construction theory, which is concerned with the active role of organisms in shaping the evolutionary process, and gene-culture coevolution theory, which relates especially to the dynamics of human evolution); (7) and, not least, a broad effort to account for the evolution of biological complexity--from major transition theory to the "Synergism Hypothesis." Here I will briefly review these developments and will present a case for the proposition that this paradigm shift has profound implications for the social sciences, including specifically political theory, economic theory, and political science as a discipline. Interdependent superorganisms, it

  3. Horizontal transfer of short and degraded DNA has evolutionary implications for microbes and eukaryotic sexual reproduction.

    Science.gov (United States)

    Overballe-Petersen, Søren; Willerslev, Eske

    2014-10-01

    Horizontal gene transfer in the form of long DNA fragments has changed our view of bacterial evolution. Recently, we discovered that such processes may also occur with the massive amounts of short and damaged DNA in the environment, and even with truly ancient DNA. Although it presently remains unclear how often it takes place in nature, horizontal gene transfer of short and damaged DNA opens up the possibility for genetic exchange across distinct species in both time and space. In this essay, we speculate on the potential evolutionary consequences of this phenomenon. We argue that it may challenge basic assumptions in evolutionary theory; that it may have distant origins in life's history; and that horizontal gene transfer should be viewed as an evolutionary strategy not only preceding but causally underpinning the evolution of sexual reproduction. © 2014 The Authors. BioEssays Published by WILEY Periodicals, Inc.

  4. Functional modules by relating protein interaction networks and gene expression.

    Science.gov (United States)

    Tornow, Sabine; Mewes, H W

    2003-11-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.

  5. Genomicus 2018: karyotype evolutionary trees and on-the-fly synteny computing.

    Science.gov (United States)

    Nguyen, Nga Thi Thuy; Vincens, Pierre; Roest Crollius, Hugues; Louis, Alexandra

    2018-01-04

    Since 2010, the Genomicus web server is available online at http://genomicus.biologie.ens.fr/genomicus. This graphical browser provides access to comparative genomic analyses in four different phyla (Vertebrate, Plants, Fungi, and non vertebrate Metazoans). Users can analyse genomic information from extant species, as well as ancestral gene content and gene order for vertebrates and flowering plants, in an integrated evolutionary context. New analyses and visualization tools have recently been implemented in Genomicus Vertebrate. Karyotype structures from several genomes can now be compared along an evolutionary pathway (Multi-KaryotypeView), and synteny blocks can be computed and visualized between any two genomes (PhylDiagView). © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Regulatory RNA design through evolutionary computation and strand displacement.

    Science.gov (United States)

    Rostain, William; Landrain, Thomas E; Rodrigo, Guillermo; Jaramillo, Alfonso

    2015-01-01

    The discovery and study of a vast number of regulatory RNAs in all kingdoms of life over the past decades has allowed the design of new synthetic RNAs that can regulate gene expression in vivo. Riboregulators, in particular, have been used to activate or repress gene expression. However, to accelerate and scale up the design process, synthetic biologists require computer-assisted design tools, without which riboregulator engineering will remain a case-by-case design process requiring expert attention. Recently, the design of RNA circuits by evolutionary computation and adapting strand displacement techniques from nanotechnology has proven to be suited to the automated generation of DNA sequences implementing regulatory RNA systems in bacteria. Herein, we present our method to carry out such evolutionary design and how to use it to create various types of riboregulators, allowing the systematic de novo design of genetic control systems in synthetic biology.

  7. Evolutionary reversion of editing sites of ndh genes suggests their origin in the Permian-Triassic, before the increase of atmospheric CO2

    Directory of Open Access Journals (Sweden)

    Mercedes eMartin

    2015-07-01

    Full Text Available The plastid ndh genes have hovered frequently on the edge of dispensability. They are absent in the plastid DNA of many algae and certain higher plants and present editing sites requiring C-to-U corrections of primary transcripts. The evolutionary origin of editing sites and their loss due to C-to-T reversions at the DNA level are unknown and must be related to the dispensability of the ndh genes in specific environments. In order to better understand the evolution of ndh gene editing sites, we have created expandable data banks with the 12 editing sites of the ndhB gene (600 GenBank sequences and both editing sites of the ndhF gene (1,600 GenBank sequences. Since their origin via T-to-C mutations that probably occurred between 300 and 200 Myr BP (Permian-Triassic, ndh editing sites have undergone independent and random C-to-T reversions in the different angiosperm lineages. Some of these reversions appear early in angiosperm diversification. Old C-to-T reversions can be traced back to radiation steps that gave origin to main classes, orders and some families.

  8. Uncovering the liver’s role in immunity through RNA co-expression networks

    Czech Academy of Sciences Publication Activity Database

    Harrall, K. K.; Kechris, K. J.; Tabakoff, B.; Hoffman, P.L.; Hines, L. M.; Tsukamoto, H.; Pravenec, Michal; Printz, M.; Saba, L. M.

    2016-01-01

    Roč. 27, 9-10 (2016), s. 469-484 ISSN 0938-8990 R&D Projects: GA ČR(CZ) GAP301/12/0696 Institutional support: RVO:67985823 Keywords : RNA coexpression networks * liver * immunity * rat * recombinant inbred strains Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.509, year: 2016

  9. Integration of heterogeneous molecular networks to unravel gene-regulation in Mycobacterium tuberculosis.

    Science.gov (United States)

    van Dam, Jesse C J; Schaap, Peter J; Martins dos Santos, Vitor A P; Suárez-Diez, María

    2014-09-26

    Different methods have been developed to infer regulatory networks from heterogeneous omics datasets and to construct co-expression networks. Each algorithm produces different networks and efforts have been devoted to automatically integrate them into consensus sets. However each separate set has an intrinsic value that is diluted and partly lost when building a consensus network. Here we present a methodology to generate co-expression networks and, instead of a consensus network, we propose an integration framework where the different networks are kept and analysed with additional tools to efficiently combine the information extracted from each network. We developed a workflow to efficiently analyse information generated by different inference and prediction methods. Our methodology relies on providing the user the means to simultaneously visualise and analyse the coexisting networks generated by different algorithms, heterogeneous datasets, and a suite of analysis tools. As a show case, we have analysed the gene co-expression networks of Mycobacterium tuberculosis generated using over 600 expression experiments. Regarding DNA damage repair, we identified SigC as a key control element, 12 new targets for LexA, an updated LexA binding motif, and a potential mismatch repair system. We expanded the DevR regulon with 27 genes while identifying 9 targets wrongly assigned to this regulon. We discovered 10 new genes linked to zinc uptake and a new regulatory mechanism for ZuR. The use of co-expression networks to perform system level analysis allows the development of custom made methodologies. As show cases we implemented a pipeline to integrate ChIP-seq data and another method to uncover multiple regulatory layers. Our workflow is based on representing the multiple types of information as network representations and presenting these networks in a synchronous framework that allows their simultaneous visualization while keeping specific associations from the different

  10. Radionuclide reporter gene imaging for cardiac gene therapy

    International Nuclear Information System (INIS)

    Inubushi, Masayuki; Tamaki, Nagara

    2007-01-01

    In the field of cardiac gene therapy, angiogenic gene therapy has been most extensively investigated. The first clinical trial of cardiac angiogenic gene therapy was reported in 1998, and at the peak, more than 20 clinical trial protocols were under evaluation. However, most trials have ceased owing to the lack of decisive proof of therapeutic effects and the potential risks of viral vectors. In order to further advance cardiac angiogenic gene therapy, remaining open issues need to be resolved: there needs to be improvement of gene transfer methods, regulation of gene expression, development of much safer vectors and optimisation of therapeutic genes. For these purposes, imaging of gene expression in living organisms is of great importance. In radionuclide reporter gene imaging, ''reporter genes'' transferred into cell nuclei encode for a protein that retains a complementary ''reporter probe'' of a positron or single-photon emitter; thus expression of the reporter genes can be imaged with positron emission tomography or single-photon emission computed tomography. Accordingly, in the setting of gene therapy, the location, magnitude and duration of the therapeutic gene co-expression with the reporter genes can be monitored non-invasively. In the near future, gene therapy may evolve into combination therapy with stem/progenitor cell transplantation, so-called cell-based gene therapy or gene-modified cell therapy. Radionuclide reporter gene imaging is now expected to contribute in providing evidence on the usefulness of this novel therapeutic approach, as well as in investigating the molecular mechanisms underlying neovascularisation and safety issues relevant to further progress in conventional gene therapy. (orig.)

  11. Segmental duplications and evolutionary acquisition of UV damage response in the SPATA31 gene family of primates and humans.

    Science.gov (United States)

    Bekpen, Cemalettin; Künzel, Sven; Xie, Chen; Eaaswarkhanth, Muthukrishnan; Lin, Yen-Lung; Gokcumen, Omer; Akdis, Cezmi A; Tautz, Diethard

    2017-03-06

    Segmental duplications are an abundant source for novel gene functions and evolutionary adaptations. This mechanism of generating novelty was very active during the evolution of primates particularly in the human lineage. Here, we characterize the evolution and function of the SPATA31 gene family (former designation FAM75A), which was previously shown to be among the gene families with the strongest signal of positive selection in hominoids. The mouse homologue for this gene family is a single copy gene expressed during spermatogenesis. We show that in primates, the SPATA31 gene duplicated into SPATA31A and SPATA31C types and broadened the expression into many tissues. Each type became further segmentally duplicated in the line towards humans with the largest number of full-length copies found for SPATA31A in humans. Copy number estimates of SPATA31A based on digital PCR show an average of 7.5 with a range of 5-11 copies per diploid genome among human individuals. The primate SPATA31 genes also acquired new protein domains that suggest an involvement in UV response and DNA repair. We generated antibodies and show that the protein is re-localized from the nucleolus to the whole nucleus upon UV-irradiation suggesting a UV damage response. We used CRISPR/Cas mediated mutagenesis to knockout copies of the gene in human primary fibroblast cells. We find that cell lines with reduced functional copies as well as naturally occurring low copy number HFF cells show enhanced sensitivity towards UV-irradiation. The acquisition of new SPATA31 protein functions and its broadening of expression may be related to the evolution of the diurnal life style in primates that required a higher UV tolerance. The increased segmental duplications in hominoids as well as its fast evolution suggest the acquisition of further specific functions particularly in humans.

  12. Evolutionary Analyses Suggest a Function of MxB Immunity Proteins Beyond Lentivirus Restriction.

    Directory of Open Access Journals (Sweden)

    Patrick S Mitchell

    2015-12-01

    Full Text Available Viruses impose diverse and dynamic challenges on host defenses. Diversifying selection of codons and gene copy number variation are two hallmarks of genetic innovation in antiviral genes engaged in host-virus genetic conflicts. The myxovirus resistance (Mx genes encode interferon-inducible GTPases that constitute a major arm of the cell-autonomous defense against viral infection. Unlike the broad antiviral activity of MxA, primate MxB was recently shown to specifically inhibit lentiviruses including HIV-1. We carried out detailed evolutionary analyses to investigate whether genetic conflict with lentiviruses has shaped MxB evolution in primates. We found strong evidence for diversifying selection in the MxB N-terminal tail, which contains molecular determinants of MxB anti-lentivirus specificity. However, we found no overlap between previously-mapped residues that dictate lentiviral restriction and those that have evolved under diversifying selection. Instead, our findings are consistent with MxB having a long-standing and important role in the interferon response to viral infection against a broader range of pathogens than is currently appreciated. Despite its critical role in host innate immunity, we also uncovered multiple functional losses of MxB during mammalian evolution, either by pseudogenization or by gene conversion from MxA genes. Thus, although the majority of mammalian genomes encode two Mx genes, this apparent stasis masks the dramatic effects that recombination and diversifying selection have played in shaping the evolutionary history of Mx genes. Discrepancies between our study and previous publications highlight the need to account for recombination in analyses of positive selection, as well as the importance of using sequence datasets with appropriate depth of divergence. Our study also illustrates that evolutionary analyses of antiviral gene families are critical towards understanding molecular principles that govern host

  13. Organization and evolution of the rat tyrosine hydroxylase gene

    International Nuclear Information System (INIS)

    Brown, E.R.; Coker, G.T. III; O'Malley, K.L.

    1987-01-01

    This report describes the organization of the rat tyrosine hydroxylase (TH) gene and compares its structure with the human phenylalanine hydroxylase gene. Both genes are single copy and contain 13 exons separated by 12 introns. Remarkably, the positions of 10 out 12 intron/exon boundaries are identical for the two genes. These results support the idea that these hydroxylases genes are members of a gene family which has a common evolutionary origin. The authors predict that this ancestral gene would have encoded exons similar to those of TH prior to evolutionary drift to other members of this gene family

  14. The dynamic landscape of gene regulation during Bombyx mori oogenesis.

    Science.gov (United States)

    Zhang, Qiang; Sun, Wei; Sun, Bang-Yong; Xiao, Yang; Zhang, Ze

    2017-09-11

    Oogenesis in the domestic silkworm (Bombyx mori) is a complex process involving previtellogenesis, vitellogenesis and choriogenesis. During this process, follicles show drastic morphological and physiological changes. However, the genome-wide regulatory profiles of gene expression during oogenesis remain to be determined. In this study, we obtained time-series transcriptome data and used these data to reveal the dynamic landscape of gene regulation during oogenesis. A total of 1932 genes were identified to be differentially expressed among different stages, most of which occurred during the transition from late vitellogenesis to early choriogenesis. Using weighted gene co-expression network analysis, we identified six stage-specific gene modules that correspond to multiple regulatory pathways. Strikingly, the biosynthesis pathway of the molting hormone 20-hydroxyecdysone (20E) was enriched in one of the modules. Further analysis showed that the ecdysteroid 20-hydroxylase gene (CYP314A1) of steroidgenesis genes was mainly expressed in previtellogenesis and early vitellogenesis. However, the 20E-inactivated genes, particularly the ecdysteroid 26-hydroxylase encoding gene (Cyp18a1), were highly expressed in late vitellogenesis. These distinct expression patterns between 20E synthesis and catabolism-related genes might ensure the rapid decline of the hormone titer at the transition point from vitellogenesis to choriogenesis. In addition, we compared landscapes of gene regulation between silkworm (Lepidoptera) and fruit fly (Diptera) oogeneses. Our results show that there is some consensus in the modules of gene co-expression during oogenesis in these insects. The data presented in this study provide new insights into the regulatory mechanisms underlying oogenesis in insects with polytrophic meroistic ovaries. The results also provide clues for further investigating the roles of epigenetic reconfiguration and circadian rhythm in insect oogenesis.

  15. Evolutionary Games with Randomly Changing Payoff Matrices

    Science.gov (United States)

    Yakushkina, Tatiana; Saakian, David B.; Bratus, Alexander; Hu, Chin-Kun

    2015-06-01

    Evolutionary games are used in various fields stretching from economics to biology. In most of these games a constant payoff matrix is assumed, although some works also consider dynamic payoff matrices. In this article we assume a possibility of switching the system between two regimes with different sets of payoff matrices. Potentially such a model can qualitatively describe the development of bacterial or cancer cells with a mutator gene present. A finite population evolutionary game is studied. The model describes the simplest version of annealed disorder in the payoff matrix and is exactly solvable at the large population limit. We analyze the dynamics of the model, and derive the equations for both the maximum and the variance of the distribution using the Hamilton-Jacobi equation formalism.

  16. Molecular evolutionary patterns of NAD+/Sirtuin aging signaling pathway across taxa.

    Directory of Open Access Journals (Sweden)

    Uma Gaur

    Full Text Available A deeper understanding of the conserved molecular mechanisms in different taxa have been made possible only because of the evolutionary conservation of crucial signaling pathways. In the present study, we explored the molecular evolutionary pattern of selection signatures in 51 species for 10 genes which are important components of NAD+/Sirtuin pathway and have already been directly linked to lifespan extension in worms and mice. Selection pressure analysis using PAML program revealed that MRPS5 and PPARGC1A were under significant constraints because of their functional significance. FOXO3a also displayed strong purifying selection. All three sirtuins, which were SIRT1, SIRT2 and SIRT6, displayed a great degree of conservation between taxa, which is consistent with the previous report. A significant evolutionary constraint is seen on the anti-oxidant gene, SOD3. As expected, TP53 gene was under significant selection pressure in mammals, owing to its major role in tumor progression. Poly-ADP-ribose polymerase (PARP genes displayed the most sites under positive selection. Further 3D structural analysis of PARP1 and PARP2 protein revealed that some of these positively selected sites caused a change in the electrostatic potential of the protein structure, which may allow a change in its interaction with other proteins and molecules ultimately leading to difference in the function. Although the functional significance of the positively selected sites could not be established in the variants databases, yet it will be interesting to see if these sites actually affect the function of PARP1 and PARP2.

  17. Horizontal gene transfer between Wolbachia and the mosquito Aedes aegypti

    Directory of Open Access Journals (Sweden)

    Walker Thomas

    2009-01-01

    Full Text Available Abstract Background The evolutionary importance of horizontal gene transfer (HGT from Wolbachia endosymbiotic bacteria to their eukaryotic hosts is a topic of considerable interest and debate. Recent transfers of genome fragments from Wolbachia into insect chromosomes have been reported, but it has been argued that these fragments may be on an evolutionary trajectory to degradation and loss. Results We have discovered a case of HGT, involving two adjacent genes, between the genomes of Wolbachia and the currently Wolbachia-uninfected mosquito Aedes aegypti, an important human disease vector. The lower level of sequence identity between Wolbachia and insect, the transcription of all the genes involved, and the fact that we have identified homologs of the two genes in another Aedes species (Ae. mascarensis, suggest that these genes are being expressed after an extended evolutionary period since horizontal transfer, and therefore that the transfer has functional significance. The association of these genes with Wolbachia prophage regions also provides a mechanism for the transfer. Conclusion The data support the argument that HGT between Wolbachia endosymbiotic bacteria and their hosts has produced evolutionary innovation.

  18. Evolutionary Nephrology.

    Science.gov (United States)

    Chevalier, Robert L

    2017-05-01

    Progressive kidney disease follows nephron loss, hyperfiltration, and incomplete repair, a process described as "maladaptive." In the past 20 years, a new discipline has emerged that expands research horizons: evolutionary medicine. In contrast to physiologic (homeostatic) adaptation, evolutionary adaptation is the result of reproductive success that reflects natural selection. Evolutionary explanations for physiologically maladaptive responses can emerge from mismatch of the phenotype with environment or evolutionary tradeoffs. Evolutionary adaptation to a terrestrial environment resulted in a vulnerable energy-consuming renal tubule and a hypoxic, hyperosmolar microenvironment. Natural selection favors successful energy investment strategy: energy is allocated to maintenance of nephron integrity through reproductive years, but this declines with increasing senescence after ~40 years of age. Risk factors for chronic kidney disease include restricted fetal growth or preterm birth (life history tradeoff resulting in fewer nephrons), evolutionary selection for APOL1 mutations (that provide resistance to trypanosome infection, a tradeoff), and modern life experience (Western diet mismatch leading to diabetes and hypertension). Current advances in genomics, epigenetics, and developmental biology have revealed proximate causes of kidney disease, but attempts to slow kidney disease remain elusive. Evolutionary medicine provides a complementary approach by addressing ultimate causes of kidney disease. Marked variation in nephron number at birth, nephron heterogeneity, and changing susceptibility to kidney injury throughout life history are the result of evolutionary processes. Combined application of molecular genetics, evolutionary developmental biology (evo-devo), developmental programming and life history theory may yield new strategies for prevention and treatment of chronic kidney disease.

  19. Evolutionary Nephrology

    Directory of Open Access Journals (Sweden)

    Robert L. Chevalier

    2017-05-01

    Full Text Available Progressive kidney disease follows nephron loss, hyperfiltration, and incomplete repair, a process described as “maladaptive.” In the past 20 years, a new discipline has emerged that expands research horizons: evolutionary medicine. In contrast to physiologic (homeostatic adaptation, evolutionary adaptation is the result of reproductive success that reflects natural selection. Evolutionary explanations for physiologically maladaptive responses can emerge from mismatch of the phenotype with environment or from evolutionary tradeoffs. Evolutionary adaptation to a terrestrial environment resulted in a vulnerable energy-consuming renal tubule and a hypoxic, hyperosmolar microenvironment. Natural selection favors successful energy investment strategy: energy is allocated to maintenance of nephron integrity through reproductive years, but this declines with increasing senescence after ∼40 years of age. Risk factors for chronic kidney disease include restricted fetal growth or preterm birth (life history tradeoff resulting in fewer nephrons, evolutionary selection for APOL1 mutations (which provide resistance to trypanosome infection, a tradeoff, and modern life experience (Western diet mismatch leading to diabetes and hypertension. Current advances in genomics, epigenetics, and developmental biology have revealed proximate causes of kidney disease, but attempts to slow kidney disease remain elusive. Evolutionary medicine provides a complementary approach by addressing ultimate causes of kidney disease. Marked variation in nephron number at birth, nephron heterogeneity, and changing susceptibility to kidney injury throughout the life history are the result of evolutionary processes. Combined application of molecular genetics, evolutionary developmental biology (evo-devo, developmental programming, and life history theory may yield new strategies for prevention and treatment of chronic kidney disease.

  20. Functional evolution of a multigene family: orthologous and paralogous pheromone receptor genes in the turnip moth, Agrotis segetum.

    Directory of Open Access Journals (Sweden)

    Dan-Dan Zhang

    Full Text Available Lepidopteran pheromone receptors (PRs, for which orthologies are evident among closely related species, provide an intriguing example of gene family evolution in terms of how new functions may arise. However, only a limited number of PRs have been functionally characterized so far and thus evolutionary scenarios suffer from elements of speculation. In this study we investigated the turnip moth Agrotis segetum, in which female moths produce a mixture of chemically related pheromone components that elicit specific responses from receptor cells on male antennae. We cloned nine A. segetum PR genes and the Orco gene by degenerate primer based RT-PCR. The nine PR genes, named as AsegOR1 and AsegOR3-10, fall into four distinct orthologous clusters of known lepidopteran PRs, of which one contains six paralogues. The paralogues are under relaxed selective pressure, contrasting with the purifying selection on other clusters. We identified the receptors AsegOR9, AsegOR4 and AsegOR5, specific for the respective homologous pheromone components (Z-5-decenyl, (Z-7-dodecenyl and (Z-9-tetradecenyl acetates, by two-electrode voltage clamp recording from Xenopus laevis oocytes co-expressing Orco and each PR candidate. These receptors occur in three different orthologous clusters. We also found that the six paralogues with high sequence similarity vary dramatically in ligand selectivity and sensitivity. Different from AsegOR9, AsegOR6 showed a relatively large response to the behavioural antagonist (Z-5-decenol, and a small response to (Z-5-decenyl acetate. AsegOR1 was broadly tuned, but most responsive to (Z-5-decenyl acetate, (Z-7-dodecenyl acetate and the behavioural antagonist (Z-8-dodecenyl acetate. AsegOR8 and AsegOR7, which differ from AsegOR6 and AsegOR1 by 7 and 10 aa respectively, showed much lower sensitivities. AsegOR10 showed only small responses to all the tested compounds. These results suggest that new receptors arise through gene duplication, and

  1. The evolutionary capacitor HSP90 buffers the regulatory effects of mammalian endogenous retroviruses.

    Science.gov (United States)

    Hummel, Barbara; Hansen, Erik C; Yoveva, Aneliya; Aprile-Garcia, Fernando; Hussong, Rebecca; Sawarkar, Ritwick

    2017-03-01

    Understanding how genotypes are linked to phenotypes is important in biomedical and evolutionary studies. The chaperone heat-shock protein 90 (HSP90) buffers genetic variation by stabilizing proteins with variant sequences, thereby uncoupling phenotypes from genotypes. Here we report an unexpected role of HSP90 in buffering cis-regulatory variation affecting gene expression. By using the tripartite-motif-containing 28 (TRIM28; also known as KAP1)-mediated epigenetic pathway, HSP90 represses the regulatory influence of endogenous retroviruses (ERVs) on neighboring genes that are critical for mouse development. Our data based on natural variations in the mouse genome show that genes respond to HSP90 inhibition in a manner dependent on their genomic location with regard to strain-specific ERV-insertion sites. The evolutionary-capacitor function of HSP90 may thus have facilitated the exaptation of ERVs as key modifiers of gene expression and morphological diversification. Our findings add a new regulatory layer through which HSP90 uncouples phenotypic outcomes from individual genotypes.

  2. An evolutionary conserved region (ECR in the human dopamine receptor D4 gene supports reporter gene expression in primary cultures derived from the rat cortex

    Directory of Open Access Journals (Sweden)

    Haddley Kate

    2011-05-01

    Full Text Available Abstract Background Detecting functional variants contributing to diversity of behaviour is crucial for dissecting genetics of complex behaviours. At a molecular level, characterisation of variation in exons has been studied as they are easily identified in the current genome annotation although the functional consequences are less well understood; however, it has been difficult to prioritise regions of non-coding DNA in which genetic variation could also have significant functional consequences. Comparison of multiple vertebrate genomes has allowed the identification of non-coding evolutionary conserved regions (ECRs, in which the degree of conservation can be comparable with exonic regions suggesting functional significance. Results We identified ECRs at the dopamine receptor D4 gene locus, an important gene for human behaviours. The most conserved non-coding ECR (D4ECR1 supported high reporter gene expression in primary cultures derived from neonate rat frontal cortex. Computer aided analysis of the sequence of the D4ECR1 indicated the potential transcription factors that could modulate its function. D4ECR1 contained multiple consensus sequences for binding the transcription factor Sp1, a factor previously implicated in DRD4 expression. Co-transfection experiments demonstrated that overexpression of Sp1 significantly decreased the activity of the D4ECR1 in vitro. Conclusion Bioinformatic analysis complemented by functional analysis of the DRD4 gene locus has identified a a strong enhancer that functions in neurons and b a transcription factor that may modulate the function of that enhancer.

  3. Comparative Genomic Analyses Provide New Insights into the Evolutionary Dynamics of Heterochromatin in Drosophila.

    Science.gov (United States)

    Caizzi, Ruggiero; Moschetti, Roberta; Piacentini, Lucia; Fanti, Laura; Marsano, Renè Massimiliano; Dimitri, Patrizio

    2016-08-01

    The term heterochromatin has been long considered synonymous with gene silencing, but it is now clear that the presence of transcribed genes embedded in pericentromeric heterochromatin is a conserved feature in the evolution of eukaryotic genomes. Several studies have addressed the epigenetic changes that enable the expression of genes in pericentric heterochromatin, yet little is known about the evolutionary processes through which this has occurred. By combining genome annotation analysis and high-resolution cytology, we have identified and mapped 53 orthologs of D. melanogaster heterochromatic genes in the genomes of two evolutionarily distant species, D. pseudoobscura and D. virilis. Our results show that the orthologs of the D. melanogaster heterochromatic genes are clustered at three main genomic regions in D. virilis and D. pseudoobscura. In D. virilis, the clusters lie in the middle of euchromatin, while those in D. pseudoobscura are located in the proximal portion of the chromosome arms. Some orthologs map to the corresponding Muller C element in D. pseudoobscura and D. virilis, while others localize on the Muller B element, suggesting that chromosomal rearrangements that have been instrumental in the fusion of two separate elements involved the progenitors of genes currently located in D. melanogaster heterochromatin. These results demonstrate an evolutionary repositioning of gene clusters from ancestral locations in euchromatin to the pericentromeric heterochromatin of descendent D. melanogaster chromosomes. Remarkably, in both D. virilis and D. pseudoobscura the gene clusters show a conserved association with the HP1a protein, one of the most highly evolutionarily conserved epigenetic marks. In light of these results, we suggest a new scenario whereby ancestral HP1-like proteins (and possibly other epigenetic marks) may have contributed to the evolutionary repositioning of gene clusters into heterochromatin.

  4. Coexpression and Secretion of Endoglucanase and Phytase Genes in Lactobacillus reuteri

    Science.gov (United States)

    Wang, Lei; Yang, Yuxin; Cai, Bei; Cao, Pinghua; Yang, Mingming; Chen, Yulin

    2014-01-01

    A multifunctional transgenic Lactobacillus with probiotic characteristics and an ability to degrade β-glucan and phytic acid (phytate) was engineered to improve nutrient utilization, increase production performance and decrease digestive diseases in broiler chickens. The Bacillus subtilis WL001 endoglucanase gene (celW) and Aspergillus fumigatus WL002 phytase gene (phyW) mature peptide (phyWM) were cloned into an expression vector with the lactate dehydrogenase promoter of Lactobacillus casei and the secretion signal peptide of the Lactococcus lactis usp45 gene. This construct was then transformed into Lactobacillus reuteri XC1 that had been isolated from the gastrointestinal tract of broilers. Heterologous enzyme production and feed effectiveness of this genetically modified L. reuteri strain were investigated and evaluated. Sodium dodecyl sulfate polyacrylamide gel electrophoresis analysis showed that the molecular mass of phyWM and celW was approximately 48.2 and 55 kDa, respectively, consistent with their predicted molecular weights. Endoglucanase and phytase activities in the extracellular fraction of the transformed L. reuteri culture were 0.68 and 0.42 U/mL, respectively. Transformed L. reuteri improved the feed conversion ratio of broilers from 21 to 42 days of age and over the whole feeding period. However, there was no effect on body weight gain and feed intake of chicks. Transformed L. reuteri supplementation improved levels of ash, calcium and phosphorus in tibiae at day 21 and of phosphorus at day 42. In addition, populations of Escherichia coli, Veillonella spp. and Bacteroides vulgatus were decreased, while populations of Bifidobacterium genus and Lactobacillus spp. were increased in the cecum at day 21. PMID:25050780

  5. Putative recombination events and evolutionary history of five economically important viruses of fruit trees based on coat protein-encoding gene sequence analysis.

    Science.gov (United States)

    Boulila, Moncef

    2010-06-01

    To enhance the knowledge of recombination as an evolutionary process, 267 accessions retrieved from GenBank were investigated, all belonging to five economically important viruses infecting fruit crops (Plum pox, Apple chlorotic leaf spot, Apple mosaic, Prune dwarf, and Prunus necrotic ringspot viruses). Putative recombinational events were detected in the coat protein (CP)-encoding gene using RECCO and RDP version 3.31beta algorithms. Based on RECCO results, all five viruses were shown to contain potential recombination signals in the CP gene. Reconstructed trees with modified topologies were proposed. Furthermore, RECCO performed better than the RDP package in detecting recombination events and exhibiting their evolution rate along the sequences of the five viruses. RDP, however, provided the possible major and minor parents of the recombinants. Thus, the two methods should be considered complementary.

  6. Genomic signatures of evolutionary transitions from solitary to group living

    DEFF Research Database (Denmark)

    Kapheim, Karen M.; Pan, Hailin; Li, Cai

    2015-01-01

    . First, many important genes show evidence of neutral evolution as a consequence of relaxed selection with increasing social complexity. Second, there is no single road map to eusociality; independent evolutionary transitions in sociality have independent genetic underpinnings. Third, though clearly...

  7. Comparative genomics sheds light on niche differentiation and the evolutionary history of comammox Nitrospira

    DEFF Research Database (Denmark)

    Palomo, Alejandro; Pedersen, Anders Gorm; Fowler, Jane

    2018-01-01

    genomes encode genes that might allow efficient growth at low oxygen concentrations. Regarding the evolutionary history of comammox Nitrospira, our analyses indicate that several genes belonging to the ammonia oxidation pathway could have been laterally transferred from β-AOB to comammox Nitrospira. We...

  8. Evolutionary molecular medicine.

    Science.gov (United States)

    Nesse, Randolph M; Ganten, Detlev; Gregory, T Ryan; Omenn, Gilbert S

    2012-05-01

    Evolution has long provided a foundation for population genetics, but some major advances in evolutionary biology from the twentieth century that provide foundations for evolutionary medicine are only now being applied in molecular medicine. They include the need for both proximate and evolutionary explanations, kin selection, evolutionary models for cooperation, competition between alleles, co-evolution, and new strategies for tracing phylogenies and identifying signals of selection. Recent advances in genomics are transforming evolutionary biology in ways that create even more opportunities for progress at its interfaces with genetics, medicine, and public health. This article reviews 15 evolutionary principles and their applications in molecular medicine in hopes that readers will use them and related principles to speed the development of evolutionary molecular medicine.

  9. On Reciprocal Causation in the Evolutionary Process.

    Science.gov (United States)

    Svensson, Erik I

    2018-01-01

    Recent calls for a revision of standard evolutionary theory (SET) are based partly on arguments about the reciprocal causation. Reciprocal causation means that cause-effect relationships are bi-directional, as a cause could later become an effect and vice versa. Such dynamic cause-effect relationships raise questions about the distinction between proximate and ultimate causes, as originally formulated by Ernst Mayr. They have also motivated some biologists and philosophers to argue for an Extended Evolutionary Synthesis (EES). The EES will supposedly expand the scope of the Modern Synthesis (MS) and SET, which has been characterized as gene-centred, relying primarily on natural selection and largely neglecting reciprocal causation. Here, I critically examine these claims, with a special focus on the last conjecture. I conclude that reciprocal causation has long been recognized as important by naturalists, ecologists and evolutionary biologists working in the in the MS tradition, although it it could be explored even further. Numerous empirical examples of reciprocal causation in the form of positive and negative feedback are now well known from both natural and laboratory systems. Reciprocal causation have also been explicitly incorporated in mathematical models of coevolutionary arms races, frequency-dependent selection, eco-evolutionary dynamics and sexual selection. Such dynamic feedback were already recognized by Richard Levins and Richard Lewontin in their bok The Dialectical Biologist . Reciprocal causation and dynamic feedback might also be one of the few contributions of dialectical thinking and Marxist philosophy in evolutionary theory. I discuss some promising empirical and analytical tools to study reciprocal causation and the implications for the EES. Finally, I briefly discuss how quantitative genetics can be adapated to studies of reciprocal causation, constructive inheritance and phenotypic plasticity and suggest that the flexibility of this approach

  10. Coexpression network analysis in abdominal and gluteal adipose tissue reveals regulatory genetic loci for metabolic syndrome and related phenotypes

    DEFF Research Database (Denmark)

    Min, Josine L; Nicholson, George; Halgrimsdottir, Ingileif

    2012-01-01

    Metabolic Syndrome (MetS) is highly prevalent and has considerable public health impact, but its underlying genetic factors remain elusive. To identify gene networks involved in MetS, we conducted whole-genome expression and genotype profiling on abdominal (ABD) and gluteal (GLU) adipose tissue...... and 51 (0.6%) in GLU only. Differential eigengene network analysis of 8,256 shared probesets detected 22 shared modules with high preservation across adipose depots (D(ABD-GLU) = 0.89), seven of which were associated with MetS (FDR P100,000 individuals; rs10282458, affecting expression of RARRES2...... and their interactions influence complex traits such as MetS, integrated analysis of genotypes and coexpression networks across multiple tissues relevant to clinical traits is an efficient strategy to identify novel associations....

  11. Lignin, mitochondrial family and photorespiratory transporter classification as case studies in using co-expression, co-response and protein locations to aid in identifying transport functions

    Directory of Open Access Journals (Sweden)

    Takayuki eTohge

    2014-03-01

    Full Text Available Whole genome sequencing and the relative ease of transcript profiling have facilitated the collection and data warehousing of immense quantities of expression data. However, a substantial proportion of genes are not yet functionally annotated a problem which is particularly acute for transport proteins. In Arabidopsis, for example, only a minor fraction of the estimated 700 intracellular transporters have been identified at the molecular genetic level. Furthermore it is only within the last couple of years that critical genes such as those encoding the final transport step required for the long distance transport of sucrose and the first transporter of the core photorespiratory pathway have been identified. Here we will describe how transcriptional coordination between genes of known function and non-annotated genes allows the identification of putative transporters on the premise that such co-expressed genes tend to be functionally related. We will additionally extend this to include the expansion of this approach to include phenotypic information from other levels of cellular organization such as proteomic and metabolomic data and provide case studies wherein this approach has successfully been used to fill knowledge gaps in important metabolic pathways and physiological processes.

  12. Diurnal Transcriptome and Gene Network Represented through Sparse Modeling in Brachypodium distachyon

    Directory of Open Access Journals (Sweden)

    Satoru Koda

    2017-11-01

    Full Text Available We report the comprehensive identification of periodic genes and their network inference, based on a gene co-expression analysis and an Auto-Regressive eXogenous (ARX model with a group smoothly clipped absolute deviation (SCAD method using a time-series transcriptome dataset in a model grass, Brachypodium distachyon. To reveal the diurnal changes in the transcriptome in B. distachyon, we performed RNA-seq analysis of its leaves sampled through a diurnal cycle of over 48 h at 4 h intervals using three biological replications, and identified 3,621 periodic genes through our wavelet analysis. The expression data are feasible to infer network sparsity based on ARX models. We found that genes involved in biological processes such as transcriptional regulation, protein degradation, and post-transcriptional modification and photosynthesis are significantly enriched in the periodic genes, suggesting that these processes might be regulated by circadian rhythm in B. distachyon. On the basis of the time-series expression patterns of the periodic genes, we constructed a chronological gene co-expression network and identified putative transcription factors encoding genes that might be involved in the time-specific regulatory transcriptional network. Moreover, we inferred a transcriptional network composed of the periodic genes in B. distachyon, aiming to identify genes associated with other genes through variable selection by grouping time points for each gene. Based on the ARX model with the group SCAD regularization using our time-series expression datasets of the periodic genes, we constructed gene networks and found that the networks represent typical scale-free structure. Our findings demonstrate that the diurnal changes in the transcriptome in B. distachyon leaves have a sparse network structure, demonstrating the spatiotemporal gene regulatory network over the cyclic phase transitions in B. distachyon diurnal growth.

  13. Characterization of resistance gene analogues (RGAs) in apple (Malus × domestica Borkh.) and their evolutionary history of the Rosaceae family.

    Science.gov (United States)

    Perazzolli, Michele; Malacarne, Giulia; Baldo, Angela; Righetti, Laura; Bailey, Aubrey; Fontana, Paolo; Velasco, Riccardo; Malnoy, Mickael

    2014-01-01

    The family of resistance gene analogues (RGAs) with a nucleotide-binding site (NBS) domain accounts for the largest number of disease resistance genes and is one of the largest gene families in plants. We have identified 868 RGAs in the genome of the apple (Malus × domestica Borkh.) cultivar 'Golden Delicious'. This represents 1.51% of the total number of predicted genes for this cultivar. Several evolutionary features are pronounced in M. domestica, including a high fraction (80%) of RGAs occurring in clusters. This suggests frequent tandem duplication and ectopic translocation events. Of the identified RGAs, 56% are located preferentially on six chromosomes (Chr 2, 7, 8, 10, 11, and 15), and 25% are located on Chr 2. TIR-NBS and non-TIR-NBS classes of RGAs are primarily exclusive of different chromosomes, and 99% of non-TIR-NBS RGAs are located on Chr 11. A phylogenetic reconstruction was conducted to study the evolution of RGAs in the Rosaceae family. More than 1400 RGAs were identified in six species based on their NBS domain, and a neighbor-joining analysis was used to reconstruct the phylogenetic relationships among the protein sequences. Specific phylogenetic clades were found for RGAs of Malus, Fragaria, and Rosa, indicating genus-specific evolution of resistance genes. However, strikingly similar RGAs were shared in Malus, Pyrus, and Prunus, indicating high conservation of specific RGAs and suggesting a monophyletic origin of these three genera.

  14. Characterization of resistance gene analogues (RGAs in apple (Malus × domestica Borkh. and their evolutionary history of the Rosaceae family.

    Directory of Open Access Journals (Sweden)

    Michele Perazzolli

    Full Text Available The family of resistance gene analogues (RGAs with a nucleotide-binding site (NBS domain accounts for the largest number of disease resistance genes and is one of the largest gene families in plants. We have identified 868 RGAs in the genome of the apple (Malus × domestica Borkh. cultivar 'Golden Delicious'. This represents 1.51% of the total number of predicted genes for this cultivar. Several evolutionary features are pronounced in M. domestica, including a high fraction (80% of RGAs occurring in clusters. This suggests frequent tandem duplication and ectopic translocation events. Of the identified RGAs, 56% are located preferentially on six chromosomes (Chr 2, 7, 8, 10, 11, and 15, and 25% are located on Chr 2. TIR-NBS and non-TIR-NBS classes of RGAs are primarily exclusive of different chromosomes, and 99% of non-TIR-NBS RGAs are located on Chr 11. A phylogenetic reconstruction was conducted to study the evolution of RGAs in the Rosaceae family. More than 1400 RGAs were identified in six species based on their NBS domain, and a neighbor-joining analysis was used to reconstruct the phylogenetic relationships among the protein sequences. Specific phylogenetic clades were found for RGAs of Malus, Fragaria, and Rosa, indicating genus-specific evolution of resistance genes. However, strikingly similar RGAs were shared in Malus, Pyrus, and Prunus, indicating high conservation of specific RGAs and suggesting a monophyletic origin of these three genera.

  15. Characterization of Resistance Gene Analogues (RGAs) in Apple (Malus × domestica Borkh.) and Their Evolutionary History of the Rosaceae Family

    Science.gov (United States)

    Baldo, Angela; Righetti, Laura; Bailey, Aubrey; Fontana, Paolo; Velasco, Riccardo; Malnoy, Mickael

    2014-01-01

    The family of resistance gene analogues (RGAs) with a nucleotide-binding site (NBS) domain accounts for the largest number of disease resistance genes and is one of the largest gene families in plants. We have identified 868 RGAs in the genome of the apple (Malus × domestica Borkh.) cultivar ‘Golden Delicious’. This represents 1.51% of the total number of predicted genes for this cultivar. Several evolutionary features are pronounced in M. domestica, including a high fraction (80%) of RGAs occurring in clusters. This suggests frequent tandem duplication and ectopic translocation events. Of the identified RGAs, 56% are located preferentially on six chromosomes (Chr 2, 7, 8, 10, 11, and 15), and 25% are located on Chr 2. TIR-NBS and non-TIR-NBS classes of RGAs are primarily exclusive of different chromosomes, and 99% of non-TIR-NBS RGAs are located on Chr 11. A phylogenetic reconstruction was conducted to study the evolution of RGAs in the Rosaceae family. More than 1400 RGAs were identified in six species based on their NBS domain, and a neighbor-joining analysis was used to reconstruct the phylogenetic relationships among the protein sequences. Specific phylogenetic clades were found for RGAs of Malus, Fragaria, and Rosa, indicating genus-specific evolution of resistance genes. However, strikingly similar RGAs were shared in Malus, Pyrus, and Prunus, indicating high conservation of specific RGAs and suggesting a monophyletic origin of these three genera. PMID:24505246

  16. The grapevine kinome: annotation, classification and expression patterns in developmental processes and stress responses.

    Science.gov (United States)

    Zhu, Kaikai; Wang, Xiaolong; Liu, Jinyi; Tang, Jun; Cheng, Qunkang; Chen, Jin-Gui; Cheng, Zong-Ming Max

    2018-01-01

    Protein kinases (PKs) have evolved as the largest family of molecular switches that regulate protein activities associated with almost all essential cellular functions. Only a fraction of plant PKs, however, have been functionally characterized even in model plant species. In the present study, the entire grapevine kinome was identified and annotated using the most recent version of the grapevine genome. A total of 1168 PK-encoding genes were identified and classified into 20 groups and 121 families, with the RLK-Pelle group being the largest, with 872 members. The 1168 kinase genes were unevenly distributed over all 19 chromosomes, and both tandem and segmental duplications contributed to the expansion of the grapevine kinome, especially of the RLK-Pelle group. Ka/Ks values indicated that most of the tandem and segmental duplication events were under purifying selection. The grapevine kinome families exhibited different expression patterns during plant development and in response to various stress treatments, with many being coexpressed. The comprehensive annotation of grapevine kinase genes, their patterns of expression and coexpression, and the related information facilitate a more complete understanding of the roles of various grapevine kinases in growth and development, responses to abiotic stress, and evolutionary history.

  17. Exploring the Diagnostic Potential of Immune Biomarker Co-expression in Gulf War Illness.

    Science.gov (United States)

    Broderick, Gordon; Fletcher, Mary Ann; Gallagher, Michael; Barnes, Zachary; Vernon, Suzanne D; Klimas, Nancy G

    2018-01-01

    Complex disorders like Gulf War illness (GWI) often defy diagnosis on the basis of a single biomarker and may only be distinguishable by considering the co-expression of multiple markers measured in response to a challenge. We demonstrate the practical application of such an approach using an example where blood was collected from 26 GWI, 13 healthy control subjects, and 9 unhealthy controls with chronic fatigue at three points during a graded exercise challenge. A 3-way multivariate projection model based on 12 markers of endocrine and immune function was constructed using a training set of n = 10 GWI and n = 11 healthy controls. These groups were separated almost completely on the basis of two co-expression patterns. In a separate test set these same features allowed for discrimination of new GWI subjects (n = 16) from unhealthy (n = 9) and healthy control subjects with a sensitivity of 70% and a specificity of 90%.

  18. Improved soluble expression and characterization of the Hc domain of Clostridium botulinum neurotoxin serotype A in Escherichia coli by using a PCR-synthesized gene and a Trx co-expression strain.

    Science.gov (United States)

    Chen, Rongchang; Shi, Jing; Cai, Kun; Tu, Wei; Hou, Xiaojun; Liu, Hao; Xiao, Le; Wang, Qin; Tang, Yunming; Wang, Hui

    2010-05-01

    Botulinum neurotoxin serotype A (BoNT/A) is an extremely potent bacterial protein toxin. The Hc fragment of BoNT/A (AHc) was shown to be non-toxic, antigenic, and capable of eliciting a protective immunity in animals challenged with homologous BoNT. In this study, we synthesized AHc gene by using T4 DNA ligase and PCR. The AHc was expressed at a high level in Escherichia coli successfully. Because of using the Trx co-expression strain, the expressed AHc is in a soluble and active form. The yield of the purified AHc was about 70mg/L, and its purity was up to 90% through one-step affinity chromatography. The AHc was positively identified by the antibodies raised against BoNT/A using immunological-dot-blot and Western blot assays. AHc was shown to bind with gangliosides and elicit immunity against BoNT/A, indicating that the expressed and purified AHc protein retains a functionally active conformation. Furthermore, the purified AHc has a strong immunogenicity and can be used as a potential subunit candidate vaccine for botulinum toxin serotype A. Copyright (c) 2009 Elsevier Inc. All rights reserved.

  19. The Pectin Methylesterase Gene Complement of Phytophthora sojae: Structural and Functional Analyses, and the Evolutionary Relationships with Its Oomycete Homologs.

    Science.gov (United States)

    Horowitz, Brent B; Ospina-Giraldo, Manuel D

    2015-01-01

    Phytophthora sojae is an oomycete pathogen that causes the disease known as root and stem rot in soybean plants, frequently leading to massive economic damage. Additionally, P. sojae is increasingly being utilized as a model for phytopathogenic oomycete research. Despite the economic and scientific importance of P. sojae, the mechanism by which it penetrates the host roots is not yet fully understood. It has been found that oomycetes are not capable of penetrating the cell wall solely through mechanical force, suggesting that alternative factors facilitate breakdown of the host cell wall. Pectin methylesterases have been suggested to be important for Phytophthora pathogenicity, but no data exist on their role in the P. sojae infection process. We have scanned the newly revised version of the annotated P. sojae genome for the presence of putative pectin methylesterases genes and conducted a sequence analysis of all gene models found. We also searched for potential regulatory motifs in the promoter region of the proposed P. sojae models, and investigated the gene expression levels throughout the early course of infection on soybean plants. We found that P. sojae contains a large repertoire of pectin methylesterase-coding genes and that most of these genes display similar motifs in the promoter region, indicating the possibility of a shared regulatory mechanism. Phylogenetic analyses confirmed the evolutionary relatedness of the pectin methylesterase-coding genes within and across Phytophthora spp. In addition, the gene duplication events that led to the emergence of this gene family appear to have occurred prior to many speciation events in the genus Phytophthora. Our results also indicate that the highest levels of expression occurred in the first 24 hours post inoculation, with expression falling after this time. Our study provides evidence that pectin methylesterases may be important for the early action of the P. sojae infection process.

  20. The Pectin Methylesterase Gene Complement of Phytophthora sojae: Structural and Functional Analyses, and the Evolutionary Relationships with Its Oomycete Homologs.

    Directory of Open Access Journals (Sweden)

    Brent B Horowitz

    Full Text Available Phytophthora sojae is an oomycete pathogen that causes the disease known as root and stem rot in soybean plants, frequently leading to massive economic damage. Additionally, P. sojae is increasingly being utilized as a model for phytopathogenic oomycete research. Despite the economic and scientific importance of P. sojae, the mechanism by which it penetrates the host roots is not yet fully understood. It has been found that oomycetes are not capable of penetrating the cell wall solely through mechanical force, suggesting that alternative factors facilitate breakdown of the host cell wall. Pectin methylesterases have been suggested to be important for Phytophthora pathogenicity, but no data exist on their role in the P. sojae infection process. We have scanned the newly revised version of the annotated P. sojae genome for the presence of putative pectin methylesterases genes and conducted a sequence analysis of all gene models found. We also searched for potential regulatory motifs in the promoter region of the proposed P. sojae models, and investigated the gene expression levels throughout the early course of infection on soybean plants. We found that P. sojae contains a large repertoire of pectin methylesterase-coding genes and that most of these genes display similar motifs in the promoter region, indicating the possibility of a shared regulatory mechanism. Phylogenetic analyses confirmed the evolutionary relatedness of the pectin methylesterase-coding genes within and across Phytophthora spp. In addition, the gene duplication events that led to the emergence of this gene family appear to have occurred prior to many speciation events in the genus Phytophthora. Our results also indicate that the highest levels of expression occurred in the first 24 hours post inoculation, with expression falling after this time. Our study provides evidence that pectin methylesterases may be important for the early action of the P. sojae infection process.