WorldWideScience

Sample records for gene functional analysis

  1. Function analysis of unknown genes

    DEFF Research Database (Denmark)

    Rogowska-Wrzesinska, A.

    2002-01-01

      This thesis entitled "Function analysis of unknown genes" presents the use of proteome analysis for the characterisation of yeast (Saccharomyces cerevisiae) genes and their products (proteins especially those of unknown function). This study illustrates that proteome analysis can be used...... to describe different aspects of molecular biology of the cell, to study changes that occur in the cell due to overexpression or deletion of a gene and to identify various protein modifications. The biological questions and the results of the described studies show the diversity of the information that can...... genes and proteins. It reports the first global proteome database collecting 36 yeast single gene deletion mutants and selecting over 650 differences between analysed mutants and the wild type strain. The obtained results show that two-dimensional gel electrophoresis and mass spectrometry based proteome...

  2. Gene coexpression network analysis as a source of functional annotation for rice genes.

    Directory of Open Access Journals (Sweden)

    Kevin L Childs

    Full Text Available With the existence of large publicly available plant gene expression data sets, many groups have undertaken data analyses to construct gene coexpression networks and functionally annotate genes. Often, a large compendium of unrelated or condition-independent expression data is used to construct gene networks. Condition-dependent expression experiments consisting of well-defined conditions/treatments have also been used to create coexpression networks to help examine particular biological processes. Gene networks derived from either condition-dependent or condition-independent data can be difficult to interpret if a large number of genes and connections are present. However, algorithms exist to identify modules of highly connected and biologically relevant genes within coexpression networks. In this study, we have used publicly available rice (Oryza sativa gene expression data to create gene coexpression networks using both condition-dependent and condition-independent data and have identified gene modules within these networks using the Weighted Gene Coexpression Network Analysis method. We compared the number of genes assigned to modules and the biological interpretability of gene coexpression modules to assess the utility of condition-dependent and condition-independent gene coexpression networks. For the purpose of providing functional annotation to rice genes, we found that gene modules identified by coexpression analysis of condition-dependent gene expression experiments to be more useful than gene modules identified by analysis of a condition-independent data set. We have incorporated our results into the MSU Rice Genome Annotation Project database as additional expression-based annotation for 13,537 genes, 2,980 of which lack a functional annotation description. These results provide two new types of functional annotation for our database. Genes in modules are now associated with groups of genes that constitute a collective functional

  3. Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

    Science.gov (United States)

    Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

    2010-10-07

    PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out

  4. Inferring gene expression dynamics via functional regression analysis

    Directory of Open Access Journals (Sweden)

    Leng Xiaoyan

    2008-01-01

    Full Text Available Abstract Background Temporal gene expression profiles characterize the time-dynamics of expression of specific genes and are increasingly collected in current gene expression experiments. In the analysis of experiments where gene expression is obtained over the life cycle, it is of interest to relate temporal patterns of gene expression associated with different developmental stages to each other to study patterns of long-term developmental gene regulation. We use tools from functional data analysis to study dynamic changes by relating temporal gene expression profiles of different developmental stages to each other. Results We demonstrate that functional regression methodology can pinpoint relationships that exist between temporary gene expression profiles for different life cycle phases and incorporates dimension reduction as needed for these high-dimensional data. By applying these tools, gene expression profiles for pupa and adult phases are found to be strongly related to the profiles of the same genes obtained during the embryo phase. Moreover, one can distinguish between gene groups that exhibit relationships with positive and others with negative associations between later life and embryonal expression profiles. Specifically, we find a positive relationship in expression for muscle development related genes, and a negative relationship for strictly maternal genes for Drosophila, using temporal gene expression profiles. Conclusion Our findings point to specific reactivation patterns of gene expression during the Drosophila life cycle which differ in characteristic ways between various gene groups. Functional regression emerges as a useful tool for relating gene expression patterns from different developmental stages, and avoids the problems with large numbers of parameters and multiple testing that affect alternative approaches.

  5. DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis

    Directory of Open Access Journals (Sweden)

    Baseler Michael W

    2007-11-01

    Full Text Available Abstract Background Due to the complex and distributed nature of biological research, our current biological knowledge is spread over many redundant annotation databases maintained by many independent groups. Analysts usually need to visit many of these bioinformatics databases in order to integrate comprehensive annotation information for their genes, which becomes one of the bottlenecks, particularly for the analytic task associated with a large gene list. Thus, a highly centralized and ready-to-use gene-annotation knowledgebase is in demand for high throughput gene functional analysis. Description The DAVID Knowledgebase is built around the DAVID Gene Concept, a single-linkage method to agglomerate tens of millions of gene/protein identifiers from a variety of public genomic resources into DAVID gene clusters. The grouping of such identifiers improves the cross-reference capability, particularly across NCBI and UniProt systems, enabling more than 40 publicly available functional annotation sources to be comprehensively integrated and centralized by the DAVID gene clusters. The simple, pair-wise, text format files which make up the DAVID Knowledgebase are freely downloadable for various data analysis uses. In addition, a well organized web interface allows users to query different types of heterogeneous annotations in a high-throughput manner. Conclusion The DAVID Knowledgebase is designed to facilitate high throughput gene functional analysis. For a given gene list, it not only provides the quick accessibility to a wide range of heterogeneous annotation data in a centralized location, but also enriches the level of biological information for an individual gene. Moreover, the entire DAVID Knowledgebase is freely downloadable or searchable at http://david.abcc.ncifcrf.gov/knowledgebase/.

  6. Protein functional links in Trypanosoma brucei, identified by gene fusion analysis

    Directory of Open Access Journals (Sweden)

    Trimpalis Philip

    2011-07-01

    Full Text Available Abstract Background Domain or gene fusion analysis is a bioinformatics method for detecting gene fusions in one organism by comparing its genome to that of other organisms. The occurrence of gene fusions suggests that the two original genes that participated in the fusion are functionally linked, i.e. their gene products interact either as part of a multi-subunit protein complex, or in a metabolic pathway. Gene fusion analysis has been used to identify protein functional links in prokaryotes as well as in eukaryotic model organisms, such as yeast and Drosophila. Results In this study we have extended this approach to include a number of recently sequenced protists, four of which are pathogenic, to identify fusion linked proteins in Trypanosoma brucei, the causative agent of African sleeping sickness. We have also examined the evolution of the gene fusion events identified, to determine whether they can be attributed to fusion or fission, by looking at the conservation of the fused genes and of the individual component genes across the major eukaryotic and prokaryotic lineages. We find relatively limited occurrence of gene fusions/fissions within the protist lineages examined. Our results point to two trypanosome-specific gene fissions, which have recently been experimentally confirmed, one fusion involving proteins involved in the same metabolic pathway, as well as two novel putative functional links between fusion-linked protein pairs. Conclusions This is the first study of protein functional links in T. brucei identified by gene fusion analysis. We have used strict thresholds and only discuss results which are highly likely to be genuine and which either have already been or can be experimentally verified. We discuss the possible impact of the identification of these novel putative protein-protein interactions, to the development of new trypanosome therapeutic drugs.

  7. Gene function analysis by artificial microRNAs in Physcomitrella patens.

    KAUST Repository

    Khraiwesh, Basel

    2011-01-01

    MicroRNAs (miRNAs) are ~21 nt long small RNAs transcribed from endogenous MIR genes which form precursor RNAs with a characteristic hairpin structure. miRNAs control the expression of cognate target genes by binding to reverse complementary sequences resulting in cleavage or translational inhibition of the target RNA. Artificial miRNAs (amiRNAs) can be generated by exchanging the miRNA/miRNA sequence of endogenous MIR precursor genes, while maintaining the general pattern of matches and mismatches in the foldback. Thus, for functional gene analysis amiRNAs can be designed to target any gene of interest. During the last decade the moss Physcomitrella patens emerged as a model plant for functional gene analysis based on its unique ability to integrate DNA into the nuclear genome by homologous recombination which allows for the generation of targeted gene knockout mutants. In addition to this, we developed a protocol to express amiRNAs in P. patens that has particular advantages over the generation of knockout mutants and might be used to speed up reverse genetics approaches in this model species.

  8. Expression and functional analysis of apoptosis-related gene ...

    African Journals Online (AJOL)

    Administrator

    2011-10-19

    Oct 19, 2011 ... conducted a molecular cloning and functional analysis to study a specific silkworm gene BmICAD related to apoptosis. .... blocking with 5% non-fat milk for 1 h at room temperature, the .... requirements for all next experiments.

  9. The identification of functional motifs in temporal gene expression analysis

    Directory of Open Access Journals (Sweden)

    Michael G. Surette

    2005-01-01

    Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.

  10. Weighted functional linear regression models for gene-based association analysis.

    Science.gov (United States)

    Belonogova, Nadezhda M; Svishcheva, Gulnara R; Wilson, James F; Campbell, Harry; Axenovich, Tatiana I

    2018-01-01

    Functional linear regression models are effectively used in gene-based association analysis of complex traits. These models combine information about individual genetic variants, taking into account their positions and reducing the influence of noise and/or observation errors. To increase the power of methods, where several differently informative components are combined, weights are introduced to give the advantage to more informative components. Allele-specific weights have been introduced to collapsing and kernel-based approaches to gene-based association analysis. Here we have for the first time introduced weights to functional linear regression models adapted for both independent and family samples. Using data simulated on the basis of GAW17 genotypes and weights defined by allele frequencies via the beta distribution, we demonstrated that type I errors correspond to declared values and that increasing the weights of causal variants allows the power of functional linear models to be increased. We applied the new method to real data on blood pressure from the ORCADES sample. Five of the six known genes with P models. Moreover, we found an association between diastolic blood pressure and the VMP1 gene (P = 8.18×10-6), when we used a weighted functional model. For this gene, the unweighted functional and weighted kernel-based models had P = 0.004 and 0.006, respectively. The new method has been implemented in the program package FREGAT, which is freely available at https://cran.r-project.org/web/packages/FREGAT/index.html.

  11. A multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors for functional gene analysis.

    Science.gov (United States)

    Weber, Kristoffer; Bartsch, Udo; Stocking, Carol; Fehse, Boris

    2008-04-01

    Functional gene analysis requires the possibility of overexpression, as well as downregulation of one, or ideally several, potentially interacting genes. Lentiviral vectors are well suited for this purpose as they ensure stable expression of complementary DNAs (cDNAs), as well as short-hairpin RNAs (shRNAs), and can efficiently transduce a wide spectrum of cell targets when packaged within the coat proteins of other viruses. Here we introduce a multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors designed according to the "building blocks" principle. Using a wide spectrum of different fluorescent markers, including drug-selectable enhanced green fluorescent protein (eGFP)- and dTomato-blasticidin-S resistance fusion proteins, LeGO vectors allow simultaneous analysis of multiple genes and shRNAs of interest within single, easily identifiable cells. Furthermore, each functional module is flanked by unique cloning sites, ensuring flexibility and individual optimization. The efficacy of these vectors for analyzing multiple genes in a single cell was demonstrated in several different cell types, including hematopoietic, endothelial, and neural stem and progenitor cells, as well as hepatocytes. LeGO vectors thus represent a valuable tool for investigating gene networks using conditional ectopic expression and knock-down approaches simultaneously.

  12. Large-scale gene function analysis with the PANTHER classification system.

    Science.gov (United States)

    Mi, Huaiyu; Muruganujan, Anushya; Casagrande, John T; Thomas, Paul D

    2013-08-01

    The PANTHER (protein annotation through evolutionary relationship) classification system (http://www.pantherdb.org/) is a comprehensive system that combines gene function, ontology, pathways and statistical analysis tools that enable biologists to analyze large-scale, genome-wide data from sequencing, proteomics or gene expression experiments. The system is built with 82 complete genomes organized into gene families and subfamilies, and their evolutionary relationships are captured in phylogenetic trees, multiple sequence alignments and statistical models (hidden Markov models or HMMs). Genes are classified according to their function in several different ways: families and subfamilies are annotated with ontology terms (Gene Ontology (GO) and PANTHER protein class), and sequences are assigned to PANTHER pathways. The PANTHER website includes a suite of tools that enable users to browse and query gene functions, and to analyze large-scale experimental data with a number of statistical tests. It is widely used by bench scientists, bioinformaticians, computer scientists and systems biologists. In the 2013 release of PANTHER (v.8.0), in addition to an update of the data content, we redesigned the website interface to improve both user experience and the system's analytical capability. This protocol provides a detailed description of how to analyze genome-wide experimental data with the PANTHER classification system.

  13. FunGene: the functional gene pipeline and repository.

    Science.gov (United States)

    Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R

    2013-01-01

    Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.

  14. FunGene: the Functional Gene Pipeline and Repository

    Directory of Open Access Journals (Sweden)

    Jordan A. Fish

    2013-10-01

    Full Text Available Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer.While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/ offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.

  15. ERC analysis: web-based inference of gene function via evolutionary rate covariation.

    Science.gov (United States)

    Wolfe, Nicholas W; Clark, Nathan L

    2015-12-01

    The recent explosion of comparative genomics data presents an unprecedented opportunity to construct gene networks via the evolutionary rate covariation (ERC) signature. ERC is used to identify genes that experienced similar evolutionary histories, and thereby draws functional associations between them. The ERC Analysis website allows researchers to exploit genome-wide datasets to infer novel genes in any biological function and to explore deep evolutionary connections between distinct pathways and complexes. The website provides five analytical methods, graphical output, statistical support and access to an increasing number of taxonomic groups. Analyses and data at http://csb.pitt.edu/erc_analysis/ nclark@pitt.edu. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  16. Phylogenetic Analysis, Lineage-Specific Expansion and Functional Divergence of seed dormancy 4-Like Genes in Plants.

    Directory of Open Access Journals (Sweden)

    Saminathan Subburaj

    Full Text Available The rice gene seed dormancy 4 (OsSdr4 functions in seed dormancy and is a major factor associated with pre-harvest sprouting (PHS. Although previous studies of this protein family were reported for rice and other species, knowledge of the evolution of genes homologous to OsSdr4 in plants remains inadequate. Fifty four Sdr4-like (hereafter designated Sdr4L genes were identified in nine plant lineages including 36 species. Phylogenetic analysis placed these genes in eight subfamilies (I-VIII. Genes from the same lineage clustered together, supported by analysis of conserved motifs and exon-intron patterns. Segmental duplications were present in both dicot and monocot clusters, while tandemly duplicated genes occurred only in monocot clusters indicating that both tandem and segmental duplications contributed to expansion of the grass I and II subfamilies. Estimation of the approximate ages of the duplication events indicated that ancestral Sdr4 genes evolved from a common angiosperm ancestor, about 160 million years ago (MYA. Moreover, diversification of Sdr4L genes in mono and dicot plants was mainly associated with genome-wide duplication and speciation events. Functional divergence was observed in all subfamily pairs, except IV/VIIIa. Further analysis indicated that functional constraints between subfamily pairs I/II, I/VIIIb, II/VI, II/VIIIb, II/IV, and VI/VIIIb were statistically significant. Site and branch-site model analyses of positive selection suggested that these genes were under strong adaptive selection pressure. Critical amino acids detected for both functional divergence and positive selection were mostly located in the loops, pointing to functional importance of these regions in this protein family. In addition, differential expression studies by transcriptome atlas of 11 Sdr4L genes showed that the duplicated genes may have undergone divergence in expression between plant species. Our findings showed that Sdr4L genes are

  17. Genome-wide analysis of the expansin gene superfamily reveals grapevine-specific structural and functional characteristics.

    Directory of Open Access Journals (Sweden)

    Silvia Dal Santo

    Full Text Available BACKGROUND: Expansins are proteins that loosen plant cell walls in a pH-dependent manner, probably by increasing the relative movement among polymers thus causing irreversible expansion. The expansin superfamily (EXP comprises four distinct families: expansin A (EXPA, expansin B (EXPB, expansin-like A (EXLA and expansin-like B (EXLB. There is experimental evidence that EXPA and EXPB proteins are required for cell expansion and developmental processes involving cell wall modification, whereas the exact functions of EXLA and EXLB remain unclear. The complete grapevine (Vitis vinifera genome sequence has allowed the characterization of many gene families, but an exhaustive genome-wide analysis of expansin gene expression has not been attempted thus far. METHODOLOGY/PRINCIPAL FINDINGS: We identified 29 EXP superfamily genes in the grapevine genome, representing all four EXP families. Members of the same EXP family shared the same exon-intron structure, and phylogenetic analysis confirmed a closer relationship between EXP genes from woody species, i.e. grapevine and poplar (Populus trichocarpa, compared to those from Arabidopsis thaliana and rice (Oryza sativa. We also identified grapevine-specific duplication events involving the EXLB family. Global gene expression analysis confirmed a strong correlation among EXP genes expressed in mature and green/vegetative samples, respectively, as reported for other gene families in the recently-published grapevine gene expression atlas. We also observed the specific co-expression of EXLB genes in woody organs, and the involvement of certain grapevine EXP genes in berry development and post-harvest withering. CONCLUSION: Our comprehensive analysis of the grapevine EXP superfamily confirmed and extended current knowledge about the structural and functional characteristics of this gene family, and also identified properties that are currently unique to grapevine expansin genes. Our data provide a model for the

  18. Genome-wide analysis of the Dof transcription factor gene family reveals soybean-specific duplicable and functional characteristics.

    Directory of Open Access Journals (Sweden)

    Yong Guo

    Full Text Available The Dof domain protein family is a classic plant-specific zinc-finger transcription factor family involved in a variety of biological processes. There is great diversity in the number of Dof genes in different plants. However, there are only very limited reports on the characterization of Dof transcription factors in soybean (Glycine max. In the present study, 78 putative Dof genes were identified from the whole-genome sequence of soybean. The predicted GmDof genes were non-randomly distributed within and across 19 out of 20 chromosomes and 97.4% (38 pairs were preferentially retained duplicate paralogous genes located in duplicated regions of the genome. Soybean-specific segmental duplications contributed significantly to the expansion of the soybean Dof gene family. These Dof proteins were phylogenetically clustered into nine distinct subgroups among which the gene structure and motif compositions were considerably conserved. Comparative phylogenetic analysis of these Dof proteins revealed four major groups, similar to those reported for Arabidopsis and rice. Most of the GmDofs showed specific expression patterns based on RNA-seq data analyses. The expression patterns of some duplicate genes were partially redundant while others showed functional diversity, suggesting the occurrence of sub-functionalization during subsequent evolution. Comprehensive expression profile analysis also provided insights into the soybean-specific functional divergence among members of the Dof gene family. Cis-regulatory element analysis of these GmDof genes suggested diverse functions associated with different processes. Taken together, our results provide useful information for the functional characterization of soybean Dof genes by combining phylogenetic analysis with global gene-expression profiling.

  19. Genetic interaction analysis of point mutations enables interrogation of gene function at a residue-level resolution

    Science.gov (United States)

    Braberg, Hannes; Moehle, Erica A.; Shales, Michael; Guthrie, Christine; Krogan, Nevan J.

    2014-01-01

    We have achieved a residue-level resolution of genetic interaction mapping – a technique that measures how the function of one gene is affected by the alteration of a second gene – by analyzing point mutations. Here, we describe how to interpret point mutant genetic interactions, and outline key applications for the approach, including interrogation of protein interaction interfaces and active sites, and examination of post-translational modifications. Genetic interaction analysis has proven effective for characterizing cellular processes; however, to date, systematic high-throughput genetic interaction screens have relied on gene deletions or knockdowns, which limits the resolution of gene function analysis and poses problems for multifunctional genes. Our point mutant approach addresses these issues, and further provides a tool for in vivo structure-function analysis that complements traditional biophysical methods. We also discuss the potential for genetic interaction mapping of point mutations in human cells and its application to personalized medicine. PMID:24842270

  20. Elucidating gene function and function evolution through comparison of co-expression networks in plants

    Directory of Open Access Journals (Sweden)

    Marek eMutwil

    2014-08-01

    Full Text Available The analysis of gene expression data has shown that transcriptionally coordinated (co-expressed genes are often functionally related, enabling scientists to use expression data in gene function prediction. This Focused Review discusses our original paper (Large-scale co-expression approach to dissect secondary cell wall formation across plant species, Frontiers in Plant Science 2:23. In this paper we applied cross-species analysis to co-expression networks of genes involved in cellulose biosynthesis. We show that the co-expression networks from different species are highly similar, indicating that whole biological pathways are conserved across species. This finding has two important implications. First, the analysis can transfer gene function annotation from well-studied plants, such as Arabidopsis, to other, uncharacterized plant species. As the analysis finds genes that have similar sequence and similar expression pattern across different organisms, functionally equivalent genes can be identified. Second, since co-expression analyses are often noisy, a comparative analysis should have higher performance, as parts of co-expression networks that are conserved are more likely to be functionally relevant. In this Focused Review, we outline the comparative analysis done in the original paper and comment on the recent advances and approaches that allow comparative analyses of co-function networks. We hypothesize that, in comparison to simple co-expression analysis, comparative analysis would yield more accurate gene function predictions. Finally, by combining comparative analysis with genomic information of green plants, we propose a possible composition of cellulose biosynthesis machinery during earlier stages of plant evolution.

  1. Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.

    Science.gov (United States)

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2015-01-01

    Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and

  2. Analysis of breast cancer metastasis candidate genes from next generation-sequencing via systematic functional genomics

    DEFF Research Database (Denmark)

    Blomstrøm, Monica Marie

    2016-01-01

    several growth modulators and invasion modulators were identified and independently validated. These candidates revealed a group of genes with metastasis-related functions in vitro that are involved in RNA-related processes, such as RNA-processing. Moreover, a general feature was that proliferation......) and non-CSCs. The main goal of this project was to functionally characterize a set of candidate genes recovered from next-generation sequencing analysis for their role in breast cancer metastasis formation. The starting gene set comprised 104 gene variants; i.e. 57 wildtype and 47 mutated variants. During...

  3. Development of resources for the analysis of gene function in Pucciniomycotina red yeasts.

    Science.gov (United States)

    Ianiri, Giuseppe; Wright, Sandra A I; Castoria, Raffaello; Idnurm, Alexander

    2011-07-01

    The Pucciniomycotina is an important subphylum of basidiomycete fungi but with limited tools to analyze gene functions. Transformation protocols were established for a Sporobolomyces species (strain IAM 13481), the first Pucciniomycotina species with a completed draft genome sequence, to enable assessment of gene function through phenotypic characterization of mutant strains. Transformation markers were the URA3 and URA5 genes that enable selection and counter-selection based on uracil auxotrophy and resistance to 5-fluoroorotic acid. The wild type copies of these genes were cloned into plasmids that were used for transformation of Sporobolomyces sp. by both biolistic and Agrobacterium-mediated approaches. These resources have been deposited to be available from the Fungal Genetics Stock Center. To show that these techniques could be used to elucidate gene functions, the LEU1 gene was targeted for specific homologous replacement, and also demonstrating that this gene is required for the biosynthesis of leucine in basidiomycete fungi. T-DNA insertional mutants were isolated and further characterized, revealing insertions in genes that encode the homologs of Chs7, Erg3, Kre6, Kex1, Pik1, Sad1, Ssu1 and Tlg1. Phenotypic analysis of these mutants reveals both conserved and divergent functions compared with other fungi. Some of these strains exhibit reduced resistance to detergents, the antifungal agent fluconazole or sodium sulfite, or lower recovery from heat stress. While there are current experimental limitations for Sporobolomyces sp. such as the lack of Mendelian genetics for conventional mating, these findings demonstrate the facile nature of at least one Pucciniomycotina species for genetic manipulation and the potential to develop these organisms into new models for understanding gene function and evolution in the fungi. Copyright © 2011 Elsevier Inc. All rights reserved.

  4. Genome-Wide Analysis of Soybean LATERAL ORGAN BOUNDARIES Domain-Containing Genes: A Functional Investigation of GmLBD12

    Directory of Open Access Journals (Sweden)

    Hui Yang

    2017-03-01

    Full Text Available Plant-specific ( genes play critical roles in various plant growth and development processes. However, the number and characteristics of genes in soybean [ (L. Merr.] remain unknown. Here, we identified 90 homologous genes in the soybean genome that phylogenetically clustered into two classes (I and II. The majority of the genes were evenly distributed across all 20 soybean chromosomes, and 77 (81.11% of them were detected in segmental duplicated regions. Furthermore, the exon–intron organization and motif composition for each were analyzed. A close phylogenetic relationship was identified between the soybean genes and 41 previously reported genes of different plants in the same group, providing insights into their putative functions. Expression analysis indicated that more than half of the genes were expressed, with the two gene classes showing differential tissue expression characteristics; in addition, they were differentially induced by biotic and abiotic stresses. To further explore the functions of genes in soybean, was selected for functional characterization. GmLBD12 was mainly localized to the nucleus and showed high expression in root and seed tissues. Overexpressing in (L. Heynh resulted in increases in lateral root (LR number and plant height. Quantitative real-time polymerase chain reaction (qRT-PCR analysis demonstrated that was induced by drought, salt, cold, indole acetic acid (IAA, abscisic acid (ABA, and salicylic acid SA treatments. This study provides the first comprehensive analysis of the soybean gene family and a valuable foundation for future functional studies of genes.

  5. Analysis of the functional gene structure and metabolic potential of microbial community in high arsenic groundwater.

    Science.gov (United States)

    Li, Ping; Jiang, Zhou; Wang, Yanhong; Deng, Ye; Van Nostrand, Joy D; Yuan, Tong; Liu, Han; Wei, Dazhun; Zhou, Jizhong

    2017-10-15

    Microbial functional potential in high arsenic (As) groundwater ecosystems remains largely unknown. In this study, the microbial community functional composition of nineteen groundwater samples was investigated using a functional gene array (GeoChip 5.0). Samples were divided into low and high As groups based on the clustering analysis of geochemical parameters and microbial functional structures. The results showed that As related genes (arsC, arrA), sulfate related genes (dsrA and dsrB), nitrogen cycling related genes (ureC, amoA, and hzo) and methanogen genes (mcrA, hdrB) in groundwater samples were correlated with As, SO 4 2- , NH 4 + or CH 4 concentrations, respectively. Canonical correspondence analysis (CCA) results indicated that some geochemical parameters including As, total organic content, SO 4 2- , NH 4 + , oxidation-reduction potential (ORP) and pH were important factors shaping the functional microbial community structures. Alkaline and reducing conditions with relatively low SO 4 2- , ORP, and high NH 4 + , as well as SO 4 2- and Fe reduction and ammonification involved in microbially-mediated geochemical processes could be associated with As enrichment in groundwater. This study provides an overall picture of functional microbial communities in high As groundwater aquifers, and also provides insights into the critical role of microorganisms in As biogeochemical cycling. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. Characterization and functional analysis of Calmodulin and Calmodulin-like genes in Fragaria vesca

    Directory of Open Access Journals (Sweden)

    Kai Zhang

    2016-12-01

    Full Text Available Calcium is a universal messenger that is involved in the modulation of diverse developmental and adaptive processes in response to various stimuli. Calmodulin (CaM and calmodulin-like (CML proteins are major calcium sensors in all eukaryotes, and they have been extensively investigated for many years in plants and animals. However, little is known about CaMs and CMLs in woodland strawberry (Fragaria vesca. In this study, we performed a genome-wide analysis of the strawberry genome and identified 4 CaM and 36 CML genes. Bioinformatics analyses, including gene structure, phylogenetic tree, synteny and three-dimensional model assessments, revealed the conservation and divergence of FvCaMs and FvCMLs, thus providing insight regarding their functions. In addition, the transcript abundance of four FvCaM genes and the four most related FvCML genes were examined in different tissues and in response to multiple stress and hormone treatments. Moreover, we investigated the subcellular localization of several FvCaMs and FvCMLs, revealing their potential interactions based on the localizations and potential functions. Furthermore, overexpression of five FvCaM and FvCML genes could not induce a hypersensitive response, but four of the five genes could increase resistance to Agrobacterium tumefaciens in Nicotiana benthamiana leaves. This study provides evidence for the biological roles of FvCaM and CML genes, and the results lay the foundation for future functional studies of these genes.

  7. Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eQTL hotspots associated with functional gene modules.

    Science.gov (United States)

    Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P

    2013-03-21

    Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group

  8. NetGen: a novel network-based probabilistic generative model for gene set functional enrichment analysis.

    Science.gov (United States)

    Sun, Duanchen; Liu, Yinliang; Zhang, Xiang-Sun; Wu, Ling-Yun

    2017-09-21

    High-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct parent or descendant terms in the GO hierarchical structure. Highly redundant terms make users difficult to analyze the underlying biological processes. In this paper, a novel network-based probabilistic generative model, NetGen, was proposed to perform the functional enrichment analysis. An additional protein-protein interaction (PPI) network was explicitly used to assist the identification of significantly enriched GO terms. NetGen achieved a superior performance than the existing methods in the simulation studies. The effectiveness of NetGen was explored further on four real datasets. Notably, several GO terms which were not directly linked with the active gene list for each disease were identified. These terms were closely related to the corresponding diseases when accessed to the curated literatures. NetGen has been implemented in the R package CopTea publicly available at GitHub ( http://github.com/wulingyun/CopTea/ ). Our procedure leads to a more reasonable and interpretable result of the functional enrichment analysis. As a novel term combination-based functional enrichment analysis method, NetGen is complementary to current individual term-based methods, and can help to explore the underlying pathogenesis of complex diseases.

  9. GeoChip-based analysis of microbial functional gene diversity in a landfill leachate-contaminated aquifer

    Science.gov (United States)

    Lu, Zhenmei; He, Zhili; Parisi, Victoria A.; Kang, Sanghoon; Deng, Ye; Van Nostrand, Joy D.; Masoner, Jason R.; Cozzarelli, Isabelle M.; Suflita, Joseph M.; Zhou, Jizhong

    2012-01-01

    The functional gene diversity and structure of microbial communities in a shallow landfill leachate-contaminated aquifer were assessed using a comprehensive functional gene array (GeoChip 3.0). Water samples were obtained from eight wells at the same aquifer depth immediately below a municipal landfill or along the predominant downgradient groundwater flowpath. Functional gene richness and diversity immediately below the landfill and the closest well were considerably lower than those in downgradient wells. Mantel tests and canonical correspondence analysis (CCA) suggested that various geochemical parameters had a significant impact on the subsurface microbial community structure. That is, leachate from the unlined landfill impacted the diversity, composition, structure, and functional potential of groundwater microbial communities as a function of groundwater pH, and concentrations of sulfate, ammonia, and dissolved organic carbon (DOC). Historical geochemical records indicate that all sampled wells chronically received leachate, and the increase in microbial diversity as a function of distance from the landfill is consistent with mitigation of the impact of leachate on the groundwater system by natural attenuation mechanisms.

  10. Iron homeostasis in Arabidopsis thaliana: transcriptomic analyses reveal novel FIT-regulated genes, iron deficiency marker genes and functional gene networks.

    Science.gov (United States)

    Mai, Hans-Jörg; Pateyron, Stéphanie; Bauer, Petra

    2016-10-03

    FIT (FER-LIKE IRON DEFICIENCY-INDUCED TRANSCRIPTION FACTOR) is the central regulator of iron uptake in Arabidopsis thaliana roots. We performed transcriptome analyses of six day-old seedlings and roots of six week-old plants using wild type, a fit knock-out mutant and a FIT over-expression line grown under iron-sufficient or iron-deficient conditions. We compared genes regulated in a FIT-dependent manner depending on the developmental stage of the plants. We assembled a high likelihood dataset which we used to perform co-expression and functional analysis of the most stably iron deficiency-induced genes. 448 genes were found FIT-regulated. Out of these, 34 genes were robustly FIT-regulated in root and seedling samples and included 13 novel FIT-dependent genes. Three hundred thirty-one genes showed differential regulation in response to the presence and absence of FIT only in the root samples, while this was the case for 83 genes in the seedling samples. We assembled a virtual dataset of iron-regulated genes based on a total of 14 transcriptomic analyses of iron-deficient and iron-sufficient wild-type plants to pinpoint the best marker genes for iron deficiency and analyzed this dataset in depth. Co-expression analysis of this dataset revealed 13 distinct regulons part of which predominantly contained functionally related genes. We could enlarge the list of FIT-dependent genes and discriminate between genes that are robustly FIT-regulated in roots and seedlings or only in one of those. FIT-regulated genes were mostly induced, few of them were repressed by FIT. With the analysis of a virtual dataset we could filter out and pinpoint new candidates among the most reliable marker genes for iron deficiency. Moreover, co-expression and functional analysis of this virtual dataset revealed iron deficiency-induced and functionally distinct regulons.

  11. Integrated analysis of microRNA and gene expression profiles reveals a functional regulatory module associated with liver fibrosis.

    Science.gov (United States)

    Chen, Wei; Zhao, Wenshan; Yang, Aiting; Xu, Anjian; Wang, Huan; Cong, Min; Liu, Tianhui; Wang, Ping; You, Hong

    2017-12-15

    Liver fibrosis, characterized with the excessive accumulation of extracellular matrix (ECM) proteins, represents the final common pathway of chronic liver inflammation. Ever-increasing evidence indicates microRNAs (miRNAs) dysregulation has important implications in the different stages of liver fibrosis. However, our knowledge of miRNA-gene regulation details pertaining to such disease remains unclear. The publicly available Gene Expression Omnibus (GEO) datasets of patients suffered from cirrhosis were extracted for integrated analysis. Differentially expressed miRNAs (DEMs) and genes (DEGs) were identified using GEO2R web tool. Putative target gene prediction of DEMs was carried out using the intersection of five major algorithms: DIANA-microT, TargetScan, miRanda, PICTAR5 and miRWalk. Functional miRNA-gene regulatory network (FMGRN) was constructed based on the computational target predictions at the sequence level and the inverse expression relationships between DEMs and DEGs. DAVID web server was selected to perform KEGG pathway enrichment analysis. Functional miRNA-gene regulatory module was generated based on the biological interpretation. Internal connections among genes in liver fibrosis-related module were determined using String database. MiRNA-gene regulatory modules related to liver fibrosis were experimentally verified in recombinant human TGFβ1 stimulated and specific miRNA inhibitor treated LX-2 cells. We totally identified 85 and 923 dysregulated miRNAs and genes in liver cirrhosis biopsy samples compared to their normal controls. All evident miRNA-gene pairs were identified and assembled into FMGRN which consisted of 990 regulations between 51 miRNAs and 275 genes, forming two big sub-networks that were defined as down-network and up-network, respectively. KEGG pathway enrichment analysis revealed that up-network was prominently involved in several KEGG pathways, in which "Focal adhesion", "PI3K-Akt signaling pathway" and "ECM

  12. Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data

    Directory of Open Access Journals (Sweden)

    Merchant Sabeeha S

    2011-07-01

    Full Text Available Abstract Background Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. Description The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of

  13. Gene2Function: An Integrated Online Resource for Gene Function Discovery

    Directory of Open Access Journals (Sweden)

    Yanhui Hu

    2017-08-01

    Full Text Available One of the most powerful ways to develop hypotheses regarding the biological functions of conserved genes in a given species, such as humans, is to first look at what is known about their function in another species. Model organism databases and other resources are rich with functional information but difficult to mine. Gene2Function addresses a broad need by integrating information about conserved genes in a single online resource.

  14. Functional network analysis of genes differentially expressed during xylogenesis in soc1ful woody Arabidopsis plants.

    Science.gov (United States)

    Davin, Nicolas; Edger, Patrick P; Hefer, Charles A; Mizrachi, Eshchar; Schuetz, Mathias; Smets, Erik; Myburg, Alexander A; Douglas, Carl J; Schranz, Michael E; Lens, Frederic

    2016-06-01

    Many plant genes are known to be involved in the development of cambium and wood, but how the expression and functional interaction of these genes determine the unique biology of wood remains largely unknown. We used the soc1ful loss of function mutant - the woodiest genotype known in the otherwise herbaceous model plant Arabidopsis - to investigate the expression and interactions of genes involved in secondary growth (wood formation). Detailed anatomical observations of the stem in combination with mRNA sequencing were used to assess transcriptome remodeling during xylogenesis in wild-type and woody soc1ful plants. To interpret the transcriptome changes, we constructed functional gene association networks of differentially expressed genes using the STRING database. This analysis revealed functionally enriched gene association hubs that are differentially expressed in herbaceous and woody tissues. In particular, we observed the differential expression of genes related to mechanical stress and jasmonate biosynthesis/signaling during wood formation in soc1ful plants that may be an effect of greater tension within woody tissues. Our results suggest that habit shifts from herbaceous to woody life forms observed in many angiosperm lineages could have evolved convergently by genetic changes that modulate the gene expression and interaction network, and thereby redeploy the conserved wood developmental program. © 2016 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.

  15. Reveal genes functionally associated with ACADS by a network study.

    Science.gov (United States)

    Chen, Yulong; Su, Zhiguang

    2015-09-15

    Establishing a systematic network is aimed at finding essential human gene-gene/gene-disease pathway by means of network inter-connecting patterns and functional annotation analysis. In the present study, we have analyzed functional gene interactions of short-chain acyl-coenzyme A dehydrogenase gene (ACADS). ACADS plays a vital role in free fatty acid β-oxidation and regulates energy homeostasis. Modules of highly inter-connected genes in disease-specific ACADS network are derived by integrating gene function and protein interaction data. Among the 8 genes in ACADS web retrieved from both STRING and GeneMANIA, ACADS is effectively conjoined with 4 genes including HAHDA, HADHB, ECHS1 and ACAT1. The functional analysis is done via ontological briefing and candidate disease identification. We observed that the highly efficient-interlinked genes connected with ACADS are HAHDA, HADHB, ECHS1 and ACAT1. Interestingly, the ontological aspect of genes in the ACADS network reveals that ACADS, HAHDA and HADHB play equally vital roles in fatty acid metabolism. The gene ACAT1 together with ACADS indulges in ketone metabolism. Our computational gene web analysis also predicts potential candidate disease recognition, thus indicating the involvement of ACADS, HAHDA, HADHB, ECHS1 and ACAT1 not only with lipid metabolism but also with infant death syndrome, skeletal myopathy, acute hepatic encephalopathy, Reye-like syndrome, episodic ketosis, and metabolic acidosis. The current study presents a comprehensible layout of ACADS network, its functional strategies and candidate disease approach associated with ACADS network. Copyright © 2015 Elsevier B.V. All rights reserved.

  16. Analysis of mammalian gene function through broad based phenotypic screens across a consortium of mouse clinics

    Science.gov (United States)

    Adams, David J; Adams, Niels C; Adler, Thure; Aguilar-Pimentel, Antonio; Ali-Hadji, Dalila; Amann, Gregory; André, Philippe; Atkins, Sarah; Auburtin, Aurelie; Ayadi, Abdel; Becker, Julien; Becker, Lore; Bedu, Elodie; Bekeredjian, Raffi; Birling, Marie-Christine; Blake, Andrew; Bottomley, Joanna; Bowl, Mike; Brault, Véronique; Busch, Dirk H; Bussell, James N; Calzada-Wack, Julia; Cater, Heather; Champy, Marie-France; Charles, Philippe; Chevalier, Claire; Chiani, Francesco; Codner, Gemma F; Combe, Roy; Cox, Roger; Dalloneau, Emilie; Dierich, André; Di Fenza, Armida; Doe, Brendan; Duchon, Arnaud; Eickelberg, Oliver; Esapa, Chris T; El Fertak, Lahcen; Feigel, Tanja; Emelyanova, Irina; Estabel, Jeanne; Favor, Jack; Flenniken, Ann; Gambadoro, Alessia; Garrett, Lilian; Gates, Hilary; Gerdin, Anna-Karin; Gkoutos, George; Greenaway, Simon; Glasl, Lisa; Goetz, Patrice; Da Cruz, Isabelle Goncalves; Götz, Alexander; Graw, Jochen; Guimond, Alain; Hans, Wolfgang; Hicks, Geoff; Hölter, Sabine M; Höfler, Heinz; Hancock, John M; Hoehndorf, Robert; Hough, Tertius; Houghton, Richard; Hurt, Anja; Ivandic, Boris; Jacobs, Hughes; Jacquot, Sylvie; Jones, Nora; Karp, Natasha A; Katus, Hugo A; Kitchen, Sharon; Klein-Rodewald, Tanja; Klingenspor, Martin; Klopstock, Thomas; Lalanne, Valerie; Leblanc, Sophie; Lengger, Christoph; le Marchand, Elise; Ludwig, Tonia; Lux, Aline; McKerlie, Colin; Maier, Holger; Mandel, Jean-Louis; Marschall, Susan; Mark, Manuel; Melvin, David G; Meziane, Hamid; Micklich, Kateryna; Mittelhauser, Christophe; Monassier, Laurent; Moulaert, David; Muller, Stéphanie; Naton, Beatrix; Neff, Frauke; Nolan, Patrick M; Nutter, Lauryl MJ; Ollert, Markus; Pavlovic, Guillaume; Pellegata, Natalia S; Peter, Emilie; Petit-Demoulière, Benoit; Pickard, Amanda; Podrini, Christine; Potter, Paul; Pouilly, Laurent; Puk, Oliver; Richardson, David; Rousseau, Stephane; Quintanilla-Fend, Leticia; Quwailid, Mohamed M; Racz, Ildiko; Rathkolb, Birgit; Riet, Fabrice; Rossant, Janet; Roux, Michel; Rozman, Jan; Ryder, Ed; Salisbury, Jennifer; Santos, Luis; Schäble, Karl-Heinz; Schiller, Evelyn; Schrewe, Anja; Schulz, Holger; Steinkamp, Ralf; Simon, Michelle; Stewart, Michelle; Stöger, Claudia; Stöger, Tobias; Sun, Minxuan; Sunter, David; Teboul, Lydia; Tilly, Isabelle; Tocchini-Valentini, Glauco P; Tost, Monica; Treise, Irina; Vasseur, Laurent; Velot, Emilie; Vogt-Weisenhorn, Daniela; Wagner, Christelle; Walling, Alison; Weber, Bruno; Wendling, Olivia; Westerberg, Henrik; Willershäuser, Monja; Wolf, Eckhard; Wolter, Anne; Wood, Joe; Wurst, Wolfgang; Yildirim, Ali Önder; Zeh, Ramona; Zimmer, Andreas; Zimprich, Annemarie

    2015-01-01

    The function of the majority of genes in the mouse and human genomes remains unknown. The mouse ES cell knockout resource provides a basis for characterisation of relationships between gene and phenotype. The EUMODIC consortium developed and validated robust methodologies for broad-based phenotyping of knockouts through a pipeline comprising 20 disease-orientated platforms. We developed novel statistical methods for pipeline design and data analysis aimed at detecting reproducible phenotypes with high power. We acquired phenotype data from 449 mutant alleles, representing 320 unique genes, of which half had no prior functional annotation. We captured data from over 27,000 mice finding that 83% of the mutant lines are phenodeviant, with 65% demonstrating pleiotropy. Surprisingly, we found significant differences in phenotype annotation according to zygosity. Novel phenotypes were uncovered for many genes with unknown function providing a powerful basis for hypothesis generation and further investigation in diverse systems. PMID:26214591

  17. Functional analysis of the cathepsin-like cysteine protease genes in adult Brugia malayi using RNA interference.

    Directory of Open Access Journals (Sweden)

    Louise Ford

    Full Text Available Cathepsin-like enzymes have been identified as potential targets for drug or vaccine development in many parasites, as their functions appear to be essential in a variety of important biological processes within the host, such as molting, cuticle remodeling, embryogenesis, feeding and immune evasion. Functional analysis of Caenorhabditis elegans cathepsin L (Ce-cpl-1 and cathepsin Z (Ce-cpz-1 has established that both genes are required for early embryogenesis, with Ce-cpl-1 having a role in regulating in part the processing of yolk proteins. Ce-cpz-1 also has an important role during molting.RNA interference assays have allowed us to verify whether the functions of the orthologous filarial genes in Brugia malayi adult female worms are similar. Treatment of B. malayi adult female worms with Bm-cpl-1, Bm-cpl-5, which belong to group Ia of the filarial cpl gene family, or Bm-cpz-1 dsRNA resulted in decreased numbers of secreted microfilariae in vitro. In addition, analysis of the intrauterine progeny of the Bm-cpl-5 or Bm-cpl Pro dsRNA- and siRNA-treated worms revealed a clear disruption in the process of embryogenesis resulting in structural abnormalities in embryos and a varied differential development of embryonic stages.Our studies suggest that these filarial cathepsin-like cysteine proteases are likely to be functional orthologs of the C. elegans genes. This functional conservation may thus allow for a more thorough investigation of their distinct functions and their development as potential drug targets.

  18. Sugarcane genes related to mitochondrial function

    Directory of Open Access Journals (Sweden)

    Fonseca Ghislaine V.

    2001-01-01

    Full Text Available Mitochondria function as metabolic powerhouses by generating energy through oxidative phosphorylation and have become the focus of renewed interest due to progress in understanding the subtleties of their biogenesis and the discovery of the important roles which these organelles play in senescence, cell death and the assembly of iron-sulfur (Fe/S centers. Using proteins from the yeast Saccharomyces cerevisiae, Homo sapiens and Arabidopsis thaliana we searched the sugarcane expressed sequence tag (SUCEST database for the presence of expressed sequence tags (ESTs with similarity to nuclear genes related to mitochondrial functions. Starting with 869 protein sequences, we searched for sugarcane EST counterparts to these proteins using the basic local alignment search tool TBLASTN similarity searching program run against 260,781 sugarcane ESTs contained in 81,223 clusters. We were able to recover 367 clusters likely to represent sugarcane orthologues of the corresponding genes from S. cerevisiae, H. sapiens and A. thaliana with E-value <= 10-10. Gene products belonging to all functional categories related to mitochondrial functions were found and this allowed us to produce an overview of the nuclear genes required for sugarcane mitochondrial biogenesis and function as well as providing a starting point for detailed analysis of sugarcane gene structure and physiology.

  19. Characteristics of functional enrichment and gene expression level of human putative transcriptional target genes.

    Science.gov (United States)

    Osato, Naoki

    2018-01-19

    Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional

  20. Transcriptome analysis by GeneTrail revealed regulation of functional categories in response to alterations of iron homeostasis in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Lenhof Hans-Peter

    2011-05-01

    Full Text Available Abstract Background High-throughput technologies have opened new avenues to study biological processes and pathways. The interpretation of the immense amount of data sets generated nowadays needs to be facilitated in order to enable biologists to identify complex gene networks and functional pathways. To cope with this task multiple computer-based programs have been developed. GeneTrail is a freely available online tool that screens comparative transcriptomic data for differentially regulated functional categories and biological pathways extracted from common data bases like KEGG, Gene Ontology (GO, TRANSPATH and TRANSFAC. Additionally, GeneTrail offers a feature that allows screening of individually defined biological categories that are relevant for the respective research topic. Results We have set up GeneTrail for the use of Arabidopsis thaliana. To test the functionality of this tool for plant analysis, we generated transcriptome data of root and leaf responses to Fe deficiency and the Arabidopsis metal homeostasis mutant nas4x-1. We performed Gene Set Enrichment Analysis (GSEA with eight meaningful pairwise comparisons of transcriptome data sets. We were able to uncover several functional pathways including metal homeostasis that were affected in our experimental situations. Representation of the differentially regulated functional categories in Venn diagrams uncovered regulatory networks at the level of whole functional pathways. Over-Representation Analysis (ORA of differentially regulated genes identified in pairwise comparisons revealed specific functional plant physiological categories as major targets upon Fe deficiency and in nas4x-1. Conclusion Here, we obtained supporting evidence, that the nas4x-1 mutant was defective in metal homeostasis. It was confirmed that nas4x-1 showed Fe deficiency in roots and signs of Fe deficiency and Fe sufficiency in leaves. Besides metal homeostasis, biotic stress, root carbohydrate, leaf

  1. The functional landscape of mouse gene expression

    Directory of Open Access Journals (Sweden)

    Zhang Wen

    2004-12-01

    Full Text Available Abstract Background Large-scale quantitative analysis of transcriptional co-expression has been used to dissect regulatory networks and to predict the functions of new genes discovered by genome sequencing in model organisms such as yeast. Although the idea that tissue-specific expression is indicative of gene function in mammals is widely accepted, it has not been objectively tested nor compared with the related but distinct strategy of correlating gene co-expression as a means to predict gene function. Results We generated microarray expression data for nearly 40,000 known and predicted mRNAs in 55 mouse tissues, using custom-built oligonucleotide arrays. We show that quantitative transcriptional co-expression is a powerful predictor of gene function. Hundreds of functional categories, as defined by Gene Ontology 'Biological Processes', are associated with characteristic expression patterns across all tissues, including categories that bear no overt relationship to the tissue of origin. In contrast, simple tissue-specific restriction of expression is a poor predictor of which genes are in which functional categories. As an example, the highly conserved mouse gene PWP1 is widely expressed across different tissues but is co-expressed with many RNA-processing genes; we show that the uncharacterized yeast homolog of PWP1 is required for rRNA biogenesis. Conclusions We conclude that 'functional genomics' strategies based on quantitative transcriptional co-expression will be as fruitful in mammals as they have been in simpler organisms, and that transcriptional control of mammalian physiology is more modular than is generally appreciated. Our data and analyses provide a public resource for mammalian functional genomics.

  2. Gene function in early mouse embryonic stem cell differentiation

    Directory of Open Access Journals (Sweden)

    Campbell Pearl A

    2007-03-01

    Full Text Available Abstract Background Little is known about the genes that drive embryonic stem cell differentiation. However, such knowledge is necessary if we are to exploit the therapeutic potential of stem cells. To uncover the genetic determinants of mouse embryonic stem cell (mESC differentiation, we have generated and analyzed 11-point time-series of DNA microarray data for three biologically equivalent but genetically distinct mESC lines (R1, J1, and V6.5 undergoing undirected differentiation into embryoid bodies (EBs over a period of two weeks. Results We identified the initial 12 hour period as reflecting the early stages of mESC differentiation and studied probe sets showing consistent changes of gene expression in that period. Gene function analysis indicated significant up-regulation of genes related to regulation of transcription and mRNA splicing, and down-regulation of genes related to intracellular signaling. Phylogenetic analysis indicated that the genes showing the largest expression changes were more likely to have originated in metazoans. The probe sets with the most consistent gene changes in the three cell lines represented 24 down-regulated and 12 up-regulated genes, all with closely related human homologues. Whereas some of these genes are known to be involved in embryonic developmental processes (e.g. Klf4, Otx2, Smn1, Socs3, Tagln, Tdgf1, our analysis points to others (such as transcription factor Phf21a, extracellular matrix related Lama1 and Cyr61, or endoplasmic reticulum related Sc4mol and Scd2 that have not been previously related to mESC function. The majority of identified functions were related to transcriptional regulation, intracellular signaling, and cytoskeleton. Genes involved in other cellular functions important in ESC differentiation such as chromatin remodeling and transmembrane receptors were not observed in this set. Conclusion Our analysis profiles for the first time gene expression at a very early stage of m

  3. Text mining and network analysis to find functional associations of genes in high altitude diseases.

    Science.gov (United States)

    Bhasuran, Balu; Subramanian, Devika; Natarajan, Jeyakumar

    2018-05-02

    Travel to elevations above 2500 m is associated with the risk of developing one or more forms of acute altitude illness such as acute mountain sickness (AMS), high altitude cerebral edema (HACE) or high altitude pulmonary edema (HAPE). Our work aims to identify the functional association of genes involved in high altitude diseases. In this work we identified the gene networks responsible for high altitude diseases by using the principle of gene co-occurrence statistics from literature and network analysis. First, we mined the literature data from PubMed on high-altitude diseases, and extracted the co-occurring gene pairs. Next, based on their co-occurrence frequency, gene pairs were ranked. Finally, a gene association network was created using statistical measures to explore potential relationships. Network analysis results revealed that EPO, ACE, IL6 and TNF are the top five genes that were found to co-occur with 20 or more genes, while the association between EPAS1 and EGLN1 genes is strongly substantiated. The network constructed from this study proposes a large number of genes that work in-toto in high altitude conditions. Overall, the result provides a good reference for further study of the genetic relationships in high altitude diseases. Copyright © 2018 Elsevier Ltd. All rights reserved.

  4. Analysis of mammalian gene function through broad-based phenotypic screens across a consortium of mouse clinics.

    Science.gov (United States)

    de Angelis, Martin Hrabě; Nicholson, George; Selloum, Mohammed; White, Jacqui; Morgan, Hugh; Ramirez-Solis, Ramiro; Sorg, Tania; Wells, Sara; Fuchs, Helmut; Fray, Martin; Adams, David J; Adams, Niels C; Adler, Thure; Aguilar-Pimentel, Antonio; Ali-Hadji, Dalila; Amann, Gregory; André, Philippe; Atkins, Sarah; Auburtin, Aurelie; Ayadi, Abdel; Becker, Julien; Becker, Lore; Bedu, Elodie; Bekeredjian, Raffi; Birling, Marie-Christine; Blake, Andrew; Bottomley, Joanna; Bowl, Mike; Brault, Véronique; Busch, Dirk H; Bussell, James N; Calzada-Wack, Julia; Cater, Heather; Champy, Marie-France; Charles, Philippe; Chevalier, Claire; Chiani, Francesco; Codner, Gemma F; Combe, Roy; Cox, Roger; Dalloneau, Emilie; Dierich, André; Di Fenza, Armida; Doe, Brendan; Duchon, Arnaud; Eickelberg, Oliver; Esapa, Chris T; El Fertak, Lahcen; Feigel, Tanja; Emelyanova, Irina; Estabel, Jeanne; Favor, Jack; Flenniken, Ann; Gambadoro, Alessia; Garrett, Lilian; Gates, Hilary; Gerdin, Anna-Karin; Gkoutos, George; Greenaway, Simon; Glasl, Lisa; Goetz, Patrice; Da Cruz, Isabelle Goncalves; Götz, Alexander; Graw, Jochen; Guimond, Alain; Hans, Wolfgang; Hicks, Geoff; Hölter, Sabine M; Höfler, Heinz; Hancock, John M; Hoehndorf, Robert; Hough, Tertius; Houghton, Richard; Hurt, Anja; Ivandic, Boris; Jacobs, Hughes; Jacquot, Sylvie; Jones, Nora; Karp, Natasha A; Katus, Hugo A; Kitchen, Sharon; Klein-Rodewald, Tanja; Klingenspor, Martin; Klopstock, Thomas; Lalanne, Valerie; Leblanc, Sophie; Lengger, Christoph; le Marchand, Elise; Ludwig, Tonia; Lux, Aline; McKerlie, Colin; Maier, Holger; Mandel, Jean-Louis; Marschall, Susan; Mark, Manuel; Melvin, David G; Meziane, Hamid; Micklich, Kateryna; Mittelhauser, Christophe; Monassier, Laurent; Moulaert, David; Muller, Stéphanie; Naton, Beatrix; Neff, Frauke; Nolan, Patrick M; Nutter, Lauryl Mj; Ollert, Markus; Pavlovic, Guillaume; Pellegata, Natalia S; Peter, Emilie; Petit-Demoulière, Benoit; Pickard, Amanda; Podrini, Christine; Potter, Paul; Pouilly, Laurent; Puk, Oliver; Richardson, David; Rousseau, Stephane; Quintanilla-Fend, Leticia; Quwailid, Mohamed M; Racz, Ildiko; Rathkolb, Birgit; Riet, Fabrice; Rossant, Janet; Roux, Michel; Rozman, Jan; Ryder, Ed; Salisbury, Jennifer; Santos, Luis; Schäble, Karl-Heinz; Schiller, Evelyn; Schrewe, Anja; Schulz, Holger; Steinkamp, Ralf; Simon, Michelle; Stewart, Michelle; Stöger, Claudia; Stöger, Tobias; Sun, Minxuan; Sunter, David; Teboul, Lydia; Tilly, Isabelle; Tocchini-Valentini, Glauco P; Tost, Monica; Treise, Irina; Vasseur, Laurent; Velot, Emilie; Vogt-Weisenhorn, Daniela; Wagner, Christelle; Walling, Alison; Weber, Bruno; Wendling, Olivia; Westerberg, Henrik; Willershäuser, Monja; Wolf, Eckhard; Wolter, Anne; Wood, Joe; Wurst, Wolfgang; Yildirim, Ali Önder; Zeh, Ramona; Zimmer, Andreas; Zimprich, Annemarie; Holmes, Chris; Steel, Karen P; Herault, Yann; Gailus-Durner, Valérie; Mallon, Ann-Marie; Brown, Steve Dm

    2015-09-01

    The function of the majority of genes in the mouse and human genomes remains unknown. The mouse embryonic stem cell knockout resource provides a basis for the characterization of relationships between genes and phenotypes. The EUMODIC consortium developed and validated robust methodologies for the broad-based phenotyping of knockouts through a pipeline comprising 20 disease-oriented platforms. We developed new statistical methods for pipeline design and data analysis aimed at detecting reproducible phenotypes with high power. We acquired phenotype data from 449 mutant alleles, representing 320 unique genes, of which half had no previous functional annotation. We captured data from over 27,000 mice, finding that 83% of the mutant lines are phenodeviant, with 65% demonstrating pleiotropy. Surprisingly, we found significant differences in phenotype annotation according to zygosity. New phenotypes were uncovered for many genes with previously unknown function, providing a powerful basis for hypothesis generation and further investigation in diverse systems.

  5. Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

    Science.gov (United States)

    Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

    2018-05-09

    Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.

  6. fabp4 is central to eight obesity associated genes: a functional gene network-based polymorphic study.

    Science.gov (United States)

    Bag, Susmita; Ramaiah, Sudha; Anbarasu, Anand

    2015-01-07

    Network study on genes and proteins offers functional basics of the complexity of gene and protein, and its interacting partners. The gene fatty acid-binding protein 4 (fabp4) is found to be highly expressed in adipose tissue, and is one of the most abundant proteins in mature adipocytes. Our investigations on functional modules of fabp4 provide useful information on the functional genes interacting with fabp4, their biochemical properties and their regulatory functions. The present study shows that there are eight set of candidate genes: acp1, ext2, insr, lipe, ostf1, sncg, usp15, and vim that are strongly and functionally linked up with fabp4. Gene ontological analysis of network modules of fabp4 provides an explicit idea on the functional aspect of fabp4 and its interacting nodes. The hierarchal mapping on gene ontology indicates gene specific processes and functions as well as their compartmentalization in tissues. The fabp4 along with its interacting genes are involved in lipid metabolic activity and are integrated in multi-cellular processes of tissues and organs. They also have important protein/enzyme binding activity. Our study elucidated disease-associated nsSNP prediction for fabp4 and it is interesting to note that there are four rsID׳s (rs1051231, rs3204631, rs140925685 and rs141169989) with disease allelic variation (T104P, T126P, G27D and G90V respectively). On the whole, our gene network analysis presents a clear insight about the interactions and functions associated with fabp4 gene network. Copyright © 2014 Elsevier Ltd. All rights reserved.

  7. Functional and bioinformatics analysis of an exopolysaccharide-related gene (epsN) from Lactobacillus kefiranofaciens ZW3.

    Science.gov (United States)

    Wang, Jingrui; Tang, Wei; Zheng, Yongna; Xing, Zhuqing; Wang, Yanping

    2016-09-01

    A novel lactic acid bacteria strain Lactobacillus kefiranofaciens ZW3 exhibited the characteristics of high production of exopolysaccharide (EPS). The epsN gene, located in the eps gene cluster of this strain, is associated with EPS biosynthesis. Bioinformatics analysis of this gene was performed. The conserved domain analysis showed that the EpsN protein contained MATE-Wzx-like domains. Then the epsN gene was amplified to construct the recombinant expression vector pMG36e-epsN. The results showed that the EPS yields of the recombinants were significantly improved. By determining the yields of EPS and intracellular polysaccharide, it was considered that epsN gene could play its Wzx flippase role in the EPS biosynthesis. This is the first time to prove the effect of EpsN on L. kefiranofaciens EPS biosynthesis and further prove its functional property.

  8. Analysis of functional importance of binding sites in the Drosophila gap gene network model.

    Science.gov (United States)

    Kozlov, Konstantin; Gursky, Vitaly V; Kulakovskiy, Ivan V; Dymova, Arina; Samsonova, Maria

    2015-01-01

    The statistical thermodynamics based approach provides a promising framework for construction of the genotype-phenotype map in many biological systems. Among important aspects of a good model connecting the DNA sequence information with that of a molecular phenotype (gene expression) is the selection of regulatory interactions and relevant transcription factor bindings sites. As the model may predict different levels of the functional importance of specific binding sites in different genomic and regulatory contexts, it is essential to formulate and study such models under different modeling assumptions. We elaborate a two-layer model for the Drosophila gap gene network and include in the model a combined set of transcription factor binding sites and concentration dependent regulatory interaction between gap genes hunchback and Kruppel. We show that the new variants of the model are more consistent in terms of gene expression predictions for various genetic constructs in comparison to previous work. We quantify the functional importance of binding sites by calculating their impact on gene expression in the model and calculate how these impacts correlate across all sites under different modeling assumptions. The assumption about the dual interaction between hb and Kr leads to the most consistent modeling results, but, on the other hand, may obscure existence of indirect interactions between binding sites in regulatory regions of distinct genes. The analysis confirms the previously formulated regulation concept of many weak binding sites working in concert. The model predicts a more or less uniform distribution of functionally important binding sites over the sets of experimentally characterized regulatory modules and other open chromatin domains.

  9. Regulatory network analysis of Epstein-Barr virus identifies functional modules and hub genes involved in infectious mononucleosis.

    Science.gov (United States)

    Poorebrahim, Mansour; Salarian, Ali; Najafi, Saeideh; Abazari, Mohammad Foad; Aleagha, Maryam Nouri; Dadras, Mohammad Nasr; Jazayeri, Seyed Mohammad; Ataei, Atousa; Poortahmasebi, Vahdat

    2017-05-01

    Epstein-Barr virus (EBV) is the most common cause of infectious mononucleosis (IM) and establishes lifetime infection associated with a variety of cancers and autoimmune diseases. The aim of this study was to develop an integrative gene regulatory network (GRN) approach and overlying gene expression data to identify the representative subnetworks for IM and EBV latent infection (LI). After identifying differentially expressed genes (DEGs) in both IM and LI gene expression profiles, functional annotations were applied using gene ontology (GO) and BiNGO tools, and construction of GRNs, topological analysis and identification of modules were carried out using several plugins of Cytoscape. In parallel, a human-EBV GRN was generated using the Hu-Vir database for further analyses. Our analysis revealed that the majority of DEGs in both IM and LI were involved in cell-cycle and DNA repair processes. However, these genes showed a significant negative correlation in the IM and LI states. Furthermore, cyclin-dependent kinase 2 (CDK2) - a hub gene with the highest centrality score - appeared to be the key player in cell cycle regulation in IM disease. The most significant functional modules in the IM and LI states were involved in the regulation of the cell cycle and apoptosis, respectively. Human-EBV network analysis revealed several direct targets of EBV proteins during IM disease. Our study provides an important first report on the response to IM/LI EBV infection in humans. An important aspect of our data was the upregulation of genes associated with cell cycle progression and proliferation.

  10. More powerful significant testing for time course gene expression data using functional principal component analysis approaches.

    Science.gov (United States)

    Wu, Shuang; Wu, Hulin

    2013-01-16

    One of the fundamental problems in time course gene expression data analysis is to identify genes associated with a biological process or a particular stimulus of interest, like a treatment or virus infection. Most of the existing methods for this problem are designed for data with longitudinal replicates. But in reality, many time course gene experiments have no replicates or only have a small number of independent replicates. We focus on the case without replicates and propose a new method for identifying differentially expressed genes by incorporating the functional principal component analysis (FPCA) into a hypothesis testing framework. The data-driven eigenfunctions allow a flexible and parsimonious representation of time course gene expression trajectories, leaving more degrees of freedom for the inference compared to that using a prespecified basis. Moreover, the information of all genes is borrowed for individual gene inferences. The proposed approach turns out to be more powerful in identifying time course differentially expressed genes compared to the existing methods. The improved performance is demonstrated through simulation studies and a real data application to the Saccharomyces cerevisiae cell cycle data.

  11. Hunting down frame shifts: Ecological analysis of diverse functional gene sequences

    Directory of Open Access Journals (Sweden)

    Michal eStrejcek

    2015-11-01

    Full Text Available Functional gene ecological analyses using amplicon sequencing can be challenging as translated sequences are often burdened with shifted reading frames. The aim of this work was to evaluate several bioinformatics tools designed to correct errors which arise during sequencing in an effort to reduce the number of frame-shifts (FS. Genes encoding for alpha subunits of biphenyl (bphA and benzoate (benA dioxygenases were used as model sequences. FrameBot, a FS correction tool, was able to reduce the number of detected FS to zero. However, up to 43.1% of sequences were discarded by FrameBot as non-specific targets. Therefore, we proposed a de novo mode of FrameBot for FS correction, which works on a similar basis as common chimera identifying platforms and is not dependent on reference sequences. By nature of FrameBot de novo design, it is crucial to provide it with data as error free as possible. We tested the ability of several publicly available correction tools to decrease the number of errors in the data sets. The combination of Maximum Expected Error (MEE filtering and single linkage pre-clustering (SLP proved the most efficient read procession. Applying FrameBot de novo on the processed data enabled analysis of BphA sequences with minimal losses of potentially functional sequences not homologous to those previously known. This experiment also demonstrated the extensive diversity of dioxygenases in soil. A script which performs FrameBot de novo is presented in the supplementary material to the study and the tool was implemented into FunGene Pipeline available at http://fungene.cme.msu.edu/FunGenePipeline/ and https://github.com/rdpstaff/Framebot.

  12. [Gene deletion and functional analysis of the heptyl glycosyltransferase (waaF) gene in Vibrio parahemolyticus O-antigen cluster].

    Science.gov (United States)

    Zhao, Feng; Meng, Songsong; Zhou, Deqing

    2016-02-04

    To construct heptyl glycosyltransferase gene II (waaF) gene deletion mutant of Vibrio parahaemolyticus, and explore the function of the waaF gene in Vibrio parahaemolyticus. The waaF gene deletion mutant was constructed by chitin-based transformation technology using clinical isolates, and then the growth rate, morphology and serotypes were identified. The different sources (O3, O5 and O10) waaF gene complementations were constructed through E. coli S17λpir strains conjugative transferring with Vibrio parahaemolyticus, and the function of the waaF gene was further verified by serotypes. The waaF gene deletion mutant strain was successfully constructed and it grew normally. The growth rate and morphology of mutant were similar with the wild type strains (WT), but the mutant could not occurred agglutination reaction with O antisera. The O3 and O5 sources waaF gene complementations occurred agglutination reaction with O antisera, but the O10 sources waaF gene complementations was not. The waaF gene was related with O-antigen synthesis and it was the key gene of O-antigen synthesis pathway in Vibrio parahaemolyticus. The function of different sources waaF gene were not the same.

  13. cDNA cloning and transcriptional controlling of a novel low dose radiation-induced gene and its function analysis

    International Nuclear Information System (INIS)

    Zhou Pingkun; Sui Jianli

    2002-01-01

    Objective: To clone a novel low dose radiation-induced gene (LRIGx) and study its function as well as its transcriptional changes after irradiation. Methods: Its cDNA was obtained by DDRT-PCR and RACE techniques. Northern blot hybridization was used to investigate the gene transcription. Bioinformatics was employed to analysis structure and function of this gene. Results: LRIGx cDNA was cloned. The sequence of LRIGx was identical to a DNA clone located in human chromosome 20 q 11.2-12 Bioinformatics analysis predicted an encoded protein with a conserved helicase domain. Northern analysis revealed a ∼8.5 kb transcript which was induced after 0.2 Gy as well as 0.02 Gy irradiation, and the transcript level was increased 5 times at 4 h after 0.2 Gy irradiation. The induced level of LRIGx transcript by 2.0 Gy high dose was lower than by 0.2 Gy. Conclusion: A novel low dose radiation-induced gene has been cloned. It encodes a protein with a conserved helicase domain that could involve in DNA metabolism in the cellular process of radiation response

  14. Genome-wide profiling of 24 hr diel rhythmicity in the water flea, Daphnia pulex: network analysis reveals rhythmic gene expression and enhances functional gene annotation.

    Science.gov (United States)

    Rund, Samuel S C; Yoo, Boyoung; Alam, Camille; Green, Taryn; Stephens, Melissa T; Zeng, Erliang; George, Gary F; Sheppard, Aaron D; Duffield, Giles E; Milenković, Tijana; Pfrender, Michael E

    2016-08-18

    Marine and freshwater zooplankton exhibit daily rhythmic patterns of behavior and physiology which may be regulated directly by the light:dark (LD) cycle and/or a molecular circadian clock. One of the best-studied zooplankton taxa, the freshwater crustacean Daphnia, has a 24 h diel vertical migration (DVM) behavior whereby the organism travels up and down through the water column daily. DVM plays a critical role in resource tracking and the behavioral avoidance of predators and damaging ultraviolet radiation. However, there is little information at the transcriptional level linking the expression patterns of genes to the rhythmic physiology/behavior of Daphnia. Here we analyzed genome-wide temporal transcriptional patterns from Daphnia pulex collected over a 44 h time period under a 12:12 LD cycle (diel) conditions using a cosine-fitting algorithm. We used a comprehensive network modeling and analysis approach to identify novel co-regulated rhythmic genes that have similar network topological properties and functional annotations as rhythmic genes identified by the cosine-fitting analyses. Furthermore, we used the network approach to predict with high accuracy novel gene-function associations, thus enhancing current functional annotations available for genes in this ecologically relevant model species. Our results reveal that genes in many functional groupings exhibit 24 h rhythms in their expression patterns under diel conditions. We highlight the rhythmic expression of immunity, oxidative detoxification, and sensory process genes. We discuss differences in the chronobiology of D. pulex from other well-characterized terrestrial arthropods. This research adds to a growing body of literature suggesting the genetic mechanisms governing rhythmicity in crustaceans may be divergent from other arthropod lineages including insects. Lastly, these results highlight the power of using a network analysis approach to identify differential gene expression and provide novel

  15. Characterization and functional analysis of the Paralichthys olivaceus prdm1 gene promoter.

    Science.gov (United States)

    Li, Peizhen; Wang, Bo; Cao, Dandan; Liu, Yuezhong; Zhang, Quanqi; Wang, Xubo

    2017-10-01

    PR domain containing protein 1 (Prdm1) is a transcriptional repressor identified in various species and plays multiple important roles in immune response and embryonic development. However, little is known about the transcriptional regulation of the prdm1 gene. This study aims to characterize the promoter of Paralichthys olivaceus prdm1 (Po-prdm1) gene and determine the regulatory mechanism of Po-prdm1 expression. A 2000bp-long 5'-flanking region (translation initiation site designated as +1) of the Po-prdm1 gene was isolated and characterized. The regulatory elements in this fragment were then investigated and many putative transcription factor (TF) binding sites involved in immunity and multiple tissue development were identified. A 5'-deletion analysis was then conducted, and the ability of the deletion mutants to promote luciferase and green fluorescent protein (GFP) expression in a flounder gill cell line was examined. The results revealed that the minimal promoter is located in the region between -446 and -13bp, and the region between -1415 and -13bp enhanced the promoter activity. Site-directed mutation analysis was subsequently performed on the putative regulatory elements sites, and the results indicated that FOXP1, MSX and BCL6 binding sites play negative functional roles in the regulation of the Po-prdm1 expression in FG cells. In vivo analysis demonstrated that a GFP reporter gene containing 1.4kb-long promoter fragment (-1415/-13) was expressed in the head and trunk muscle fibres of transient transgenic zebrafish embryos. Our study provided the basic information for the exploration of Po-prdm1 regulation and expression. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. Genome-Wide Detection and Analysis of Multifunctional Genes

    Science.gov (United States)

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  17. Effect of the absolute statistic on gene-sampling gene-set analysis methods.

    Science.gov (United States)

    Nam, Dougu

    2017-06-01

    Gene-set enrichment analysis and its modified versions have commonly been used for identifying altered functions or pathways in disease from microarray data. In particular, the simple gene-sampling gene-set analysis methods have been heavily used for datasets with only a few sample replicates. The biggest problem with this approach is the highly inflated false-positive rate. In this paper, the effect of absolute gene statistic on gene-sampling gene-set analysis methods is systematically investigated. Thus far, the absolute gene statistic has merely been regarded as a supplementary method for capturing the bidirectional changes in each gene set. Here, it is shown that incorporating the absolute gene statistic in gene-sampling gene-set analysis substantially reduces the false-positive rate and improves the overall discriminatory ability. Its effect was investigated by power, false-positive rate, and receiver operating curve for a number of simulated and real datasets. The performances of gene-set analysis methods in one-tailed (genome-wide association study) and two-tailed (gene expression data) tests were also compared and discussed.

  18. PageRank analysis reveals topologically expressed genes correspond to psoriasis and their functions are associated with apoptosis resistance.

    Science.gov (United States)

    Zeng, Xue; Zhao, Jingjing; Wu, Xiaohong; Shi, Hongbo; Liu, Wali; Cui, Bingnan; Yang, Li; Ding, Xu; Song, Ping

    2016-05-01

    Psoriasis is an inflammatory skin disease. Deceleration in keratinocyte apoptosis is the most significant pathological change observed in psoriasis. To detect a meaningful correlation between the genes and gene functions associated with the mechanism underlying psoriasis, 927 differentially expressed genes (DEGs) were identified using the Gene Expression Omnibus database, GSE13355 [false discovery rate (FDR) 1] with the package in R langue. The selected DEGs were further constructed using the search tool for the retrieval of interacting genes, in order to analyze the interaction network between the DEGs. Subsequent to PageRank analysis, 14 topological hub genes were identified, and the functions and pathways in the hub genes network were analyzed. The top‑ranked hub gene, estrogen receptor‑1 (ESR1) is downregulated in psoriasis, exhibited binding sites enriched with genes possessing anti‑apoptotic functions. The ESR1 gene encodes estrogen receptor α (ERα); a reduced level of ERα expression provides a crucial foundation in response to the anti‑apoptotic activity of psoriatic keratinocytes by activating the expression of anti‑apoptotic genes. Furthermore, it was detected that the pathway that is associated most significantly with psoriasis is the pathways in cancer. Pathways in cancer may protect psoriatic cells from apoptosis by inhibition of ESR1 expression. The present study provides support towards the investigation of ESR1 gene function and elucidates that the interaction with anti‑apoptotic genes is involved in the underlying biological mechanisms of resistance to apoptosis in psoriasis. However, further investigation is required to confirm the present results.

  19. Expansion and Functional Divergence of AP2 Group Genes in Spermatophytes Determined by Molecular Evolution and Arabidopsis Mutant Analysis

    Directory of Open Access Journals (Sweden)

    Pengkai Wang

    2016-09-01

    Full Text Available The APETALA2 (AP2 genes represent the AP2 group within a large group of DNA-binding proteins called AP2/EREBP. The AP2 gene is functional and necessary for flower development, stem cell maintenance, and seed development, whereas the other members of AP2 group redundantly affect flowering time. Here we study the phylogeny of AP2 group genes in spermatophytes. Spermatophyte AP2 group genes can be classified into AP2 and TOE types, six clades, and we found that the AP2 group homologs in gymnosperms belong to the AP2 type, whereas TOE types are absent, which indicates the AP2 type gene are more ancient and TOE type was split out of AP2 type and losing the major function. In Brassicaceae, the expansion of AP2 and TOE type lead to the gene number of AP2 group were up to six. Purifying selection appears to have been the primary driving force of spermatophyte AP2 group evolution, although positive selection occurred in the AP2 clade. The transition from exon to intron of AtAP2 in Arabidopsis mutant leads to the loss of gene function and the same situation was found in AtTOE2. Combining this evolutionary analysis and published research, the results suggest that typical AP2 group genes may first appear in gymnosperms and diverged in angiosperms, following expansion of group members and functional differentiation. In angiosperms, AP2 genes (AP2 clade inherited key functions from ancestors and other genes of AP2 group lost most function but just remained flowering time controlling in gene formation. In this study, the phylogenies of AP2 group genes in spermatophytes was analyzed, which supported the evidence for the research of gene functional evolution of AP2 group.

  20. Integrative Functional Genomics for Systems Genetics in GeneWeaver.org.

    Science.gov (United States)

    Bubier, Jason A; Langston, Michael A; Baker, Erich J; Chesler, Elissa J

    2017-01-01

    The abundance of existing functional genomics studies permits an integrative approach to interpreting and resolving the results of diverse systems genetics studies. However, a major challenge lies in assembling and harmonizing heterogeneous data sets across species for facile comparison to the positional candidate genes and coexpression networks that come from systems genetic studies. GeneWeaver is an online database and suite of tools at www.geneweaver.org that allows for fast aggregation and analysis of gene set-centric data. GeneWeaver contains curated experimental data together with resource-level data such as GO annotations, MP annotations, and KEGG pathways, along with persistent stores of user entered data sets. These can be entered directly into GeneWeaver or transferred from widely used resources such as GeneNetwork.org. Data are analyzed using statistical tools and advanced graph algorithms to discover new relations, prioritize candidate genes, and generate function hypotheses. Here we use GeneWeaver to find genes common to multiple gene sets, prioritize candidate genes from a quantitative trait locus, and characterize a set of differentially expressed genes. Coupling a large multispecies repository curated and empirical functional genomics data to fast computational tools allows for the rapid integrative analysis of heterogeneous data for interpreting and extrapolating systems genetics results.

  1. Gene analogue finder: a GRID solution for finding functionally analogous gene products

    Directory of Open Access Journals (Sweden)

    Licciulli Flavio

    2007-09-01

    Full Text Available Abstract Background To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO. Such vast, well defined knowledge opens the possibility of compare gene products at the level of functionality, finding gene products which have a similar function or are involved in similar biological processes without relying on the conventional sequence similarity approach. Comparisons within such a large space of knowledge are highly data and computing intensive. For this reason this project was based upon the use of the computational GRID, a technology offering large computing and storage resources. Results We have developed a tool, GENe AnaloGue FINdEr (ENGINE that parallelizes the search process and distributes the calculation and data over the computational GRID, splitting the process into many sub-processes and joining the calculation and the data on the same machine and therefore completing the whole search in about 3 days instead of occupying one single machine for more than 5 CPU years. The results of the functional comparison contain potential functional analogues for more than 79000 gene products from the most important species. 46% of the analyzed gene products are well enough described for such an analysis to individuate functional analogues, such as well-known members of the same gene family, or gene products with similar functions which would never have been associated by standard methods. Conclusion ENGINE has produced a list of potential functionally analogous relations between gene products within and between species using, in place of the sequence, the gene description of the GO, thus demonstrating the potential of the GO. However, the current limiting factor is the quality of the associations of many gene products from non

  2. Functional Module Analysis for Gene Coexpression Networks with Network Integration.

    Science.gov (United States)

    Zhang, Shuqin; Zhao, Hongyu; Ng, Michael K

    2015-01-01

    Network has been a general tool for studying the complex interactions between different genes, proteins, and other small molecules. Module as a fundamental property of many biological networks has been widely studied and many computational methods have been proposed to identify the modules in an individual network. However, in many cases, a single network is insufficient for module analysis due to the noise in the data or the tuning of parameters when building the biological network. The availability of a large amount of biological networks makes network integration study possible. By integrating such networks, more informative modules for some specific disease can be derived from the networks constructed from different tissues, and consistent factors for different diseases can be inferred. In this paper, we have developed an effective method for module identification from multiple networks under different conditions. The problem is formulated as an optimization model, which combines the module identification in each individual network and alignment of the modules from different networks together. An approximation algorithm based on eigenvector computation is proposed. Our method outperforms the existing methods, especially when the underlying modules in multiple networks are different in simulation studies. We also applied our method to two groups of gene coexpression networks for humans, which include one for three different cancers, and one for three tissues from the morbidly obese patients. We identified 13 modules with three complete subgraphs, and 11 modules with two complete subgraphs, respectively. The modules were validated through Gene Ontology enrichment and KEGG pathway enrichment analysis. We also showed that the main functions of most modules for the corresponding disease have been addressed by other researchers, which may provide the theoretical basis for further studying the modules experimentally.

  3. A meta-analysis based method for prioritizing candidate genes involved in a pre-specific function

    Directory of Open Access Journals (Sweden)

    Jingjing Zhai

    2016-12-01

    Full Text Available The identification of genes associated with a given biological function in plants remains a challenge, although network-based gene prioritization algorithms have been developed for Arabidopsis thaliana and many non-model plant species. Nevertheless, these network-based gene prioritization algorithms have encountered several problems; one in particular is that of unsatisfactory prediction accuracy due to limited network coverage, varying link quality, and/or uncertain network connectivity. Thus a model that integrates complementary biological data may be expected to increase the prediction accuracy of gene prioritization. Towards this goal, we developed a novel gene prioritization method named RafSee, to rank candidate genes using a random forest algorithm that integrates sequence, evolutionary, and epigenetic features of plants. Subsequently, we proposed an integrative approach named RAP (Rank Aggregation-based data fusion for gene Prioritization, in which an order statistics-based meta-analysis was used to aggregate the rank of the network-based gene prioritization method and RafSee, for accurately prioritizing candidate genes involved in a pre-specific biological function. Finally, we showcased the utility of RAP by prioritizing 380 flowering-time genes in Arabidopsis. The ‘leave-one-out’ cross-validation experiment showed that RafSee could work as a complement to a current state-of-art network-based gene prioritization system (AraNet v2. Moreover, RAP ranked 53.68% (204/380 flowering-time genes higher than AraNet v2, resulting in an 39.46% improvement in term of the first quartile rank. Further evaluations also showed that RAP was effective in prioritizing genes-related to different abiotic stresses. To enhance the usability of RAP for Arabidopsis and non-model plant species, an R package implementing the method is freely available at http://bioinfo.nwafu.edu.cn/software.

  4. Gain-of-function analysis of poplar CLE genes in Arabidopsis by exogenous application and over-expression assays.

    Science.gov (United States)

    Liu, Yisen; Yang, Shaohui; Song, Yingjin; Men, Shuzhen; Wang, Jiehua

    2016-04-01

    Among 50 CLE gene family members in the Populus trichocarpa genome, three and six PtCLE genes encode a CLE motif sequence highly homologous to Arabidopsis CLV3 and TDIF peptides, respectively, which potentially make them functional equivalents. To test and compare their biological activity, we first chemically synthesized each dodecapeptide and analysed itsi n vitro bioactivity on Arabidopsis seedlings. Similarly, but to a different extent, three types of poplar CLV3-related peptides caused root meristem consumption, phyllotaxis disorder, anthocyanin accumulation and failure to enter the bolting stage. In comparison, application of two poplar TDIF-related peptides led to root length promotion in a dose-dependent manner with an even stronger effect observed for poplar TDIF-like peptide than TDIF. Next, we constructed CaMV35S:PtCLE transgenic plants for each of the nine PtCLE genes. Phenotypic abnormalities exemplified by arrested shoot apical meristem and abnormal flower structure were found to be more dominant and severe in 35S:PtCLV3 and 35S:PtCLV3-like2 lines than in the 35S:PtCLV3-like line. Disordered vasculature was detected in both stem and hypocotyl cross-sections in Arabidopsis plants over-expressing poplar TDIF-related genes with the most defective vascular patterning observed for TDIF2 and two TDIF-like genes. Phenotypic difference consistently observed in peptide application assay and transgenic analysis indicated the functional diversity of nine poplar PtCLE genes under investigation. This work represents the first report on the functional analysis of CLE genes in a tree species and constitutes a basis for further study of the CLE peptide signalling pathway in tree development. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  5. Functional analysis of mating type genes and transcriptome analysis during fruiting body development of botrytis cinerea

    NARCIS (Netherlands)

    Rodenburg, Sander Y.A.; Terhem, Razak B.; Veloso, Javier; Stassen, Joost H.M.; Kan, van Jan A.L.

    2018-01-01

    Botrytis cinerea is a plant-pathogenic fungus producing apothecia as sexual fruiting bodies. To study the function of mating type (MAT) genes, single-gene deletion mutants were generated in both genes of the MAT1-1 locus and both genes of the MAT1-2 locus. Deletion mutants in two MAT genes were

  6. Genetic manipulation in Sulfolobus islandicus and functional analysis of DNA repair genes

    DEFF Research Database (Denmark)

    Zhang, Changyi; Tian, Bin; Li, Suming

    2013-01-01

    Recently, a novel gene-deletion method was developed for the crenarchaeal model Sulfolobus islandicus, which is a suitable tool for addressing gene essentiality in depth. Using this technique, we have investigated functions of putative DNA repair genes by constructing deletion mutants and studying...

  7. Functional Analysis of an ATP-Binding Cassette Transporter Gene in Botrytis cinerea by Gene Disruption

    OpenAIRE

    Masami, NAKAJIMA; Junko, SUZUKI; Takehiko, HOSAKA; Tadaaki, HIBI; Katsumi, AKUTSU; School of Agriculture, Ibaraki University; School of Agriculture, Ibaraki University; School of Agriculture, Ibaraki University; Department of Agriculture and Environmental Biology, The University of Tokyo; School of Agriculture, Ibaraki University

    2001-01-01

    The BMR1 gene encoding an ABC transporter was cloned from Botrytis cinerea. To examine the function of BMR1 in B.cinerea, we isolated BMR1-deficient mutants after gene disruption. Disruption vector pBcDF4 was constructed by replacing the BMR1-coding region with a hygromycin B phosphotransferase gene(hph)cassette. The BMR1 disruptants had an increased sensitivity to polyoxin and iprobenfos. Polyoxin and iprobenfos, structurally unrelated compounds, may therefore be substrates of BMR1.

  8. Target genes prediction and functional analysis of microRNAs differentially expressed in gastric cancer stem cells MKN-45

    Directory of Open Access Journals (Sweden)

    Zohreh Salehi

    2017-01-01

    Conclusions: Bioinformatics analysis such as DAVID database, GO biological process, GO molecular function, Kyoto encyclopedia of genes and genomes pathways, BioCarta pathway, Panther pathway, and Reactome pathway revealed that target genes of differentially expressed miRNAs in gastric CSCs were connected to pivotal biological pathways that involved in cell cycle regulation, stemness properties, and differentiation.

  9. Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

    Science.gov (United States)

    Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

    2018-02-23

    Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.

  10. Prosecutor: parameter-free inference of gene function for prokaryotes using DNA microarray data, genomic context and multiple gene annotation sources

    Directory of Open Access Journals (Sweden)

    van Hijum Sacha AFT

    2008-10-01

    Full Text Available Abstract Background Despite a plethora of functional genomic efforts, the function of many genes in sequenced genomes remains unknown. The increasing amount of microarray data for many species allows employing the guilt-by-association principle to predict function on a large scale: genes exhibiting similar expression patterns are more likely to participate in shared biological processes. Results We developed Prosecutor, an application that enables researchers to rapidly infer gene function based on available gene expression data and functional annotations. Our parameter-free functional prediction method uses a sensitive algorithm to achieve a high association rate of linking genes with unknown function to annotated genes. Furthermore, Prosecutor utilizes additional biological information such as genomic context and known regulatory mechanisms that are specific for prokaryotes. We analyzed publicly available transcriptome data sets and used literature sources to validate putative functions suggested by Prosecutor. We supply the complete results of our analysis for 11 prokaryotic organisms on a dedicated website. Conclusion The Prosecutor software and supplementary datasets available at http://www.prosecutor.nl allow researchers working on any of the analyzed organisms to quickly identify the putative functions of their genes of interest. A de novo analysis allows new organisms to be studied.

  11. Functional clustering of time series gene expression data by Granger causality

    Science.gov (United States)

    2012-01-01

    Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425

  12. Functional analysis of the Phycomyces carRA gene encoding the enzymes phytoene synthase and lycopene cyclase.

    Directory of Open Access Journals (Sweden)

    Catalina Sanz

    Full Text Available Phycomyces carRA gene encodes a protein with two domains. Domain R is characterized by red carR mutants that accumulate lycopene. Domain A is characterized by white carA mutants that do not accumulate significant amounts of carotenoids. The carRA-encoded protein was identified as the lycopene cyclase and phytoene synthase enzyme by sequence homology with other proteins. However, no direct data showing the function of this protein have been reported so far. Different Mucor circinelloides mutants altered at the phytoene synthase, the lycopene cyclase or both activities were transformed with the Phycomyces carRA gene. Fully transcribed carRA mRNA molecules were detected by Northern assays in the transformants and the correct processing of the carRA messenger was verified by RT-PCR. These results showed that Phycomyces carRA gene was correctly expressed in Mucor. Carotenoids analysis in these transformants showed the presence of ß-carotene, absent in the untransformed strains, providing functional evidence that the Phycomyces carRA gene complements the M. circinelloides mutations. Co-transformation of the carRA cDNA in E. coli with different combinations of the carotenoid structural genes from Erwinia uredovora was also performed. Newly formed carotenoids were accumulated showing that the Phycomyces CarRA protein does contain lycopene cyclase and phytoene synthase activities. The heterologous expression of the carRA gene and the functional complementation of the mentioned activities are not very efficient in E. coli. However, the simultaneous presence of both carRA and carB gene products from Phycomyces increases the efficiency of these enzymes, presumably due to an interaction mechanism.

  13. Spermatogenesis-related ring finger gene ZNF230 promoter: identification and functional analysis

    DEFF Research Database (Denmark)

    Xu, Wenming; Zhang, Sizhong; Qiu, Weimin

    2009-01-01

    reporter Plasmids. Overexpression and site-directed mutation test were used to characterize the cis-element. The results showed ZNF230 gene promoter to be GC rich and not contain a TATA box. Deletion analysis of the 5'-flanking region of ZNF230 in HEK293 cells indicated that the sequence encompassing from...... nt -131 to +152 has a basal transcriptional activity. Site-directed mutation test and mithramycin A treatment demonstrated that the ZNF230 promoter contained a functional Sp1 site. Overexpression of the Sox5 protein activated the promoter activity. A 312-bp fragment surrounding the transcription...

  14. miRNA-mediated functional changes through co-regulating function related genes.

    Directory of Open Access Journals (Sweden)

    Jie He

    Full Text Available BACKGROUND: MicroRNAs play important roles in various biological processes involving fairly complex mechanism. Analysis of genome-wide miRNA microarray demonstrate that a single miRNA can regulate hundreds of genes, but the regulative extent on most individual genes is surprisingly mild so that it is difficult to understand how a miRNA provokes detectable functional changes with such mild regulation. RESULTS: To explore the internal mechanism of miRNA-mediated regulation, we re-analyzed the data collected from genome-wide miRNA microarray with bioinformatics assay, and found that the transfection of miR-181b and miR-34a in Hela and HCT-116 tumor cells regulated large numbers of genes, among which, the genes related to cell growth and cell death demonstrated high Enrichment scores, suggesting that these miRNAs may be important in cell growth and cell death. MiR-181b induced changes in protein expression of most genes that were seemingly related to enhancing cell growth and decreasing cell death, while miR-34a mediated contrary changes of gene expression. Cell growth assays further confirmed this finding. In further study on miR-20b-mediated osteogenesis in hMSCs, miR-20b was found to enhance osteogenesis by activating BMPs/Runx2 signaling pathway in several stages by co-repressing of PPARγ, Bambi and Crim1. CONCLUSIONS: With its multi-target characteristics, miR-181b, miR-34a and miR-20b provoked detectable functional changes by co-regulating functionally-related gene groups or several genes in the same signaling pathway, and thus mild regulation from individual miRNA targeting genes could have contributed to an additive effect. This might also be one of the modes of miRNA-mediated gene regulation.

  15. Gene Ontology-Based Analysis of Zebrafish Omics Data Using the Web Tool Comparative Gene Ontology.

    Science.gov (United States)

    Ebrahimie, Esmaeil; Fruzangohar, Mario; Moussavi Nik, Seyyed Hani; Newman, Morgan

    2017-10-01

    Gene Ontology (GO) analysis is a powerful tool in systems biology, which uses a defined nomenclature to annotate genes/proteins within three categories: "Molecular Function," "Biological Process," and "Cellular Component." GO analysis can assist in revealing functional mechanisms underlying observed patterns in transcriptomic, genomic, and proteomic data. The already extensive and increasing use of zebrafish for modeling genetic and other diseases highlights the need to develop a GO analytical tool for this organism. The web tool Comparative GO was originally developed for GO analysis of bacterial data in 2013 ( www.comparativego.com ). We have now upgraded and elaborated this web tool for analysis of zebrafish genetic data using GOs and annotations from the Gene Ontology Consortium.

  16. Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data

    Directory of Open Access Journals (Sweden)

    Tintle Nathan L

    2012-08-01

    Full Text Available Abstract Background Statistical analyses of whole genome expression data require functional information about genes in order to yield meaningful biological conclusions. The Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG are common sources of functionally grouped gene sets. For bacteria, the SEED and MicrobesOnline provide alternative, complementary sources of gene sets. To date, no comprehensive evaluation of the data obtained from these resources has been performed. Results We define a series of gene set consistency metrics directly related to the most common classes of statistical analyses for gene expression data, and then perform a comprehensive analysis of 3581 Affymetrix® gene expression arrays across 17 diverse bacteria. We find that gene sets obtained from GO and KEGG demonstrate lower consistency than those obtained from the SEED and MicrobesOnline, regardless of gene set size. Conclusions Despite the widespread use of GO and KEGG gene sets in bacterial gene expression data analysis, the SEED and MicrobesOnline provide more consistent sets for a wide variety of statistical analyses. Increased use of the SEED and MicrobesOnline gene sets in the analysis of bacterial gene expression data may improve statistical power and utility of expression data.

  17. RoMo: An efficient strategy for functional mosaic analysis via stochastic Cre recombination and gene targeting in the ROSA26 locus.

    Science.gov (United States)

    Movahedi, Kiavash; Wiegmann, Robert; De Vlaminck, Karen; Van Ginderachter, Jo A; Nikolaev, Viacheslav O

    2018-07-01

    Functional mosaic analysis allows for the direct comparison of mutant cells with differentially marked control cells in the same organism. While this offers a powerful approach for elucidating the role of specific genes or signalling pathways in cell populations of interest, genetic strategies for generating functional mosaicism remain challenging. We describe a novel and streamlined approach for functional mosaic analysis, which combines stochastic Cre/lox recombination with gene targeting in the ROSA26 locus. With the RoMo strategy a cell population of interest is randomly split into a cyan fluorescent and red fluorescent subset, of which the latter overexpresses a chosen transgene. To integrate this approach into high-throughput gene targeting initiatives, we developed a procedure that utilizes Gateway cloning for the generation of new targeting vectors. RoMo can be used for gain-of-function experiments or for altering signaling pathways in a mosaic fashion. To demonstrate this, we developed RoMo-dnGs mice, in which Cre-recombined red fluorescent cells co-express a dominant-negative Gs protein. RoMo-dnGs mice allowed us to inhibit G protein-coupled receptor activation in a fraction of cells, which could then be directly compared to differentially marked control cells in the same animal. We demonstrate how RoMo-dnGs mice can be used to obtain mosaicism in the brain and in peripheral organs for various cell types. RoMo offers an efficient new approach for functional mosaic analysis that extends the current toolbox and may reveal important new insights into in vivo gene function. © 2018 Wiley Periodicals, Inc.

  18. Key Microbiota Identification Using Functional Gene Analysis during Pepper (Piper nigrum L.) Peeling.

    Science.gov (United States)

    Zhang, Jiachao; Hu, Qisong; Xu, Chuanbiao; Liu, Sixin; Li, Congfa

    2016-01-01

    Pepper pericarp microbiota plays an important role in the pepper peeling process for the production of white pepper. We collected pepper samples at different peeling time points from Hainan Province, China, and used a metagenomic approach to identify changes in the pericarp microbiota based on functional gene analysis. UniFrac distance-based principal coordinates analysis revealed significant changes in the pericarp microbiota structure during peeling, which were attributed to increases in bacteria from the genera Selenomonas and Prevotella. We identified 28 core operational taxonomic units at each time point, mainly belonging to Selenomonas, Prevotella, Megasphaera, Anaerovibrio, and Clostridium genera. The results were confirmed by quantitative polymerase chain reaction. At the functional level, we observed significant increases in microbial features related to acetyl xylan esterase and pectinesterase for pericarp degradation during peeling. These findings offer a new insight into biodegradation for pepper peeling and will promote the development of the white pepper industry.

  19. Functional Analysis and Marker Development of TaCRT-D Gene in Common Wheat (Triticum aestivum L.

    Directory of Open Access Journals (Sweden)

    Jiping Wang

    2017-09-01

    Full Text Available Calreticulin (CRT, an endoplasmic reticulum (ER-localized Ca2+-binding/buffering protein, is highly conserved and extensively expressed in animal and plant cells. To understand the function of CRTs in wheat (Triticum aestivum L., particularly their roles in stress tolerance, we cloned the full-length genomic sequence of the TaCRT-D isoform from D genome of common hexaploid wheat, and characterized its function by transgenic Arabidopsis system. TaCRT-D exhibited different expression patterns in wheat seedling under different abiotic stresses. Transgenic Arabidopsis plants overexpressing ORF of TaCRT-D displayed more tolerance to drought, cold, salt, mannitol, and other abiotic stresses at both seed germination and seedling stages, compared with the wild-type controls. Furthermore, DNA polymorphism analysis and gene mapping were employed to develop the functional markers of this gene for marker-assistant selection in wheat breeding program. One SNP, S440 (T→C was detected at the TaCRT-D locus by genotyping a wheat recombinant inbred line (RIL population (114 lines developed from Opata 85 × W7984. The TaCRT-D was then fine mapped between markers Xgwm645 and Xgwm664 on chromosome 3DL, corresponding to genetic distances of 3.5 and 4.4 cM, respectively, using the RIL population and Chinese Spring nulli-tetrasomic lines. Finally, the genome-specific and allele-specific markers were developed for the TaCRT-D gene. These findings indicate that TaCRT-D function importantly in plant stress responses, providing a gene target for genetic engineering to increase plant stress tolerance and the functional markers of TaCRT-D for marker-assistant selection in wheat breeding.

  20. Functional Analysis and Marker Development of TaCRT-D Gene in Common Wheat (Triticum aestivum L.).

    Science.gov (United States)

    Wang, Jiping; Li, Runzhi; Mao, Xinguo; Jing, Ruilian

    2017-01-01

    Calreticulin (CRT), an endoplasmic reticulum (ER)-localized Ca 2+ -binding/buffering protein, is highly conserved and extensively expressed in animal and plant cells. To understand the function of CRTs in wheat ( Triticum aestivum L.), particularly their roles in stress tolerance, we cloned the full-length genomic sequence of the TaCRT-D isoform from D genome of common hexaploid wheat, and characterized its function by transgenic Arabidopsis system. TaCRT-D exhibited different expression patterns in wheat seedling under different abiotic stresses. Transgenic Arabidopsis plants overexpressing ORF of TaCRT-D displayed more tolerance to drought, cold, salt, mannitol, and other abiotic stresses at both seed germination and seedling stages, compared with the wild-type controls. Furthermore, DNA polymorphism analysis and gene mapping were employed to develop the functional markers of this gene for marker-assistant selection in wheat breeding program. One SNP, S440 (T→C) was detected at the TaCRT-D locus by genotyping a wheat recombinant inbred line (RIL) population (114 lines) developed from Opata 85 × W7984. The TaCRT-D was then fine mapped between markers Xgwm645 and Xgwm664 on chromosome 3DL, corresponding to genetic distances of 3.5 and 4.4 cM, respectively, using the RIL population and Chinese Spring nulli-tetrasomic lines. Finally, the genome-specific and allele-specific markers were developed for the TaCRT-D gene. These findings indicate that TaCRT-D function importantly in plant stress responses, providing a gene target for genetic engineering to increase plant stress tolerance and the functional markers of TaCRT-D for marker-assistant selection in wheat breeding.

  1. Genes2FANs: connecting genes through functional association networks

    Science.gov (United States)

    2012-01-01

    Background Protein-protein, cell signaling, metabolic, and transcriptional interaction networks are useful for identifying connections between lists of experimentally identified genes/proteins. However, besides physical or co-expression interactions there are many ways in which pairs of genes, or their protein products, can be associated. By systematically incorporating knowledge on shared properties of genes from diverse sources to build functional association networks (FANs), researchers may be able to identify additional functional interactions between groups of genes that are not readily apparent. Results Genes2FANs is a web based tool and a database that utilizes 14 carefully constructed FANs and a large-scale protein-protein interaction (PPI) network to build subnetworks that connect lists of human and mouse genes. The FANs are created from mammalian gene set libraries where mouse genes are converted to their human orthologs. The tool takes as input a list of human or mouse Entrez gene symbols to produce a subnetwork and a ranked list of intermediate genes that are used to connect the query input list. In addition, users can enter any PubMed search term and then the system automatically converts the returned results to gene lists using GeneRIF. This gene list is then used as input to generate a subnetwork from the user’s PubMed query. As a case study, we applied Genes2FANs to connect disease genes from 90 well-studied disorders. We find an inverse correlation between the counts of links connecting disease genes through PPI and links connecting diseases genes through FANs, separating diseases into two categories. Conclusions Genes2FANs is a useful tool for interpreting the relationships between gene/protein lists in the context of their various functions and networks. Combining functional association interactions with physical PPIs can be useful for revealing new biology and help form hypotheses for further experimentation. Our finding that disease genes in

  2. Methods for transient assay of gene function in floral tissues

    Directory of Open Access Journals (Sweden)

    Pathirana Nilangani N

    2007-01-01

    Full Text Available Abstract Background There is considerable interest in rapid assays or screening systems for assigning gene function. However, analysis of gene function in the flowers of some species is restricted due to the difficulty of producing stably transformed transgenic plants. As a result, experimental approaches based on transient gene expression assays are frequently used. Biolistics has long been used for transient over-expression of genes of interest, but has not been exploited for gene silencing studies. Agrobacterium-infiltration has also been used, but the focus primarily has been on the transient transformation of leaf tissue. Results Two constructs, one expressing an inverted repeat of the Antirrhinum majus (Antirrhinum chalcone synthase gene (CHS and the other an inverted repeat of the Antirrhinum transcription factor gene Rosea1, were shown to effectively induce CHS and Rosea1 gene silencing, respectively, when introduced biolistically into petal tissue of Antirrhinum flowers developing in vitro. A high-throughput vector expressing the Antirrhinum CHS gene attached to an inverted repeat of the nos terminator was also shown to be effective. Silencing spread systemically to create large zones of petal tissue lacking pigmentation, with transmission of the silenced state spreading both laterally within the affected epidermal cell layer and into lower cell layers, including the epidermis of the other petal surface. Transient Agrobacterium-mediated transformation of petal tissue of tobacco and petunia flowers in situ or detached was also achieved, using expression of the reporter genes GUS and GFP to visualise transgene expression. Conclusion We demonstrate the feasibility of using biolistics-based transient RNAi, and transient transformation of petal tissue via Agrobacterium infiltration to study gene function in petals. We have also produced a vector for high throughput gene silencing studies, incorporating the option of using T-A cloning to

  3. Gene set analysis: limitations in popular existing methods and proposed improvements.

    Science.gov (United States)

    Mishra, Pashupati; Törönen, Petri; Leino, Yrjö; Holm, Liisa

    2014-10-01

    Gene set analysis is the analysis of a set of genes that collectively contribute to a biological process. Most popular gene set analysis methods are based on empirical P-value that requires large number of permutations. Despite numerous gene set analysis methods developed in the past decade, the most popular methods still suffer from serious limitations. We present a gene set analysis method (mGSZ) based on Gene Set Z-scoring function (GSZ) and asymptotic P-values. Asymptotic P-value calculation requires fewer permutations, and thus speeds up the gene set analysis process. We compare the GSZ-scoring function with seven popular gene set scoring functions and show that GSZ stands out as the best scoring function. In addition, we show improved performance of the GSA method when the max-mean statistics is replaced by the GSZ scoring function. We demonstrate the importance of both gene and sample permutations by showing the consequences in the absence of one or the other. A comparison of asymptotic and empirical methods of P-value estimation demonstrates a clear advantage of asymptotic P-value over empirical P-value. We show that mGSZ outperforms the state-of-the-art methods based on two different evaluations. We compared mGSZ results with permutation and rotation tests and show that rotation does not improve our asymptotic P-values. We also propose well-known asymptotic distribution models for three of the compared methods. mGSZ is available as R package from cran.r-project.org. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  4. Molecular Characterization and Functional Analysis of Three Pathogenesis-Related Cytochrome P450 Genes from Bursaphelenchus xylophilus (Tylenchida: Aphelenchoidoidea

    Directory of Open Access Journals (Sweden)

    Xiao-Lu Xu

    2015-03-01

    Full Text Available Bursaphelenchus xylophilus, the causal agent of pine wilt disease, causes huge economic losses in pine forests. The high expression of cytochrome P450 genes in B. xylophilus during infection in P. thunbergii indicated that these genes had a certain relationship with the pathogenic process of B. xylophilus. Thus, we attempted to identify the molecular characterization and functions of cytochrome P450 genes in B. xylophilus. In this study, full-length cDNA of three cytochrome P450 genes, BxCYP33C9, BxCYP33C4 and BxCYP33D3 were first cloned from B. xylophilus using 3' and 5' RACE PCR amplification. Sequence analysis showed that all of them contained a highly-conserved cytochrome P450 domain. The characteristics of the three putative proteins were analyzed with bioinformatic methods. RNA interference (RNAi was used to assess the functions of BxCYP33C9, BxCYP33C4 and BxCYP33D3. The results revealed that these cytochrome P450 genes were likely to be associated with the vitality, dispersal ability, reproduction, pathogenicity and pesticide metabolism of B. xylophilus. This discovery confirmed the molecular characterization and functions of three cytochrome P450 genes from B. xylophilus and provided fundamental information in elucidating the molecular interaction mechanism between B. xylophilus and its host plant.

  5. Functional gene array-based analysis of microbial community structure in groundwaters with a gradient of contaminant levels

    Energy Technology Data Exchange (ETDEWEB)

    Waldron, P.J.; Wu, L.; Van Nostrand, J.D.; Schadt, C.W.; Watson, D.B.; Jardine, P.M.; Palumbo, A.V.; Hazen, T.C.; Zhou, J.

    2009-06-15

    To understand how contaminants affect microbial community diversity, heterogeneity, and functional structure, six groundwater monitoring wells from the Field Research Center of the U.S. Department of Energy Environmental Remediation Science Program (ERSP; Oak Ridge, TN), with a wide range of pH, nitrate, and heavy metal contamination were investigated. DNA from the groundwater community was analyzed with a functional gene array containing 2006 probes to detect genes involved in metal resistance, sulfate reduction, organic contaminant degradation, and carbon and nitrogen cycling. Microbial diversity decreased in relation to the contamination levels of the wells. Highly contaminated wells had lower gene diversity but greater signal intensity than the pristine well. The microbial composition was heterogeneous, with 17-70% overlap between different wells. Metal-resistant and metal-reducing microorganisms were detected in both contaminated and pristine wells, suggesting the potential for successful bioremediation of metal-contaminated groundwaters. In addition, results of Mantel tests and canonical correspondence analysis indicate that nitrate, sulfate, pH, uranium, and technetium have a significant (p < 0.05) effect on microbial community structure. This study provides an overall picture of microbial community structure in contaminated environments with functional gene arrays by showing that diversity and heterogeneity can vary greatly in relation to contamination.

  6. Microbial Functional Gene Diversity Predicts Groundwater Contamination and Ecosystem Functioning.

    Science.gov (United States)

    He, Zhili; Zhang, Ping; Wu, Linwei; Rocha, Andrea M; Tu, Qichao; Shi, Zhou; Wu, Bo; Qin, Yujia; Wang, Jianjun; Yan, Qingyun; Curtis, Daniel; Ning, Daliang; Van Nostrand, Joy D; Wu, Liyou; Yang, Yunfeng; Elias, Dwayne A; Watson, David B; Adams, Michael W W; Fields, Matthew W; Alm, Eric J; Hazen, Terry C; Adams, Paul D; Arkin, Adam P; Zhou, Jizhong

    2018-02-20

    Contamination from anthropogenic activities has significantly impacted Earth's biosphere. However, knowledge about how environmental contamination affects the biodiversity of groundwater microbiomes and ecosystem functioning remains very limited. Here, we used a comprehensive functional gene array to analyze groundwater microbiomes from 69 wells at the Oak Ridge Field Research Center (Oak Ridge, TN), representing a wide pH range and uranium, nitrate, and other contaminants. We hypothesized that the functional diversity of groundwater microbiomes would decrease as environmental contamination (e.g., uranium or nitrate) increased or at low or high pH, while some specific populations capable of utilizing or resistant to those contaminants would increase, and thus, such key microbial functional genes and/or populations could be used to predict groundwater contamination and ecosystem functioning. Our results indicated that functional richness/diversity decreased as uranium (but not nitrate) increased in groundwater. In addition, about 5.9% of specific key functional populations targeted by a comprehensive functional gene array (GeoChip 5) increased significantly ( P contamination and ecosystem functioning. This study indicates great potential for using microbial functional genes to predict environmental contamination and ecosystem functioning. IMPORTANCE Disentangling the relationships between biodiversity and ecosystem functioning is an important but poorly understood topic in ecology. Predicting ecosystem functioning on the basis of biodiversity is even more difficult, particularly with microbial biomarkers. As an exploratory effort, this study used key microbial functional genes as biomarkers to provide predictive understanding of environmental contamination and ecosystem functioning. The results indicated that the overall functional gene richness/diversity decreased as uranium increased in groundwater, while specific key microbial guilds increased significantly as

  7. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  8. Statistical indicators of collective behavior and functional clusters in gene networks of yeast

    Science.gov (United States)

    Živković, J.; Tadić, B.; Wick, N.; Thurner, S.

    2006-03-01

    We analyze gene expression time-series data of yeast (S. cerevisiae) measured along two full cell-cycles. We quantify these data by using q-exponentials, gene expression ranking and a temporal mean-variance analysis. We construct gene interaction networks based on correlation coefficients and study the formation of the corresponding giant components and minimum spanning trees. By coloring genes according to their cell function we find functional clusters in the correlation networks and functional branches in the associated trees. Our results suggest that a percolation point of functional clusters can be identified on these gene expression correlation networks.

  9. Cross disease analysis of co-functional microRNA pairs on a reconstructed network of disease-gene-microRNA tripartite.

    Science.gov (United States)

    Peng, Hui; Lan, Chaowang; Zheng, Yi; Hutvagner, Gyorgy; Tao, Dacheng; Li, Jinyan

    2017-03-24

    MicroRNAs always function cooperatively in their regulation of gene expression. Dysfunctions of these co-functional microRNAs can play significant roles in disease development. We are interested in those multi-disease associated co-functional microRNAs that regulate their common dysfunctional target genes cooperatively in the development of multiple diseases. The research is potentially useful for human disease studies at the transcriptional level and for the study of multi-purpose microRNA therapeutics. We designed a computational method to detect multi-disease associated co-functional microRNA pairs and conducted cross disease analysis on a reconstructed disease-gene-microRNA (DGR) tripartite network. The construction of the DGR tripartite network is by the integration of newly predicted disease-microRNA associations with those relationships of diseases, microRNAs and genes maintained by existing databases. The prediction method uses a set of reliable negative samples of disease-microRNA association and a pre-computed kernel matrix instead of kernel functions. From this reconstructed DGR tripartite network, multi-disease associated co-functional microRNA pairs are detected together with their common dysfunctional target genes and ranked by a novel scoring method. We also conducted proof-of-concept case studies on cancer-related co-functional microRNA pairs as well as on non-cancer disease-related microRNA pairs. With the prioritization of the co-functional microRNAs that relate to a series of diseases, we found that the co-function phenomenon is not unusual. We also confirmed that the regulation of the microRNAs for the development of cancers is more complex and have more unique properties than those of non-cancer diseases.

  10. Functional microarray analysis of nitrogen and carbon cycling genes across an Antarctic latitudinal transect.

    NARCIS (Netherlands)

    Yergeau, E.; Kang, S.; He, Z.; Zhou, J.; Kowalchuk, G.A.

    2007-01-01

    Soil-borne microbial communities were examined via a functional gene microarray approach across a southern polar latitudinal gradient to gain insight into the environmental factors steering soil N- and C-cycling in terrestrial Antarctic ecosystems. The abundance and diversity of functional gene

  11. Functional analysis of human hematopoietic stem cell gene expression using zebrafish.

    Directory of Open Access Journals (Sweden)

    2005-08-01

    Full Text Available Although several reports have characterized the hematopoietic stem cell (HSC transcriptome, the roles of HSC-specific genes in hematopoiesis remain elusive. To identify candidate regulators of HSC fate decisions, we compared the transcriptome of human umbilical cord blood and bone marrow (CD34+(CD33-(CD38-Rho(lo(c-kit+ cells, enriched for hematopoietic stem/progenitor cells with (CD34+(CD33-(CD38-Rho(hi cells, enriched in committed progenitors. We identified 277 differentially expressed transcripts conserved in these ontogenically distinct cell sources. We next performed a morpholino antisense oligonucleotide (MO-based functional screen in zebrafish to determine the hematopoietic function of 61 genes that had no previously known function in HSC biology and for which a likely zebrafish ortholog could be identified. MO knock down of 14/61 (23% of the differentially expressed transcripts resulted in hematopoietic defects in developing zebrafish embryos, as demonstrated by altered levels of circulating blood cells at 30 and 48 h postfertilization and subsequently confirmed by quantitative RT-PCR for erythroid-specific hbae1 and myeloid-specific lcp1 transcripts. Recapitulating the knockdown phenotype using a second MO of independent sequence, absence of the phenotype using a mismatched MO sequence, and rescue of the phenotype by cDNA-based overexpression of the targeted transcript for zebrafish spry4 confirmed the specificity of MO targeting in this system. Further characterization of the spry4-deficient zebrafish embryos demonstrated that hematopoietic defects were not due to more widespread defects in the mesodermal development, and therefore represented primary defects in HSC specification, proliferation, and/or differentiation. Overall, this high-throughput screen for the functional validation of differentially expressed genes using a zebrafish model of hematopoiesis represents a major step toward obtaining meaningful information from global

  12. Identification and functional analysis of a new glyphosate resistance gene from a fungus cDNA library.

    Science.gov (United States)

    Tao, Bo; Shao, Bai-Hui; Qiao, Yu-Xin; Wang, Xiao-Qin; Chang, Shu-Jun; Qiu, Li-Juan

    2017-08-01

    Glyphosate is a widely used broad spectrum herbicide; however, this limits its use once crops are planted. If glyphosate-resistant crops are grown, glyphosate can be used for weed control in crops. While several glyphosate resistance genes are used in commercial glyphosate tolerant crops, there is interest in identifying additional genes for glyphosate tolerance. This research constructed a high-quality cDNA library form the glyphosate-resistant fungus Aspergillus oryzae RIB40 to identify genes that may confer resistance to glyphosate. Using a medium containing glyphosate (120mM), we screened several clones from the library. Based on a nucleotide sequence analysis, we identified a gene of unknown function (GenBank accession number: XM_001826835.2) that encoded a hypothetical 344-amino acid protein. The gene was named MFS40. Its ORF was amplified to construct an expression vector, pGEX-4T-1-MFS40, to express the protein in Escherichia coli BL21. The gene conferred glyphosate tolerance to E. coli ER2799 cells. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Functional analysis of the ATP-binding cassette (ABC) transporter gene family of Tribolium castaneum.

    Science.gov (United States)

    Broehan, Gunnar; Kroeger, Tobias; Lorenzen, Marcé; Merzendorfer, Hans

    2013-01-16

    The ATP-binding cassette (ABC) transporters belong to a large superfamily of proteins that have important physiological functions in all living organisms. Most are integral membrane proteins that transport a broad spectrum of substrates across lipid membranes. In insects, ABC transporters are of special interest because of their role in insecticide resistance. We have identified 73 ABC transporter genes in the genome of T. castaneum, which group into eight subfamilies (ABCA-H). This coleopteran ABC family is significantly larger than those reported for insects in other taxonomic groups. Phylogenetic analysis revealed that this increase is due to gene expansion within a single clade of subfamily ABCC. We performed an RNA interference (RNAi) screen to study the function of ABC transporters during development. In ten cases, injection of double-stranded RNA (dsRNA) into larvae caused developmental phenotypes, which included growth arrest and localized melanization, eye pigmentation defects, abnormal cuticle formation, egg-laying and egg-hatching defects, and mortality due to abortive molting and desiccation. Some of the ABC transporters we studied in closer detail to examine their role in lipid, ecdysteroid and eye pigment transport. The results from our study provide new insights into the physiological function of ABC transporters in T. castaneum, and may help to establish new target sites for insect control.

  14. Human Intellectual Disability Genes Form Conserved Functional Modules in Drosophila

    Science.gov (United States)

    Oortveld, Merel A. W.; Keerthikumar, Shivakumar; Oti, Martin; Nijhof, Bonnie; Fernandes, Ana Clara; Kochinke, Korinna; Castells-Nobau, Anna; van Engelen, Eva; Ellenkamp, Thijs; Eshuis, Lilian; Galy, Anne; van Bokhoven, Hans; Habermann, Bianca; Brunner, Han G.; Zweier, Christiane; Verstreken, Patrik; Huynen, Martijn A.; Schenck, Annette

    2013-01-01

    Intellectual Disability (ID) disorders, defined by an IQ below 70, are genetically and phenotypically highly heterogeneous. Identification of common molecular pathways underlying these disorders is crucial for understanding the molecular basis of cognition and for the development of therapeutic intervention strategies. To systematically establish their functional connectivity, we used transgenic RNAi to target 270 ID gene orthologs in the Drosophila eye. Assessment of neuronal function in behavioral and electrophysiological assays and multiparametric morphological analysis identified phenotypes associated with knockdown of 180 ID gene orthologs. Most of these genotype-phenotype associations were novel. For example, we uncovered 16 genes that are required for basal neurotransmission and have not previously been implicated in this process in any system or organism. ID gene orthologs with morphological eye phenotypes, in contrast to genes without phenotypes, are relatively highly expressed in the human nervous system and are enriched for neuronal functions, suggesting that eye phenotyping can distinguish different classes of ID genes. Indeed, grouping genes by Drosophila phenotype uncovered 26 connected functional modules. Novel links between ID genes successfully predicted that MYCN, PIGV and UPF3B regulate synapse development. Drosophila phenotype groups show, in addition to ID, significant phenotypic similarity also in humans, indicating that functional modules are conserved. The combined data indicate that ID disorders, despite their extreme genetic diversity, are caused by disruption of a limited number of highly connected functional modules. PMID:24204314

  15. Cloning and functional analysis of 5'-upstream region of the Pokemon gene.

    Science.gov (United States)

    Yang, Yutao; Zhou, Xiaowei; Zhu, Xudong; Zhang, Chuanfu; Yang, Zhixin; Xu, Long; Huang, Peitang

    2008-04-01

    Pokemon, the POK erythroid myeloid ontogenic factor, not only regulates the expression of many genes, but also plays an important role in cell tumorigenesis. To investigate the molecular mechanism regulating expression of the Pokemon gene in humans, its 5'-upstream region was cloned and analyzed. Transient analysis revealed that the Pokemon promoter is constitutive. Deletion analysis and a DNA decoy assay indicated that the NEG-U and NEG-D elements were involved in negative regulation of the Pokemon promoter, whereas the POS-D element was mainly responsible for its strong activity. Electrophoretic mobility shift assays suggested that the NEG-U, NEG-D and POS-D elements were specifically bound by the nuclear extract from A549 cells in vitro. Mutation analysis demonstrated that cooperation of the NEG-U and NEG-D elements led to negative regulation of the Pokemon promoter. Moreover, the NEG-U and NEG-D elements needed to be an appropriate distance apart in the Pokemon promoter in order to cooperate. Taken together, our results elucidate the mechanism underlying the regulation of Pokemon gene transcription, and also define a novel regulatory sequence that may be used to decrease expression of the Pokemon gene in cancer gene therapy.

  16. New Dimensions in Microbial Ecology—Functional Genes in Studies to Unravel the Biodiversity and Role of Functional Microbial Groups in the Environment

    Science.gov (United States)

    Imhoff, Johannes F.

    2016-01-01

    During the past decades, tremendous advances have been made in the possibilities to study the diversity of microbial communities in the environment. The development of methods to study these communities on the basis of 16S rRNA gene sequences analysis was a first step into the molecular analysis of environmental communities and the study of biodiversity in natural habitats. A new dimension in this field was reached with the introduction of functional genes of ecological importance and the establishment of genetic tools to study the diversity of functional microbial groups and their responses to environmental factors. Functional gene approaches are excellent tools to study the diversity of a particular function and to demonstrate changes in the composition of prokaryote communities contributing to this function. The phylogeny of many functional genes largely correlates with that of the 16S rRNA gene, and microbial species may be identified on the basis of functional gene sequences. Functional genes are perfectly suited to link culture-based microbiological work with environmental molecular genetic studies. In this review, the development of functional gene studies in environmental microbiology is highlighted with examples of genes relevant for important ecophysiological functions. Examples are presented for bacterial photosynthesis and two types of anoxygenic phototrophic bacteria, with genes of the Fenna-Matthews-Olson-protein (fmoA) as target for the green sulfur bacteria and of two reaction center proteins (pufLM) for the phototrophic purple bacteria, with genes of adenosine-5′phosphosulfate (APS) reductase (aprA), sulfate thioesterase (soxB) and dissimilatory sulfite reductase (dsrAB) for sulfur oxidizing and sulfate reducing bacteria, with genes of ammonia monooxygenase (amoA) for nitrifying/ammonia-oxidizing bacteria, with genes of particulate nitrate reductase and nitrite reductases (narH/G, nirS, nirK) for denitrifying bacteria and with genes of methane

  17. New Dimensions in Microbial Ecology—Functional Genes in Studies to Unravel the Biodiversity and Role of Functional Microbial Groups in the Environment

    Directory of Open Access Journals (Sweden)

    Johannes F. Imhoff

    2016-05-01

    Full Text Available During the past decades, tremendous advances have been made in the possibilities to study the diversity of microbial communities in the environment. The development of methods to study these communities on the basis of 16S rRNA gene sequences analysis was a first step into the molecular analysis of environmental communities and the study of biodiversity in natural habitats. A new dimension in this field was reached with the introduction of functional genes of ecological importance and the establishment of genetic tools to study the diversity of functional microbial groups and their responses to environmental factors. Functional gene approaches are excellent tools to study the diversity of a particular function and to demonstrate changes in the composition of prokaryote communities contributing to this function. The phylogeny of many functional genes largely correlates with that of the 16S rRNA gene, and microbial species may be identified on the basis of functional gene sequences. Functional genes are perfectly suited to link culture-based microbiological work with environmental molecular genetic studies. In this review, the development of functional gene studies in environmental microbiology is highlighted with examples of genes relevant for important ecophysiological functions. Examples are presented for bacterial photosynthesis and two types of anoxygenic phototrophic bacteria, with genes of the Fenna-Matthews-Olson-protein (fmoA as target for the green sulfur bacteria and of two reaction center proteins (pufLM for the phototrophic purple bacteria, with genes of adenosine-5′phosphosulfate (APS reductase (aprA, sulfate thioesterase (soxB and dissimilatory sulfite reductase (dsrAB for sulfur oxidizing and sulfate reducing bacteria, with genes of ammonia monooxygenase (amoA for nitrifying/ammonia-oxidizing bacteria, with genes of particulate nitrate reductase and nitrite reductases (narH/G, nirS, nirK for denitrifying bacteria and with genes

  18. Type 1 plaminogen activator inhibitor gene: Functional analysis and glucocorticoid regulation of its promoter

    International Nuclear Information System (INIS)

    Van Zonneveld, A.J.; Curriden, S.A.; Loskutoff, D.J.

    1988-01-01

    Plasminogen activator inhibitor type 1 is an important component of the fibrinolytic system and its biosynthesis is subject to complex regulation. To study this regulation at the level of transcription, the authors have identified and sequenced the promoter of the human plasminogen activator inhibitor type 1 gene. Nuclease protection experiments were performed by using endothelial cell mRNA and the transcription initiation (cap) site was established. Sequence analysis of the 5' flanking region of the gene revealed a perfect TATA box at position -28 to position -23, the conserved distance from the cap site. Comparative functional studies with the firefly luciferase gene as a reporter gene showed that fragments derived from this 5' flanking region exhibited high promoter activity when transfected into bovine aortic endothelial cells and mouse Ltk - fibroblasts but were inactive when introduced into HeLa cells. These studies indicate that the fragments contain the plasminogen activator inhibitor type 1 promoter and that it is expressed in a tissue-specific manner. Although the fragments were also silent in rat FTO2B hepatoma cells, their promoter activity could be induced up to 40-fold with the synthetic glucocorticoid dexamethasone. Promoter deletion mapping experiments and studies involving the fusion of promoter fragments to a heterologous gene indicated that dexamethasone induction is mediated by a glucocorticoid responsive element with enhancer-like properties located within the region between nucleotides -305 and +75 of the plasminogen activator inhibitor type 1 gene

  19. Text analysis of MEDLINE for discovering functional relationships among genes: evaluation of keyword extraction weighting schemes.

    Science.gov (United States)

    Liu, Ying; Navathe, Shamkant B; Pivoshenko, Alex; Dasigi, Venu G; Dingledine, Ray; Ciliax, Brian J

    2006-01-01

    One of the key challenges of microarray studies is to derive biological insights from the gene-expression patterns. Clustering genes by functional keyword association can provide direct information about the functional links among genes. However, the quality of the keyword lists significantly affects the clustering results. We compared two keyword weighting schemes: normalised z-score and term frequency-inverse document frequency (TFIDF). Two gene sets were tested to evaluate the effectiveness of the weighting schemes for keyword extraction for gene clustering. Using established measures of cluster quality, the results produced from TFIDF-weighted keywords outperformed those produced from normalised z-score weighted keywords. The optimised algorithms should be useful for partitioning genes from microarray lists into functionally discrete clusters.

  20. Proteome Profiling Outperforms Transcriptome Profiling for Coexpression Based Gene Function Prediction

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Jing; Ma, Zihao; Carr, Steven A.; Mertins, Philipp; Zhang, Hui; Zhang, Zhen; Chan, Daniel W.; Ellis, Matthew J. C.; Townsend, R. Reid; Smith, Richard D.; McDermott, Jason E.; Chen, Xian; Paulovich, Amanda G.; Boja, Emily S.; Mesri, Mehdi; Kinsinger, Christopher R.; Rodriguez, Henry; Rodland, Karin D.; Liebler, Daniel C.; Zhang, Bing

    2016-11-11

    Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies

  1. Recent adaptive events in human brain revealed by meta-analysis of positively selected genes.

    Directory of Open Access Journals (Sweden)

    Yue Huang

    Full Text Available BACKGROUND AND OBJECTIVES: Analysis of positively-selected genes can help us understand how human evolved, especially the evolution of highly developed cognitive functions. However, previous works have reached conflicting conclusions regarding whether human neuronal genes are over-represented among genes under positive selection. METHODS AND RESULTS: We divided positively-selected genes into four groups according to the identification approaches, compiling a comprehensive list from 27 previous studies. We showed that genes that are highly expressed in the central nervous system are enriched in recent positive selection events in human history identified by intra-species genomic scan, especially in brain regions related to cognitive functions. This pattern holds when different datasets, parameters and analysis pipelines were used. Functional category enrichment analysis supported these findings, showing that synapse-related functions are enriched in genes under recent positive selection. In contrast, immune-related functions, for instance, are enriched in genes under ancient positive selection revealed by inter-species coding region comparison. We further demonstrated that most of these patterns still hold even after controlling for genomic characteristics that might bias genome-wide identification of positively-selected genes including gene length, gene density, GC composition, and intensity of negative selection. CONCLUSION: Our rigorous analysis resolved previous conflicting conclusions and revealed recent adaptation of human brain functions.

  2. In Silico Analysis of FMR1 Gene Missense SNPs.

    Science.gov (United States)

    Tekcan, Akin

    2016-06-01

    The FMR1 gene, a member of the fragile X-related gene family, is responsible for fragile X syndrome (FXS). Missense single-nucleotide polymorphisms (SNPs) are responsible for many complex diseases. The effect of FMR1 gene missense SNPs is unknown. The aim of this study, using in silico techniques, was to analyze all known missense mutations that can affect the functionality of the FMR1 gene, leading to mental retardation (MR) and FXS. Data on the human FMR1 gene were collected from the Ensembl database (release 81), National Centre for Biological Information dbSNP Short Genetic Variations database, 1000 Genomes Browser, and NHLBI Exome Sequencing Project Exome Variant Server. In silico analysis was then performed. One hundred-twenty different missense SNPs of the FMR1 gene were determined. Of these, 11.66 % of the FMR1 gene missense SNPs were in highly conserved domains, and 83.33 % were in domains with high variety. The results of the in silico prediction analysis showed that 31.66 % of the FMR1 gene SNPs were disease related and that 50 % of SNPs had a pathogenic effect. The results of the structural and functional analysis revealed that although the R138Q mutation did not seem to have a damaging effect on the protein, the G266E and I304N SNPs appeared to disturb the interaction between the domains and affect the function of the protein. This is the first study to analyze all missense SNPs of the FMR1 gene. The results indicate the applicability of a bioinformatics approach to FXS and other FMR1-related diseases. I think that the analysis of FMR1 gene missense SNPs using bioinformatics methods would help diagnosis of FXS and other FMR1-related diseases.

  3. EvoCor: a platform for predicting functionally related genes using phylogenetic and expression profiles.

    Science.gov (United States)

    Dittmar, W James; McIver, Lauren; Michalak, Pawel; Garner, Harold R; Valdez, Gregorio

    2014-07-01

    The wealth of publicly available gene expression and genomic data provides unique opportunities for computational inference to discover groups of genes that function to control specific cellular processes. Such genes are likely to have co-evolved and be expressed in the same tissues and cells. Unfortunately, the expertise and computational resources required to compare tens of genomes and gene expression data sets make this type of analysis difficult for the average end-user. Here, we describe the implementation of a web server that predicts genes involved in affecting specific cellular processes together with a gene of interest. We termed the server 'EvoCor', to denote that it detects functional relationships among genes through evolutionary analysis and gene expression correlation. This web server integrates profiles of sequence divergence derived by a Hidden Markov Model (HMM) and tissue-wide gene expression patterns to determine putative functional linkages between pairs of genes. This server is easy to use and freely available at http://pilot-hmm.vbi.vt.edu/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Drosha regulates gene expression independently of RNA cleavage function

    DEFF Research Database (Denmark)

    Gromak, Natalia; Dienstbier, Martin; Macias, Sara

    2013-01-01

    Drosha is the main RNase III-like enzyme involved in the process of microRNA (miRNA) biogenesis in the nucleus. Using whole-genome ChIP-on-chip analysis, we demonstrate that, in addition to miRNA sequences, Drosha specifically binds promoter-proximal regions of many human genes in a transcription......-dependent manner. This binding is not associated with miRNA production or RNA cleavage. Drosha knockdown in HeLa cells downregulated nascent gene transcription, resulting in a reduction of polyadenylated mRNA produced from these gene regions. Furthermore, we show that this function of Drosha is dependent on its N......-terminal protein-interaction domain, which associates with the RNA-binding protein CBP80 and RNA Polymerase II. Consequently, we uncover a previously unsuspected RNA cleavage-independent function of Drosha in the regulation of human gene expression....

  5. FunGeneNet: a web tool to estimate enrichment of functional interactions in experimental gene sets.

    Science.gov (United States)

    Tiys, Evgeny S; Ivanisenko, Timofey V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2018-02-09

    Estimation of functional connectivity in gene sets derived from genome-wide or other biological experiments is one of the essential tasks of bioinformatics. A promising approach for solving this problem is to compare gene networks built using experimental gene sets with random networks. One of the resources that make such an analysis possible is CrossTalkZ, which uses the FunCoup database. However, existing methods, including CrossTalkZ, do not take into account individual types of interactions, such as protein/protein interactions, expression regulation, transport regulation, catalytic reactions, etc., but rather work with generalized types characterizing the existence of any connection between network members. We developed the online tool FunGeneNet, which utilizes the ANDSystem and STRING to reconstruct gene networks using experimental gene sets and to estimate their difference from random networks. To compare the reconstructed networks with random ones, the node permutation algorithm implemented in CrossTalkZ was taken as a basis. To study the FunGeneNet applicability, the functional connectivity analysis of networks constructed for gene sets involved in the Gene Ontology biological processes was conducted. We showed that the method sensitivity exceeds 0.8 at a specificity of 0.95. We found that the significance level of the difference between gene networks of biological processes and random networks is determined by the type of connections considered between objects. At the same time, the highest reliability is achieved for the generalized form of connections that takes into account all the individual types of connections. By taking examples of the thyroid cancer networks and the apoptosis network, it is demonstrated that key participants in these processes are involved in the interactions of those types by which these networks differ from random ones. FunGeneNet is a web tool aimed at proving the functionality of networks in a wide range of sizes of

  6. Annotating the Function of the Human Genome with Gene Ontology and Disease Ontology.

    Science.gov (United States)

    Hu, Yang; Zhou, Wenyang; Ren, Jun; Dong, Lixiang; Wang, Yadong; Jin, Shuilin; Cheng, Liang

    2016-01-01

    Increasing evidences indicated that function annotation of human genome in molecular level and phenotype level is very important for systematic analysis of genes. In this study, we presented a framework named Gene2Function to annotate Gene Reference into Functions (GeneRIFs), in which each functional description of GeneRIFs could be annotated by a text mining tool Open Biomedical Annotator (OBA), and each Entrez gene could be mapped to Human Genome Organisation Gene Nomenclature Committee (HGNC) gene symbol. After annotating all the records about human genes of GeneRIFs, 288,869 associations between 13,148 mRNAs and 7,182 terms, 9,496 associations between 948 microRNAs and 533 terms, and 901 associations between 139 long noncoding RNAs (lncRNAs) and 297 terms were obtained as a comprehensive annotation resource of human genome. High consistency of term frequency of individual gene (Pearson correlation = 0.6401, p = 2.2e - 16) and gene frequency of individual term (Pearson correlation = 0.1298, p = 3.686e - 14) in GeneRIFs and GOA shows our annotation resource is very reliable.

  7. Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean) Cattle.

    Science.gov (United States)

    Lim, Dajeong; Lee, Seung-Hwan; Kim, Nam-Kuk; Cho, Yong-Min; Chai, Han-Ha; Seong, Hwan-Hoo; Kim, Heebal

    2013-01-01

    Marbling (intramuscular fat) is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the 'marbling score' trait and systemically analyzed the network topology in Hanwoo (Korean cattle). As a result, we determined 3 modules (gene groups) that showed statistically significant results for marbling score. In particular, one module (denoted as red) has a statistically significant result for marbling score (p = 0.008) and intramuscular fat (p = 0.02) and water capacity (p = 0.006). From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA) have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

  8. Gene Co-expression Analysis to Characterize Genes Related to Marbling Trait in Hanwoo (Korean Cattle

    Directory of Open Access Journals (Sweden)

    Dajeong Lim

    2013-01-01

    Full Text Available Marbling (intramuscular fat is an important trait that affects meat quality and is a casual factor determining the price of beef in the Korean beef market. It is a complex trait and has many biological pathways related to muscle and fat. There is a need to identify functional modules or genes related to marbling traits and investigate their relationships through a weighted gene co-expression network analysis based on the system level. Therefore, we investigated the co-expression relationships of genes related to the ‘marbling score’ trait and systemically analyzed the network topology in Hanwoo (Korean cattle. As a result, we determined 3 modules (gene groups that showed statistically significant results for marbling score. In particular, one module (denoted as red has a statistically significant result for marbling score (p = 0.008 and intramuscular fat (p = 0.02 and water capacity (p = 0.006. From functional enrichment and relationship analysis of the red module, the pathway hub genes (IL6, CHRNE, RB1, INHBA and NPPA have a direct interaction relationship and share the biological functions related to fat or muscle, such as adipogenesis or muscle growth. This is the first gene network study with m.logissimus in Hanwoo to observe co-expression patterns in divergent marbling phenotypes. It may provide insights into the functional mechanisms of the marbling trait.

  9. Evolutionary Pattern and Regulation Analysis to Support Why Diversity Functions Existed within PPAR Gene Family Members

    Directory of Open Access Journals (Sweden)

    Tianyu Zhou

    2015-01-01

    Full Text Available Peroxisome proliferators-activated receptor (PPAR gene family members exhibit distinct patterns of distribution in tissues and differ in functions. The purpose of this study is to investigate the evolutionary impacts on diversity functions of PPAR members and the regulatory differences on gene expression patterns. 63 homology sequences of PPAR genes from 31 species were collected and analyzed. The results showed that three isolated types of PPAR gene family may emerge from twice times of gene duplication events. The conserved domains of HOLI (ligand binding domain of hormone receptors domain and ZnF_C4 (C4 zinc finger in nuclear in hormone receptors are essential for keeping basic roles of PPAR gene family, and the variant domains of LCRs may be responsible for their divergence in functions. The positive selection sites in HOLI domain are benefit for PPARs to evolve towards diversity functions. The evolutionary variants in the promoter regions and 3′ UTR regions of PPARs result into differential transcription factors and miRNAs involved in regulating PPAR members, which may eventually affect their expressions and tissues distributions. These results indicate that gene duplication event, selection pressure on HOLI domain, and the variants on promoter and 3′ UTR are essential for PPARs evolution and diversity functions acquired.

  10. Evolutionary Pattern and Regulation Analysis to Support Why Diversity Functions Existed within PPAR Gene Family Members.

    Science.gov (United States)

    Zhou, Tianyu; Yan, Xiping; Wang, Guosong; Liu, Hehe; Gan, Xiang; Zhang, Tao; Wang, Jiwen; Li, Liang

    2015-01-01

    Peroxisome proliferators-activated receptor (PPAR) gene family members exhibit distinct patterns of distribution in tissues and differ in functions. The purpose of this study is to investigate the evolutionary impacts on diversity functions of PPAR members and the regulatory differences on gene expression patterns. 63 homology sequences of PPAR genes from 31 species were collected and analyzed. The results showed that three isolated types of PPAR gene family may emerge from twice times of gene duplication events. The conserved domains of HOLI (ligand binding domain of hormone receptors) domain and ZnF_C4 (C4 zinc finger in nuclear in hormone receptors) are essential for keeping basic roles of PPAR gene family, and the variant domains of LCRs may be responsible for their divergence in functions. The positive selection sites in HOLI domain are benefit for PPARs to evolve towards diversity functions. The evolutionary variants in the promoter regions and 3' UTR regions of PPARs result into differential transcription factors and miRNAs involved in regulating PPAR members, which may eventually affect their expressions and tissues distributions. These results indicate that gene duplication event, selection pressure on HOLI domain, and the variants on promoter and 3' UTR are essential for PPARs evolution and diversity functions acquired.

  11. Chronic obstructive pulmonary disease candidate gene prioritization based on metabolic networks and functional information.

    Directory of Open Access Journals (Sweden)

    Xinyan Wang

    Full Text Available Chronic obstructive pulmonary disease (COPD is a multi-factor disease, in which metabolic disturbances played important roles. In this paper, functional information was integrated into a COPD-related metabolic network to assess similarity between genes. Then a gene prioritization method was applied to the COPD-related metabolic network to prioritize COPD candidate genes. The gene prioritization method was superior to ToppGene and ToppNet in both literature validation and functional enrichment analysis. Top-ranked genes prioritized from the metabolic perspective with functional information could promote the better understanding about the molecular mechanism of this disease. Top 100 genes might be potential markers for diagnostic and effective therapies.

  12. Microbial Functional Gene Diversity Predicts Groundwater Contamination and Ecosystem Functioning

    Science.gov (United States)

    Zhang, Ping; Wu, Linwei; Rocha, Andrea M.; Shi, Zhou; Wu, Bo; Qin, Yujia; Wang, Jianjun; Yan, Qingyun; Curtis, Daniel; Ning, Daliang; Van Nostrand, Joy D.; Wu, Liyou; Watson, David B.; Adams, Michael W. W.; Alm, Eric J.; Adams, Paul D.; Arkin, Adam P.

    2018-01-01

    ABSTRACT Contamination from anthropogenic activities has significantly impacted Earth’s biosphere. However, knowledge about how environmental contamination affects the biodiversity of groundwater microbiomes and ecosystem functioning remains very limited. Here, we used a comprehensive functional gene array to analyze groundwater microbiomes from 69 wells at the Oak Ridge Field Research Center (Oak Ridge, TN), representing a wide pH range and uranium, nitrate, and other contaminants. We hypothesized that the functional diversity of groundwater microbiomes would decrease as environmental contamination (e.g., uranium or nitrate) increased or at low or high pH, while some specific populations capable of utilizing or resistant to those contaminants would increase, and thus, such key microbial functional genes and/or populations could be used to predict groundwater contamination and ecosystem functioning. Our results indicated that functional richness/diversity decreased as uranium (but not nitrate) increased in groundwater. In addition, about 5.9% of specific key functional populations targeted by a comprehensive functional gene array (GeoChip 5) increased significantly (P contamination and ecosystem functioning. This study indicates great potential for using microbial functional genes to predict environmental contamination and ecosystem functioning. PMID:29463661

  13. Functionally enigmatic genes: a case study of the brain ignorome.

    Directory of Open Access Journals (Sweden)

    Ashutosh K Pandey

    Full Text Available What proportion of genes with intense and selective expression in specific tissues, cells, or systems are still almost completely uncharacterized with respect to biological function? In what ways do these functionally enigmatic genes differ from well-studied genes? To address these two questions, we devised a computational approach that defines so-called ignoromes. As proof of principle, we extracted and analyzed a large subset of genes with intense and selective expression in brain. We find that publications associated with this set are highly skewed--the top 5% of genes absorb 70% of the relevant literature. In contrast, approximately 20% of genes have essentially no neuroscience literature. Analysis of the ignorome over the past decade demonstrates that it is stubbornly persistent, and the rapid expansion of the neuroscience literature has not had the expected effect on numbers of these genes. Surprisingly, ignorome genes do not differ from well-studied genes in terms of connectivity in coexpression networks. Nor do they differ with respect to numbers of orthologs, paralogs, or protein domains. The major distinguishing characteristic between these sets of genes is date of discovery, early discovery being associated with greater research momentum--a genomic bandwagon effect. Finally we ask to what extent massive genomic, imaging, and phenotype data sets can be used to provide high-throughput functional annotation for an entire ignorome. In a majority of cases we have been able to extract and add significant information for these neglected genes. In several cases--ELMOD1, TMEM88B, and DZANK1--we have exploited sequence polymorphisms, large phenome data sets, and reverse genetic methods to evaluate the function of ignorome genes.

  14. Functionally enigmatic genes: a case study of the brain ignorome.

    Science.gov (United States)

    Pandey, Ashutosh K; Lu, Lu; Wang, Xusheng; Homayouni, Ramin; Williams, Robert W

    2014-01-01

    What proportion of genes with intense and selective expression in specific tissues, cells, or systems are still almost completely uncharacterized with respect to biological function? In what ways do these functionally enigmatic genes differ from well-studied genes? To address these two questions, we devised a computational approach that defines so-called ignoromes. As proof of principle, we extracted and analyzed a large subset of genes with intense and selective expression in brain. We find that publications associated with this set are highly skewed--the top 5% of genes absorb 70% of the relevant literature. In contrast, approximately 20% of genes have essentially no neuroscience literature. Analysis of the ignorome over the past decade demonstrates that it is stubbornly persistent, and the rapid expansion of the neuroscience literature has not had the expected effect on numbers of these genes. Surprisingly, ignorome genes do not differ from well-studied genes in terms of connectivity in coexpression networks. Nor do they differ with respect to numbers of orthologs, paralogs, or protein domains. The major distinguishing characteristic between these sets of genes is date of discovery, early discovery being associated with greater research momentum--a genomic bandwagon effect. Finally we ask to what extent massive genomic, imaging, and phenotype data sets can be used to provide high-throughput functional annotation for an entire ignorome. In a majority of cases we have been able to extract and add significant information for these neglected genes. In several cases--ELMOD1, TMEM88B, and DZANK1--we have exploited sequence polymorphisms, large phenome data sets, and reverse genetic methods to evaluate the function of ignorome genes.

  15. Identification, characterization and functional analysis of regulatory region of nanos gene from half-smooth tongue sole (Cynoglossus semilaevis).

    Science.gov (United States)

    Huang, Jinqiang; Li, Yongjuan; Shao, Changwei; Wang, Na; Chen, Songlin

    2017-06-20

    The nanos gene encodes an RNA-binding zinc finger protein, which is required in the development and maintenance of germ cells. However, there is very limited information about nanos in flatfish, which impedes its application in fish breeding. In this study, we report the molecular cloning, characterization and functional analysis of the 3'-untranslated region of the nanos gene (Csnanos) from half-smooth tongue sole (Cynoglossus semilaevis), which is an economically important flatfish in China. The 1233-bp cDNA sequence, 1709-bp genomic sequence and flanking sequences (2.8-kb 5'- and 1.6-kb 3'-flanking regions) of Csnanos were cloned and characterized. Sequence analysis revealed that CsNanos shares low homology with Nanos in other species, but the zinc finger domain of CsNanos is highly similar. Phylogenetic analysis indicated that CsNanos belongs to the Nanos2 subfamily. Csnanos expression was widely detected in various tissues, but the expression level was higher in testis and ovary. During early development and sex differentiation, Csnanos expression exhibited a clear sexually dimorphic pattern, suggesting its different roles in the migration and differentiation of primordial germ cells (PGCs). Higher expression levels of Csnanos mRNA in normal females and males than in neomales indicated that the nanos gene may play key roles in maintaining the differentiation of gonad. Moreover, medaka PGCs were successfully labeled by the microinjection of synthesized mRNA consisting of green fluorescence protein and the 3'-untranslated region of Csnanos. These findings provide new insights into nanos gene expression and function, and lay the foundation for further study of PGC development and applications in tongue sole breeding. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Identification of a novel uromodulin-like gene related to predator-induced bulgy morph in anuran tadpoles by functional microarray analysis.

    Directory of Open Access Journals (Sweden)

    Tsukasa Mori

    2009-06-01

    Full Text Available Tadpoles of the anuran species Rana pirica can undergo predator-specific morphological responses. Exposure to a predation threat by larvae of the salamander Hynobius retardatus results in formation of a bulgy body (bulgy morph with a higher tail. The tadpoles revert to a normal phenotype upon removal of the larval salamander threat. Although predator-induced phenotypic plasticity is of major interest to evolutionary ecologists, the molecular and physiological mechanisms that control this response have yet to be elucidated. In a previous study, we identified various genes that are expressed in the skin of the bulgy morph. However, it proved difficult to determine which of these were key genes in the control of gene expression associated with the bulgy phenotype. Here, we show that a novel gene plays an important role in the phenotypic plasticity producing the bulgy morph. A functional microarray analysis using facial tissue samples of control and bulgy morph tadpoles identified candidate functional genes for predator-specific morphological responses. A larger functional microarray was prepared than in the previous study and used to analyze mRNAs extracted from facial and brain tissues of tadpoles from induction-reversion experiments. We found that a novel uromodulin-like gene, which we name here pirica, was up-regulated and that keratin genes were down-regulated as the period of exposure to larval salamanders increased. Pirica consists of a 1296 bp open reading frame, which is putatively translated into a protein of 432 amino acids. The protein contains a zona pellucida domain similar to that of proteins that function to control water permeability. We found that the gene was expressed in the superficial epidermis of the tadpole skin.

  17. Analysis of multiplex gene expression maps obtained by voxelation

    Directory of Open Access Journals (Sweden)

    Smith Desmond J

    2009-04-01

    Full Text Available Abstract Background Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. Results To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in

  18. Analysis of multiplex gene expression maps obtained by voxelation.

    Science.gov (United States)

    An, Li; Xie, Hongbo; Chin, Mark H; Obradovic, Zoran; Smith, Desmond J; Megalooikonomou, Vasileios

    2009-04-29

    Gene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions. To analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum. The experimental

  19. Separate enrichment analysis of pathways for up- and downregulated genes.

    Science.gov (United States)

    Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng

    2014-03-06

    Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.

  20. Digital Gene Expression Profiling Analysis of Aged Mice under Moxibustion Treatment

    Directory of Open Access Journals (Sweden)

    Nan Liu

    2018-01-01

    Full Text Available Aging is closely connected with death, progressive physiological decline, and increased risk of diseases, such as cancer, arteriosclerosis, heart disease, hypertension, and neurodegenerative diseases. It is reported that moxibustion can treat more than 300 kinds of diseases including aging related problems and can improve immune function and physiological functions. The digital gene expression profiling of aged mice with or without moxibustion treatment was investigated and the mechanisms of moxibustion in aged mice were speculated by gene ontology and pathway analysis in the study. Almost 145 million raw reads were obtained by digital gene expression analysis and about 140 million (96.55% were clean reads. Five differentially expressed genes with an adjusted P value 1 were identified between the control and moxibustion groups. They were Gm6563, Gm8116, Rps26-ps1, Nat8f4, and Igkv3-12. Gene ontology analysis was carried out by the GOseq R package and functional annotations of the differentially expressed genes related to translation, mRNA export from nucleus, mRNA transport, nuclear body, acetyltransferase activity, and so on. Kyoto Encyclopedia of Genes and Genomes database was used for pathway analysis and ribosome was the most significantly enriched pathway term.

  1. Genome-wide analysis of the GRAS gene family in Prunus mume.

    Science.gov (United States)

    Lu, Jiuxing; Wang, Tao; Xu, Zongda; Sun, Lidan; Zhang, Qixiang

    2015-02-01

    Prunus mume is an ornamental flower and fruit tree in Rosaceae. We investigated the GRAS gene family to improve the breeding and cultivation of P. mume and other Rosaceae fruit trees. The GRAS gene family encodes transcriptional regulators that have diverse functions in plant growth and development, such as gibberellin and phytochrome A signal transduction, root radial patterning, and axillary meristem formation and gametogenesis in the P. mume genome. Despite the important roles of these genes in plant growth regulation, no findings on the GRAS genes of P. mume have been reported. In this study, we discerned phylogenetic relationships of P. mume GRAS genes, and their locations, structures in the genome and expression levels of different tissues. Out of 46 identified GRAS genes, 45 were located on the 8 P. mume chromosomes. Phylogenetic results showed that these genes could be classified into 11 groups. We found that Group X was P. mume-specific, and three genes of Group IX clustered with the rice-specific gene Os4. We speculated that these genes existed before the divergence of dicotyledons and monocotyledons and were lost in Arabidopsis. Tissue expression analysis indicated that 13 genes showed high expression levels in roots, stems, leaves, flowers and fruits, and were related to plant growth and development. Functional analysis of 24 GRAS genes and an orthologous relationship analysis indicated that many functioned during plant growth and flower and fruit development. Our bioinformatics analysis provides valuable information to improve the economic, agronomic and ecological benefits of P. mume and other Rosaceae fruit trees.

  2. Genome-wide analysis of the WRKY gene family in cotton.

    Science.gov (United States)

    Dou, Lingling; Zhang, Xiaohong; Pang, Chaoyou; Song, Meizhen; Wei, Hengling; Fan, Shuli; Yu, Shuxun

    2014-12-01

    WRKY proteins are major transcription factors involved in regulating plant growth and development. Although many studies have focused on the functional identification of WRKY genes, our knowledge concerning many areas of WRKY gene biology is limited. For example, in cotton, the phylogenetic characteristics, global expression patterns, molecular mechanisms regulating expression, and target genes/pathways of WRKY genes are poorly characterized. Therefore, in this study, we present a genome-wide analysis of the WRKY gene family in cotton (Gossypium raimondii and Gossypium hirsutum). We identified 116 WRKY genes in G. raimondii from the completed genome sequence, and we cloned 102 WRKY genes in G. hirsutum. Chromosomal location analysis indicated that WRKY genes in G. raimondii evolved mainly from segmental duplication followed by tandem amplifications. Phylogenetic analysis of alga, bryophyte, lycophyta, monocot and eudicot WRKY domains revealed family member expansion with increasing complexity of the plant body. Microarray, expression profiling and qRT-PCR data revealed that WRKY genes in G. hirsutum may regulate the development of fibers, anthers, tissues (roots, stems, leaves and embryos), and are involved in the response to stresses. Expression analysis showed that most group II and III GhWRKY genes are highly expressed under diverse stresses. Group I members, representing the ancestral form, seem to be insensitive to abiotic stress, with low expression divergence. Our results indicate that cotton WRKY genes might have evolved by adaptive duplication, leading to sensitivity to diverse stresses. This study provides fundamental information to inform further analysis and understanding of WRKY gene functions in cotton species.

  3. Functional validation of GWAS gene candidates for abnormal liver function during zebrafish liver development

    Directory of Open Access Journals (Sweden)

    Leah Y. Liu

    2013-09-01

    Genome-wide association studies (GWAS have revealed numerous associations between many phenotypes and gene candidates. Frequently, however, further elucidation of gene function has not been achieved. A recent GWAS identified 69 candidate genes associated with elevated liver enzyme concentrations, which are clinical markers of liver disease. To investigate the role of these genes in liver homeostasis, we narrowed down this list to 12 genes based on zebrafish orthology, zebrafish liver expression and disease correlation. To assess the function of gene candidates during liver development, we assayed hepatic progenitors at 48 hours post fertilization (hpf and hepatocytes at 72 hpf using in situ hybridization following morpholino knockdown in zebrafish embryos. Knockdown of three genes (pnpla3, pklr and mapk10 decreased expression of hepatic progenitor cells, whereas knockdown of eight genes (pnpla3, cpn1, trib1, fads2, slc2a2, pklr, mapk10 and samm50 decreased cell-specific hepatocyte expression. We then induced liver injury in zebrafish embryos using acetaminophen exposure and observed changes in liver toxicity incidence in morphants. Prioritization of GWAS candidates and morpholino knockdown expedites the study of newly identified genes impacting liver development and represents a feasible method for initial assessment of candidate genes to instruct further mechanistic analyses. Our analysis can be extended to GWAS for additional disease-associated phenotypes.

  4. Knowledge Enrichment Analysis for Human Tissue- Specific Genes Uncover New Biological Insights

    Directory of Open Access Journals (Sweden)

    Gong Xiu-Jun

    2012-06-01

    Full Text Available The expression and regulation of genes in different tissues are fundamental questions to be answered in biology. Knowledge enrichment analysis for tissue specific (TS and housekeeping (HK genes may help identify their roles in biological process or diseases and gain new biological insights.In this paper, we performed the knowledge enrichment analysis for 17,343 genes in 84 human tissues using Gene Set Enrichment Analysis (GSEA and Hypergeometric Analysis (HA against three biological ontologies: Gene Ontology (GO, KEGG pathways and Disease Ontology (DO respectively.The analyses results demonstrated that the functions of most gene groups are consistent with their tissue origins. Meanwhile three interesting new associations for HK genes and the skeletal muscle tissuegenes are found. Firstly, Hypergeometric analysis against KEGG database for HK genes disclosed that three disease terms (Parkinson’s disease, Huntington’s disease, Alzheimer’s disease are intensively enriched.Secondly, Hypergeometric analysis against the KEGG database for Skeletal Muscle tissue genes shows that two cardiac diseases of “Hypertrophic cardiomyopathy (HCM” and “Arrhythmogenic right ventricular cardiomyopathy (ARVC” are heavily enriched, which are also considered as no relationship with skeletal functions.Thirdly, “Prostate cancer” is intensively enriched in Hypergeometric analysis against the disease ontology (DO for the Skeletal Muscle tissue genes, which is a much unexpected phenomenon.

  5. Singular Perturbation Analysis and Gene Regulatory Networks with Delay

    Science.gov (United States)

    Shlykova, Irina; Ponosov, Arcady

    2009-09-01

    There are different ways of how to model gene regulatory networks. Differential equations allow for a detailed description of the network's dynamics and provide an explicit model of the gene concentration changes over time. Production and relative degradation rate functions used in such models depend on the vector of steeply sloped threshold functions which characterize the activity of genes. The most popular example of the threshold functions comes from the Boolean network approach, where the threshold functions are given by step functions. The system of differential equations becomes then piecewise linear. The dynamics of this system can be described very easily between the thresholds, but not in the switching domains. For instance this approach fails to analyze stationary points of the system and to define continuous solutions in the switching domains. These problems were studied in [2], [3], but the proposed model did not take into account a time delay in cellular systems. However, analysis of real gene expression data shows a considerable number of time-delayed interactions suggesting that time delay is essential in gene regulation. Therefore, delays may have a great effect on the dynamics of the system presenting one of the critical factors that should be considered in reconstruction of gene regulatory networks. The goal of this work is to apply the singular perturbation analysis to certain systems with delay and to obtain an analog of Tikhonov's theorem, which provides sufficient conditions for constracting the limit system in the delay case.

  6. Gene mapping and functional analysis of the novel leaf color gene SiYGL1 in foxtail millet [Setaria italica (L.) P. Beauv].

    Science.gov (United States)

    Li, Wen; Tang, Sha; Zhang, Shuo; Shan, Jianguo; Tang, Chanjuan; Chen, Qiannan; Jia, Guanqing; Han, Yuanhuai; Zhi, Hui; Diao, Xianmin

    2016-05-01

    Setaria italica and its wild ancestor Setaria viridis are emerging as model systems for genetics and functional genomics research. However, few systematic gene mapping or functional analyses have been reported in these promising C4 models. We herein isolated the yellow-green leaf mutant (siygl1) in S. italica using forward genetics approaches. Map-based cloning revealed that SiYGL1, which is a recessive nuclear gene encoding a magnesium-chelatase D subunit (CHLD), is responsible for the mutant phenotype. A single Phe to Leu amino acid change occurring near the ATPase-conserved domain resulted in decreased chlorophyll (Chl) accumulation and modified chloroplast ultrastructure. However, the mutation enhanced the light-use efficiency of the siygl1 mutant, suggesting that the mutated CHLD protein does not completely lose its original activity, but instead, gains novel features. A transcriptional analysis of Chl a oxygenase revealed that there is a strong negative feedback control of Chl b biosynthesis in S. italica. The SiYGL1 mRNA was expressed in all examined tissues, with higher expression observed in the leaves. Comparison of gene expression profiles in wild-type and siygl1 mutant plants indicated that SiYGL1 regulates a subset of genes involved in photosynthesis (rbcL and LHCB1), thylakoid development (DEG2) and chloroplast signaling (SRP54CP). These results provide information regarding the mutant phenotype at the transcriptional level. This study demonstrated that the genetic material of a Setaria species could be ideal for gene discovery investigations using forward genetics approaches and may help to explain the molecular mechanisms associated with leaf color variation. © 2015 Scandinavian Plant Physiology Society.

  7. Microbial Functional Gene Diversity Predicts Groundwater Contamination and Ecosystem Functioning

    Directory of Open Access Journals (Sweden)

    Zhili He

    2018-02-01

    Full Text Available Contamination from anthropogenic activities has significantly impacted Earth’s biosphere. However, knowledge about how environmental contamination affects the biodiversity of groundwater microbiomes and ecosystem functioning remains very limited. Here, we used a comprehensive functional gene array to analyze groundwater microbiomes from 69 wells at the Oak Ridge Field Research Center (Oak Ridge, TN, representing a wide pH range and uranium, nitrate, and other contaminants. We hypothesized that the functional diversity of groundwater microbiomes would decrease as environmental contamination (e.g., uranium or nitrate increased or at low or high pH, while some specific populations capable of utilizing or resistant to those contaminants would increase, and thus, such key microbial functional genes and/or populations could be used to predict groundwater contamination and ecosystem functioning. Our results indicated that functional richness/diversity decreased as uranium (but not nitrate increased in groundwater. In addition, about 5.9% of specific key functional populations targeted by a comprehensive functional gene array (GeoChip 5 increased significantly (P < 0.05 as uranium or nitrate increased, and their changes could be used to successfully predict uranium and nitrate contamination and ecosystem functioning. This study indicates great potential for using microbial functional genes to predict environmental contamination and ecosystem functioning.

  8. Expressed sequence tag analysis of functional genes associated with adventitious rooting in Liriodendron hybrids.

    Science.gov (United States)

    Zhong, Y D; Sun, X Y; Liu, E Y; Li, Y Q; Gao, Z; Yu, F X

    2016-06-24

    Liriodendron hybrids (Liriodendron chinense x L. tulipifera) are important landscaping and afforestation hardwood trees. To date, little genomic research on adventitious rooting has been reported in these hybrids, as well as in the genus Liriodendron. In the present study, we used adventitious roots to construct the first cDNA library for Liriodendron hybrids. A total of 5176 expressed sequence tags (ESTs) were generated and clustered into 2921 unigenes. Among these unigenes, 2547 had significant homology to the non-redundant protein database representing a wide variety of putative functions. Homologs of these genes regulated many aspects of adventitious rooting, including those for auxin signal transduction and root hair development. Results of quantitative real-time polymerase chain reaction showed that AUX1, IRE, and FB1 were highly expressed in adventitious roots and the expression of AUX1, ARF1, NAC1, RHD1, and IRE increased during the development of adventitious roots. Additionally, 181 simple sequence repeats were identified from 166 ESTs and more than 91.16% of these were dinucleotide and trinucleotide repeats. To the best of our knowledge, the present study reports the identification of the genes associated with adventitious rooting in the genus Liriodendron for the first time and provides a valuable resource for future genomic studies. Expression analysis of selected genes could allow us to identify regulatory genes that may be essential for adventitious rooting.

  9. ADAGE signature analysis: differential expression analysis with data-defined gene sets.

    Science.gov (United States)

    Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S

    2017-11-22

    Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed

  10. Genome-wide analysis of immune system genes by EST profiling

    Science.gov (United States)

    Giallourakis, Cosmas; Benita, Yair; Molinie, Benoit; Cao, Zhifang; Despo, Orion; Pratt, Henry E.; Zukerberg, Lawrence R.; Daly, Mark J.; Rioux, John D.; Xavier, Ramnik J.

    2013-01-01

    Profiling studies of mRNA and miRNA, particularly microarray-based studies, have been extensively used to create compendia of genes that are preferentially expressed in the immune system. In some instances, functional studies have been subsequently pursued. Recent efforts such as ENCODE have demonstrated the benefit of coupling RNA-Seq analysis with information from expressed sequence tags (ESTs) for transcriptomic analysis. However, the full characterization and identification of transcripts that function as modulators of human immune responses remains incomplete. In this study, we demonstrate that an integrated analysis of human ESTs provides a robust platform to identify the immune transcriptome. Beyond recovering a reference set of immune-enriched genes and providing large-scale cross-validation of previous microarray studies, we discovered hundreds of novel genes preferentially expressed in the immune system, including non-coding RNAs. As a result, we have established the Immunogene database, representing an integrated EST “road map” of gene expression in human immune cells, which can be used to further investigate the function of coding and non-coding genes in the immune system. Using this approach, we have uncovered a unique metabolic gene signature of human macrophages and identified PRDM15 as a novel overexpressed gene in human lymphomas. Thus we demonstrate the utility of EST profiling as a basis for further deconstruction of physiologic and pathologic immune processes. PMID:23616578

  11. ProbFAST: Probabilistic Functional Analysis System Tool

    Directory of Open Access Journals (Sweden)

    Oliveira Thiago YK

    2010-03-01

    Full Text Available Abstract Background The post-genomic era has brought new challenges regarding the understanding of the organization and function of the human genome. Many of these challenges are centered on the meaning of differential gene regulation under distinct biological conditions and can be performed by analyzing the Multiple Differential Expression (MDE of genes associated with normal and abnormal biological processes. Currently MDE analyses are limited to usual methods of differential expression initially designed for paired analysis. Results We proposed a web platform named ProbFAST for MDE analysis which uses Bayesian inference to identify key genes that are intuitively prioritized by means of probabilities. A simulated study revealed that our method gives a better performance when compared to other approaches and when applied to public expression data, we demonstrated its flexibility to obtain relevant genes biologically associated with normal and abnormal biological processes. Conclusions ProbFAST is a free accessible web-based application that enables MDE analysis on a global scale. It offers an efficient methodological approach for MDE analysis of a set of genes that are turned on and off related to functional information during the evolution of a tumor or tissue differentiation. ProbFAST server can be accessed at http://gdm.fmrp.usp.br/probfast.

  12. ProbFAST: Probabilistic functional analysis system tool.

    Science.gov (United States)

    Silva, Israel T; Vêncio, Ricardo Z N; Oliveira, Thiago Y K; Molfetta, Greice A; Silva, Wilson A

    2010-03-30

    The post-genomic era has brought new challenges regarding the understanding of the organization and function of the human genome. Many of these challenges are centered on the meaning of differential gene regulation under distinct biological conditions and can be performed by analyzing the Multiple Differential Expression (MDE) of genes associated with normal and abnormal biological processes. Currently MDE analyses are limited to usual methods of differential expression initially designed for paired analysis. We proposed a web platform named ProbFAST for MDE analysis which uses Bayesian inference to identify key genes that are intuitively prioritized by means of probabilities. A simulated study revealed that our method gives a better performance when compared to other approaches and when applied to public expression data, we demonstrated its flexibility to obtain relevant genes biologically associated with normal and abnormal biological processes. ProbFAST is a free accessible web-based application that enables MDE analysis on a global scale. It offers an efficient methodological approach for MDE analysis of a set of genes that are turned on and off related to functional information during the evolution of a tumor or tissue differentiation. ProbFAST server can be accessed at http://gdm.fmrp.usp.br/probfast.

  13. Association of functional MMP-2 gene variant with intracranial aneurysms: case-control genetic association study and meta-analysis.

    Science.gov (United States)

    Alg, Varinder S; Ke, Xiayi; Grieve, Joan; Bonner, Stephen; Walsh, Daniel C; Bulters, Diederik; Kitchen, Neil; Houlden, Henry; Werring, David J

    2018-01-15

    Abnormalities in Matrix Metalloproteinase (MMP) genes, which are important in extracellular matrix (ECM) maintenance and therefore arterial wall integrity are a plausible underlying mechanism of intracranial aneurysm (IA) formation, growth and subsequent rupture. We investigated whether the rs243865 C > T SNP (single nucleotide polymorphism) within the MMP-2 gene (which influences gene transcription) is associated with IA compared to matched controls. We conducted a case-control genetic association study, adjusted for known IA risk factors (smoking and hypertension), in a UK Caucasian population of 1409 patients with intracranial aneurysms (IA), and 1290 matched controls, to determine the association of the rs243865 C > T functional MMP-2 gene SNP with IA (overall, and classified as ruptured and unruptured). We also undertook a meta-analysis of two previous studies examining this SNP. The rs243865 T allele was associated with IA presence in univariate (OR 1.18 [95% CI 1.04-1.33], p = .01) and in multi-variable analyses adjusted for smoking and hypertension status (OR 1.16 [95% CI 1.01-1.35], p = .042). Subgroup analysis demonstrated an association of the rs243865 SNP with ruptured IA (OR 1.18 [95% CI 1.03-1.34] p = .017), but, not unruptured IA (OR 1.17 [95% CI 0.97-1.42], p = .11). Our study demonstrated an association between the functional MMP-2 rs243865 variant and IAs. Our findings suggest a genetic role for altered extracellular matrix integrity in the pathogenesis of IA development and rupture.

  14. GSMA: Gene Set Matrix Analysis, An Automated Method for Rapid Hypothesis Testing of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Chris Cheadle

    2007-01-01

    Full Text Available Background: Microarray technology has become highly valuable for identifying complex global changes in gene expression patterns. The assignment of functional information to these complex patterns remains a challenging task in effectively interpreting data and correlating results from across experiments, projects and laboratories. Methods which allow the rapid and robust evaluation of multiple functional hypotheses increase the power of individual researchers to data mine gene expression data more efficiently.Results: We have developed (gene set matrix analysis GSMA as a useful method for the rapid testing of group-wise up- or downregulation of gene expression simultaneously for multiple lists of genes (gene sets against entire distributions of gene expression changes (datasets for single or multiple experiments. The utility of GSMA lies in its flexibility to rapidly poll gene sets related by known biological function or as designated solely by the end-user against large numbers of datasets simultaneously.Conclusions: GSMA provides a simple and straightforward method for hypothesis testing in which genes are tested by groups across multiple datasets for patterns of expression enrichment.

  15. GeneViTo: Visualizing gene-product functional and structural features in genomic datasets

    Directory of Open Access Journals (Sweden)

    Promponas Vasilis J

    2003-10-01

    Full Text Available Abstract Background The availability of increasing amounts of sequence data from completely sequenced genomes boosts the development of new computational methods for automated genome annotation and comparative genomics. Therefore, there is a need for tools that facilitate the visualization of raw data and results produced by bioinformatics analysis, providing new means for interactive genome exploration. Visual inspection can be used as a basis to assess the quality of various analysis algorithms and to aid in-depth genomic studies. Results GeneViTo is a JAVA-based computer application that serves as a workbench for genome-wide analysis through visual interaction. The application deals with various experimental information concerning both DNA and protein sequences (derived from public sequence databases or proprietary data sources and meta-data obtained by various prediction algorithms, classification schemes or user-defined features. Interaction with a Graphical User Interface (GUI allows easy extraction of genomic and proteomic data referring to the sequence itself, sequence features, or general structural and functional features. Emphasis is laid on the potential comparison between annotation and prediction data in order to offer a supplement to the provided information, especially in cases of "poor" annotation, or an evaluation of available predictions. Moreover, desired information can be output in high quality JPEG image files for further elaboration and scientific use. A compilation of properly formatted GeneViTo input data for demonstration is available to interested readers for two completely sequenced prokaryotes, Chlamydia trachomatis and Methanococcus jannaschii. Conclusions GeneViTo offers an inspectional view of genomic functional elements, concerning data stemming both from database annotation and analysis tools for an overall analysis of existing genomes. The application is compatible with Linux or Windows ME-2000-XP operating

  16. A new measure for functional similarity of gene products based on Gene Ontology

    Directory of Open Access Journals (Sweden)

    Lengauer Thomas

    2006-06-01

    Full Text Available Abstract Background Gene Ontology (GO is a standard vocabulary of functional terms and allows for coherent annotation of gene products. These annotations provide a basis for new methods that compare gene products regarding their molecular function and biological role. Results We present a new method for comparing sets of GO terms and for assessing the functional similarity of gene products. The method relies on two semantic similarity measures; simRel and funSim. One measure (simRel is applied in the comparison of the biological processes found in different groups of organisms. The other measure (funSim is used to find functionally related gene products within the same or between different genomes. Results indicate that the method, in addition to being in good agreement with established sequence similarity approaches, also provides a means for the identification of functionally related proteins independent of evolutionary relationships. The method is also applied to estimating functional similarity between all proteins in Saccharomyces cerevisiae and to visualizing the molecular function space of yeast in a map of the functional space. A similar approach is used to visualize the functional relationships between protein families. Conclusion The approach enables the comparison of the underlying molecular biology of different taxonomic groups and provides a new comparative genomics tool identifying functionally related gene products independent of homology. The proposed map of the functional space provides a new global view on the functional relationships between gene products or protein families.

  17. Evolutionary Fates and Dynamic Functionalization of Young Duplicate Genes in Arabidopsis Genomes.

    Science.gov (United States)

    Wang, Jun; Tao, Feng; Marowsky, Nicholas C; Fan, Chuanzhu

    2016-09-01

    Gene duplication is a primary means to generate genomic novelties, playing an essential role in speciation and adaptation. Particularly in plants, a high abundance of duplicate genes has been maintained for significantly long periods of evolutionary time. To address the manner in which young duplicate genes were derived primarily from small-scale gene duplication and preserved in plant genomes and to determine the underlying driving mechanisms, we generated transcriptomes to produce the expression profiles of five tissues in Arabidopsis thaliana and the closely related species Arabidopsis lyrata and Capsella rubella Based on the quantitative analysis metrics, we investigated the evolutionary processes of young duplicate genes in Arabidopsis. We determined that conservation, neofunctionalization, and specialization are three main evolutionary processes for Arabidopsis young duplicate genes. We explicitly demonstrated the dynamic functionalization of duplicate genes along the evolutionary time scale. Upon origination, duplicates tend to maintain their ancestral functions; but as they survive longer, they might be likely to develop distinct and novel functions. The temporal evolutionary processes and functionalization of plant duplicate genes are associated with their ancestral functions, dynamic DNA methylation levels, and histone modification abundances. Furthermore, duplicate genes tend to be initially expressed in pollen and then to gain more interaction partners over time. Altogether, our study provides novel insights into the dynamic retention processes of young duplicate genes in plant genomes. © 2016 American Society of Plant Biologists. All rights reserved.

  18. Identification and Functional Analysis of Gene Regulatory Sequences Interacting with Colorectal Tumor Suppressors

    DEFF Research Database (Denmark)

    Dahlgaard, Katja; Troelsen, Jesper

    2018-01-01

    Several tumor suppressors possess gene regulatory activity. Here, we describe how promoter and promoter/enhancer reporter assays can be used to characterize a colorectal tumor suppressor proteins’ gene regulatory activity of possible target genes. In the first part, a bioinformatic approach...... of the quick and efficient In-Fusion cloning method, and how to carry out transient transfections of Caco-2 colon cancer cells with the produced luciferase reporter plasmids using polyethyleneimine (PEI). A plan describing how to set up and carry out the luciferase expression assay is presented. The luciferase...... to identify relevant gene regulatory regions of potential target genes is presented. In the second part, it is demonstrated how to prepare and carry out the functional assay. We explain how to clone the bioinformatically identified gene regulatory regions into luciferase reporter plasmids by the use...

  19. Validation of suitable reference genes for quantitative gene expression analysis in Panax ginseng

    Directory of Open Access Journals (Sweden)

    Meizhen eWang

    2016-01-01

    Full Text Available Reverse transcription-qPCR (RT-qPCR has become a popular method for gene expression studies. Its results require data normalization by housekeeping genes. No single gene is proved to be stably expressed under all experimental conditions. Therefore, systematic evaluation of reference genes is necessary. With the aim to identify optimum reference genes for RT-qPCR analysis of gene expression in different tissues of Panax ginseng and the seedlings grown under heat stress, we investigated the expression stability of eight candidate reference genes, including elongation factor 1-beta (EF1-β, elongation factor 1-gamma (EF1-γ, eukaryotic translation initiation factor 3G (IF3G, eukaryotic translation initiation factor 3B (IF3B, actin (ACT, actin11 (ACT11, glyceraldehyde-3-phosphate dehydrogenase (GAPDH and cyclophilin ABH-like protein (CYC, using four widely used computational programs: geNorm, Normfinder, BestKeeper, and the comparative ΔCt method. The results were then integrated using the web-based tool RefFinder. As a result, EF1-γ, IF3G and EF1-β were the three most stable genes in different tissues of P. ginseng, while IF3G, ACT11 and GAPDH were the top three-ranked genes in seedlings treated with heat. Using three better reference genes alone or in combination as internal control, we examined the expression profiles of MAR, a multiple function-associated mRNA-like non-coding RNA (mlncRNA in P. ginseng. Taken together, we recommended EF1-γ/IF3G and IF3G/ACT11 as the suitable pair of reference genes for RT-qPCR analysis of gene expression in different tissues of P. ginseng and the seedlings grown under heat stress, respectively. The results serve as a foundation for future studies on P. ginseng functional genomics.

  20. Microbial functional genes enriched in the Xiangjiang River sediments with heavy metal contamination.

    Science.gov (United States)

    Jie, Shiqi; Li, Mingming; Gan, Min; Zhu, Jianyu; Yin, Huaqun; Liu, Xueduan

    2016-08-08

    Xiangjiang River (Hunan, China) has been contaminated with heavy metal for several decades by surrounding factories. However, little is known about the influence of a gradient of heavy metal contamination on the diversity, structure of microbial functional gene in sediment. To deeply understand the impact of heavy metal contamination on microbial community, a comprehensive functional gene array (GeoChip 5.0) has been used to study the functional genes structure, composition, diversity and metabolic potential of microbial community from three heavy metal polluted sites of Xiangjiang River. A total of 25595 functional genes involved in different biogeochemical processes have been detected in three sites, and different diversities and structures of microbial functional genes were observed. The analysis of gene overlapping, unique genes, and various diversity indices indicated a significant correlation between the level of heavy metal contamination and the functional diversity. Plentiful resistant genes related to various metal were detected, such as copper, arsenic, chromium and mercury. The results indicated a significantly higher abundance of genes involved in metal resistance including sulfate reduction genes (dsr) in studied site with most serious heavy metal contamination, such as cueo, mer, metc, merb, tehb and terc gene. With regard to the relationship between the environmental variables and microbial functional structure, S, Cu, Cd, Hg and Cr were the dominating factor shaping the microbial distribution pattern in three sites. This study suggests that high level of heavy metal contamination resulted in higher functional diversity and the abundance of metal resistant genes. These variation therefore significantly contribute to the resistance, resilience and stability of the microbial community subjected to the gradient of heavy metals contaminant in Xiangjiang River.

  1. Gene Overexpression Resources in Cereals for Functional Genomics and Discovery of Useful Genes

    Directory of Open Access Journals (Sweden)

    Kiyomi Abe

    2016-09-01

    Full Text Available Identification and elucidation of functions of plant genes is valuable for both basic and applied research. In addition to natural variation in model plants, numerous loss-of-function resources have been produced by mutagenesis with chemicals, irradiation, or insertions of transposable elements or T-DNA. However, we may be unable to observe loss-of-function phenotypes for genes with functionally redundant homologs, and for those essential for growth and development. To offset such disadvantages, gain-of-function transgenic resources have been exploited. Activation-tagged lines have been generated using obligatory overexpression of endogenous genes by random insertion of an enhancer. Recent progress in DNA sequencing technology and bioinformatics has enabled the preparation of genomewide collections of full-length cDNAs (fl-cDNAs in some model species. Using the fl-cDNA clones, a novel gain-of-function strategy, Fl-cDNA OvereXpressor gene (FOX-hunting system, has been developed. A mutant phenotype in a FOX line can be directly attributed to the overexpressed fl-cDNA. Investigating a large population of FOX lines could reveal important genes conferring favorable phenotypes for crop breeding. Alternatively, a unique loss-of-function approach Chimeric REpressor gene Silencing Technology (CRES-T has been developed. In CRES-T, overexpression of a chimeric repressor, composed of the coding sequence of a transcription factor (TF and short peptide designated as the repression domain, could interfere with the action of endogenous TF in plants. Although plant TFs usually consist of gene families, CRES-T is effective, in principle, even for the TFs with functional redundancy. In this review, we focus on the current status of the gene-overexpression strategies and resources for identifying and elucidating novel functions of cereal genes. We discuss the potential of these research tools for identifying useful genes and phenotypes for application in crop

  2. Functional Analysis of Developmentally Regulated Genes chs7 and sec22 in the Ascomycete Sordaria macrospora.

    Science.gov (United States)

    Traeger, Stefanie; Nowrousian, Minou

    2015-04-14

    During sexual development, filamentous ascomycetes form complex, three-dimensional fruiting bodies for the generation and dispersal of spores. In previous studies, we identified genes with evolutionary conserved expression patterns during fruiting body formation in several fungal species. Here, we present the functional analysis of two developmentally up-regulated genes, chs7 and sec22, in the ascomycete Sordaria macrospora. The genes encode a class VII (division III) chitin synthase and a soluble N-ethylmaleimide-sensitive-factor attachment protein receptor (SNARE) protein, respectively. Deletion mutants of chs7 had normal vegetative growth and were fully fertile but showed sensitivity toward cell wall stress. Deletion of sec22 resulted in a reduced number of ascospores and in defects in ascospore pigmentation and germination, whereas vegetative growth was normal in the mutant. A SEC22-EGFP fusion construct under control of the native sec22 promoter and terminator regions was expressed during different stages of sexual development. Expression of several development-related genes was deregulated in the sec22 mutant, including three genes involved in melanin biosynthesis. Our data indicate that chs7 is dispensable for fruiting body formation in S. macrospora, whereas sec22 is required for ascospore maturation and germination and thus involved in late stages of sexual development. Copyright © 2015 Traeger and Nowrousian.

  3. Identification of functionally related genes using data mining and data integration: a breast cancer case study

    Directory of Open Access Journals (Sweden)

    Zucchi Ileana

    2009-10-01

    Full Text Available Abstract Background The identification of the organisation and dynamics of molecular pathways is crucial for the understanding of cell function. In order to reconstruct the molecular pathways in which a gene of interest is involved in regulating a cell, it is important to identify the set of genes to which it interacts with to determine cell function. In this context, the mining and the integration of a large amount of publicly available data, regarding the transcriptome and the proteome states of a cell, are a useful resource to complement biological research. Results We describe an approach for the identification of genes that interact with each other to regulate cell function. The strategy relies on the analysis of gene expression profile similarity, considering large datasets of expression data. During the similarity evaluation, the methodology determines the most significant subset of samples in which the evaluated genes are highly correlated. Hence, the strategy enables the exclusion of samples that are not relevant for each gene pair analysed. This feature is important when considering a large set of samples characterised by heterogeneous experimental conditions where different pools of biological processes can be active across the samples. The putative partners of the studied gene are then further characterised, analysing the distribution of the Gene Ontology terms and integrating the protein-protein interaction (PPI data. The strategy was applied for the analysis of the functional relationships of a gene of known function, Pyruvate Kinase, and for the prediction of functional partners of the human transcription factor TBX3. In both cases the analysis was done on a dataset composed by breast primary tumour expression data derived from the literature. Integration and analysis of PPI data confirmed the prediction of the methodology, since the genes identified to be functionally related were associated to proteins close in the PPI network

  4. First Comprehensive In Silico Analysis of the Functional and Structural Consequences of SNPs in Human GalNAc-T1 Gene

    Directory of Open Access Journals (Sweden)

    Hussein Sheikh Ali Mohamoud

    2014-01-01

    Full Text Available GalNAc-T1, a key candidate of GalNac-transferases genes family that is involved in mucin-type O-linked glycosylation pathway, is expressed in most biological tissues and cell types. Despite the reported association of GalNAc-T1 gene mutations with human disease susceptibility, the comprehensive computational analysis of coding, noncoding and regulatory SNPs, and their functional impacts on protein level, still remains unknown. Therefore, sequence- and structure-based computational tools were employed to screen the entire listed coding SNPs of GalNAc-T1 gene in order to identify and characterize them. Our concordant in silico analysis by SIFT, PolyPhen-2, PANTHER-cSNP, and SNPeffect tools, identified the potential nsSNPs (S143P, G258V, and Y414D variants from 18 nsSNPs of GalNAc-T1. Additionally, 2 regulatory SNPs (rs72964406 and #x26; rs34304568 were also identified in GalNAc-T1 by using FastSNP tool. Using multiple computational approaches, we have systematically classified the functional mutations in regulatory and coding regions that can modify expression and function of GalNAc-T1 enzyme. These genetic variants can further assist in better understanding the wide range of disease susceptibility associated with the mucin-based cell signalling and pathogenic binding, and may help to develop novel therapeutic elements for associated diseases.

  5. Digital gene expression analysis of Microsporum canis exposed to berberine chloride.

    Directory of Open Access Journals (Sweden)

    Chen-Wen Xiao

    Full Text Available Berberine, a natural isoquinoline alkaloid of many medicinal herbs, has an active function against a variety of microbial infections including Microsporum canis (M. canis. However, the underlying mechanisms are poorly understood. To study the effect of berberine chloride on M. canis infection, a Digital Gene Expression (DGE tag profiling was constructed and a transcriptome analysis of the M. canis cellular responses upon berberine treatment was performed. Illumina/Hisseq sequencing technique was used to generate the data of gene expression profile, and the following enrichment analysis of Gene Ontology (GO and Pathway function were conducted based on the data of transcriptome. The results of DGE showed that there were 8476945, 14256722, 7708575, 5669955, 6565513 and 9303468 tags respectively, which was obtained from M. canis incubated with berberine or control DMSO. 8,783 genes were totally mapped, and 1,890 genes have shown significant changes between the two groups. 1,030 genes were up-regulated and 860 genes were down-regulated (P<0.05 in berberine treated group compared to the control group. Besides, twenty-three GO terms were identified by Gene Ontology functional enrichment analysis, such as calcium-transporting ATPase activity, 2-oxoglutarate metabolic process, valine catabolic process, peroxisome and unfolded protein binding. Pathway significant enrichment analysis indicated 6 signaling pathways that are significant, including steroid biosynthesis, steroid hormone biosynthesis, Parkinson's disease, 2,4-Dichlorobenzoate degradation, and tropane, piperidine and Isoquinoline alkaloid biosynthesis. Among these, eleven selected genes were further verified by qRT-PCR. Our findings provide a comprehensive view on the gene expression profile of M. canis upon berberine treatment, and shed light on its complicated effects on M. canis.

  6. Comparative genomic analysis of Drosophila melanogaster and vector mosquito developmental genes.

    Directory of Open Access Journals (Sweden)

    Susanta K Behura

    Full Text Available Genome sequencing projects have presented the opportunity for analysis of developmental genes in three vector mosquito species: Aedes aegypti, Culex quinquefasciatus, and Anopheles gambiae. A comparative genomic analysis of developmental genes in Drosophila melanogaster and these three important vectors of human disease was performed in this investigation. While the study was comprehensive, special emphasis centered on genes that 1 are components of developmental signaling pathways, 2 regulate fundamental developmental processes, 3 are critical for the development of tissues of vector importance, 4 function in developmental processes known to have diverged within insects, and 5 encode microRNAs (miRNAs that regulate developmental transcripts in Drosophila. While most fruit fly developmental genes are conserved in the three vector mosquito species, several genes known to be critical for Drosophila development were not identified in one or more mosquito genomes. In other cases, mosquito lineage-specific gene gains with respect to D. melanogaster were noted. Sequence analyses also revealed that numerous repetitive sequences are a common structural feature of Drosophila and mosquito developmental genes. Finally, analysis of predicted miRNA binding sites in fruit fly and mosquito developmental genes suggests that the repertoire of developmental genes targeted by miRNAs is species-specific. The results of this study provide insight into the evolution of developmental genes and processes in dipterans and other arthropods, serve as a resource for those pursuing analysis of mosquito development, and will promote the design and refinement of functional analysis experiments.

  7. [FANCA gene mutation analysis in Fanconi anemia patients].

    Science.gov (United States)

    Chen, Fei; Peng, Guang-Jie; Zhang, Kejian; Hu, Qun; Zhang, Liu-Qing; Liu, Ai-Guo

    2005-10-01

    To screen the FANCA gene mutation and explore the FANCA protein function in Fanconi anemia (FA) patients. FANCA protein expression and its interaction with FANCF were analyzed using Western blot and immunoprecipitation in 3 cases of FA-A. Genomic DNA was used for MLPA analysis followed by sequencing. FANCA protein was undetectable and FANCA and FANCF protein interaction was impaired in these 3 cases of FA-A. Each case of FA-A contained biallelic pathogenic mutations in FANCA gene. No functional FANCA protein was found in these 3 cases of FA-A, and intragenic deletion, frame shift and splice site mutation were the major pathogenic mutations found in FANCA gene.

  8. Analysis of the robustness of network-based disease-gene prioritization methods reveals redundancy in the human interactome and functional diversity of disease-genes.

    Directory of Open Access Journals (Sweden)

    Emre Guney

    Full Text Available Complex biological systems usually pose a trade-off between robustness and fragility where a small number of perturbations can substantially disrupt the system. Although biological systems are robust against changes in many external and internal conditions, even a single mutation can perturb the system substantially, giving rise to a pathophenotype. Recent advances in identifying and analyzing the sequential variations beneath human disorders help to comprehend a systemic view of the mechanisms underlying various disease phenotypes. Network-based disease-gene prioritization methods rank the relevance of genes in a disease under the hypothesis that genes whose proteins interact with each other tend to exhibit similar phenotypes. In this study, we have tested the robustness of several network-based disease-gene prioritization methods with respect to the perturbations of the system using various disease phenotypes from the Online Mendelian Inheritance in Man database. These perturbations have been introduced either in the protein-protein interaction network or in the set of known disease-gene associations. As the network-based disease-gene prioritization methods are based on the connectivity between known disease-gene associations, we have further used these methods to categorize the pathophenotypes with respect to the recoverability of hidden disease-genes. Our results have suggested that, in general, disease-genes are connected through multiple paths in the human interactome. Moreover, even when these paths are disturbed, network-based prioritization can reveal hidden disease-gene associations in some pathophenotypes such as breast cancer, cardiomyopathy, diabetes, leukemia, parkinson disease and obesity to a greater extend compared to the rest of the pathophenotypes tested in this study. Gene Ontology (GO analysis highlighted the role of functional diversity for such diseases.

  9. Saponin determination, expression analysis and functional characterization of saponin biosynthetic genes in Chenopodium quinoa leaves.

    Science.gov (United States)

    Fiallos-Jurado, Jennifer; Pollier, Jacob; Moses, Tessa; Arendt, Philipp; Barriga-Medina, Noelia; Morillo, Eduardo; Arahana, Venancio; de Lourdes Torres, Maria; Goossens, Alain; Leon-Reyes, Antonio

    2016-09-01

    Quinoa (Chenopodium quinoa Willd.) is a highly nutritious pseudocereal with an outstanding protein, vitamin, mineral and nutraceutical content. The leaves, flowers and seed coat of quinoa contain triterpenoid saponins, which impart bitterness to the grain and make them unpalatable without postharvest removal of the saponins. In this study, we quantified saponin content in quinoa leaves from Ecuadorian sweet and bitter genotypes and assessed the expression of saponin biosynthetic genes in leaf samples elicited with methyl jasmonate. We found saponin accumulation in leaves after MeJA treatment in both ecotypes tested. As no reference genes were available to perform qPCR in quinoa, we mined publicly available RNA-Seq data for orthologs of 22 genes known to be stably expressed in Arabidopsis thaliana using geNorm, NormFinder and BestKeeper algorithms. The quinoa ortholog of At2g28390 (Monensin Sensitivity 1, MON1) was stably expressed and chosen as a suitable reference gene for qPCR analysis. Candidate saponin biosynthesis genes were screened in the quinoa RNA-Seq data and subsequent functional characterization in yeast led to the identification of CqbAS1, CqCYP716A78 and CqCYP716A79. These genes were found to be induced by MeJA, suggesting this phytohormone might also modulate saponin biosynthesis in quinoa leaves. Knowledge of the saponin biosynthesis and its regulation in quinoa may aid the further development of sweet cultivars that do not require postharvest processing. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  10. Genome-wide analysis of the Hsp70 family genes in pepper (Capsicum annuum L.) and functional identification of CaHsp70-2 involvement in heat stress.

    Science.gov (United States)

    Guo, Meng; Liu, Jin-Hong; Ma, Xiao; Zhai, Yu-Fei; Gong, Zhen-Hui; Lu, Ming-Hui

    2016-11-01

    Hsp70s function as molecular chaperones and are encoded by a multi-gene family whose members play a crucial role in plant response to stress conditions, and in plant growth and development. Pepper (Capsicum annuum L.) is an important vegetable crop whose genome has been sequenced. Nonetheless, no overall analysis of the Hsp70 gene family is reported in this crop plant to date. To assess the functionality of Capsicum annuum Hsp70 (CaHsp70) genes, pepper genome database was analyzed in this research. A total of 21 CaHsp70 genes were identified and their characteristics were also described. The promoter and transcript expression analysis revealed that CaHsp70s were involved in pepper growth and development, and heat stress response. Ectopic expression of a cytosolic gene, CaHsp70-2, regulated expression of stress-related genes and conferred increased thermotolerance in transgenic Arabidopsis. Taken together, our results provide the basis for further studied to dissect CaHsp70s' function in response to heat stress as well as other environmental stresses. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  11. Gene expression analysis of zebrafish melanocytes, iridophores, and retinal pigmented epithelium reveals indicators of biological function and developmental origin.

    Directory of Open Access Journals (Sweden)

    Charles W Higdon

    Full Text Available In order to facilitate understanding of pigment cell biology, we developed a method to concomitantly purify melanocytes, iridophores, and retinal pigmented epithelium from zebrafish, and analyzed their transcriptomes. Comparing expression data from these cell types and whole embryos allowed us to reveal gene expression co-enrichment in melanocytes and retinal pigmented epithelium, as well as in melanocytes and iridophores. We found 214 genes co-enriched in melanocytes and retinal pigmented epithelium, indicating the shared functions of melanin-producing cells. We found 62 genes significantly co-enriched in melanocytes and iridophores, illustrative of their shared developmental origins from the neural crest. This is also the first analysis of the iridophore transcriptome. Gene expression analysis for iridophores revealed extensive enrichment of specific enzymes to coordinate production of their guanine-based reflective pigment. We speculate the coordinated upregulation of specific enzymes from several metabolic pathways recycles the rate-limiting substrate for purine synthesis, phosphoribosyl pyrophosphate, thus constituting a guanine cycle. The purification procedure and expression analysis described here, along with the accompanying transcriptome-wide expression data, provide the first mRNA sequencing data for multiple purified zebrafish pigment cell types, and will be a useful resource for further studies of pigment cell biology.

  12. MicroScope-an integrated resource for community expertise of gene functions and comparative analysis of microbial genomic and metabolic data.

    Science.gov (United States)

    Médigue, Claudine; Calteau, Alexandra; Cruveiller, Stéphane; Gachet, Mathieu; Gautreau, Guillaume; Josso, Adrien; Lajus, Aurélie; Langlois, Jordan; Pereira, Hugo; Planel, Rémi; Roche, David; Rollin, Johan; Rouy, Zoe; Vallenet, David

    2017-09-12

    The overwhelming list of new bacterial genomes becoming available on a daily basis makes accurate genome annotation an essential step that ultimately determines the relevance of thousands of genomes stored in public databanks. The MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Starting from the results of our syntactic, functional and relational annotation pipelines, MicroScope provides an integrated environment for the expert annotation and comparative analysis of prokaryotic genomes. It combines tools and graphical interfaces to analyze genomes and to perform the manual curation of gene function in a comparative genomics and metabolic context. In this article, we describe the free-of-charge MicroScope services for the annotation and analysis of microbial (meta)genomes, transcriptomic and re-sequencing data. Then, the functionalities of the platform are presented in a way providing practical guidance and help to the nonspecialists in bioinformatics. Newly integrated analysis tools (i.e. prediction of virulence and resistance genes in bacterial genomes) and original method recently developed (the pan-genome graph representation) are also described. Integrated environments such as MicroScope clearly contribute, through the user community, to help maintaining accurate resources. © The Author 2017. Published by Oxford University Press.

  13. Large-scale inference of gene function through phylogenetic annotation of Gene Ontology terms: case study of the apoptosis and autophagy cellular processes.

    Science.gov (United States)

    Feuermann, Marc; Gaudet, Pascale; Mi, Huaiyu; Lewis, Suzanna E; Thomas, Paul D

    2016-01-01

    We previously reported a paradigm for large-scale phylogenomic analysis of gene families that takes advantage of the large corpus of experimentally supported Gene Ontology (GO) annotations. This 'GO Phylogenetic Annotation' approach integrates GO annotations from evolutionarily related genes across ∼100 different organisms in the context of a gene family tree, in which curators build an explicit model of the evolution of gene functions. GO Phylogenetic Annotation models the gain and loss of functions in a gene family tree, which is used to infer the functions of uncharacterized (or incompletely characterized) gene products, even for human proteins that are relatively well studied. Here, we report our results from applying this paradigm to two well-characterized cellular processes, apoptosis and autophagy. This revealed several important observations with respect to GO annotations and how they can be used for function inference. Notably, we applied only a small fraction of the experimentally supported GO annotations to infer function in other family members. The majority of other annotations describe indirect effects, phenotypes or results from high throughput experiments. In addition, we show here how feedback from phylogenetic annotation leads to significant improvements in the PANTHER trees, the GO annotations and GO itself. Thus GO phylogenetic annotation both increases the quantity and improves the accuracy of the GO annotations provided to the research community. We expect these phylogenetically based annotations to be of broad use in gene enrichment analysis as well as other applications of GO annotations.Database URL: http://amigo.geneontology.org/amigo. © The Author(s) 2016. Published by Oxford University Press.

  14. GOMA: functional enrichment analysis tool based on GO modules

    Institute of Scientific and Technical Information of China (English)

    Qiang Huang; Ling-Yun Wu; Yong Wang; Xiang-Sun Zhang

    2013-01-01

    Analyzing the function of gene sets is a critical step in interpreting the results of high-throughput experiments in systems biology.A variety of enrichment analysis tools have been developed in recent years,but most output a long list of significantly enriched terms that are often redundant,making it difficult to extract the most meaningful functions.In this paper,we present GOMA,a novel enrichment analysis method based on the new concept of enriched functional Gene Ontology (GO) modules.With this method,we systematically revealed functional GO modules,i.e.,groups of functionally similar GO terms,via an optimization model and then ranked them by enrichment scores.Our new method simplifies enrichment analysis results by reducing redundancy,thereby preventing inconsistent enrichment results among functionally similar terms and providing more biologically meaningful results.

  15. Digital gene expression analysis of gene expression differences within Brassica diploids and allopolyploids.

    Science.gov (United States)

    Jiang, Jinjin; Wang, Yue; Zhu, Bao; Fang, Tingting; Fang, Yujie; Wang, Youping

    2015-01-27

    Brassica includes many successfully cultivated crop species of polyploid origin, either by ancestral genome triplication or by hybridization between two diploid progenitors, displaying complex repetitive sequences and transposons. The U's triangle, which consists of three diploids and three amphidiploids, is optimal for the analysis of complicated genomes after polyploidization. Next-generation sequencing enables the transcriptome profiling of polyploids on a global scale. We examined the gene expression patterns of three diploids (Brassica rapa, B. nigra, and B. oleracea) and three amphidiploids (B. napus, B. juncea, and B. carinata) via digital gene expression analysis. In total, the libraries generated between 5.7 and 6.1 million raw reads, and the clean tags of each library were mapped to 18547-21995 genes of B. rapa genome. The unambiguous tag-mapped genes in the libraries were compared. Moreover, the majority of differentially expressed genes (DEGs) were explored among diploids as well as between diploids and amphidiploids. Gene ontological analysis was performed to functionally categorize these DEGs into different classes. The Kyoto Encyclopedia of Genes and Genomes analysis was performed to assign these DEGs into approximately 120 pathways, among which the metabolic pathway, biosynthesis of secondary metabolites, and peroxisomal pathway were enriched. The non-additive genes in Brassica amphidiploids were analyzed, and the results indicated that orthologous genes in polyploids are frequently expressed in a non-additive pattern. Methyltransferase genes showed differential expression pattern in Brassica species. Our results provided an understanding of the transcriptome complexity of natural Brassica species. The gene expression changes in diploids and allopolyploids may help elucidate the morphological and physiological differences among Brassica species.

  16. The Reconstruction and Analysis of Gene Regulatory Networks.

    Science.gov (United States)

    Zheng, Guangyong; Huang, Tao

    2018-01-01

    In post-genomic era, an important task is to explore the function of individual biological molecules (i.e., gene, noncoding RNA, protein, metabolite) and their organization in living cells. For this end, gene regulatory networks (GRNs) are constructed to show relationship between biological molecules, in which the vertices of network denote biological molecules and the edges of network present connection between nodes (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). Biologists can understand not only the function of biological molecules but also the organization of components of living cells through interpreting the GRNs, since a gene regulatory network is a comprehensively physiological map of living cells and reflects influence of genetic and epigenetic factors (Strogatz, Nature 410:268-276, 2001; Bray, Science 301:1864-1865, 2003). In this paper, we will review the inference methods of GRN reconstruction and analysis approaches of network structure. As a powerful tool for studying complex diseases and biological processes, the applications of the network method in pathway analysis and disease gene identification will be introduced.

  17. Microarray analysis of the gene expression profile in triethylene ...

    African Journals Online (AJOL)

    Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells. ... Conclusions: Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.

  18. Functional Interaction Network Construction and Analysis for Disease Discovery.

    Science.gov (United States)

    Wu, Guanming; Haw, Robin

    2017-01-01

    Network-based approaches project seemingly unrelated genes or proteins onto a large-scale network context, therefore providing a holistic visualization and analysis platform for genomic data generated from high-throughput experiments, reducing the dimensionality of data via using network modules and increasing the statistic analysis power. Based on the Reactome database, the most popular and comprehensive open-source biological pathway knowledgebase, we have developed a highly reliable protein functional interaction network covering around 60 % of total human genes and an app called ReactomeFIViz for Cytoscape, the most popular biological network visualization and analysis platform. In this chapter, we describe the detailed procedures on how this functional interaction network is constructed by integrating multiple external data sources, extracting functional interactions from human curated pathway databases, building a machine learning classifier called a Naïve Bayesian Classifier, predicting interactions based on the trained Naïve Bayesian Classifier, and finally constructing the functional interaction database. We also provide an example on how to use ReactomeFIViz for performing network-based data analysis for a list of genes.

  19. Functional Analysis of the Nitrogen Metabolite Repression Regulator Gene nmrA in Aspergillus flavus

    Directory of Open Access Journals (Sweden)

    Xiaoyun Han

    2016-11-01

    Full Text Available In Aspergillus nidulans, the nitrogen metabolite repression regulator NmrA plays a major role in regulating the activity of the GATA transcription factor AreA during nitrogen metabolism. However, the function of nmrA in Aspergillus flavus has notbeen previously studied. Here, we report the identification and functional analysis of nmrA in A. flavus. Our work showed that the amino acid sequences of NmrA are highly conserved among Aspergillus species and that A. flavus NmrA protein contains a canonical Rossmann fold motif. Deletion of nmrA slowed the growth of A. flavus but significantly increased conidiation and sclerotia production. Moreover, seed infection experiments indicated that nmrA is required for the invasive virulence of A. flavus. In addition, the ΔnmrA mutant showed increased sensitivity to rapamycin and methyl methanesulfonate, suggesting that nmrA could be responsive to target of rapamycin signaling and DNA damage. Furthermore, quantitative real-time reverse transcription polymerase chain reaction analysis suggested that nmrA might interact with other nitrogen regulatory and catabolic genes. Our study provides a better understanding of nitrogen metabolite repression and the nitrogen metabolism network in fungi.

  20. Analysis of TIR- and non-TIR-NBS-LRR disease resistance gene analogous in pepper: characterization, genetic variation, functional divergence and expression patterns

    Directory of Open Access Journals (Sweden)

    Wan Hongjian

    2012-09-01

    Full Text Available Abstract Background Pepper (Capsicum annuum L. is one of the most important vegetable crops worldwide. However, its yield and fruit quality can be severely threatened by several pathogens. The plant nucleotide-binding site (NBS-leucine-rich repeat (LRR gene family is the largest class of known disease resistance genes (R genes effective against such pathogens. Therefore, the isolation and identification of such R gene homologues from pepper will provide a critical foundation for improving disease resistance breeding programs. Results A total of 78 R gene analogues (CaRGAs were identified in pepper by degenerate PCR amplification and database mining. Phylogenetic tree analysis of the deduced amino acid sequences for 51 of these CaRGAs with typically conserved motifs ( P-loop, kinase-2 and GLPL along with some known R genes from Arabidopsis and tomato grouped these CaRGAs into the non-Toll interleukin-1 receptor (TIR-NBS-LRR (CaRGAs I to IV and TIR-NBS-LRR (CaRGAs V to VII subfamilies. The presence of consensus motifs (i.e. P-loop, kinase-2 and hydrophobic domain is typical of the non-TIR- and TIR-NBS-LRR gene subfamilies. This finding further supports the view that both subfamilies are widely distributed in dicot species. Functional divergence analysis provided strong statistical evidence of altered selective constraints during protein evolution between the two subfamilies. Thirteen critical amino acid sites involved in this divergence were also identified using DIVERGE version 2 software. Analyses of non-synonymous and synonymous substitutions per site showed that purifying selection can play a critical role in the evolutionary processes of non-TIR- and TIR-NBS-LRR RGAs in pepper. In addition, four specificity-determining positions were predicted to be responsible for functional specificity. qRT-PCR analysis showed that both salicylic and abscisic acids induce the expression of CaRGA genes, suggesting that they may primarily be involved in

  1. SITEX 2.0: Projections of protein functional sites on eukaryotic genes. Extension with orthologous genes.

    Science.gov (United States)

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2017-04-01

    Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .

  2. Genome-wide identification and comparative expression analysis reveal a rapid expansion and functional divergence of duplicated genes in the WRKY gene family of cabbage, Brassica oleracea var. capitata.

    Science.gov (United States)

    Yao, Qiu-Yang; Xia, En-Hua; Liu, Fei-Hu; Gao, Li-Zhi

    2015-02-15

    WRKY transcription factors (TFs), one of the ten largest TF families in higher plants, play important roles in regulating plant development and resistance. To date, little is known about the WRKY TF family in Brassica oleracea. Recently, the completed genome sequence of cabbage (B. oleracea var. capitata) allows us to systematically analyze WRKY genes in this species. A total of 148 WRKY genes were characterized and classified into seven subgroups that belong to three major groups. Phylogenetic and synteny analyses revealed that the repertoire of cabbage WRKY genes was derived from a common ancestor shared with Arabidopsis thaliana. The B. oleracea WRKY genes were found to be preferentially retained after the whole-genome triplication (WGT) event in its recent ancestor, suggesting that the WGT event had largely contributed to a rapid expansion of the WRKY gene family in B. oleracea. The analysis of RNA-Seq data from various tissues (i.e., roots, stems, leaves, buds, flowers and siliques) revealed that most of the identified WRKY genes were positively expressed in cabbage, and a large portion of them exhibited patterns of differential and tissue-specific expression, demonstrating that these gene members might play essential roles in plant developmental processes. Comparative analysis of the expression level among duplicated genes showed that gene expression divergence was evidently presented among cabbage WRKY paralogs, indicating functional divergence of these duplicated WRKY genes. Copyright © 2014 Elsevier B.V. All rights reserved.

  3. Bioinformatics Analysis of MAPKKK Family Genes in Medicago truncatula

    Directory of Open Access Journals (Sweden)

    Wei Li

    2016-04-01

    Full Text Available Mitogen‐activated protein kinase kinase kinase (MAPKKK is a component of the MAPK cascade pathway that plays an important role in plant growth, development, and response to abiotic stress, the functions of which have been well characterized in several plant species, such as Arabidopsis, rice, and maize. In this study, we performed genome‐wide and systemic bioinformatics analysis of MAPKKK family genes in Medicago truncatula. In total, there were 73 MAPKKK family members identified by search of homologs, and they were classified into three subfamilies, MEKK, ZIK, and RAF. Based on the genomic duplication function, 72 MtMAPKKK genes were located throughout all chromosomes, but they cluster in different chromosomes. Using microarray data and high‐throughput sequencing‐data, we assessed their expression profiles in growth and development processes; these results provided evidence for exploring their important functions in developmental regulation, especially in the nodulation process. Furthermore, we investigated their expression in abiotic stresses by RNA‐seq, which confirmed their critical roles in signal transduction and regulation processes under stress. In summary, our genome‐wide, systemic characterization and expressional analysis of MtMAPKKK genes will provide insights that will be useful for characterizing the molecular functions of these genes in M. truncatula.

  4. Gene expression and functional annotation of the human ciliary body epithelia.

    Directory of Open Access Journals (Sweden)

    Sarah F Janssen

    Full Text Available PURPOSE: The ciliary body (CB of the human eye consists of the non-pigmented (NPE and pigmented (PE neuro-epithelia. We investigated the gene expression of NPE and PE, to shed light on the molecular mechanisms underlying the most important functions of the CB. We also developed molecular signatures for the NPE and PE and studied possible new clues for glaucoma. METHODS: We isolated NPE and PE cells from seven healthy human donor eyes using laser dissection microscopy. Next, we performed RNA isolation, amplification, labeling and hybridization against 44×k Agilent microarrays. For microarray conformations, we used a literature study, RT-PCRs, and immunohistochemical stainings. We analyzed the gene expression data with R and with the knowledge database Ingenuity. RESULTS: The gene expression profiles and functional annotations of the NPE and PE were highly similar. We found that the most important functionalities of the NPE and PE were related to developmental processes, neural nature of the tissue, endocrine and metabolic signaling, and immunological functions. In total 1576 genes differed statistically significantly between NPE and PE. From these genes, at least 3 were cell-specific for the NPE and 143 for the PE. Finally, we observed high expression in the (NPE of 35 genes previously implicated in molecular mechanisms related to glaucoma. CONCLUSION: Our gene expression analysis suggested that the NPE and PE of the CB were quite similar. Nonetheless, cell-type specific differences were found. The molecular machineries of the human NPE and PE are involved in a range of neuro-endocrinological, developmental and immunological functions, and perhaps glaucoma.

  5. Functional dissection of drought-responsive gene expression patterns in Cynodon dactylon L.

    Science.gov (United States)

    Kim, Changsoo; Lemke, Cornelia; Paterson, Andrew H

    2009-05-01

    Water deficit is one of the main abiotic factors that affect plant productivity in subtropical regions. To identify genes induced during the water stress response in Bermudagrass (Cynodon dactylon), cDNA macroarrays were used. The macroarray analysis identified 189 drought-responsive candidate genes from C. dactylon, of which 120 were up-regulated and 69 were down-regulated. The candidate genes were classified into seven groups by cluster analysis of expression levels across two intensities and three durations of imposed stress. Annotation using BLASTX suggested that up-regulated genes may be involved in proline biosynthesis, signal transduction pathways, protein repair systems, and removal of toxins, while down-regulated genes were mostly related to basic plant metabolism such as photosynthesis and glycolysis. The functional classification of gene ontology (GO) was consistent with the BLASTX results, also suggesting some crosstalk between abiotic and biotic stress. Comparative analysis of cis-regulatory elements from the candidate genes implicated specific elements in drought response in Bermudagrass. Although only a subset of genes was studied, Bermudagrass shared many drought-responsive genes and cis-regulatory elements with other botanical models, supporting a strategy of cross-taxon application of drought-responsive genes, regulatory cues, and physiological-genetic information.

  6. ANLN functions as a key candidate gene in cervical cancer as determined by integrated bioinformatic analysis

    Directory of Open Access Journals (Sweden)

    Xia L

    2018-04-01

    Full Text Available Leilei Xia,1,* Xiaoling Su,1,2,* Jizi Shen,1,* Qi Meng,1 Jiuqiong Yan,1 Caihong Zhang,1 Yu Chen,1 Han Wang,3 Mingjuan Xu,1 1Department of Obstetrics and Gynecology, Changhai Hospital, Second Military Medical University, Shanghai, People’s Republic of China; 2Department of Obstetrics and Gynecology, No. 455 Hospital, Shanghai, People’s Republic of China; 3Department of Pathology, Eastern Hepatobiliary Surgery Hospital, Second Military Medical University, Shanghai, People’s Republic of China *These authors contributed equally to this work Background: Cervical cancer, one of the leading causes of female deaths, remains a top cause of mortality in gynecologic oncology and tends to affect younger individuals. However, the pathogenesis of cervical cancer is still far from clear. Given the high incidence and mortality of cervical cancer, uncovering the causes and pathogenesis as well as identifying novel biomarkers are of great significance and are desperately needed.Materials and methods: First, raw data were downloaded from the Gene Expression Omnibus database. The Robuse Multi-Array Average algorithm and combat function of the sva package were subsequently applied to preprocess and remove batch effects. Differentially expressed genes (DEGs analyzed with the limma package were followed by gene ontology and pathway analysis, and a protein–protein interaction (PPI network based on the STRING website and the Cytoscape software was constructed. Weighted Correlation Network Analysis (WGCNA was utilized to build the coexpression network. Subsequently, UALCAN websites were employed to conduct survival analysis. Finally, the oncomine database was used to validate the expression of ANLN in other datasets.Results: GSE29570 and GSE89657, including 49 cervical cancer tissues and 20 normal cervical tissues, were screened as the datasets. Three-hundred-twenty-four DEGs were identified and, among them, 123 were upregulated, while 201 were downregulated. The

  7. Functional analysis of PI-like gene in relation to flower development ...

    Indian Academy of Sciences (India)

    lying flower development in bamboo, a petal-identity gene was identified as a ... 35S::BoPI fully rescued the defective petal forma- tion in the ... Arabidopsis converted sepals to petals; BoPI-C interacted with BoAP3 on yeast two-hybrid assay, just like the full-length ... PI homologue function in regulating perianth organ forma-.

  8. Gene organization in rice revealed by full-length cDNA mapping and gene expression analysis through microarray.

    Directory of Open Access Journals (Sweden)

    Kouji Satoh

    Full Text Available Rice (Oryza sativa L. is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE genes, 33K annotated non-expressed (ANE genes, and 5.5K non-annotated expressed (NAE genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.

  9. Gene set analysis of purine and pyrimidine antimetabolites cancer therapies.

    Science.gov (United States)

    Fridley, Brooke L; Batzler, Anthony; Li, Liang; Li, Fang; Matimba, Alice; Jenkins, Gregory D; Ji, Yuan; Wang, Liewei; Weinshilboum, Richard M

    2011-11-01

    Responses to therapies, either with regard to toxicities or efficacy, are expected to involve complex relationships of gene products within the same molecular pathway or functional gene set. Therefore, pathways or gene sets, as opposed to single genes, may better reflect the true underlying biology and may be more appropriate units for analysis of pharmacogenomic studies. Application of such methods to pharmacogenomic studies may enable the detection of more subtle effects of multiple genes in the same pathway that may be missed by assessing each gene individually. A gene set analysis of 3821 gene sets is presented assessing the association between basal messenger RNA expression and drug cytotoxicity using ethnically defined human lymphoblastoid cell lines for two classes of drugs: pyrimidines [gemcitabine (dFdC) and arabinoside] and purines [6-thioguanine and 6-mercaptopurine]. The gene set nucleoside-diphosphatase activity was found to be significantly associated with both dFdC and arabinoside, whereas gene set γ-aminobutyric acid catabolic process was associated with dFdC and 6-thioguanine. These gene sets were significantly associated with the phenotype even after adjusting for multiple testing. In addition, five associated gene sets were found in common between the pyrimidines and two gene sets for the purines (3',5'-cyclic-AMP phosphodiesterase activity and γ-aminobutyric acid catabolic process) with a P value of less than 0.0001. Functional validation was attempted with four genes each in gene sets for thiopurine and pyrimidine antimetabolites. All four genes selected from the pyrimidine gene sets (PSME3, CANT1, ENTPD6, ADRM1) were validated, but only one (PDE4D) was validated for the thiopurine gene sets. In summary, results from the gene set analysis of pyrimidine and purine therapies, used often in the treatment of various cancers, provide novel insight into the relationship between genomic variation and drug response.

  10. DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures.

    Science.gov (United States)

    Mazandu, Gaston K; Mulder, Nicola J

    2013-09-25

    The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications. We have developed DaGO-Fun, an online tool available at http://web.cbio.uct.ac.za/ITGOM, which incorporates many different GO similarity measures for exploring, analyzing and comparing GO terms and proteins within the context of GO. It uses GO data and UniProt proteins with their GO annotations as provided by the Gene Ontology Annotation (GOA) project to precompute GO term information content (IC), enabling rapid response to user queries. The DaGO-Fun online tool presents the advantage of integrating all the relevant IC-based GO similarity measures, including topology- and annotation-based approaches to facilitate effective exploration of these measures, thus enabling users to choose the most relevant approach for their application. Furthermore, this tool includes several biological applications related to GO semantic similarity scores, including the retrieval of genes based on their GO annotations, the clustering of functionally related genes within a set, and term enrichment analysis.

  11. Functional understanding of the diverse exon-intron structures of human GPCR genes.

    Science.gov (United States)

    Hammond, Dorothy A; Olman, Victor; Xu, Ying

    2014-02-01

    The GPCR genes have a variety of exon-intron structures even though their proteins are all structurally homologous. We have examined all human GPCR genes with at least two functional protein isoforms, totaling 199, aiming to gain an understanding of what may have contributed to the large diversity of the exon-intron structures of the GPCR genes. The 199 genes have a total of 808 known protein splicing isoforms with experimentally verified functions. Our analysis reveals that 1301 (80.6%) adjacent exon-exon pairs out of the total of 1,613 in the 199 genes have either exactly one exon skipped or the intron in-between retained in at least one of the 808 protein splicing isoforms. This observation has a statistical significance p-value of 2.051762 * e(-09), assuming that the observed splicing isoforms are independent of the exon-intron structures. Our interpretation of this observation is that the exon boundaries of the GPCR genes are not randomly determined; instead they may be selected to facilitate specific alternative splicing for functional purposes.

  12. Comparative genomic analysis of SET domain family reveals the origin, expansion, and putative function of the arthropod-specific SmydA genes as histone modifiers in insects.

    Science.gov (United States)

    Jiang, Feng; Liu, Qing; Wang, Yanli; Zhang, Jie; Wang, Huimin; Song, Tianqi; Yang, Meiling; Wang, Xianhui; Kang, Le

    2017-06-01

    The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain-containing genes (SET genes) in all of the 147 arthropod genomes sequenced at the time of carrying out this experiment to understand the evolutionary history by which SET domains have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, that is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution. © The Authors 2017. Published by Oxford University Press.

  13. Combining many interaction networks to predict gene function and analyze gene lists.

    Science.gov (United States)

    Mostafavi, Sara; Morris, Quaid

    2012-05-01

    In this article, we review how interaction networks can be used alone or in combination in an automated fashion to provide insight into gene and protein function. We describe the concept of a "gene-recommender system" that can be applied to any large collection of interaction networks to make predictions about gene or protein function based on a query list of proteins that share a function of interest. We discuss these systems in general and focus on one specific system, GeneMANIA, that has unique features and uses different algorithms from the majority of other systems. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Pleiotropy analysis of quantitative traits at gene level by multivariate functional linear models.

    Science.gov (United States)

    Wang, Yifan; Liu, Aiyi; Mills, James L; Boehnke, Michael; Wilson, Alexander F; Bailey-Wilson, Joan E; Xiong, Momiao; Wu, Colin O; Fan, Ruzong

    2015-05-01

    In genetics, pleiotropy describes the genetic effect of a single gene on multiple phenotypic traits. A common approach is to analyze the phenotypic traits separately using univariate analyses and combine the test results through multiple comparisons. This approach may lead to low power. Multivariate functional linear models are developed to connect genetic variant data to multiple quantitative traits adjusting for covariates for a unified analysis. Three types of approximate F-distribution tests based on Pillai-Bartlett trace, Hotelling-Lawley trace, and Wilks's Lambda are introduced to test for association between multiple quantitative traits and multiple genetic variants in one genetic region. The approximate F-distribution tests provide much more significant results than those of F-tests of univariate analysis and optimal sequence kernel association test (SKAT-O). Extensive simulations were performed to evaluate the false positive rates and power performance of the proposed models and tests. We show that the approximate F-distribution tests control the type I error rates very well. Overall, simultaneous analysis of multiple traits can increase power performance compared to an individual test of each trait. The proposed methods were applied to analyze (1) four lipid traits in eight European cohorts, and (2) three biochemical traits in the Trinity Students Study. The approximate F-distribution tests provide much more significant results than those of F-tests of univariate analysis and SKAT-O for the three biochemical traits. The approximate F-distribution tests of the proposed functional linear models are more sensitive than those of the traditional multivariate linear models that in turn are more sensitive than SKAT-O in the univariate case. The analysis of the four lipid traits and the three biochemical traits detects more association than SKAT-O in the univariate case. © 2015 WILEY PERIODICALS, INC.

  15. Microarray analysis reveals key genes and pathways in Tetralogy of Fallot

    Science.gov (United States)

    He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai

    2017-01-01

    The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF

  16. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E.; Re, Matteo

    2014-01-01

    Objective In the context of “network medicine”, gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. Materials and methods We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. Results The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different “informativeness” embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Conclusions Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further

  17. Functional characterization of a Penicillium chrysogenum mutanase gene induced upon co-cultivation with Bacillus subtilis

    NARCIS (Netherlands)

    Bajaj, I.; Veiga, T.; Van Dissel, D.; Pronk, J.T.; Daran, J.M.

    2014-01-01

    Background Microbial gene expression is strongly influenced by environmental growth conditions. Comparison of gene expression under different conditions is frequently used for functional analysis and to unravel regulatory networks, however, gene expression responses to co-cultivation with other

  18. Processes of fungal proteome evolution and gain of function: gene duplication and domain rearrangement

    International Nuclear Information System (INIS)

    Cohen-Gihon, Inbar; Nussinov, Ruth; Sharan, Roded

    2011-01-01

    During evolution, organisms have gained functional complexity mainly by modifying and improving existing functioning systems rather than creating new ones ab initio. Here we explore the interplay between two processes which during evolution have had major roles in the acquisition of new functions: gene duplication and protein domain rearrangements. We consider four possible evolutionary scenarios: gene families that have undergone none of these event types; only gene duplication; only domain rearrangement, or both events. We characterize each of the four evolutionary scenarios by functional attributes. Our analysis of ten fungal genomes indicates that at least for the fungi clade, species significantly appear to gain complexity by gene duplication accompanied by the expansion of existing domain architectures via rearrangements. We show that paralogs gaining new domain architectures via duplication tend to adopt new functions compared to paralogs that preserve their domain architectures. We conclude that evolution of protein families through gene duplication and domain rearrangement is correlated with their functional properties. We suggest that in general, new functions are acquired via the integration of gene duplication and domain rearrangements rather than each process acting independently

  19. Functional analysis and tissue-differential expression of four FAD2 genes in amphidiploid Brassica napus derived from Brassica rapa and Brassica oleracea.

    Science.gov (United States)

    Lee, Kyeong-Ryeol; In Sohn, Soo; Jung, Jin Hee; Kim, Sun Hee; Roh, Kyung Hee; Kim, Jong-Bum; Suh, Mi Chung; Kim, Hyun Uk

    2013-12-01

    Fatty acid desaturase 2 (FAD2), which resides in the endoplasmic reticulum (ER), plays a crucial role in producing linoleic acid (18:2) through catalyzing the desaturation of oleic acid (18:1) by double bond formation at the delta 12 position. FAD2 catalyzes the first step needed for the production of polyunsaturated fatty acids found in the glycerolipids of cell membranes and the triacylglycerols in seeds. In this study, four FAD2 genes from amphidiploid Brassica napus genome were isolated by PCR amplification, with their enzymatic functions predicted by sequence analysis of the cDNAs. Fatty acid analysis of budding yeast transformed with each of the FAD2 genes showed that whereas BnFAD2-1, BnFAD2-2, and BnFAD2-4 are functional enzymes, and BnFAD2-3 is nonfunctional. The four FAD2 genes of B. napus originated from synthetic hybridization of its diploid progenitors Brassica rapa and Brassica oleracea, each of which has two FAD2 genes identical to those of B. napus. The BnFAD2-3 gene of B. napus, a nonfunctional pseudogene mutated by multiple nucleotide deletions and insertions, was inherited from B. rapa. All BnFAD2 isozymes except BnFAD2-3 localized to the ER. Nonfunctional BnFAD2-3 localized to the nucleus and chloroplasts. Four BnFAD2 genes can be classified on the basis of their expression patterns. © 2013.

  20. Effects of in ovo electroporation on endogenous gene expression: genome-wide analysis

    Directory of Open Access Journals (Sweden)

    Chambers David

    2011-04-01

    Full Text Available Abstract Background In ovo electroporation is a widely used technique to study gene function in developmental biology. Despite the widespread acceptance of this technique, no genome-wide analysis of the effects of in ovo electroporation, principally the current applied across the tissue and exogenous vector DNA introduced, on endogenous gene expression has been undertaken. Here, the effects of electric current and expression of a GFP-containing construct, via electroporation into the midbrain of Hamburger-Hamilton stage 10 chicken embryos, are analysed by microarray. Results Both current alone and in combination with exogenous DNA expression have a small but reproducible effect on endogenous gene expression, changing the expression of the genes represented on the array by less than 0.1% (current and less than 0.5% (current + DNA, respectively. The subset of genes regulated by electric current and exogenous DNA span a disparate set of cellular functions. However, no genes involved in the regional identity were affected. In sharp contrast to this, electroporation of a known transcription factor, Dmrt5, caused a much greater change in gene expression. Conclusions These findings represent the first systematic genome-wide analysis of the effects of in ovo electroporation on gene expression during embryonic development. The analysis reveals that this process has minimal impact on the genetic basis of cell fate specification. Thus, the study demonstrates the validity of the in ovo electroporation technique to study gene function and expression during development. Furthermore, the data presented here can be used as a resource to refine the set of transcriptional responders in future in ovo electroporation studies of specific gene function.

  1. GeoChips for Analysis of Microbial Functional Communities

    Energy Technology Data Exchange (ETDEWEB)

    Van Nostrand, Joy D.; Wu, Liyou; He, Zhili; Zhou, Jizhong

    2008-09-30

    Functional gene arrays (FGA) are microarrays that contain probes for genes encoding proteins or enzymes involved in functions of interest and allow for the study of thousands of genes at one time. The most comprehensive FGA to date is the GeoChip, which contains ~;;24,000 probes for ~;;10,000 genes involved in the geochemical cycling of C, N, P, and S, as well as genes involved in metal resistance and reduction and contaminant degradation. This chapter details the methods necessary for GeoChip analysis. Methods covered include preparation of DNA (whole community genome amplification and labeling), array setup (prehybridization steps), hybridization (sample and hybridization buffers), and post hybridization steps (slide washing and array scanning).

  2. FMAP: Functional Mapping and Analysis Pipeline for metagenomics and metatranscriptomics studies.

    Science.gov (United States)

    Kim, Jiwoong; Kim, Min Soo; Koh, Andrew Y; Xie, Yang; Zhan, Xiaowei

    2016-10-10

    Given the lack of a complete and comprehensive library of microbial reference genomes, determining the functional profile of diverse microbial communities is challenging. The available functional analysis pipelines lack several key features: (i) an integrated alignment tool, (ii) operon-level analysis, and (iii) the ability to process large datasets. Here we introduce our open-sourced, stand-alone functional analysis pipeline for analyzing whole metagenomic and metatranscriptomic sequencing data, FMAP (Functional Mapping and Analysis Pipeline). FMAP performs alignment, gene family abundance calculations, and statistical analysis (three levels of analyses are provided: differentially-abundant genes, operons and pathways). The resulting output can be easily visualized with heatmaps and functional pathway diagrams. FMAP functional predictions are consistent with currently available functional analysis pipelines. FMAP is a comprehensive tool for providing functional analysis of metagenomic/metatranscriptomic sequencing data. With the added features of integrated alignment, operon-level analysis, and the ability to process large datasets, FMAP will be a valuable addition to the currently available functional analysis toolbox. We believe that this software will be of great value to the wider biology and bioinformatics communities.

  3. Assessing gene function in the ruminant placenta.

    Science.gov (United States)

    Anthony, R V; Cantlon, J D; Gates, K C; Purcell, S H; Clay, C M

    2010-01-01

    The placenta provides the means for nutrient transfer from the mother to the fetus, waste transfer from the fetus to the mother, protection of the fetus from the maternal immune system, and is an active endocrine organ. While many placental functions have been defined and investigated, assessing the function of specific genes expressed by the placenta has been problematic, since classical ablation-replacement methods are not feasible with the placenta. The pregnant sheep has been a long-standing animal model for assessing in vivo physiology during pregnancy, since surgical placement of indwelling catheters into both maternal and fetal vasculature has allowed the assessment of placental nutrient transfer and utilization, as well as placental hormone secretion, under unanesthetized-unstressed steady state sampling conditions. However, in ruminants the lack of well-characterized trophoblast cell lines and the inefficiency of creating transgenic pregnancies in ruminants have inhibited our ability to assess specific gene function. Recently, sheep and cattle primary trophoblast cell lines have been reported, and may further our ability to investigate trophoblast function and transcriptional regulation of genes expressed by the placenta. Furthermore, viral infection of the trophoectoderm layer of hatched blastocysts, as a means for placenta-specific transgenesis, holds considerable potential to assess gene function in the ruminant placenta. This approach has been used successfully to "knockdown" gene expression in the developing sheep conceptus, and has the potential for gain-of-function experiments as well. While this technology is still being developed, it may provide an efficient approach to assess specific gene function in the ruminant placenta.

  4. Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea (Cicer arietinum L.).

    Science.gov (United States)

    Deokar, Amit A; Tar'an, Bunyamin

    2016-01-01

    Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea ( Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis -acting regulatory elements revealed enrichment of cis -elements involved in circadian control, light response, defense and stress responsiveness

  5. Dynamic gene expression analysis in a H1N1 influenza virus mouse pneumonia model.

    Science.gov (United States)

    Bao, Yanyan; Gao, Yingjie; Shi, Yujing; Cui, Xiaolan

    2017-06-01

    H1N1, a major pathogenic subtype of influenza A virus, causes a respiratory infection in humans and livestock that can range from a mild infection to more severe pneumonia associated with acute respiratory distress syndrome. Understanding the dynamic changes in the genome and the related functional changes induced by H1N1 influenza virus infection is essential to elucidating the pathogenesis of this virus and thereby determining strategies to prevent future outbreaks. In this study, we filtered the significantly expressed genes in mouse pneumonia using mRNA microarray analysis. Using STC analysis, seven significant gene clusters were revealed, and using STC-GO analysis, we explored the significant functions of these seven gene clusters. The results revealed GOs related to H1N1 virus-induced inflammatory and immune functions, including innate immune response, inflammatory response, specific immune response, and cellular response to interferon-beta. Furthermore, the dynamic regulation relationships of the key genes in mouse pneumonia were revealed by dynamic gene network analysis, and the most important genes were filtered, including Dhx58, Cxcl10, Cxcl11, Zbp1, Ifit1, Ifih1, Trim25, Mx2, Oas2, Cd274, Irgm1, and Irf7. These results suggested that during mouse pneumonia, changes in the expression of gene clusters and the complex interactions among genes lead to significant changes in function. Dynamic gene expression analysis revealed key genes that performed important functions. These results are a prelude to advancements in mouse H1N1 influenza virus infection biology, as well as the use of mice as a model organism for human H1N1 influenza virus infection studies.

  6. The duplicated genes database: identification and functional annotation of co-localised duplicated genes across genomes.

    Directory of Open Access Journals (Sweden)

    Marion Ouedraogo

    Full Text Available BACKGROUND: There has been a surge in studies linking genome structure and gene expression, with special focus on duplicated genes. Although initially duplicated from the same sequence, duplicated genes can diverge strongly over evolution and take on different functions or regulated expression. However, information on the function and expression of duplicated genes remains sparse. Identifying groups of duplicated genes in different genomes and characterizing their expression and function would therefore be of great interest to the research community. The 'Duplicated Genes Database' (DGD was developed for this purpose. METHODOLOGY: Nine species were included in the DGD. For each species, BLAST analyses were conducted on peptide sequences corresponding to the genes mapped on a same chromosome. Groups of duplicated genes were defined based on these pairwise BLAST comparisons and the genomic location of the genes. For each group, Pearson correlations between gene expression data and semantic similarities between functional GO annotations were also computed when the relevant information was available. CONCLUSIONS: The Duplicated Gene Database provides a list of co-localised and duplicated genes for several species with the available gene co-expression level and semantic similarity value of functional annotation. Adding these data to the groups of duplicated genes provides biological information that can prove useful to gene expression analyses. The Duplicated Gene Database can be freely accessed through the DGD website at http://dgd.genouest.org.

  7. Characterization, expression patterns and functional analysis of the MAPK and MAPKK genes in watermelon (Citrullus lanatus).

    Science.gov (United States)

    Song, Qiuming; Li, Dayong; Dai, Yi; Liu, Shixia; Huang, Lei; Hong, Yongbo; Zhang, Huijuan; Song, Fengming

    2015-12-23

    Mitogen-activated protein kinase (MAPK) cascades, which consist of three functionally associated protein kinases, namely MEKKs, MKKs and MPKs, are universal signaling modules in all eukaryotes and have been shown to play critical roles in many physiological and biochemical processes in plants. However, little or nothing is known about the MPK and MKK families in watermelon. In the present study, we performed a systematic characterization of the ClMPK and ClMKK families including the identification and nomenclature, chromosomal localization, phylogenetic relationships, ClMPK-ClMKK interactions, expression patterns in different tissues and in response to abiotic and biotic stress and transient expression-based functional analysis for their roles in disease resistance. Genome-wide survey identified fifteen ClMPK and six ClMKK genes in watermelon genome and phylogenetic analysis revealed that both of the ClMPK and ClMKK families can be classified into four distinct groups. Yeast two-hybrid assays demonstrated significant interactions between members of the ClMPK and ClMKK families, defining putative ClMKK2-1/ClMKK6-ClMPK4-1/ClMPK4-2/ClMPK13 and ClMKK5-ClMPK6 cascades. Most of the members in the ClMPK and ClMKK families showed differential expression patterns in different tissues and in response to abiotic (e.g. drought, salt, cold and heat treatments) and biotic (e.g. infection of Fusarium oxysporum f. sp. niveum) stresses. Transient expression of ClMPK1, ClMPK4-2 and ClMPK7 in Nicotiana benthamiana resulted in enhanced resistance to Botrytis cinerea and upregulated expression of defense genes while transient expression of ClMPK6 and ClMKK2-2 led to increased susceptibility to B. cinerea. Furthermore, transient expression of ClMPK7 also led to hypersensitive response (HR)-like cell death and significant accumulation of H2O2 in N. benthamiana. We identified fifteen ClMPK and six ClMKK genes from watermelon and analyzed their phylogenetic relationships, expression

  8. Integrative mining of traditional Chinese medicine literature and MEDLINE for functional gene networks.

    Science.gov (United States)

    Zhou, Xuezhong; Liu, Baoyan; Wu, Zhaohui; Feng, Yi

    2007-10-01

    The amount of biomedical data in different disciplines is growing at an exponential rate. Integrating these significant knowledge sources to generate novel hypotheses for systems biology research is difficult. Traditional Chinese medicine (TCM) is a completely different discipline, and is a complementary knowledge system to modern biomedical science. This paper uses a significant TCM bibliographic literature database in China, together with MEDLINE, to help discover novel gene functional knowledge. We present an integrative mining approach to uncover the functional gene relationships from MEDLINE and TCM bibliographic literature. This paper introduces TCM literature (about 50,000 records) as one knowledge source for constructing literature-based gene networks. We use the TCM diagnosis, TCM syndrome, to automatically congregate the related genes. The syndrome-gene relationships are discovered based on the syndrome-disease relationships extracted from TCM literature and the disease-gene relationships in MEDLINE. Based on the bubble-bootstrapping and relation weight computing methods, we have developed a prototype system called MeDisco/3S, which has name entity and relation extraction, and online analytical processing (OLAP) capabilities, to perform the integrative mining process. We have got about 200,000 syndrome-gene relations, which could help generate syndrome-based gene networks, and help analyze the functional knowledge of genes from syndrome perspective. We take the gene network of Kidney-Yang Deficiency syndrome (KYD syndrome) and the functional analysis of some genes, such as CRH (corticotropin releasing hormone), PTH (parathyroid hormone), PRL (prolactin), BRCA1 (breast cancer 1, early onset) and BRCA2 (breast cancer 2, early onset), to demonstrate the preliminary results. The underlying hypothesis is that the related genes of the same syndrome will have some biological functional relationships, and will constitute a functional network. This paper presents

  9. Evolutionary Fates and Dynamic Functionalization of Young Duplicate Genes in Arabidopsis Genomes1[OPEN

    Science.gov (United States)

    Wang, Jun; Tao, Feng; Marowsky, Nicholas C.; Fan, Chuanzhu

    2016-01-01

    Gene duplication is a primary means to generate genomic novelties, playing an essential role in speciation and adaptation. Particularly in plants, a high abundance of duplicate genes has been maintained for significantly long periods of evolutionary time. To address the manner in which young duplicate genes were derived primarily from small-scale gene duplication and preserved in plant genomes and to determine the underlying driving mechanisms, we generated transcriptomes to produce the expression profiles of five tissues in Arabidopsis thaliana and the closely related species Arabidopsis lyrata and Capsella rubella. Based on the quantitative analysis metrics, we investigated the evolutionary processes of young duplicate genes in Arabidopsis. We determined that conservation, neofunctionalization, and specialization are three main evolutionary processes for Arabidopsis young duplicate genes. We explicitly demonstrated the dynamic functionalization of duplicate genes along the evolutionary time scale. Upon origination, duplicates tend to maintain their ancestral functions; but as they survive longer, they might be likely to develop distinct and novel functions. The temporal evolutionary processes and functionalization of plant duplicate genes are associated with their ancestral functions, dynamic DNA methylation levels, and histone modification abundances. Furthermore, duplicate genes tend to be initially expressed in pollen and then to gain more interaction partners over time. Altogether, our study provides novel insights into the dynamic retention processes of young duplicate genes in plant genomes. PMID:27485883

  10. New Genes and Functional Innovation in Mammals.

    Science.gov (United States)

    Luis Villanueva-Cañas, José; Ruiz-Orera, Jorge; Agea, M Isabel; Gallo, Maria; Andreu, David; Albà, M Mar

    2017-07-01

    The birth of genes that encode new protein sequences is a major source of evolutionary innovation. However, we still understand relatively little about how these genes come into being and which functions they are selected for. To address these questions, we have obtained a large collection of mammalian-specific gene families that lack homologues in other eukaryotic groups. We have combined gene annotations and de novo transcript assemblies from 30 different mammalian species, obtaining ∼6,000 gene families. In general, the proteins in mammalian-specific gene families tend to be short and depleted in aromatic and negatively charged residues. Proteins which arose early in mammalian evolution include milk and skin polypeptides, immune response components, and proteins involved in reproduction. In contrast, the functions of proteins which have a more recent origin remain largely unknown, despite the fact that these proteins also have extensive proteomics support. We identify several previously described cases of genes originated de novo from noncoding genomic regions, supporting the idea that this mechanism frequently underlies the evolution of new protein-coding genes in mammals. Finally, we show that most young mammalian genes are preferentially expressed in testis, suggesting that sexual selection plays an important role in the emergence of new functional genes. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  11. Live imaging of muscles in Drosophila metamorphosis: Towards high-throughput gene identification and function analysis.

    Science.gov (United States)

    Puah, Wee Choo; Wasser, Martin

    2016-03-01

    Time-lapse microscopy in developmental biology is an emerging tool for functional genomics. Phenotypic effects of gene perturbations can be studied non-invasively at multiple time points in chronological order. During metamorphosis of Drosophila melanogaster, time-lapse microscopy using fluorescent reporters allows visualization of alternative fates of larval muscles, which are a model for the study of genes related to muscle wasting. While doomed muscles enter hormone-induced programmed cell death, a smaller population of persistent muscles survives to adulthood and undergoes morphological remodeling that involves atrophy in early, and hypertrophy in late pupation. We developed a method that combines in vivo imaging, targeted gene perturbation and image analysis to identify and characterize genes involved in muscle development. Macrozoom microscopy helps to screen for interesting muscle phenotypes, while confocal microscopy in multiple locations over 4-5 days produces time-lapse images that are used to quantify changes in cell morphology. Performing a similar investigation using fixed pupal tissues would be too time-consuming and therefore impractical. We describe three applications of our pipeline. First, we show how quantitative microscopy can track and measure morphological changes of muscle throughout metamorphosis and analyze genes involved in atrophy. Second, our assay can help to identify genes that either promote or prevent histolysis of abdominal muscles. Third, we apply our approach to test new fluorescent proteins as live markers for muscle development. We describe mKO2 tagged Cysteine proteinase 1 (Cp1) and Troponin-I (TnI) as examples of proteins showing developmental changes in subcellular localization. Finally, we discuss strategies to improve throughput of our pipeline to permit genome-wide screens in the future. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  12. Identification of pathogenic genes related to rheumatoid arthritis through integrated analysis of DNA methylation and gene expression profiling.

    Science.gov (United States)

    Zhang, Lei; Ma, Shiyun; Wang, Huailiang; Su, Hang; Su, Ke; Li, Longjie

    2017-11-15

    The purpose of our study was to identify new pathogenic genes used for exploring the pathogenesis of rheumatoid arthritis (RA). To screen pathogenic genes of RA, an integrated analysis was performed by using the microarray datasets in RA derived from the Gene Expression Omnibus (GEO) database. The functional annotation and potential pathways of differentially expressed genes (DEGs) were further discovered by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis. Afterwards, the integrated analysis of DNA methylation and gene expression profiling was used to screen crucial genes. In addition, we used RT-PCR and MSP to verify the expression levels and methylation status of these crucial genes in 20 synovial biopsy samples obtained from 10 RA model mice and 10 normal mice. BCL11B, CCDC88C, FCRLA and APOL6 were both up-regulated and hypomethylated in RA according to integrated analysis, RT-PCR and MSP verification. Four crucial genes (BCL11B, CCDC88C, FCRLA and APOL6) identified and analyzed in this study might be closely connected with the pathogenesis of RA. Copyright © 2017. Published by Elsevier B.V.

  13. Investigating Gene Function in Cereal Rust Fungi by Plant-Mediated Virus-Induced Gene Silencing.

    Science.gov (United States)

    Panwar, Vinay; Bakkeren, Guus

    2017-01-01

    Cereal rust fungi are destructive pathogens, threatening grain production worldwide. Targeted breeding for resistance utilizing host resistance genes has been effective. However, breakdown of resistance occurs frequently and continued efforts are needed to understand how these fungi overcome resistance and to expand the range of available resistance genes. Whole genome sequencing, transcriptomic and proteomic studies followed by genome-wide computational and comparative analyses have identified large repertoire of genes in rust fungi among which are candidates predicted to code for pathogenicity and virulence factors. Some of these genes represent defence triggering avirulence effectors. However, functions of most genes still needs to be assessed to understand the biology of these obligate biotrophic pathogens. Since genetic manipulations such as gene deletion and genetic transformation are not yet feasible in rust fungi, performing functional gene studies is challenging. Recently, Host-induced gene silencing (HIGS) has emerged as a useful tool to characterize gene function in rust fungi while infecting and growing in host plants. We utilized Barley stripe mosaic virus-mediated virus induced gene silencing (BSMV-VIGS) to induce HIGS of candidate rust fungal genes in the wheat host to determine their role in plant-fungal interactions. Here, we describe the methods for using BSMV-VIGS in wheat for functional genomics study in cereal rust fungi.

  14. FY 1999 report on the results on analysis of protein functions; 1999 nendo tanpakushitsu kino kaiseki seika hokokusho

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2001-03-01

    This project is aimed at construction of the intellectual infrastructures for biotechnologies, in order to accelerate development of the Japanese technologies and activate their application to industries. Described herein are the FY 1999 results. These infrastructures are for functional analysis of protein which will be one of the key issues in genome analysis, and collection and analysis of biological information. This project includes a total of 9 research and development themes for four research categories: frequency analysis of gene expression (development of the gene expression profile database system for functional analysis of human genome, and analysis of the gene expression and protein functions by the ECA chip technology), function analysis by the biological model (high-performance analysis by the bio-project, database system for drug metabolizing enzymes, analysis of gene functions using mutant mice, and simple genome function analysis of murine individuals using the RNAi effect), protein expression (function validation of unknown human genes based on the useful biological model, and protein function analysis using multi-purpose destination vectors), and protein function prediction by the information science method. (NEDO)

  15. QTL global meta-analysis: are trait determining genes clustered?

    Directory of Open Access Journals (Sweden)

    Adelson David L

    2009-04-01

    Full Text Available Abstract Background A key open question in biology is if genes are physically clustered with respect to their known functions or phenotypic effects. This is of particular interest for Quantitative Trait Loci (QTL where a QTL region could contain a number of genes that contribute to the trait being measured. Results We observed a significant increase in gene density within QTL regions compared to non-QTL regions and/or the entire bovine genome. By grouping QTL from the Bovine QTL Viewer database into 8 categories of non-redundant regions, we have been able to analyze gene density and gene function distribution, based on Gene Ontology (GO with relation to their location within QTL regions, outside of QTL regions and across the entire bovine genome. We identified a number of GO terms that were significantly over represented within particular QTL categories. Furthermore, select GO terms expected to be associated with the QTL category based on common biological knowledge have also proved to be significantly over represented in QTL regions. Conclusion Our analysis provides evidence of over represented GO terms in QTL regions. This increased GO term density indicates possible clustering of gene functions within QTL regions of the bovine genome. Genes with similar functions may be grouped in specific locales and could be contributing to QTL traits. Moreover, we have identified over-represented GO terminology that from a biological standpoint, makes sense with respect to QTL category type.

  16. Genetic and functional analysis of the gene encoding GAP-43 in schizophrenia.

    Science.gov (United States)

    Shen, Yu-Chih; Tsai, Ho-Min; Cheng, Min-Chih; Hsu, Shih-Hsin; Chen, Shih-Fen; Chen, Chia-Hsiang

    2012-02-01

    In earlier reports, growth-associated protein 43 (GAP-43) has been shown to be critical for initial establishment or reorganization of synaptic connections, a process thought to be disrupted in schizophrenia. Additionally, abnormal GAP-43 expression in different brain regions has been linked to this disorder in postmortem brain studies. In this study, we investigated the involvement of the gene encoding GAP-43 in the susceptibility to schizophrenia. We searched for genetic variants in the promoter region and 3 exons (including both UTR ends) of the GAP-43 gene using direct sequencing in a sample of patients with schizophrenia (n=586) and non-psychotic controls (n=576), both being Han Chinese from Taiwan, and conducted an association and functional study. We identified 11 common polymorphisms in the GAP-43 gene. SNP and haplotype-based analyses displayed no associations with schizophrenia. Additionally, we identified 4 rare variants in 5 out of 586 patients, including 1 variant located at the promoter region (c.-258-4722G>T) and 1 synonymous (V110V) and 2 missense (G150R and P188L) variants located at exon 2. No rare variants were found in the control subjects. The results of the reporter gene assay demonstrated that the regulatory activity of construct containing c.-258-4722T was significantly lower as compared to the wild type construct (c.-258-4722G; panalysis also demonstrated the functional relevance of other rare variants. Our study lends support to the hypothesis of multiple rare mutations in schizophrenia, and it provides genetic clues that indicate the involvement of GAP-43 in this disorder. Copyright © 2011 Elsevier B.V. All rights reserved.

  17. Comparative expression profiling reveals gene functions in female meiosis and gametophyte development in Arabidopsis.

    Science.gov (United States)

    Zhao, Lihua; He, Jiangman; Cai, Hanyang; Lin, Haiyan; Li, Yanqiang; Liu, Renyi; Yang, Zhenbiao; Qin, Yuan

    2014-11-01

    Megasporogenesis is essential for female fertility, and requires the accomplishment of meiosis and the formation of functional megaspores. The inaccessibility and low abundance of female meiocytes make it particularly difficult to elucidate the molecular basis underlying megasporogenesis. We used high-throughput tag-sequencing analysis to identify genes expressed in female meiocytes (FMs) by comparing gene expression profiles from wild-type ovules undergoing megasporogenesis with those from the spl mutant ovules, which lack megasporogenesis. A total of 862 genes were identified as FMs, with levels that are consistently reduced in spl ovules in two biological replicates. Fluorescence-assisted cell sorting followed by RNA-seq analysis of DMC1:GFP-labeled female meiocytes confirmed that 90% of the FMs are indeed detected in the female meiocyte protoplast profiling. We performed reverse genetic analysis of 120 candidate genes and identified four FM genes with a function in female meiosis progression in Arabidopsis. We further revealed that KLU, a putative cytochrome P450 monooxygenase, is involved in chromosome pairing during female meiosis, most likely by affecting the normal expression pattern of DMC1 in ovules during female meiosis. Our studies provide valuable information for functional genomic analyses of plant germline development as well as insights into meiosis. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  18. Gene Circuit Analysis of the Terminal Gap Gene huckebein

    Science.gov (United States)

    Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

    2009-01-01

    The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network. PMID:19876378

  19. Interconnection of Key Microbial Functional Genes for Enhanced Benzo[a]pyrene Biodegradation in Sediments by Microbial Electrochemistry.

    Science.gov (United States)

    Yan, Zaisheng; He, Yuhong; Cai, Haiyuan; Van Nostrand, Joy D; He, Zhili; Zhou, Jizhong; Krumholz, Lee R; Jiang, He-Long

    2017-08-01

    Sediment microbial fuel cells (SMFCs) can stimulate the degradation of polycyclic aromatic hydrocarbons in sediments, but the mechanism of this process is poorly understood at the microbial functional gene level. Here, the use of SMFC resulted in 92% benzo[a]pyrene (BaP) removal over 970 days relative to 54% in the controls. Sediment functions, microbial community structure, and network interactions were dramatically altered by the SMFC employment. Functional gene analysis showed that c-type cytochrome genes for electron transfer, aromatic degradation genes, and extracellular ligninolytic enzymes involved in lignin degradation were significantly enriched in bulk sediments during SMFC operation. Correspondingly, chemical analysis of the system showed that these genetic changes resulted in increases in the levels of easily oxidizable organic carbon and humic acids which may have resulted in increased BaP bioavailability and increased degradation rates. Tracking microbial functional genes and corresponding organic matter responses should aid mechanistic understanding of BaP enhanced biodegradation by microbial electrochemistry and development of sustainable bioremediation strategies.

  20. Functional requirements for bacteriophage growth: gene essentiality and expression in mycobacteriophage Giles.

    Science.gov (United States)

    Dedrick, Rebekah M; Marinelli, Laura J; Newton, Gerald L; Pogliano, Kit; Pogliano, Joseph; Hatfull, Graham F

    2013-05-01

    Bacteriophages represent a majority of all life forms, and the vast, dynamic population with early origins is reflected in their enormous genetic diversity. A large number of bacteriophage genomes have been sequenced. They are replete with novel genes without known relatives. We know little about their functions, which genes are required for lytic growth, and how they are expressed. Furthermore, the diversity is such that even genes with required functions - such as virion proteins and repressors - cannot always be recognized. Here we describe a functional genomic dissection of mycobacteriophage Giles, in which the virion proteins are identified, genes required for lytic growth are determined, the repressor is identified, and the transcription patterns determined. We find that although all of the predicted phage genes are expressed either in lysogeny or in lytic growth, 45% of the predicted genes are non-essential for lytic growth. We also describe genes required for DNA replication, show that recombination is required for lytic growth, and that Giles encodes a novel repressor. RNAseq analysis reveals abundant expression of a small non-coding RNA in a lysogen and in late lytic growth, although it is non-essential for lytic growth and does not alter lysogeny. © 2013 Blackwell Publishing Ltd.

  1. A-DaGO-Fun: an adaptable Gene Ontology semantic similarity-based functional analysis tool.

    Science.gov (United States)

    Mazandu, Gaston K; Chimusa, Emile R; Mbiyavanga, Mamana; Mulder, Nicola J

    2016-02-01

    Gene Ontology (GO) semantic similarity measures are being used for biological knowledge discovery based on GO annotations by integrating biological information contained in the GO structure into data analyses. To empower users to quickly compute, manipulate and explore these measures, we introduce A-DaGO-Fun (ADaptable Gene Ontology semantic similarity-based Functional analysis). It is a portable software package integrating all known GO information content-based semantic similarity measures and relevant biological applications associated with these measures. A-DaGO-Fun has the advantage not only of handling datasets from the current high-throughput genome-wide applications, but also allowing users to choose the most relevant semantic similarity approach for their biological applications and to adapt a given module to their needs. A-DaGO-Fun is freely available to the research community at http://web.cbio.uct.ac.za/ITGOM/adagofun. It is implemented in Linux using Python under free software (GNU General Public Licence). gmazandu@cbio.uct.ac.za or Nicola.Mulder@uct.ac.za Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  2. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods.

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E; Re, Matteo

    2014-06-01

    In the context of "network medicine", gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different "informativeness" embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both

  3. Metagenomes reveal microbial structures, functional potentials, and biofouling-related genes in a membrane bioreactor.

    Science.gov (United States)

    Ma, Jinxing; Wang, Zhiwei; Li, Huan; Park, Hee-Deung; Wu, Zhichao

    2016-06-01

    Metagenomic sequencing was used to investigate the microbial structures, functional potentials, and biofouling-related genes in a membrane bioreactor (MBR). The results showed that the microbial community in the MBR was highly diverse. Notably, function analysis of the dominant genera indicated that common genes from different phylotypes were identified for important functional potentials with the observation of variation of abundances of genes in a certain taxon (e.g., Dechloromonas). Despite maintaining similar metabolic functional potentials with a parallel full-scale conventional activated sludge (CAS) system due to treating the identical wastewater, the MBR had more abundant nitrification-related bacteria and coding genes of ammonia monooxygenase, which could well explain its excellent ammonia removal in the low-temperature period. Furthermore, according to quantification of the genes involved in exopolysaccharide and extracellular polymeric substance (EPS) protein metabolism, the MBR did not show a much different potential in producing EPS compared to the CAS system, and bacteria from the membrane biofilm had lower abundances of genes associated with EPS biosynthesis and transport compared to the activated sludge in the MBR.

  4. Discovering functions of unannotated genes from a transcriptome survey of wild fungal isolates.

    Science.gov (United States)

    Ellison, Christopher E; Kowbel, David; Glass, N Louise; Taylor, John W; Brem, Rachel B

    2014-04-01

    Most fungal genomes are poorly annotated, and many fungal traits of industrial and biomedical relevance are not well suited to classical genetic screens. Assigning genes to phenotypes on a genomic scale thus remains an urgent need in the field. We developed an approach to infer gene function from expression profiles of wild fungal isolates, and we applied our strategy to the filamentous fungus Neurospora crassa. Using transcriptome measurements in 70 strains from two well-defined clades of this microbe, we first identified 2,247 cases in which the expression of an unannotated gene rose and fell across N. crassa strains in parallel with the expression of well-characterized genes. We then used image analysis of hyphal morphologies, quantitative growth assays, and expression profiling to test the functions of four genes predicted from our population analyses. The results revealed two factors that influenced regulation of metabolism of nonpreferred carbon and nitrogen sources, a gene that governed hyphal architecture, and a gene that mediated amino acid starvation resistance. These findings validate the power of our population-transcriptomic approach for inference of novel gene function, and we suggest that this strategy will be of broad utility for genome-scale annotation in many fungal systems. IMPORTANCE Some fungal species cause deadly infections in humans or crop plants, and other fungi are workhorses of industrial chemistry, including the production of biofuels. Advances in medical and industrial mycology require an understanding of the genes that control fungal traits. We developed a method to infer functions of uncharacterized genes by observing correlated expression of their mRNAs with those of known genes across wild fungal isolates. We applied this strategy to a filamentous fungus and predicted functions for thousands of unknown genes. In four cases, we experimentally validated the predictions from our method, discovering novel genes involved in the

  5. Functional Associations by Response Overlap (FARO, a functional genomics approach matching gene expression phenotypes.

    Directory of Open Access Journals (Sweden)

    Henrik Bjørn Nielsen

    2007-08-01

    Full Text Available The systematic comparison of transcriptional responses of organisms is a powerful tool in functional genomics. For example, mutants may be characterized by comparing their transcript profiles to those obtained in other experiments querying the effects on gene expression of many experimental factors including treatments, mutations and pathogen infections. Similarly, drugs may be discovered by the relationship between the transcript profiles effectuated or impacted by a candidate drug and by the target disease. The integration of such data enables systems biology to predict the interplay between experimental factors affecting a biological system. Unfortunately, direct comparisons of gene expression profiles obtained in independent, publicly available microarray experiments are typically compromised by substantial, experiment-specific biases. Here we suggest a novel yet conceptually simple approach for deriving 'Functional Association(s by Response Overlap' (FARO between microarray gene expression studies. The transcriptional response is defined by the set of differentially expressed genes independent from the magnitude or direction of the change. This approach overcomes the limited comparability between studies that is typical for methods that rely on correlation in gene expression. We apply FARO to a compendium of 242 diverse Arabidopsis microarray experimental factors, including phyto-hormones, stresses and pathogens, growth conditions/stages, tissue types and mutants. We also use FARO to confirm and further delineate the functions of Arabidopsis MAP kinase 4 in disease and stress responses. Furthermore, we find that a large, well-defined set of genes responds in opposing directions to different stress conditions and predict the effects of different stress combinations. This demonstrates the usefulness of our approach for exploiting public microarray data to derive biologically meaningful associations between experimental factors. Finally, our

  6. Studying Functions of All Yeast Genes Simultaneously

    Science.gov (United States)

    Stolc, Viktor; Eason, Robert G.; Poumand, Nader; Herman, Zelek S.; Davis, Ronald W.; Anthony Kevin; Jejelowo, Olufisayo

    2006-01-01

    A method of studying the functions of all the genes of a given species of microorganism simultaneously has been developed in experiments on Saccharomyces cerevisiae (commonly known as baker's or brewer's yeast). It is already known that many yeast genes perform functions similar to those of corresponding human genes; therefore, by facilitating understanding of yeast genes, the method may ultimately also contribute to the knowledge needed to treat some diseases in humans. Because of the complexity of the method and the highly specialized nature of the underlying knowledge, it is possible to give only a brief and sketchy summary here. The method involves the use of unique synthetic deoxyribonucleic acid (DNA) sequences that are denoted as DNA bar codes because of their utility as molecular labels. The method also involves the disruption of gene functions through deletion of genes. Saccharomyces cerevisiae is a particularly powerful experimental system in that multiple deletion strains easily can be pooled for parallel growth assays. Individual deletion strains recently have been created for 5,918 open reading frames, representing nearly all of the estimated 6,000 genetic loci of Saccharomyces cerevisiae. Tagging of each deletion strain with one or two unique 20-nucleotide sequences enables identification of genes affected by specific growth conditions, without prior knowledge of gene functions. Hybridization of bar-code DNA to oligonucleotide arrays can be used to measure the growth rate of each strain over several cell-division generations. The growth rate thus measured serves as an index of the fitness of the strain.

  7. Identifying arsenic trioxide (ATO) functions in leukemia cells by using time series gene expression profiles.

    Science.gov (United States)

    Yang, Hong; Lin, Shan; Cui, Jingru

    2014-02-10

    Arsenic trioxide (ATO) is presently the most active single agent in the treatment of acute promyelocytic leukemia (APL). In order to explore the molecular mechanism of ATO in leukemia cells with time series, we adopted bioinformatics strategy to analyze expression changing patterns and changes in transcription regulation modules of time series genes filtered from Gene Expression Omnibus database (GSE24946). We totally screened out 1847 time series genes for subsequent analysis. The KEGG (Kyoto encyclopedia of genes and genomes) pathways enrichment analysis of these genes showed that oxidative phosphorylation and ribosome were the top 2 significantly enriched pathways. STEM software was employed to compare changing patterns of gene expression with assigned 50 expression patterns. We screened out 7 significantly enriched patterns and 4 tendency charts of time series genes. The result of Gene Ontology showed that functions of times series genes mainly distributed in profiles 41, 40, 39 and 38. Seven genes with positive regulation of cell adhesion function were enriched in profile 40, and presented the same first increased model then decreased model as profile 40. The transcription module analysis showed that they mainly involved in oxidative phosphorylation pathway and ribosome pathway. Overall, our data summarized the gene expression changes in ATO treated K562-r cell lines with time and suggested that time series genes mainly regulated cell adhesive. Furthermore, our result may provide theoretical basis of molecular biology in treating acute promyelocytic leukemia. Copyright © 2013 Elsevier B.V. All rights reserved.

  8. Genomic structure and promoter functional analysis of GnRH3 gene in large yellow croaker (Larimichthys crocea).

    Science.gov (United States)

    Huang, Wei; Zhang, Jianshe; Liao, Zhi; Lv, Zhenming; Wu, Huifei; Zhu, Aiyi; Wu, Changwen

    2016-01-15

    Gonadotropin-releasing hormone III (GnRH3) is considered to be a key neurohormone in fish reproduction control. In the present study, the cDNA and genomic sequences of GnRH3 were cloned and characterized from large yellow croaker Larimichthys crocea. The cDNA encoded a protein of 99 amino acids with four functional motifs. The full-length genome sequence was composed of 3797 nucleotides, including four exons and three introns. Higher identities of amino acid sequences and conserved exon-intron organizations were found between LcGnRH3 and other GnRH3 genes. In addition, some special features of the sequences were detected in partial species. For example, two specific residues (V and A) were found in the family Sciaenidae, and the unique 75-72 bp type of the open reading frame 2 and 3 existed in the family Cyprinidae. Analysis of the 2576 bp promoter fragment of LcGnRH3 showed a number of transcription factor binding sites, such as AP1, CREB, GATA-1, HSF, FOXA2, and FOXL1. Promoter functional analysis using an EGFP reporter fusion in zebrafish larvae presented positive signals in the brain, including the olfactory region, the terminal nerve ganglion, the telencephalon, and the hypothalamus. The expression pattern was generally consistent with the endogenous GnRH3 GFP-expressing transgenic zebrafish lines, but the details were different. These results indicate that the structure and function of LcGnRH3 are generally similar to the other teleost GnRH3 genes, but there exist some distinctions among them. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. [Construction and functional identification of eukaryotic expression vector carrying Sprague-Dawley rat MSX-2 gene].

    Science.gov (United States)

    Yang, Xian-Xian; Zhang, Mei; Yan, Zhao-Wen; Zhang, Ru-Hong; Mu, Xiong-Zheng

    2008-01-01

    To construct a high effective eukaryotic expressing plasmid PcDNA 3.1-MSX-2 encoding Sprague-Dawley rat MSX-2 gene for the further study of MSX-2 gene function. The full length SD rat MSX-2 gene was amplified by PCR, and the full length DNA was inserted in the PMD1 8-T vector. It was isolated by restriction enzyme digest with BamHI and Xhol, then ligated into the cloning site of the PcDNA3.1 expression plasmid. The positive recombinant was identified by PCR analysis, restriction endonudease analysis and sequence analysis. Expression of RNA and protein was detected by RT-PCR and Western blot analysis in PcDNA3.1-MSX-2 transfected HEK293 cells. Sequence analysis and restriction endonudease analysis of PcDNA3.1-MSX-2 demonstrated that the position and size of MSX-2 cDNA insertion were consistent with the design. RT-PCR and Western blot analysis showed specific expression of mRNA and protein of MSX-2 in the transfected HEK293 cells. The high effective eukaryotic expression plasmid PcDNA3.1-MSX-2 encoding Sprague-Dawley Rat MSX-2 gene which is related to craniofacial development can be successfully reconstructed. It may serve as the basis for the further study of MSX-2 gene function.

  10. Identification and expression profiling analysis of TCP family genes involved in growth and development in maize.

    Science.gov (United States)

    Chai, Wenbo; Jiang, Pengfei; Huang, Guoyu; Jiang, Haiyang; Li, Xiaoyu

    2017-10-01

    The TCP family is a group of plant-specific transcription factors. TCP genes encode proteins harboring bHLH structure, which is implicated in DNA binding and protein-protein interactions and known as the TCP domain. TCP genes play important roles in plant development and have been evolutionarily and functionally elaborated in various plants, however, no overall phylogenetic analysis or expression profiling of TCP genes in Zea mays has been reported. In the present study, a systematic analysis of molecular evolution and functional prediction of TCP family genes in maize ( Z . mays L.) has been conducted. We performed a genome-wide survey of TCP genes in maize, revealing the gene structure, chromosomal location and phylogenetic relationship of family members. Microsynteny between grass species and tissue-specific expression profiles were also investigated. In total, 29 TCP genes were identified in the maize genome, unevenly distributed on the 10 maize chromosomes. Additionally, ZmTCP genes were categorized into nine classes based on phylogeny and purifying selection may largely be responsible for maintaining the functions of maize TCP genes. What's more, microsynteny analysis suggested that TCP genes have been conserved during evolution. Finally, expression analysis revealed that most TCP genes are expressed in the stem and ear, which suggests that ZmTCP genes influence stem and ear growth. This result is consistent with the previous finding that maize TCP genes represses the growth of axillary organs and enables the formation of female inflorescences. Altogether, this study presents a thorough overview of TCP family in maize and provides a new perspective on the evolution of this gene family. The results also indicate that TCP family genes may be involved in development stage in plant growing conditions. Additionally, our results will be useful for further functional analysis of the TCP gene family in maize.

  11. Integration of multiple networks and pathways identifies cancer driver genes in pan-cancer analysis.

    Science.gov (United States)

    Cava, Claudia; Bertoli, Gloria; Colaprico, Antonio; Olsen, Catharina; Bontempi, Gianluca; Castiglioni, Isabella

    2018-01-06

    Modern high-throughput genomic technologies represent a comprehensive hallmark of molecular changes in pan-cancer studies. Although different cancer gene signatures have been revealed, the mechanism of tumourigenesis has yet to be completely understood. Pathways and networks are important tools to explain the role of genes in functional genomic studies. However, few methods consider the functional non-equal roles of genes in pathways and the complex gene-gene interactions in a network. We present a novel method in pan-cancer analysis that identifies de-regulated genes with a functional role by integrating pathway and network data. A pan-cancer analysis of 7158 tumour/normal samples from 16 cancer types identified 895 genes with a central role in pathways and de-regulated in cancer. Comparing our approach with 15 current tools that identify cancer driver genes, we found that 35.6% of the 895 genes identified by our method have been found as cancer driver genes with at least 2/15 tools. Finally, we applied a machine learning algorithm on 16 independent GEO cancer datasets to validate the diagnostic role of cancer driver genes for each cancer. We obtained a list of the top-ten cancer driver genes for each cancer considered in this study. Our analysis 1) confirmed that there are several known cancer driver genes in common among different types of cancer, 2) highlighted that cancer driver genes are able to regulate crucial pathways.

  12. Gene set analysis using variance component tests.

    Science.gov (United States)

    Huang, Yen-Tsung; Lin, Xihong

    2013-06-28

    Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.

  13. Functional regression method for whole genome eQTL epistasis analysis with sequencing data.

    Science.gov (United States)

    Xu, Kelin; Jin, Li; Xiong, Momiao

    2017-05-18

    Epistasis plays an essential rule in understanding the regulation mechanisms and is an essential component of the genetic architecture of the gene expressions. However, interaction analysis of gene expressions remains fundamentally unexplored due to great computational challenges and data availability. Due to variation in splicing, transcription start sites, polyadenylation sites, post-transcriptional RNA editing across the entire gene, and transcription rates of the cells, RNA-seq measurements generate large expression variability and collectively create the observed position level read count curves. A single number for measuring gene expression which is widely used for microarray measured gene expression analysis is highly unlikely to sufficiently account for large expression variation across the gene. Simultaneously analyzing epistatic architecture using the RNA-seq and whole genome sequencing (WGS) data poses enormous challenges. We develop a nonlinear functional regression model (FRGM) with functional responses where the position-level read counts within a gene are taken as a function of genomic position, and functional predictors where genotype profiles are viewed as a function of genomic position, for epistasis analysis with RNA-seq data. Instead of testing the interaction of all possible pair-wises SNPs, the FRGM takes a gene as a basic unit for epistasis analysis, which tests for the interaction of all possible pairs of genes and use all the information that can be accessed to collectively test interaction between all possible pairs of SNPs within two genome regions. By large-scale simulations, we demonstrate that the proposed FRGM for epistasis analysis can achieve the correct type 1 error and has higher power to detect the interactions between genes than the existing methods. The proposed methods are applied to the RNA-seq and WGS data from the 1000 Genome Project. The numbers of pairs of significantly interacting genes after Bonferroni correction

  14. Serial Expression Analysis: a web tool for the analysis of serial gene expression data

    Science.gov (United States)

    Nueda, Maria José; Carbonell, José; Medina, Ignacio; Dopazo, Joaquín; Conesa, Ana

    2010-01-01

    Serial transcriptomics experiments investigate the dynamics of gene expression changes associated with a quantitative variable such as time or dosage. The statistical analysis of these data implies the study of global and gene-specific expression trends, the identification of significant serial changes, the comparison of expression profiles and the assessment of transcriptional changes in terms of cellular processes. We have created the SEA (Serial Expression Analysis) suite to provide a complete web-based resource for the analysis of serial transcriptomics data. SEA offers five different algorithms based on univariate, multivariate and functional profiling strategies framed within a user-friendly interface and a project-oriented architecture to facilitate the analysis of serial gene expression data sets from different perspectives. SEA is available at sea.bioinfo.cipf.es. PMID:20525784

  15. The normal function of a speciation gene, Odysseus, and its hybrid sterility effect.

    Science.gov (United States)

    Sun, Sha; Ting, Chau-Ti; Wu, Chung-I

    2004-07-02

    To understand how postmating isolation is connected to the normal process of species divergence and why hybrid male sterility is often the first sign of speciation, we analyzed the Odysseus (OdsH) gene of hybrid male sterility in Drosophila. We carried out expression analysis, transgenic study, and gene knockout. The combined evidence suggests that the sterility phenotype represents a novel manifestation of the gene function rather than the reduction or loss of the normal one. The gene knockout experiment identified the normal function of OdsH as a modest enhancement of sperm production in young males. The implication of a weak effect of OdsH on the normal phenotype but a strong influence on hybrid male sterility is discussed in light of Haldane's rule of postmating isolation.

  16. Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

    Science.gov (United States)

    Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

    2017-08-01

    This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.

  17. Comparative structural and functional analysis of genes encoding pectin methylesterases in Phytophthora spp.

    Science.gov (United States)

    Mingora, Christina; Ewer, Jason; Ospina-Giraldo, Manuel

    2014-03-15

    We have scanned the Phytophthora infestans, P. ramorum, and P. sojae genomes for the presence of putative pectin methylesterase genes and conducted a sequence analysis of all gene models found. We also searched for potential regulatory motifs in the promoter region of the proposed P. infestans models, and investigated the gene expression levels throughout the course of P. infestans infection on potato plants, using in planta and detached leaf assays. We found that genes located on contiguous chromosomal regions contain similar motifs in the promoter region, indicating the possibility of a shared regulatory mechanism. Results of our investigations also suggest that, during the pathogenicity process, the expression levels of some of the analyzed genes vary considerably when compared to basal expression observed in in vitro cultures of non-sporulating mycelium. These results were observed both in planta and in detached leaf assays. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. Genome-Wide Identification and Analysis of Genes Encoding PHD-Finger Protein in Tomato

    International Nuclear Information System (INIS)

    Hayat, S.; Cheng, Z.; Chen, X.

    2016-01-01

    The PHD-finger proteins are conserved in eukaryotic organisms and are involved in a variety of important functions in different biological processes in plants. However, the function of PHD fingers are poorly known in tomato (Solanum lycopersicum L.). In current study, we identified 45 putative genes coding Phd finger protein in tomato distributed on 11 chromosomes except for chromosome 8. Some of the genes encode other conserved key domains besides Phd-finger. Phylogenetic analysis of these 45 proteins resulted in seven clusters. Most Phd finger proteins were predicted to PML body location. These PHD-finger genes displayed differential expression either in various organs, at different development stages and under stresses in tomato. Our study provides the first systematic analysis of PHD-finger genes and proteins in tomato. This preliminary study provides a very useful reference information for Phd-finger proteins in tomato. They will be helpful for cloning and functional study of tomato PHD-finger genes. (author)

  19. Application of biclustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

    Directory of Open Access Journals (Sweden)

    Andrew Williams

    2015-12-01

    Full Text Available Background: The presence of diverse types of nanomaterials (NMs in commerce is growing at an exponential pace. As a result, human exposure to these materials in the environment is inevitable, necessitating the need for rapid and reliable toxicity testing methods to accurately assess the potential hazards associated with NMs. In this study, we applied biclustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related biclusters of genes showing similar expression profiles were identified. The identified biclusters were then used to conduct a gene set enrichment analysis on pulmonary gene expression profiles derived from mice exposed to nano-titanium dioxide (nano-TiO2, carbon black (CB or carbon nanotubes (CNTs to determine the disease significance of these data-driven gene sets.Results: Biclusters representing inflammation (chemokine activity, DNA binding, cell cycle, apoptosis, reactive oxygen species (ROS and fibrosis processes were identified. All of the NM studies were significant with respect to the bicluster related to chemokine activity (DAVID; FDR p-value = 0.032. The bicluster related to pulmonary fibrosis was enriched in studies where toxicity induced by CNT and CB studies was investigated, suggesting the potential for these materials to induce lung fibrosis. The pro-fibrogenic potential of CNTs is well established. Although CB has not been shown to induce fibrosis, it induces stronger inflammatory, oxidative stress and DNA damage responses than nano-TiO2 particles.Conclusion: The results of the analysis correctly identified all NMs to be inflammogenic and only CB and CNTs as potentially fibrogenic. In addition to identifying several

  20. [Fanconi anemia: genes and function(s) revisited].

    Science.gov (United States)

    Papadopoulo, Dora; Moustacchi, Ethel

    2005-01-01

    Fanconi anemia (FA), a rare inherited disorder, exhibits a complex phenotype including progressive bone marrow failure, congenital malformations and increased risk of cancers, mainly acute myeloid leukaemia. At the cellular level, FA is characterized by hypersensitivity to DNA cross-linking agents and by high frequencies of induced chromosomal aberrations, a property used for diagnosis. FA results from mutations in one of the eleven FANC (FANCA to FANCJ) genes. Nine of them have been identified. In addition, FANCD1 gene has been shown to be identical to BRCA2, one of the two breast cancer susceptibility genes. Seven of the FANC proteins form a complex, which exists in four different forms depending of its subcellular localisation. Four FANC proteins (D1(BRCA2), D2, I and J) are not associated to the complex. The presence of the nuclear form of the FA core complex is necessary for the mono-ubiquitinylation of FANCD2 protein, a modification required for its re-localization to nuclear foci, likely to be sites of DNA repair. A clue towards understanding the molecular function of the FANC genes comes from the recently identified connection of FANC to the BRCA1, ATM, NBS1 and ATR genes. Two of the FANC proteins (A and D2) directly interact with BRCA1, which in turn interacts with the MRE11/RAD50/NBS1 complex, which is one of the key components in the mechanisms involved in the cellular response to DNA double strand breaks (DSB). Moreover, ATM, a protein kinase that plays a central role in the network of DSB signalling, phosphorylates in vitro and in vivo FANCD2 in response to ionising radiations. Moreover, the NBS1 protein and the monoubiquitinated form of FANCD2 seem to act together in response to DNA crosslinking agents. Taken together with the previously reported impaired DSB and DNA interstrand crosslinks repair in FA cells, the connection of FANC genes to the ATM, ATR, NBS1 and BRCA1 links the FANC genes function to the finely orchestrated network involved in the

  1. [Differential gene expression profile in ischemic myocardium of Wistar rats with acute myocardial infarction: the study on gene construction, identification and function].

    Science.gov (United States)

    Guo, Chun Yu; Yin, Hui Jun; Jiang, Yue Rong; Xue, Mei; Zhang, Lu; Shi, Da Zhuo

    2008-06-18

    To construct the differential genes expressed profile in the ischemic myocardium tissue reduced from acute myocardial infarction(AMI), and determine the biological functions of target genes. AMI model was generated by ligation of the left anterior descending coronary artery in Wistar rats. Total RNA was extracted from the normal and the ischemic heart tissues under the ligation point 7 days after the operation. Differential gene expression profiles of the two samples were constructed using Long Serial Analysis of Gene Expression(LongSAGE). Real time fluorescence quantitative PCR was used to verify gene expression profile and to identify the expression of 2 functional genes. The activities of enzymes from functional genes were determined by histochemistry. A total of 15,966 tags were screened from the normal and the ischemic LongSAGE maps. The similarities of the sequences were compared using the BLAST algebra in NCBI and 7,665 novel tags were found. In the ischemic tissue 142 genes were significantly changed compared with those in the normal tissue (Ppathways of oxidation and phosphorylation, ATP synthesis and glycolysis. The partial genes identified by LongSAGE were confirmed using real time fluorescence quantitative PCR. Two genes related to energy metabolism, COX5a and ATP5e, were screened and quantified. Expression of two functional genes down-regulated at their mRNA levels and the activities of correlative functional enzymes decreased compared with those in the normal tissue. AMI causes a series of changes in gene expression, in which the abnormal expression of genes related to energy metabolism could be one of the molecular mechanisms of AMI. The intervention of the expressions of COX5a and ATP5e may be a new target for AMI therapy.

  2. Common variants in Mendelian kidney disease genes and their association with renal function.

    Science.gov (United States)

    Parsa, Afshin; Fuchsberger, Christian; Köttgen, Anna; O'Seaghdha, Conall M; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M; Borecki, Ingrid; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Bochud, Murielle; Heid, Iris M; Siscovick, David S; Fox, Caroline S; Kao, W Linda; Böger, Carsten A

    2013-12-01

    Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research.

  3. Available nitrogen is the key factor influencing soil microbial functional gene diversity in tropical rainforest.

    Science.gov (United States)

    Cong, Jing; Liu, Xueduan; Lu, Hui; Xu, Han; Li, Yide; Deng, Ye; Li, Diqiang; Zhang, Yuguang

    2015-08-20

    Tropical rainforests cover over 50% of all known plant and animal species and provide a variety of key resources and ecosystem services to humans, largely mediated by metabolic activities of soil microbial communities. A deep analysis of soil microbial communities and their roles in ecological processes would improve our understanding on biogeochemical elemental cycles. However, soil microbial functional gene diversity in tropical rainforests and causative factors remain unclear. GeoChip, contained almost all of the key functional genes related to biogeochemical cycles, could be used as a specific and sensitive tool for studying microbial gene diversity and metabolic potential. In this study, soil microbial functional gene diversity in tropical rainforest was analyzed by using GeoChip technology. Gene categories detected in the tropical rainforest soils were related to different biogeochemical processes, such as carbon (C), nitrogen (N) and phosphorus (P) cycling. The relative abundance of genes related to C and P cycling detected mostly derived from the cultured bacteria. C degradation gene categories for substrates ranging from labile C to recalcitrant C were all detected, and gene abundances involved in many recalcitrant C degradation gene categories were significantly (P rainforest. Soil available N could be the key factor in shaping the soil microbial functional gene structure and metabolic potential.

  4. Analysis of mammalian gene function through broad-based phenotypic screens across a consortium of mouse clinics

    DEFF Research Database (Denmark)

    de Angelis, Martin Hrabě; Nicholson, George; Selloum, Mohammed

    2015-01-01

    The function of the majority of genes in the mouse and human genomes remains unknown. The mouse embryonic stem cell knockout resource provides a basis for the characterization of relationships between genes and phenotypes. The EUMODIC consortium developed and validated robust methodologies...

  5. Genome-Wide Identification and Structural Analysis of bZIP Transcription Factor Genes in Brassica napus.

    Science.gov (United States)

    Zhou, Yan; Xu, Daixiang; Jia, Ledong; Huang, Xiaohu; Ma, Guoqiang; Wang, Shuxian; Zhu, Meichen; Zhang, Aoxiang; Guan, Mingwei; Lu, Kun; Xu, Xinfu; Wang, Rui; Li, Jiana; Qu, Cunmin

    2017-10-24

    The basic region/leucine zipper motif (bZIP) transcription factor family is one of the largest families of transcriptional regulators in plants. bZIP genes have been systematically characterized in some plants, but not in rapeseed ( Brassica napus ). In this study, we identified 247 BnbZIP genes in the rapeseed genome, which we classified into 10 subfamilies based on phylogenetic analysis of their deduced protein sequences. The BnbZIP genes were grouped into functional clades with Arabidopsis genes with similar putative functions, indicating functional conservation. Genome mapping analysis revealed that the BnbZIPs are distributed unevenly across all 19 chromosomes, and that some of these genes arose through whole-genome duplication and dispersed duplication events. All expression profiles of 247 bZIP genes were extracted from RNA-sequencing data obtained from 17 different B . napus ZS11 tissues with 42 various developmental stages. These genes exhibited different expression patterns in various tissues, revealing that these genes are differentially regulated. Our results provide a valuable foundation for functional dissection of the different BnbZIP homologs in B . napus and its parental lines and for molecular breeding studies of bZIP genes in B . napus .

  6. Functional characterization of endogenous siRNA target genes in Caenorhabditis elegans

    Directory of Open Access Journals (Sweden)

    Heikkinen Liisa

    2008-06-01

    Full Text Available Abstract Background Small interfering RNA (siRNA molecules mediate sequence specific silencing in RNA interference (RNAi, a gene regulatory phenomenon observed in almost all organisms. Large scale sequencing of small RNA libraries obtained from C. elegans has revealed that a broad spectrum of siRNAs is endogenously transcribed from genomic sequences. The biological role and molecular diversity of C. elegans endogenous siRNA (endo-siRNA molecules, nonetheless, remain poorly understood. In order to gain insight into their biological function, we annotated two large libraries of endo-siRNA sequences, identified their cognate targets, and performed gene ontology analysis to identify enriched functional categories. Results Systematic trends in categorization of target genes according to the specific length of siRNA sequences were observed: 18- to 22-mer siRNAs were associated with genes required for embryonic development; 23-mers were associated uniquely with post-embryonic development; 24–26-mers were associated with phosphorus metabolism or protein modification. Moreover, we observe that some argonaute related genes associate with siRNAs with multiple reads. Sequence frequency graphs suggest that different lengths of siRNAs share similarities in overall sequence structure: the 5' end begins with G, while the body predominates with U and C. Conclusion These results suggest that the lengths of endogenous siRNA molecules are consequential to their biological functions since the gene ontology categories for their cognate mRNA targets vary depending upon their lengths.

  7. Functional analysis of TamA, a coactivator of nitrogen-regulated gene expression in Aspergillus nidulans.

    Science.gov (United States)

    Small, A J; Todd, R B; Zanker, M C; Delimitrou, S; Hynes, M J; Davis, M A

    2001-06-01

    The tam A gene of Aspergillus nidulans encodes a 739-amino acid protein with similarity to Uga35p/Dal81p/DurLp of Saccharomyces cerevisiae. It has been proposed that TamA functions as a co-activator of AreA, the major nitrogen regulatory protein in A. nidulans. Because AreA functions as a transcriptional activator under nitrogen-limiting conditions, we investigated whether TamA was also present in the nucleus. We found that a GFP-TamA fusion protein was predominantly localised to the nucleus in the presence and absence of ammonium, and that AreA was not required for this distribution. As the predicted DNA-binding domain of TamA is not essential for function, we have used a number of approaches to further define functionally important regions. We have cloned the tamA gene of A. oryzae and compared its functional and sequence characteristics with those of A. nidulans tamA and S. cerevisiae UGA35/DAL81/DURL. The Aspergillus homologues are highly conserved and functionally interchangeable, whereas the S. cerevisiae gene does not complement a tamA mutant when expressed in A. nidulans. Uga35p/Dal81p/DurLp was also found to be unable to recruit AreA. The sequence changes in a number of tamA mutant alleles were determined, and altered versions of TamA were tested for tamA complementation and interaction with AreA. Changes in most regions of TamA appeared to destroy its function, suggesting that the overall conformation of the protein may be critical for its activity.

  8. Characterization of the global profile of genes expressed in cervical epithelium by Serial Analysis of Gene Expression (SAGE

    Directory of Open Access Journals (Sweden)

    Piña-Sanchez Patricia

    2005-09-01

    Full Text Available Abstract Background Serial Analysis of Gene Expression (SAGE is a new technique that allows a detailed and profound quantitative and qualitative knowledge of gene expression profile, without previous knowledge of sequence of analyzed genes. We carried out a modification of SAGE methodology (microSAGE, useful for the analysis of limited quantities of tissue samples, on normal human cervical tissue obtained from a donor without histopathological lesions. Cervical epithelium is constituted mainly by cervical keratinocytes which are the targets of human papilloma virus (HPV, where persistent HPV infection of cervical epithelium is associated with an increase risk for developing cervical carcinomas (CC. Results We report here a transcriptome analysis of cervical tissue by SAGE, derived from 30,418 sequenced tags that provide a wealth of information about the gene products involved in normal cervical epithelium physiology, as well as genes not previously found in uterine cervix tissue involved in the process of epidermal differentiation. Conclusion This first comprehensive and profound analysis of uterine cervix transcriptome, should be useful for the identification of genes involved in normal cervix uterine function, and candidate genes associated with cervical carcinoma.

  9. Functional and gene expression analysis of hTERT overexpressed endothelial cells

    Directory of Open Access Journals (Sweden)

    Haruna Takano

    2008-09-01

    Full Text Available Haruna Takano1, Satoshi Murasawa1,2, Takayuki Asahara1,2,31Institute of Biomedical Research and Innovation, Kobe, Japan; 2RIKEN Center for Developmental Biology, Kobe 650-0047, Japan; 3Tokai University of School of Medicine, Tokai, JapanAbstract: Telomerase dysfunction contributes to cellular senescence. Recent advances indicate the importance of senescence in maintaining vascular cell function in vitro. Human telomerase reverse transcriptase (hTERT overexpression is thought to lead to resistance to apoptosis and oxidative stress. However, the mechanism in endothelial lineage cells is unclear. We tried to generate an immortal endothelial cell line from human umbilical vein endothelial cells using a no-virus system and examine the functional mechanisms of hTERT overexpressed endothelial cell senescence in vitro. High levels of hTERT genes and endothelial cell-specific markers were expressed during long-term culture. Also, angiogenic responses were observed in hTERT overexpressed endothelial cell. These cells showed a delay in senescence and appeared more resistant to stressed conditions. PI3K/Akt-related gene levels were enhanced in hTERT overexpressed endothelial cells. An up-regulated PI3K/Akt pathway caused by hTERT overexpression might contribute to anti-apoptosis and survival effects in endothelial lineage cells.Keywords: endothelial, telomerase, senescence, oxidative stress, anti-apoptosis, PI3K/Akt pathway

  10. Global gene expression analysis of the zoonotic parasite Trichinella spiralis revealed novel genes in host parasite interaction.

    Directory of Open Access Journals (Sweden)

    Xiaolei Liu

    Full Text Available BACKGROUND: Trichinellosis is a typical food-borne zoonotic disease which is epidemic worldwide and the nematode Trichinella spiralis is the main pathogen. The life cycle of T. spiralis contains three developmental stages, i.e. adult worms, new borne larva (new borne L1 larva and muscular larva (infective L1 larva. Stage-specific gene expression in the parasites has been investigated with various immunological and cDNA cloning approaches, whereas the genome-wide transcriptome and expression features of the parasite have been largely unknown. The availability of the genome sequence information of T. spiralis has made it possible to deeply dissect parasite biology in association with global gene expression and pathogenesis. METHODOLOGY AND PRINCIPAL FINDINGS: In this study, we analyzed the global gene expression patterns in the three developmental stages of T. spiralis using digital gene expression (DGE analysis. Almost 15 million sequence tags were generated with the Illumina RNA-seq technology, producing expression data for more than 9,000 genes, covering 65% of the genome. The transcriptome analysis revealed thousands of differentially expressed genes within the genome, and importantly, a panel of genes encoding functional proteins associated with parasite invasion and immuno-modulation were identified. More than 45% of the genes were found to be transcribed from both strands, indicating the importance of RNA-mediated gene regulation in the development of the parasite. Further, based on gene ontological analysis, over 3000 genes were functionally categorized and biological pathways in the three life cycle stage were elucidated. CONCLUSIONS AND SIGNIFICANCE: The global transcriptome of T. spiralis in three developmental stages has been profiled, and most gene activity in the genome was found to be developmentally regulated. Many metabolic and biological pathways have been revealed. The findings of the differential expression of several protein

  11. Functional Analysis of Promoter Region from Eel Cytochrome P450 1A1 Gene in Transgenic Medaka.

    Science.gov (United States)

    Ogino; Itakura; Kato; Aoki; Sato

    1999-07-01

    : Transcription of the CYP1A1 genes in mammals and fish is stimulated by polyaromatic hydrocarbons. DNA sequencing analysis revealed that CYP1A1 gene in eel (Anguilla japonica) contains two kinds of putative cis-acting regulatory elements, XRE (xenobiotic-responsive element) and ERE (estrogen-responsive element). XRE is known as the enhancer that is responsible for the inducibility of the genes of CYP1A1 and some other drug-metabolizing enzymes. In the eel CYP1A1 gene, XRE motifs are distributed as follows: five times in the region from -2136 to -1125 bp, XRE(-6) to (-2); once in the proximal basal promoter region, XRE(-1); and once in the first intron, XRE(+1). The region between XRE(-2) and XRE(-1) contains three ERE motifs. To investigate the function of the cis-acting regulatory elements in the eel CYP1A1 gene, recombinant plasmids prepared with its 5' upstream sequence and the structural gene for luciferase were microinjected into fertilized eggs of medaka at the one-cell stage. Hatched fry were treated with 3-methylcholanthrene, and the transcription efficiency was assayed using competitive polymerase chain reaction analysis. Deletion of the region containing the five XREs, XRE(-6) to XRE(-2), and the point mutation of XRE(-1) reduced the inducible expressions by 75% and 56%, respectively, showing apparent dependency of the drug induction on the XREs. Constitutive expression, however, was not significantly affected by deletion or disruption of the XREs. When the region between XRE(-2) and XRE(-1) containing no XREs but three ERE motifs was internally deleted, the inducible expression and the constitutive expression were reduced by 88% and 75%, respectively. Replacement of this region with a partial fragment of eel CYP1A1 complementary DNA, with slight alteration of the distance between the five XREs and XRE(-1), reduced the inducible expression and the constitutive expression by 91% and 60%, respectively. These results strongly suggest that not only XRE but

  12. Functional genes reveal the intrinsic PAH biodegradation potential in creosote-contaminated groundwater following in situ biostimulation.

    Science.gov (United States)

    Nyyssönen, Mari; Kapanen, Anu; Piskonen, Reetta; Lukkari, Tuomas; Itävaara, Merja

    2009-08-01

    A small-scale functional gene array containing 15 functional gene probes targeting aliphatic and aromatic hydrocarbon biodegradation pathways was used to investigate the effect of a pilot-scale air sparging and nutrient infiltration treatment on hydrocarbon biodegradation in creosote-contaminated groundwater. Genes involved in the different phases of polycyclic aromatic hydrocarbon (PAH) biodegradation were detected with the functional gene array in the contaminant plume, thus indicating the presence of intrinsic biodegradation potential. However, the low aerobic fluorescein diacetate hydrolysis, the polymerase chain reaction (PCR) amplification of 16S rRNA genes closely similar to sulphate-reducing and denitrifying bacteria and the negligible decrease in contaminant concentrations showed that aerobic PAH biodegradation was limited in the anoxic groundwater. Increased abundance of PAH biodegradation genes was detected by functional gene array in the monitoring well located at the rear end of the biostimulated area, which indicated that air sparging and nutrient infiltration enhanced the intrinsic, aerobic PAH biodegradation. Furthermore, ten times higher naphthalene dioxygenase gene copy numbers were detected by real-time PCR in the biostimulated area, which was in good agreement with the functional gene array data. As a result, functional gene array analysis was demonstrated to provide a potential tool for evaluating the efficiency of the bioremediation treatment for enhancing hydrocarbon biodegradation in field-scale applications.

  13. Genome-Wide Distribution, Organisation and Functional Characterization of Disease Resistance and Defence Response Genes across Rice Species

    Science.gov (United States)

    Singh, Sangeeta; Chand, Suresh; Singh, N. K.; Sharma, Tilak Raj

    2015-01-01

    The resistance (R) genes and defense response (DR) genes have become very important resources for the development of disease resistant cultivars. In the present investigation, genome-wide identification, expression, phylogenetic and synteny analysis was done for R and DR-genes across three species of rice viz: Oryza sativa ssp indica cv 93-11, Oryza sativa ssp japonica and wild rice species, Oryza brachyantha. We used the in silico approach to identify and map 786 R -genes and 167 DR-genes, 672 R-genes and 142 DR-genes, 251 R-genes and 86 DR-genes in the japonica, indica and O. brachyanth a genomes, respectively. Our analysis showed that 60.5% and 55.6% of the R-genes are tandemly repeated within clusters and distributed over all the rice chromosomes in indica and japonica genomes, respectively. The phylogenetic analysis along with motif distribution shows high degree of conservation of R- and DR-genes in clusters. In silico expression analysis of R-genes and DR-genes showed more than 85% were expressed genes showing corresponding EST matches in the databases. This study gave special emphasis on mechanisms of gene evolution and duplication for R and DR genes across species. Analysis of paralogs across rice species indicated 17% and 4.38% R-genes, 29% and 11.63% DR-genes duplication in indica and Oryza brachyantha, as compared to 20% and 26% duplication of R-genes and DR-genes in japonica respectively. We found that during the course of duplication only 9.5% of R- and DR-genes changed their function and rest of the genes have maintained their identity. Syntenic relationship across three genomes inferred that more orthology is shared between indica and japonica genomes as compared to brachyantha genome. Genome wide identification of R-genes and DR-genes in the rice genome will help in allele mining and functional validation of these genes, and to understand molecular mechanism of disease resistance and their evolution in rice and related species. PMID:25902056

  14. Phylogenomic and functional domain analysis of polyketide synthases in Fusarium

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Daren W.; Butchko, Robert A.; Baker, Scott E.; Proctor, Robert H.

    2012-02-01

    Fusarium species are ubiquitous in nature, cause a range of plant diseases, and produce a variety of chemicals often referred to as secondary metabolites. Although some fungal secondary metabolites affect plant growth or protect plants from other fungi and bacteria, their presence in grain based food and feed is more often associated with a variety of diseases in plants and in animals. Many of these structurally diverse metabolites are derived from a family of related enzymes called polyketide synthases (PKSs). A search of genomic sequence of Fusarium verticillioides, F. graminearum, F. oxysporum and Nectria haematococca (anamorph F. solani) identified a total of 58 PKS genes. To gain insight into how this gene family evolved and to guide future studies, we conducted a phylogenomic and functional domain analysis. The resulting genealogy suggested that Fusarium PKSs represent 34 different groups responsible for synthesis of different core metabolites. The analyses indicate that variation in the Fusarium PKS gene family is due to gene duplication and loss events as well as enzyme gain-of-function due to the acquisition of new domains or of loss-of-function due to nucleotide mutations. Transcriptional analysis indicate that the 16 F. verticillioides PKS genes are expressed under a range of conditions, further evidence that they are functional genes that confer the ability to produce secondary metabolites.

  15. Multiscale Embedded Gene Co-expression Network Analysis.

    Directory of Open Access Journals (Sweden)

    Won-Min Song

    2015-11-01

    Full Text Available Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3, the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA by: i introducing quality control of co-expression similarities, ii parallelizing embedded network construction, and iii developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs. We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA. MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  16. Multiscale Embedded Gene Co-expression Network Analysis.

    Science.gov (United States)

    Song, Won-Min; Zhang, Bin

    2015-11-01

    Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.

  17. Non-functional genes repaired at the RNA level.

    Science.gov (United States)

    Burger, Gertraud

    2016-01-01

    Genomes and genes continuously evolve. Gene sequences undergo substitutions, deletions or nucleotide insertions; mobile genetic elements invade genomes and interleave in genes; chromosomes break, even within genes, and pieces reseal in reshuffled order. To maintain functional gene products and assure an organism's survival, two principal strategies are used - either repair of the gene itself or of its product. I will introduce common types of gene aberrations and how gene function is restored secondarily, and then focus on systematically fragmented genes found in a poorly studied protist group, the diplonemids. Expression of their broken genes involves restitching of pieces at the RNA-level, and substantial RNA editing, to compensate for point mutations. I will conclude with thoughts on how such a grotesquely unorthodox system may have evolved, and why this group of organisms persists and thrives since tens of millions of years. Copyright © 2016 Académie des sciences. Published by Elsevier SAS. All rights reserved.

  18. Heterologous gene expression and functional analysis of a type III polyketide synthase from Aspergillus niger NRRL 328

    Energy Technology Data Exchange (ETDEWEB)

    Kirimura, Kohtaro, E-mail: kkohtaro@waseda.jp; Watanabe, Shotaro; Kobayashi, Keiichi

    2016-05-13

    Type III polyketide synthases (PKSs) catalyze the formation of pyrone- and resorcinol-types aromatic polyketides. The genomic analysis of the filamentous fungus Aspergillus niger NRRL 328 revealed that this strain has a putative gene (chr-8-2: 2978617–2979847) encoding a type III PKS, although its functions are unknown. In this study, for functional analysis of this putative type III PKS designated as An-CsyA, cloning and heterologous expression of the An-CsyA gene (An-csyA) in Escherichia coli were performed. Recombinant His-tagged An-CsyA was successfully expressed in E. coli BL21 (DE3), purified by Ni{sup 2+}-affinity chromatography, and used for in vitro assay. Tests on the substrate specificity of the His-tagged An-CsyA with myriad acyl-CoAs as starter substrates and malonyl-CoA as extender substrate showed that His-tagged An-CsyA accepted fatty acyl-CoAs (C2-C14) and produced triketide pyrones (C2-C14), tetraketide pyrones (C2-C10), and pentaketide resorcinols (C10-C14). Furthermore, acetoacetyl-CoA, malonyl-CoA, isobutyryl-CoA, and benzoyl-CoA were also accepted as starter substrates, and both of triketide pyrones and tetraketide pyrones were produced. It is noteworthy that the His-tagged An-CsyA produced polyketides from malonyl-CoA as starter and extender substrates and produced tetraketide pyrones from short-chain fatty acyl-CoAs as starter substrates. Therefore, this is the first report showing the functional properties of An-CsyA different from those of other fungal type III PKSs. -- Highlights: •Type III PKS from Aspergillus niger NRRL 328, An-CsyA, was cloned and characterized. •An-CsyA produced triketide pyrones, tetraketide pyrones and pentaketide resorcinols. •Functional properties of An-CsyA differs from those of other fungal type III PKSs.

  19. Heterologous gene expression and functional analysis of a type III polyketide synthase from Aspergillus niger NRRL 328

    International Nuclear Information System (INIS)

    Kirimura, Kohtaro; Watanabe, Shotaro; Kobayashi, Keiichi

    2016-01-01

    Type III polyketide synthases (PKSs) catalyze the formation of pyrone- and resorcinol-types aromatic polyketides. The genomic analysis of the filamentous fungus Aspergillus niger NRRL 328 revealed that this strain has a putative gene (chr-8-2: 2978617–2979847) encoding a type III PKS, although its functions are unknown. In this study, for functional analysis of this putative type III PKS designated as An-CsyA, cloning and heterologous expression of the An-CsyA gene (An-csyA) in Escherichia coli were performed. Recombinant His-tagged An-CsyA was successfully expressed in E. coli BL21 (DE3), purified by Ni"2"+-affinity chromatography, and used for in vitro assay. Tests on the substrate specificity of the His-tagged An-CsyA with myriad acyl-CoAs as starter substrates and malonyl-CoA as extender substrate showed that His-tagged An-CsyA accepted fatty acyl-CoAs (C2-C14) and produced triketide pyrones (C2-C14), tetraketide pyrones (C2-C10), and pentaketide resorcinols (C10-C14). Furthermore, acetoacetyl-CoA, malonyl-CoA, isobutyryl-CoA, and benzoyl-CoA were also accepted as starter substrates, and both of triketide pyrones and tetraketide pyrones were produced. It is noteworthy that the His-tagged An-CsyA produced polyketides from malonyl-CoA as starter and extender substrates and produced tetraketide pyrones from short-chain fatty acyl-CoAs as starter substrates. Therefore, this is the first report showing the functional properties of An-CsyA different from those of other fungal type III PKSs. -- Highlights: •Type III PKS from Aspergillus niger NRRL 328, An-CsyA, was cloned and characterized. •An-CsyA produced triketide pyrones, tetraketide pyrones and pentaketide resorcinols. •Functional properties of An-CsyA differs from those of other fungal type III PKSs.

  20. Functional alterations due to amino acid changes and evolutionary comparative analysis of ARPKD and ADPKD genes

    Directory of Open Access Journals (Sweden)

    Burhan M. Edrees

    2016-12-01

    Full Text Available A targeted customized sequencing of genes implicated in autosomal recessive polycystic kidney disease (ARPKD phenotype was performed to identify candidate variants using the Ion torrent PGM next-generation sequencing. The results identified four potential pathogenic variants in PKHD1 gene [c.4870C>T, p.(Arg1624Trp, c.5725C>T, p.(Arg1909Trp, c.1736C>T, p.(Thr579Met and c.10628T>G, p.(Leu3543Trp] among 12 out of 18 samples. However, one variant c.4870C>T, p.(Arg1624Trp was common among eight patients. Some patient samples also showed few variants in autosomal dominant polycystic kidney disease (ADPKD disease causing genes PKD1 and PKD2 such as c.12433G>A, p.(Val4145Ile and c.1445T>G, p.(Phe482Cys, respectively. All causative variants were validated by capillary sequencing and confirmed the presence of a novel homozygous variant c.10628T>G, p.(Leu3543Trp in a male proband. We have recently published the results of these studies (Edrees et al., 2016. Here we report for the first time the effect of the common mutation p.(Arg1624Trp found in eight samples on the protein structure and function due to the specific amino acid changes of PKHD1 protein using molecular dynamics simulations. The computational approaches provide tool predict the phenotypic effect of variant on the structure and function of the altered protein. The structural analysis with the common mutation p.(Arg1624Trp in the native and mutant modeled protein were also studied for solvent accessibility, secondary structure and stabilizing residues to find out the stability of the protein between wild type and mutant forms. Furthermore, comparative genomics and evolutionary analyses of variants observed in PKHD1, PKD1, and PKD2 genes were also performed in some mammalian species including human to understand the complexity of genomes among closely related mammalian species. Taken together, the results revealed that the evolutionary comparative analyses and characterization of PKHD1, PKD1

  1. Conditional Loss of Hoxa5 Function Early after Birth Impacts on Expression of Genes with Synaptic Function

    Science.gov (United States)

    Lizen, Benoit; Moens, Charlotte; Mouheiche, Jinane; Sacré, Thomas; Ahn, Marie-Thérèse; Jeannotte, Lucie; Salti, Ahmad; Gofflot, Françoise

    2017-01-01

    Hoxa5 is a member of the Hox gene family that plays critical roles in successive steps of the central nervous system formation during embryonic and fetal development. In the mouse, Hoxa5 was recently shown to be expressed in the medulla oblongata and the pons from fetal stages to adulthood. In these territories, Hoxa5 transcripts are enriched in many precerebellar neurons and several nuclei involved in autonomic functions, while the HOXA5 protein is detected mainly in glutamatergic and GABAergic neurons. However, whether HOXA5 is functionally required in these neurons after birth remains unknown. As a first approach to tackle this question, we aimed at determining the molecular programs downstream of the HOXA5 transcription factor in the context of the postnatal brainstem. A comparative transcriptomic analysis was performed in combination with gene expression localization, using a conditional postnatal Hoxa5 loss-of-function mouse model. After inactivation of Hoxa5 at postnatal days (P)1–P4, we established the transcriptome of the brainstem from P21 Hoxa5 conditional mutants using RNA-Seq analysis. One major finding was the downregulation of several genes associated with synaptic function in Hoxa5 mutant specimens including different actors involved in glutamatergic synapse, calcium signaling pathway, and GABAergic synapse. Data were confirmed and extended by reverse transcription quantitative polymerase chain reaction analysis, and the expression of several HOXA5 candidate targets was shown to co-localize with Hoxa5 transcripts in precerebellar nuclei. Together, these new results revealed that HOXA5, through the regulation of key actors of the glutamatergic/GABAergic synapses and calcium signaling, might be involved in synaptogenesis, synaptic transmission, and synaptic plasticity of the cortico-ponto-cerebellar circuitry in the postnatal brainstem. PMID:29187810

  2. Conditional Loss of Hoxa5 Function Early after Birth Impacts on Expression of Genes with Synaptic Function

    Directory of Open Access Journals (Sweden)

    Benoit Lizen

    2017-11-01

    Full Text Available Hoxa5 is a member of the Hox gene family that plays critical roles in successive steps of the central nervous system formation during embryonic and fetal development. In the mouse, Hoxa5 was recently shown to be expressed in the medulla oblongata and the pons from fetal stages to adulthood. In these territories, Hoxa5 transcripts are enriched in many precerebellar neurons and several nuclei involved in autonomic functions, while the HOXA5 protein is detected mainly in glutamatergic and GABAergic neurons. However, whether HOXA5 is functionally required in these neurons after birth remains unknown. As a first approach to tackle this question, we aimed at determining the molecular programs downstream of the HOXA5 transcription factor in the context of the postnatal brainstem. A comparative transcriptomic analysis was performed in combination with gene expression localization, using a conditional postnatal Hoxa5 loss-of-function mouse model. After inactivation of Hoxa5 at postnatal days (P1–P4, we established the transcriptome of the brainstem from P21 Hoxa5 conditional mutants using RNA-Seq analysis. One major finding was the downregulation of several genes associated with synaptic function in Hoxa5 mutant specimens including different actors involved in glutamatergic synapse, calcium signaling pathway, and GABAergic synapse. Data were confirmed and extended by reverse transcription quantitative polymerase chain reaction analysis, and the expression of several HOXA5 candidate targets was shown to co-localize with Hoxa5 transcripts in precerebellar nuclei. Together, these new results revealed that HOXA5, through the regulation of key actors of the glutamatergic/GABAergic synapses and calcium signaling, might be involved in synaptogenesis, synaptic transmission, and synaptic plasticity of the cortico-ponto-cerebellar circuitry in the postnatal brainstem.

  3. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  4. Comparison of lists of genes based on functional profiles

    Directory of Open Access Journals (Sweden)

    Salicrú Miquel

    2011-10-01

    Full Text Available Abstract Background How to compare studies on the basis of their biological significance is a problem of central importance in high-throughput genomics. Many methods for performing such comparisons are based on the information in databases of functional annotation, such as those that form the Gene Ontology (GO. Typically, they consist of analyzing gene annotation frequencies in some pre-specified GO classes, in a class-by-class way, followed by p-value adjustment for multiple testing. Enrichment analysis, where a list of genes is compared against a wider universe of genes, is the most common example. Results A new global testing procedure and a method incorporating it are presented. Instead of testing separately for each GO class, a single global test for all classes under consideration is performed. The test is based on the distance between the functional profiles, defined as the joint frequencies of annotation in a given set of GO classes. These classes may be chosen at one or more GO levels. The new global test is more powerful and accurate with respect to type I errors than the usual class-by-class approach. When applied to some real datasets, the results suggest that the method may also provide useful information that complements the tests performed using a class-by-class approach if gene counts are sparse in some classes. An R library, goProfiles, implements these methods and is available from Bioconductor, http://bioconductor.org/packages/release/bioc/html/goProfiles.html. Conclusions The method provides an inferential basis for deciding whether two lists are functionally different. For global comparisons it is preferable to the global chi-square test of homogeneity. Furthermore, it may provide additional information if used in conjunction with class-by-class methods.

  5. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.

    Directory of Open Access Journals (Sweden)

    Boris P Hejblum

    2015-06-01

    Full Text Available Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial, and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package.

  6. Phylogenetic and comparative gene expression analysis of barley (Hordeum vulgare)WRKY transcription factor family reveals putatively retained functions betweenmonocots and dicots

    Energy Technology Data Exchange (ETDEWEB)

    Mangelsen, Elke; Kilian, Joachim; Berendzen, Kenneth W.; Kolukisaoglu, Uner; Harter, Klaus; Jansson, Christer; Wanke, Dierk

    2008-02-01

    WRKY proteins belong to the WRKY-GCM1 superfamily of zinc finger transcription factors that have been subject to a large plant-specific diversification. For the cereal crop barley (Hordeum vulgare), three different WRKY proteins have been characterized so far, as regulators in sucrose signaling, in pathogen defense, and in response to cold and drought, respectively. However, their phylogenetic relationship remained unresolved. In this study, we used the available sequence information to identify a minimum number of 45 barley WRKY transcription factor (HvWRKY) genes. According to their structural features the HvWRKY factors were classified into the previously defined polyphyletic WRKY subgroups 1 to 3. Furthermore, we could assign putative orthologs of the HvWRKY proteins in Arabidopsis and rice. While in most cases clades of orthologous proteins were formed within each group or subgroup, other clades were composed of paralogous proteins for the grasses and Arabidopsis only, which is indicative of specific gene radiation events. To gain insight into their putative functions, we examined expression profiles of WRKY genes from publicly available microarray data resources and found group specific expression patterns. While putative orthologs of the HvWRKY transcription factors have been inferred from phylogenetic sequence analysis, we performed a comparative expression analysis of WRKY genes in Arabidopsis and barley. Indeed, highly correlative expression profiles were found between some of the putative orthologs. HvWRKY genes have not only undergone radiation in monocot or dicot species, but exhibit evolutionary traits specific to grasses. HvWRKY proteins exhibited not only sequence similarities between orthologs with Arabidopsis, but also relatedness in their expression patterns. This correlative expression is indicative for a putative conserved function of related WRKY proteins in mono- and dicot species.

  7. A genome-wide gene function prediction resource for Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Han Yan

    2010-08-01

    Full Text Available Predicting gene functions by integrating large-scale biological data remains a challenge for systems biology. Here we present a resource for Drosophila melanogaster gene function predictions. We trained function-specific classifiers to optimize the influence of different biological datasets for each functional category. Our model predicted GO terms and KEGG pathway memberships for Drosophila melanogaster genes with high accuracy, as affirmed by cross-validation, supporting literature evidence, and large-scale RNAi screens. The resulting resource of prioritized associations between Drosophila genes and their potential functions offers a guide for experimental investigations.

  8. Comparative genomic analysis of the PKS genes in five species and expression analysis in upland cotton

    Directory of Open Access Journals (Sweden)

    Xueqiang Su

    2017-10-01

    Full Text Available Plant type III polyketide synthase (PKS can catalyse the formation of a series of secondary metabolites with different structures and different biological functions; the enzyme plays an important role in plant growth, development and resistance to stress. At present, the PKS gene has been identified and studied in a variety of plants. Here, we identified 11 PKS genes from upland cotton (Gossypium hirsutum and compared them with 41 PKS genes in Populus tremula, Vitis vinifera, Malus domestica and Arabidopsis thaliana. According to the phylogenetic tree, a total of 52 PKS genes can be divided into four subfamilies (I–IV. The analysis of gene structures and conserved motifs revealed that most of the PKS genes were composed of two exons and one intron and there are two characteristic conserved domains (Chal_sti_synt_N and Chal_sti_synt_C of the PKS gene family. In our study of the five species, gene duplication was found in addition to Arabidopsis thaliana and we determined that purifying selection has been of great significance in maintaining the function of PKS gene family. From qRT-PCR analysis and a combination of the role of the accumulation of proanthocyanidins (PAs in brown cotton fibers, we concluded that five PKS genes are candidate genes involved in brown cotton fiber pigment synthesis. These results are important for the further study of brown cotton PKS genes. It not only reveals the relationship between PKS gene family and pigment in brown cotton, but also creates conditions for improving the quality of brown cotton fiber.

  9. Discovery of genes related to insecticide resistance in Bactrocera dorsalis by functional genomic analysis of a de novo assembled transcriptome.

    Science.gov (United States)

    Hsu, Ju-Chun; Chien, Ting-Ying; Hu, Chia-Cheng; Chen, Mei-Ju May; Wu, Wen-Jer; Feng, Hai-Tung; Haymer, David S; Chen, Chien-Yu

    2012-01-01

    Insecticide resistance has recently become a critical concern for control of many insect pest species. Genome sequencing and global quantization of gene expression through analysis of the transcriptome can provide useful information relevant to this challenging problem. The oriental fruit fly, Bactrocera dorsalis, is one of the world's most destructive agricultural pests, and recently it has been used as a target for studies of genetic mechanisms related to insecticide resistance. However, prior to this study, the molecular data available for this species was largely limited to genes identified through homology. To provide a broader pool of gene sequences of potential interest with regard to insecticide resistance, this study uses whole transcriptome analysis developed through de novo assembly of short reads generated by next-generation sequencing (NGS). The transcriptome of B. dorsalis was initially constructed using Illumina's Solexa sequencing technology. Qualified reads were assembled into contigs and potential splicing variants (isotigs). A total of 29,067 isotigs have putative homologues in the non-redundant (nr) protein database from NCBI, and 11,073 of these correspond to distinct D. melanogaster proteins in the RefSeq database. Approximately 5,546 isotigs contain coding sequences that are at least 80% complete and appear to represent B. dorsalis genes. We observed a strong correlation between the completeness of the assembled sequences and the expression intensity of the transcripts. The assembled sequences were also used to identify large numbers of genes potentially belonging to families related to insecticide resistance. A total of 90 P450-, 42 GST-and 37 COE-related genes, representing three major enzyme families involved in insecticide metabolism and resistance, were identified. In addition, 36 isotigs were discovered to contain target site sequences related to four classes of resistance genes. Identified sequence motifs were also analyzed to

  10. Microarray analysis of gene expression profiles in ripening pineapple fruits.

    Science.gov (United States)

    Koia, Jonni H; Moyle, Richard L; Botella, Jose R

    2012-12-18

    Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit

  11. Assembly of inflammation-related genes for pathway-focused genetic analysis.

    Directory of Open Access Journals (Sweden)

    Matthew J Loza

    2007-10-01

    Full Text Available Recent identifications of associations between novel variants in inflammation-related genes and several common diseases emphasize the need for systematic evaluations of these genes in disease susceptibility. Considering that many genes are involved in the complex inflammation responses and many genetic variants in these genes have the potential to alter the functions and expression of these genes, we assembled a list of key inflammation-related genes to facilitate the identification of genetic associations of diseases with an inflammation-related etiology. We first reviewed various phases of inflammation responses, including the development of immune cells, sensing of danger, influx of cells to sites of insult, activation and functional responses of immune and non-immune cells, and resolution of the immune response. Assisted by the Ingenuity Pathway Analysis, we then identified 17 functional sub-pathways that are involved in one or multiple phases. This organization would greatly increase the chance of detecting gene-gene interactions by hierarchical clustering of genes with their functional closeness in a pathway. Finally, as an example application, we have developed tagging single nucleotide polymorphism (tSNP arrays for populations of European and African descent to capture all the common variants of these key inflammation-related genes. Assays of these tSNPs have been designed and assembled into two Affymetrix ParAllele customized chips, one each for European (12,011 SNPs and African (21,542 SNPs populations. These tSNPs have greater coverage for these inflammation-related genes compared to the existing genome-wide arrays, particularly in the African population. These tSNP arrays can facilitate systematic evaluation of inflammation pathways in disease susceptibility. For additional applications, other genotyping platforms could also be employed. For existing genome-wide association data, this list of key inflammation-related genes and

  12. Construction of functional linkage gene networks by data integration.

    Science.gov (United States)

    Linghu, Bolan; Franzosa, Eric A; Xia, Yu

    2013-01-01

    Networks of functional associations between genes have recently been successfully used for gene function and disease-related research. A typical approach for constructing such functional linkage gene networks (FLNs) is based on the integration of diverse high-throughput functional genomics datasets. Data integration is a nontrivial task due to the heterogeneous nature of the different data sources and their variable accuracy and completeness. The presence of correlations between data sources also adds another layer of complexity to the integration process. In this chapter we discuss an approach for constructing a human FLN from data integration and a subsequent application of the FLN to novel disease gene discovery. Similar approaches can be applied to nonhuman species and other discovery tasks.

  13. Identification of Key Pathways and Genes in Advanced Coronary Atherosclerosis Using Bioinformatics Analysis

    Directory of Open Access Journals (Sweden)

    Xiaowen Tan

    2017-01-01

    Full Text Available Background. Coronary artery atherosclerosis is a chronic inflammatory disease. This study aimed to identify the key changes of gene expression between early and advanced carotid atherosclerotic plaque in human. Methods. Gene expression dataset GSE28829 was downloaded from Gene Expression Omnibus (GEO, including 16 advanced and 13 early stage atherosclerotic plaque samples from human carotid. Differentially expressed genes (DEGs were analyzed. Results. 42,450 genes were obtained from the dataset. Top 100 up- and downregulated DEGs were listed. Functional enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG identification were performed. The result of functional and pathway enrichment analysis indicted that the immune system process played a critical role in the progression of carotid atherosclerotic plaque. Protein-protein interaction (PPI networks were performed either. Top 10 hub genes were identified from PPI network and top 6 modules were inferred. These genes were mainly involved in chemokine signaling pathway, cell cycle, B cell receptor signaling pathway, focal adhesion, and regulation of actin cytoskeleton. Conclusion. The present study indicated that analysis of DEGs would make a deeper understanding of the molecular mechanisms of atherosclerosis development and they might be used as molecular targets and diagnostic biomarkers for the treatment of atherosclerosis.

  14. Functional Analysis of Genes Comprising the Locus of Heat Resistance in Escherichia coli.

    Science.gov (United States)

    Mercer, Ryan; Nguyen, Oanh; Ou, Qixing; McMullen, Lynn; Gänzle, Michael G

    2017-10-15

    control of pathogens by current food processing and preparation techniques. The function of LHR-comprising genes and their regulation, however, remain largely unknown. This study defines a core complement of LHR-encoded proteins that are necessary for heat resistance and demonstrates that regulation of the LHR in E. coli requires a chromosomal copy of the gene encoding EvgA. This study provides insight into the function of a transmissible genomic island that allows otherwise heat-sensitive enteric bacteria, including pathogens, to lead a thermoduric lifestyle and thus contributes to the detection and control of heat-resistant enteric bacteria in food. Copyright © 2017 American Society for Microbiology.

  15. Prediction of operon-like gene clusters in the Arabidopsis thaliana genome based on co-expression analysis of neighboring genes.

    Science.gov (United States)

    Wada, Masayoshi; Takahashi, Hiroki; Altaf-Ul-Amin, Md; Nakamura, Kensuke; Hirai, Masami Y; Ohta, Daisaku; Kanaya, Shigehiko

    2012-07-15

    Operon-like arrangements of genes occur in eukaryotes ranging from yeasts and filamentous fungi to nematodes, plants, and mammals. In plants, several examples of operon-like gene clusters involved in metabolic pathways have recently been characterized, e.g. the cyclic hydroxamic acid pathways in maize, the avenacin biosynthesis gene clusters in oat, the thalianol pathway in Arabidopsis thaliana, and the diterpenoid momilactone cluster in rice. Such operon-like gene clusters are defined by their co-regulation or neighboring positions within immediate vicinity of chromosomal regions. A comprehensive analysis of the expression of neighboring genes therefore accounts a crucial step to reveal the complete set of operon-like gene clusters within a genome. Genome-wide prediction of operon-like gene clusters should contribute to functional annotation efforts and provide novel insight into evolutionary aspects acquiring certain biological functions as well. We predicted co-expressed gene clusters by comparing the Pearson correlation coefficient of neighboring genes and randomly selected gene pairs, based on a statistical method that takes false discovery rate (FDR) into consideration for 1469 microarray gene expression datasets of A. thaliana. We estimated that A. thaliana contains 100 operon-like gene clusters in total. We predicted 34 statistically significant gene clusters consisting of 3 to 22 genes each, based on a stringent FDR threshold of 0.1. Functional relationships among genes in individual clusters were estimated by sequence similarity and functional annotation of genes. Duplicated gene pairs (determined based on BLAST with a cutoff of EOperon-like clusters tend to include genes encoding bio-machinery associated with ribosomes, the ubiquitin/proteasome system, secondary metabolic pathways, lipid and fatty-acid metabolism, and the lipid transfer system. Copyright © 2012 Elsevier B.V. All rights reserved.

  16. Life cycle analysis of kidney gene expression in male F344 rats.

    Directory of Open Access Journals (Sweden)

    Joshua C Kwekel

    Full Text Available Age is a predisposing condition for susceptibility to chronic kidney disease and progression as well as acute kidney injury that may arise due to the adverse effects of some drugs. Age-related differences in kidney biology, therefore, are a key concern in understanding drug safety and disease progression. We hypothesize that the underlying suite of genes expressed in the kidney at various life cycle stages will impact susceptibility to adverse drug reactions. Therefore, establishing changes in baseline expression data between these life stages is the first and necessary step in evaluating this hypothesis. Untreated male F344 rats were sacrificed at 2, 5, 6, 8, 15, 21, 78, and 104 weeks of age. Kidneys were collected for histology and gene expression analysis. Agilent whole-genome rat microarrays were used to query global expression profiles. An ANOVA (p1.5 in relative mRNA expression, was used to identify 3,724 unique differentially expressed genes (DEGs. Principal component analyses of these DEGs revealed three major divisions in life-cycle renal gene expression. K-means cluster analysis identified several groups of genes that shared age-specific patterns of expression. Pathway analysis of these gene groups revealed age-specific gene networks and functions related to renal function and aging, including extracellular matrix turnover, immune cell response, and renal tubular injury. Large age-related changes in expression were also demonstrated for the genes that code for qualified renal injury biomarkers KIM-1, Clu, and Tff3. These results suggest specific groups of genes that may underlie age-specific susceptibilities to adverse drug reactions and disease. This analysis of the basal gene expression patterns of renal genes throughout the life cycle of the rat will improve the use of current and future renal biomarkers and inform our assessments of kidney injury and disease.

  17. Functional evolution of ADAMTS genes: Evidence from analyses of phylogeny and gene organization

    Directory of Open Access Journals (Sweden)

    Van Meir Erwin G

    2005-02-01

    Full Text Available Abstract Background The ADAMTS (A Disintegrin-like and Metalloprotease with Thrombospondin motifs proteins are a family of metalloproteases with sequence similarity to the ADAM proteases, that contain the thrombospondin type 1 sequence repeat motifs (TSRs common to extracellular matrix proteins. ADAMTS proteins have recently gained attention with the discovery of their role in a variety of diseases, including tissue and blood disorders, cancer, osteoarthritis, Alzheimer's and the genetic syndromes Weill-Marchesani syndrome (ADAMTS10, thrombotic thrombocytopenic purpura (ADAMTS13, and Ehlers-Danlos syndrome type VIIC (ADAMTS2 in humans and belted white-spotting mutation in mice (ADAMTS20. Results Phylogenetic analysis and comparison of the exon/intron organization of vertebrate (Homo, Mus, Fugu, chordate (Ciona and invertebrate (Drosophila and Caenorhabditis ADAMTS homologs has elucidated the evolutionary relationships of this important gene family, which comprises 19 members in humans. Conclusions The evolutionary history of ADAMTS genes in vertebrate genomes has been marked by rampant gene duplication, including a retrotransposition that gave rise to a distinct ADAMTS subfamily (ADAMTS1, -4, -5, -8, -15 that may have distinct aggrecanase and angiogenesis functions.

  18. When Is Hub Gene Selection Better than Standard Meta-Analysis?

    Science.gov (United States)

    Langfelder, Peter; Mischel, Paul S.; Horvath, Steve

    2013-01-01

    gene expression data and presents novel R functions for carrying out consensus network analysis, network based screening, and meta analysis. PMID:23613865

  19. When is hub gene selection better than standard meta-analysis?

    Directory of Open Access Journals (Sweden)

    Peter Langfelder

    applied to gene expression data and presents novel R functions for carrying out consensus network analysis, network based screening, and meta analysis.

  20. When is hub gene selection better than standard meta-analysis?

    Science.gov (United States)

    Langfelder, Peter; Mischel, Paul S; Horvath, Steve

    2013-01-01

    gene expression data and presents novel R functions for carrying out consensus network analysis, network based screening, and meta analysis.

  1. Genome-wide identification, functional and evolutionary analysis of terpene synthases in pineapple.

    Science.gov (United States)

    Chen, Xiaoe; Yang, Wei; Zhang, Liqin; Wu, Xianmiao; Cheng, Tian; Li, Guanglin

    2017-10-01

    Terpene synthases (TPSs) are vital for the biosynthesis of active terpenoids, which have important physiological, ecological and medicinal value. Although terpenoids have been reported in pineapple (Ananas comosus), genome-wide investigations of the TPS genes responsible for pineapple terpenoid synthesis are still lacking. By integrating pineapple genome and proteome data, twenty-one putative terpene synthase genes were found in pineapple and divided into five subfamilies. Tandem duplication is the cause of TPS gene family duplication. Furthermore, functional differentiation between each TPS subfamily may have occurred for several reasons. Sixty-two key amino acid sites were identified as being type-II functionally divergence between TPS-a and TPS-c subfamily. Finally, coevolution analysis indicated that multiple amino acid residues are involved in coevolutionary processes. In addition, the enzyme activity of two TPSs were tested. This genome-wide identification, functional and evolutionary analysis of pineapple TPS genes provide a new insight into understanding the roles of TPS family and lay the basis for further characterizing the function and evolution of TPS gene family. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. When natural selection gives gene function the cold shoulder.

    Science.gov (United States)

    Cutter, Asher D; Jovelin, Richard

    2015-11-01

    It is tempting to invoke organismal selection as perpetually optimizing the function of any given gene. However, natural selection can drive genic functional change without improvement of biochemical activity, even to the extinction of gene activity. Detrimental mutations can creep in owing to linkage with other selectively favored loci. Selection can promote functional degradation, irrespective of genetic drift, when adaptation occurs by loss of gene function. Even stabilizing selection on a trait can lead to divergence of the underlying molecular constituents. Selfish genetic elements can also proliferate independent of any functional benefits to the host genome. Here we review the logic and evidence for these diverse processes acting in genome evolution. This collection of distinct evolutionary phenomena - while operating through easily understandable mechanisms - all contribute to the seemingly counterintuitive notion that maintenance or improvement of a gene's biochemical function sometimes do not determine its evolutionary fate. © 2015 WILEY Periodicals, Inc.

  3. Identification and functional analysis of pheromone and receptor genes in the B3 mating locus of Pleurotus eryngii.

    Science.gov (United States)

    Kim, Kyung-Hee; Kang, Young Min; Im, Chak Han; Ali, Asjad; Kim, Sun Young; Je, Hee-Jeong; Kim, Min-Keun; Rho, Hyun Su; Lee, Hyun Sook; Kong, Won-Sik; Ryu, Jae-San

    2014-01-01

    Pleurotus eryngii has recently become a major cultivated mushroom; it uses tetrapolar heterothallism as a part of its reproductive process. Sexual development progresses only when the A and B mating types are compatible. Such mating incompatibility occasionally limits the efficiency of breeding programs in which crossing within loci-shared strains or backcrossing strategies are employed. Therefore, understanding the mating system in edible mushroom fungi will help provide a short cut in the development of new strains. We isolated and identified pheromone and receptor genes in the B3 locus of P. eryngii and performed a functional analysis of the genes in the mating process by transformation. A genomic DNA library was constructed to map the entire mating-type locus. The B3 locus was found to contain four pheromone precursor genes and four receptor genes. Remarkably, receptor PESTE3.3.1 has just 34 amino acid residues in its C-terminal cytoplasmic region; therefore, it seems likely to be a receptor-like gene. Real-time quantitative RT-PCR (real-time qRT-PCR) revealed that most pheromone and receptor genes showed significantly higher expression in monokaryotic cells than dikaryotic cells. The pheromone genes PEphb3.1 and PEphb3.3 and the receptor gene PESTE3.3.1 were transformed into P5 (A3B4). The transformants were mated with a tester strain (A4B4), and the progeny showed clamp connections and a normal fruiting body, which indicates the proposed role of these genes in mating and fruiting processes. This result also confirms that PESTE3.3.1 is a receptor gene. In this study, we identified pheromone and receptor genes in the B3 locus of P. eryngii and found that some of those genes appear to play a role in the mating and fruiting processes. These results might help elucidate the mechanism of fruiting differentiation and improve breeding efficiency.

  4. A gene network bioinformatics analysis for pemphigoid autoimmune blistering diseases.

    Science.gov (United States)

    Barone, Antonio; Toti, Paolo; Giuca, Maria Rita; Derchi, Giacomo; Covani, Ugo

    2015-07-01

    In this theoretical study, a text mining search and clustering analysis of data related to genes potentially involved in human pemphigoid autoimmune blistering diseases (PAIBD) was performed using web tools to create a gene/protein interaction network. The Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database was employed to identify a final set of PAIBD-involved genes and to calculate the overall significant interactions among genes: for each gene, the weighted number of links, or WNL, was registered and a clustering procedure was performed using the WNL analysis. Genes were ranked in class (leader, B, C, D and so on, up to orphans). An ontological analysis was performed for the set of 'leader' genes. Using the above-mentioned data network, 115 genes represented the final set; leader genes numbered 7 (intercellular adhesion molecule 1 (ICAM-1), interferon gamma (IFNG), interleukin (IL)-2, IL-4, IL-6, IL-8 and tumour necrosis factor (TNF)), class B genes were 13, whereas the orphans were 24. The ontological analysis attested that the molecular action was focused on extracellular space and cell surface, whereas the activation and regulation of the immunity system was widely involved. Despite the limited knowledge of the present pathologic phenomenon, attested by the presence of 24 genes revealing no protein-protein direct or indirect interactions, the network showed significant pathways gathered in several subgroups: cellular components, molecular functions, biological processes and the pathologic phenomenon obtained from the Kyoto Encyclopaedia of Genes and Genomes (KEGG) database. The molecular basis for PAIBD was summarised and expanded, which will perhaps give researchers promising directions for the identification of new therapeutic targets.

  5. Gene prediction validation and functional analysis of redundant pathways

    DEFF Research Database (Denmark)

    Sønderkær, Mads

    2011-01-01

    have employed a large mRNA-seq data set to improve and validate ab initio predicted gene models. This direct experimental evidence also provides reliable determinations of UTR regions and polyadenylation sites, which are not easily predicted in plants. Furthermore, once an annotated genome sequence...... is available, gene expression by mRNA-Seq enables acquisition of a more complete overview of gene isoform usage in complex enzymatic pathways enabling the identification of key genes. Metabolism in potatoes This information is useful e.g. for crop improvement based on manipulation of agronomically important...

  6. Development of gene diagnosis for diabetes and cholecystitis based on gene analysis of CCK-A receptor

    International Nuclear Information System (INIS)

    Kono, Akira

    1999-01-01

    Base sequence analysis of CCKAR gene (a gene of A-type receptor for cholecystokinin) from OLETF rat, a model rat for insulin-independent diabetes was made based on the base sequence of wild CCKAR gene, which had been clarified in the previous year. From the pancreas of OLETF rat, DNA was extracted and transduced into λphage after fragmentation to construct the gene library of OLETF. Then, λphage DNA clone bound with labelled cDNA of CCKAR gene was analyzed and the gene structure was compared with that of the wild gene. It was demonstrated that CCKAR gene of OLETF had a deletion (6800 b.p.) ranging from the promoter region to the Exon 2, suggesting that CCKAR gene is not functional in OLETF rat. The whole sequence of this mutant gene was registered into Japan DNA Bank (D 50610). Then, F 2 offspring rats were obtained through crossing OLETF (female) and F344 (male) and the time course-changes in the blood glucose level after glucose loading were compared among them. The blood glucose level after glucose loading was significantly higher in the homo-mutant F 2 (CCKAR,-/-) as well as the parent OLETF rat than hetero-mutant F 2 (CCKARm-/+) or the wild rat (CCKAR,+/+). This suggests that CCKAR gene might be involved in the control of blood glucose level and an alteration of the expression level or the functions of CCKAR gene might affect the blood glucose level. (M.N.)

  7. Comparative analysis of clustering methods for gene expression time course data

    Directory of Open Access Journals (Sweden)

    Ivan G. Costa

    2004-01-01

    Full Text Available This work performs a data driven comparative study of clustering methods used in the analysis of gene expression time courses (or time series. Five clustering methods found in the literature of gene expression analysis are compared: agglomerative hierarchical clustering, CLICK, dynamical clustering, k-means and self-organizing maps. In order to evaluate the methods, a k-fold cross-validation procedure adapted to unsupervised methods is applied. The accuracy of the results is assessed by the comparison of the partitions obtained in these experiments with gene annotation, such as protein function and series classification.

  8. Cloning and Functional Analysis of the Promoter of an Ascorbate Oxidase Gene from Gossypium hirsutum.

    Directory of Open Access Journals (Sweden)

    Shan Xin

    Full Text Available Apoplastic ascorbate oxidase (AO plays significant roles in plant cell growth. However, the mechanism of underlying the transcriptional regulation of AO in Gossypium hirsutum remains unclear. Here, we obtained a 1,920-bp promoter sequence from the Gossypium hirsutum ascorbate oxidase (GhAO1 gene, and this GhAO1 promoter included a number of known cis-elements. Promoter activity analysis in overexpressing pGhAO1::GFP-GUS tobacco (Nicotiana benthamiana showed that the GhAO1 promoter exhibited high activity, driving strong reporter gene expression in tobacco trichomes, leaves and roots. Promoter 5'-deletion analysis demonstrated that truncated GhAO1 promoters with serial 5'-end deletions had different GUS activities. A 360-bp fragment was sufficient to activate GUS expression. The P-1040 region had less GUS activity than the P-720 region, suggesting that the 320-bp region from nucleotide -720 to -1040 might include a cis-element acting as a silencer. Interestingly, an auxin-responsive cis-acting element (TGA-element was uncovered in the promoter. To analyze the function of the TGA-element, tobacco leaves transformed with promoters with different 5' truncations were treated with indole-3-acetic acid (IAA. Tobacco leaves transformed with the promoter regions containing the TGA-element showed significantly increased GUS activity after IAA treatment, implying that the fragment spanning nucleotides -1760 to -1600 (which includes the TGA-element might be a key component for IAA responsiveness. Analyses of the AO promoter region and AO expression pattern in Gossypium arboreum (Ga, diploid cotton with an AA genome, Gossypium raimondii (Gr, diploid cotton with a DD genome and Gossypium hirsutum (Gh, tetraploid cotton with an AADD genome indicated that AO promoter activation and AO transcription were detected together only in D genome/sub-genome (Gr and Gh cotton. Taken together, these results suggest that the 1,920-bp GhAO1 promoter is a functional sequence

  9. Cloning and Functional Analysis of the Promoter of an Ascorbate Oxidase Gene from Gossypium hirsutum.

    Science.gov (United States)

    Xin, Shan; Tao, Chengcheng; Li, Hongbin

    2016-01-01

    Apoplastic ascorbate oxidase (AO) plays significant roles in plant cell growth. However, the mechanism of underlying the transcriptional regulation of AO in Gossypium hirsutum remains unclear. Here, we obtained a 1,920-bp promoter sequence from the Gossypium hirsutum ascorbate oxidase (GhAO1) gene, and this GhAO1 promoter included a number of known cis-elements. Promoter activity analysis in overexpressing pGhAO1::GFP-GUS tobacco (Nicotiana benthamiana) showed that the GhAO1 promoter exhibited high activity, driving strong reporter gene expression in tobacco trichomes, leaves and roots. Promoter 5'-deletion analysis demonstrated that truncated GhAO1 promoters with serial 5'-end deletions had different GUS activities. A 360-bp fragment was sufficient to activate GUS expression. The P-1040 region had less GUS activity than the P-720 region, suggesting that the 320-bp region from nucleotide -720 to -1040 might include a cis-element acting as a silencer. Interestingly, an auxin-responsive cis-acting element (TGA-element) was uncovered in the promoter. To analyze the function of the TGA-element, tobacco leaves transformed with promoters with different 5' truncations were treated with indole-3-acetic acid (IAA). Tobacco leaves transformed with the promoter regions containing the TGA-element showed significantly increased GUS activity after IAA treatment, implying that the fragment spanning nucleotides -1760 to -1600 (which includes the TGA-element) might be a key component for IAA responsiveness. Analyses of the AO promoter region and AO expression pattern in Gossypium arboreum (Ga, diploid cotton with an AA genome), Gossypium raimondii (Gr, diploid cotton with a DD genome) and Gossypium hirsutum (Gh, tetraploid cotton with an AADD genome) indicated that AO promoter activation and AO transcription were detected together only in D genome/sub-genome (Gr and Gh) cotton. Taken together, these results suggest that the 1,920-bp GhAO1 promoter is a functional sequence with a

  10. Functional analysis of duplicated Symbiosis Receptor Kinase (SymRK) genes during nodulation and mycorrhizal infection in soybean (Glycine max).

    Science.gov (United States)

    Indrasumunar, Arief; Wilde, Julia; Hayashi, Satomi; Li, Dongxue; Gresshoff, Peter M

    2015-03-15

    Association between legumes and rhizobia results in the formation of root nodules, where symbiotic nitrogen fixation occurs. The early stages of this association involve a complex of signalling events between the host and microsymbiont. Several genes dealing with early signal transduction have been cloned, and one of them encodes the leucine-rich repeat (LRR) receptor kinase (SymRK; also termed NORK). The Symbiosis Receptor Kinase gene is required by legumes to establish a root endosymbiosis with Rhizobium bacteria as well as mycorrhizal fungi. Using degenerate primer and BAC sequencing, we cloned duplicated SymRK homeologues in soybean called GmSymRKα and GmSymRKβ. These duplicated genes have high similarity of nucleotide (96%) and amino acid sequence (95%). Sequence analysis predicted a malectin-like domain within the extracellular domain of both genes. Several putative cis-acting elements were found in promoter regions of GmSymRKα and GmSymRKβ, suggesting a participation in lateral root development, cell division and peribacteroid membrane formation. The mutant of SymRK genes is not available in soybean; therefore, to know the functions of these genes, RNA interference (RNAi) of these duplicated genes was performed. For this purpose, RNAi construct of each gene was generated and introduced into the soybean genome by Agrobacterium rhizogenes-mediated hairy root transformation. RNAi of GmSymRKβ gene resulted in an increased reduction of nodulation and mycorrhizal infection than RNAi of GmSymRKα, suggesting it has the major activity of the duplicated gene pair. The results from the important crop legume soybean confirm the joint phenotypic action of GmSymRK genes in both mycorrhizal and rhizobial infection seen in model legumes. Copyright © 2015 Elsevier GmbH. All rights reserved.

  11. Using riboswitches to regulate gene expression and define gene function in mycobacteria.

    Science.gov (United States)

    Van Vlack, Erik R; Seeliger, Jessica C

    2015-01-01

    Mycobacteria include both environmental species and many pathogenic species such as Mycobacterium tuberculosis, an intracellular pathogen that is the causative agent of tuberculosis in humans. Inducible gene expression is a powerful tool for examining gene function and essentiality, both in in vitro culture and in host cell infections. The theophylline-inducible artificial riboswitch has recently emerged as an alternative to protein repressor-based systems. The riboswitch is translationally regulated and is combined with a mycobacterial promoter that provides transcriptional control. We here provide methods used by our laboratory to characterize the riboswitch response to theophylline in reporter strains, recombinant organisms containing riboswitch-regulated endogenous genes, and in host cell infections. These protocols should facilitate the application of both existing and novel artificial riboswitches to the exploration of gene function in mycobacteria. © 2015 Elsevier Inc. All rights reserved.

  12. Microarray gene expression profiling and analysis in renal cell carcinoma

    Directory of Open Access Journals (Sweden)

    Sadhukhan Provash

    2004-06-01

    Full Text Available Abstract Background Renal cell carcinoma (RCC is the most common cancer in adult kidney. The accuracy of current diagnosis and prognosis of the disease and the effectiveness of the treatment for the disease are limited by the poor understanding of the disease at the molecular level. To better understand the genetics and biology of RCC, we profiled the expression of 7,129 genes in both clear cell RCC tissue and cell lines using oligonucleotide arrays. Methods Total RNAs isolated from renal cell tumors, adjacent normal tissue and metastatic RCC cell lines were hybridized to affymatrix HuFL oligonucleotide arrays. Genes were categorized into different functional groups based on the description of the Gene Ontology Consortium and analyzed based on the gene expression levels. Gene expression profiles of the tissue and cell line samples were visualized and classified by singular value decomposition. Reverse transcription polymerase chain reaction was performed to confirm the expression alterations of selected genes in RCC. Results Selected genes were annotated based on biological processes and clustered into functional groups. The expression levels of genes in each group were also analyzed. Seventy-four commonly differentially expressed genes with more than five-fold changes in RCC tissues were identified. The expression alterations of selected genes from these seventy-four genes were further verified using reverse transcription polymerase chain reaction (RT-PCR. Detailed comparison of gene expression patterns in RCC tissue and RCC cell lines shows significant differences between the two types of samples, but many important expression patterns were preserved. Conclusions This is one of the initial studies that examine the functional ontology of a large number of genes in RCC. Extensive annotation, clustering and analysis of a large number of genes based on the gene functional ontology revealed many interesting gene expression patterns in RCC. Most

  13. Functional analysis of rice HOMEOBOX4 (Oshox4) gene reveals a negative function in gibberellin responses.

    Science.gov (United States)

    Dai, Mingqiu; Hu, Yongfeng; Ma, Qian; Zhao, Yu; Zhou, Dao-Xiu

    2008-02-01

    The homeodomain-leucine zipper (HD-Zip) putative transcription factor genes are divided into 4 families. In this work, we studied the function of a rice HD-Zip I gene, H OME O BO X4 (Oshox4). Oshox4 transcripts were detected in leaf and floral organ primordia but excluded from the shoot apical meristem and the protein was nuclear localized. Over-expression of Oshox4 in rice induced a semi-dwarf phenotype that could not be complemented by applied GA3. The over-expression plants accumulated elevated levels of bioactive GA, while the GA catabolic gene GA2ox3 was upregulated in the transgenic plants. In addition, over-expression of Oshox4 blocked GA-dependent alpha-amylase production. However, down-regulation of Oshox4 in RNAi transgenic plants induced no phenotypic alteration. Interestingly, the expression of YAB1 that is involved in the negative feedback regulation of the GA biosynthesis was upregulated in the Oshox4 over-expressing plants. One-hybrid assays showed that Oshox4 could interact with YAB1 promoter in yeast. In addition, Oshox4 expression was upregulated by GA. These data together suggest that Oshox4 may be involved in the negative regulation of GA signalling and may play a role to fine tune GA responses in rice.

  14. Cloning and Functional Analysis of the MADS-box CiMADS9 Gene from Carya illinoinensis

    Directory of Open Access Journals (Sweden)

    Zhang Jiyu

    2015-07-01

    Full Text Available A MADS-box gene, CiMADS9, was cloned from the male flowers of Carya illinoinensis by rapid amplification of cDNA ends. The gene was 1 077 bp with a 768 bp open reading frame encoding 255 amino acids. Multiple sequence comparisons revealed that CiMADS9 is a typical MIKC-type MADS-box gene with a MADS-box domain and a K semi-conserved region. Phylogenetic analysis indicated that CiMADS9 belongs to the AGL15 group of the MADS-box gene family. Quantitative reverse transcription polymerase chain reaction analysis indicated that the expression levels in reproductive organs (i.e., flowers and young fruits were considerably higher than in vegetative tissues (i.e., leaves and branches. The highest expression levels were observed in male flowers. An overexpression vector for CiMADS9 was constructed and the gene was inserted into the Arabidopsis thaliana genome. CiMADS9 expression was confirmed in all transgenic lines. Compared with wild-type plants, transgenic A. thaliana plants overexpressing CiMADS9 exhibited delayed flowering and an increased number of leaves.

  15. Functional Gene Diversity and Metabolic Potential of the Microbial Community in an Estuary-Shelf Environment

    Directory of Open Access Journals (Sweden)

    Yu Wang

    2017-06-01

    Full Text Available Microbes play crucial roles in various biogeochemical processes in the ocean, including carbon (C, nitrogen (N, and phosphorus (P cycling. Functional gene diversity and the structure of the microbial community determines its metabolic potential and therefore its ecological function in the marine ecosystem. However, little is known about the functional gene composition and metabolic potential of bacterioplankton in estuary areas. The East China Sea (ECS is a dynamic marginal ecosystem in the western Pacific Ocean that is mainly affected by input from the Changjiang River and the Kuroshio Current. Here, using a high-throughput functional gene microarray (GeoChip, we analyzed the functional gene diversity, composition, structure, and metabolic potential of microbial assemblages in different ECS water masses. Four water masses determined by temperature and salinity relationship showed different patterns of functional gene diversity and composition. Generally, functional gene diversity [Shannon–Weaner’s H and reciprocal of Simpson’s 1/(1-D] in the surface water masses was higher than that in the bottom water masses. The different presence and proportion of functional genes involved in C, N, and P cycling among the bacteria of the different water masses showed different metabolic preferences of the microbial populations in the ECS. Genes involved in starch metabolism (amyA and nplT showed higher proportion in microbial communities of the surface water masses than of the bottom water masses. In contrast, a higher proportion of genes involved in chitin degradation was observed in microorganisms of the bottom water masses. Moreover, we found a higher proportion of nitrogen fixation (nifH, transformation of hydroxylamine to nitrite (hao and ammonification (gdh genes in the microbial communities of the bottom water masses compared with those of the surface water masses. The spatial variation of microbial functional genes was significantly correlated

  16. Heterologous expression and transcript analysis of gibberellin biosynthetic genes of grasses reveals novel functionality in the GA3ox family.

    Science.gov (United States)

    Pearce, Stephen; Huttly, Alison K; Prosser, Ian M; Li, Yi-dan; Vaughan, Simon P; Gallova, Barbora; Patil, Archana; Coghill, Jane A; Dubcovsky, Jorge; Hedden, Peter; Phillips, Andrew L

    2015-06-05

    The gibberellin (GA) pathway plays a central role in the regulation of plant development, with the 2-oxoglutarate-dependent dioxygenases (2-ODDs: GA20ox, GA3ox, GA2ox) that catalyse the later steps in the biosynthetic pathway of particularly importance in regulating bioactive GA levels. Although GA has important impacts on crop yield and quality, our understanding of the regulation of GA biosynthesis during wheat and barley development remains limited. In this study we identified or assembled genes encoding the GA 2-ODDs of wheat, barley and Brachypodium distachyon and characterised the wheat genes by heterologous expression and transcript analysis. The wheat, barley and Brachypodium genomes each contain orthologous copies of the GA20ox, GA3ox and GA2ox genes identified in rice, with the exception of OsGA3ox1 and OsGA2ox5 which are absent in these species. Some additional paralogs of 2-ODD genes were identified: notably, a novel gene in the wheat B genome related to GA3ox2 was shown to encode a GA 1-oxidase, named as TaGA1ox-B1. This enzyme is likely to be responsible for the abundant 1β-hydroxylated GAs present in developing wheat grains. We also identified a related gene in barley, located in a syntenic position to TaGA1ox-B1, that encodes a GA 3,18-dihydroxylase which similarly accounts for the accumulation of unusual GAs in barley grains. Transcript analysis showed that some paralogs of the different classes of 2-ODD were expressed mainly in a single tissue or at specific developmental stages. In particular, TaGA20ox3, TaGA1ox1, TaGA3ox3 and TaGA2ox7 were predominantly expressed in developing grain. More detailed analysis of grain-specific gene expression showed that while the transcripts of biosynthetic genes were most abundant in the endosperm, genes encoding inactivation and signalling components were more highly expressed in the seed coat and pericarp. The comprehensive expression and functional characterisation of the multigene families encoding the 2-ODD

  17. Molecular cloning and functional analysis of a blue light receptor gene MdCRY2 from apple (Malus domestica).

    Science.gov (United States)

    Li, Yuan-Yuan; Mao, Ke; Zhao, Cheng; Zhao, Xian-Yan; Zhang, Rui-Fen; Zhang, Hua-Lei; Shu, Huai-Rui; Hao, Yu-Jin

    2013-04-01

    MdCRY2 was isolated from apple fruit skin, and its function was analyzed in MdCRY2 transgenic Arabidopsis. The interaction between MdCRY2 and AtCOP1 was found by yeast two-hybrid and BiFC assays. Cryptochromes are blue/ultraviolet-A (UV-A) light receptors involved in regulating various aspects of plant growth and development. Investigations of the structure and functions of cryptochromes in plants have largely focused on Arabidopsis (Arabidopsis thaliana), tomato (Solanum lycopersicum), pea (Pisum sativum), and rice (Oryza sativa). However, no data on the function of CRY2 are available in woody plants. In this study, we isolated a cryptochrome gene, MdCRY2, from apple (Malus domestica). The deduced amino acid sequences of MdCRY2 contain the conserved N-terminal photolyase-related domain and the flavin adenine dinucleotide (FAD) binding domain, as well as the C-terminal DQXVP-acidic-STAES (DAS) domain. Relationship analysis indicates that MdCRY2 shows the highest similarity to the strawberry FvCRY protein. The expression of MdCRY2 is induced by blue/UV-A light, which represents a 48-h circadian rhythm. To investigate the function of MdCRY2, we overexpressed the MdCRY2 gene in a cry2 mutant and wild type (WT) Arabidopsis, assessed the phenotypes of the resulting transgenic plants, and found that MdCRY2 functions to regulate hypocotyl elongation, root growth, flower initiation, and anthocyanin accumulation. Furthermore, we examined the interaction between MdCRY2 and AtCOP1 using a yeast two-hybrid assay and a bimolecular fluorescence complementation assay. These data provide functional evidence for a role of blue/UV-A light-induced MdCRY2 in controlling photomorphogenesis in apple.

  18. Gene-Transformation-Induced Changes in Chemical Functional Group Features and Molecular Structure Conformation in Alfalfa Plants Co-Expressing Lc-bHLH and C1-MYB Transcriptive Flavanoid Regulatory Genes: Effects of Single-Gene and Two-Gene Insertion.

    Science.gov (United States)

    Heendeniya, Ravindra G; Yu, Peiqiang

    2017-03-20

    Alfalfa ( Medicago sativa L.) genotypes transformed with Lc-bHLH and Lc transcription genes were developed with the intention of stimulating proanthocyanidin synthesis in the aerial parts of the plant. To our knowledge, there are no studies on the effect of single-gene and two-gene transformation on chemical functional groups and molecular structure changes in these plants. The objective of this study was to use advanced molecular spectroscopy with multivariate chemometrics to determine chemical functional group intensity and molecular structure changes in alfalfa plants when co-expressing Lc-bHLH and C1-MYB transcriptive flavanoid regulatory genes in comparison with non-transgenic (NT) and AC Grazeland (ACGL) genotypes. The results showed that compared to NT genotype, the presence of double genes ( Lc and C1 ) increased ratios of both the area and peak height of protein structural Amide I/II and the height ratio of α-helix to β-sheet. In carbohydrate-related spectral analysis, the double gene-transformed alfalfa genotypes exhibited lower peak heights at 1370, 1240, 1153, and 1020 cm -1 compared to the NT genotype. Furthermore, the effect of double gene transformation on carbohydrate molecular structure was clearly revealed in the principal component analysis of the spectra. In conclusion, single or double transformation of Lc and C1 genes resulted in changing functional groups and molecular structure related to proteins and carbohydrates compared to the NT alfalfa genotype. The current study provided molecular structural information on the transgenic alfalfa plants and provided an insight into the impact of transgenes on protein and carbohydrate properties and their molecular structure's changes.

  19. Cloning, Expression Profiling and Functional Analysis of CnHMGS, a Gene Encoding 3-hydroxy-3-Methylglutaryl Coenzyme A Synthase from Chamaemelum nobile

    Directory of Open Access Journals (Sweden)

    Shuiyuan Cheng

    2016-03-01

    Full Text Available Roman chamomile (Chamaemelum nobile L. is renowned for its production of essential oils, which major components are sesquiterpenoids. As the important enzyme in the sesquiterpenoid biosynthesis pathway, 3-hydroxy-3-methylglutaryl coenzyme A synthase (HMGS catalyze the crucial step in the mevalonate pathway in plants. To isolate and identify the functional genes involved in the sesquiterpene biosynthesis of C. nobile L., a HMGS gene designated as CnHMGS (GenBank Accession No. KU529969 was cloned from C. nobile. The cDNA sequence of CnHMGS contained a 1377 bp open reading frame encoding a 458-amino-acid protein. The sequence of the CnHMGS protein was highly homologous to those of HMGS proteins from other plant species. Phylogenetic tree analysis revealed that CnHMGS clustered with the HMGS of Asteraceae in the dicotyledon clade. Further functional complementation of CnHMGS in the mutant yeast strain YSC6274 lacking HMGS activity demonstrated that the cloned CnHMGS cDNA encodes a functional HMGS. Transcript profile analysis indicated that CnHMGS was preferentially expressed in flowers and roots of C. nobile. The expression of CnHMGS could be upregulated by exogenous elicitors, including methyl jasmonate and salicylic acid, suggesting that CnHMGS was elicitor-responsive. The characterization and expression analysis of CnHMGS is helpful to understand the biosynthesis of sesquiterpenoid in C. nobile at the molecular level and also provides molecular wealth for the biotechnological improvement of this important medicinal plant.

  20. Cloning, Expression Profiling and Functional Analysis of CnHMGS, a Gene Encoding 3-hydroxy-3-Methylglutaryl Coenzyme A Synthase from Chamaemelum nobile.

    Science.gov (United States)

    Cheng, Shuiyuan; Wang, Xiaohui; Xu, Feng; Chen, Qiangwen; Tao, Tingting; Lei, Jing; Zhang, Weiwei; Liao, Yongling; Chang, Jie; Li, Xingxiang

    2016-03-08

    Roman chamomile (Chamaemelum nobile L.) is renowned for its production of essential oils, which major components are sesquiterpenoids. As the important enzyme in the sesquiterpenoid biosynthesis pathway, 3-hydroxy-3-methylglutaryl coenzyme A synthase (HMGS) catalyze the crucial step in the mevalonate pathway in plants. To isolate and identify the functional genes involved in the sesquiterpene biosynthesis of C. nobile L., a HMGS gene designated as CnHMGS (GenBank Accession No. KU529969) was cloned from C. nobile. The cDNA sequence of CnHMGS contained a 1377 bp open reading frame encoding a 458-amino-acid protein. The sequence of the CnHMGS protein was highly homologous to those of HMGS proteins from other plant species. Phylogenetic tree analysis revealed that CnHMGS clustered with the HMGS of Asteraceae in the dicotyledon clade. Further functional complementation of CnHMGS in the mutant yeast strain YSC6274 lacking HMGS activity demonstrated that the cloned CnHMGS cDNA encodes a functional HMGS. Transcript profile analysis indicated that CnHMGS was preferentially expressed in flowers and roots of C. nobile. The expression of CnHMGS could be upregulated by exogenous elicitors, including methyl jasmonate and salicylic acid, suggesting that CnHMGS was elicitor-responsive. The characterization and expression analysis of CnHMGS is helpful to understand the biosynthesis of sesquiterpenoid in C. nobile at the molecular level and also provides molecular wealth for the biotechnological improvement of this important medicinal plant.

  1. Functional Analysis of the FZF1 Genes of Saccharomyces uvarum

    Directory of Open Access Journals (Sweden)

    Xiaozhen Liu

    2018-02-01

    Full Text Available Being a sister species of Saccharomyces cerevisiae, Saccharomyces uvarum shows great potential regarding the future of the wine industry. The sulfite tolerance of most S. uvarum strains is poor, however. This is a major flaw that limits its utility in the wine industry. In S. cerevisiae, FZF1 plays a positive role in the transcription of SSU1, which encodes a sulfite efflux transport protein that is critical for sulfite tolerance. Although FZF1 has previously been shown to play a role in sulfite tolerance in S. uvarum, there is little information about its action mechanism. To assess the function of FZF1, two over-expression vectors that contained different FZF1 genes, and one FZF1 silencing vector, were constructed and introduced into a sulfite-tolerant S. uvarum strain using electroporation. In addition, an FZF1-deletion strain was constructed. Both of the FZF1-over-expressing strains showed an elevated tolerance to sulfite, and the FZF1-deletion strain showed the opposite effect. Repression of FZF1 transcription failed, however, presumably due to the lack of alleles of DCR1 and AGO. The qRT-PCR analysis was used to examine changes in transcription in the strains. Surprisingly, neither over-expressing strain promoted SSU1 transcription, although MET4 and HAL4 transcripts significantly increased in both sulfite-tolerance increased strains. We conclude that FZF1 plays a different role in the sulfite tolerance of S. uvarum compared to its role in S. cerevisiae.

  2. MAGMA: generalized gene-set analysis of GWAS data.

    Science.gov (United States)

    de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

    2015-04-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.

  3. Restoration using Azolla imbricata increases nitrogen functional bacterial groups and genes in soil.

    Science.gov (United States)

    Lu, Xiao-Ming; Lu, Peng-Zhen; Yang, Ke

    2017-05-01

    Microbial groups are major factors that influence soil function. Currently, there is a lack of studies on microbial functional groups. Although soil microorganisms play an important role in the nitrogen cycle, systematic studies of the effects of environmental factors on microbial populations in relation to key metabolic processes in the nitrogen cycle are seldom reported. In this study, we conducted a systematic analysis of the changes in nitrogen functional groups in mandarin orange garden soil treated with Azolla imbricata. The structures of the major functional bacterial groups and the functional gene abundances involved in key processes of the soil nitrogen cycle were analyzed using high-throughput sequencing (HTS) and quantitative real-time PCR, respectively. The results indicated that returning A. imbricata had an important influence on the composition of soil nitrogen functional bacterial communities. Treatment with A. imbricata increased the diversity of the nitrogen functional bacteria. The abundances of nitrogen functional genes were significantly higher in the treated soil compared with the control soil. Both the diversity of the major nitrogen functional bacteria (nifH bacteria, nirK bacteria, and narG bacteria) and the abundances of nitrogen functional genes in the soil showed significant positive correlations with the soil pH, the organic carbon content, available nitrogen, available phosphorus, and NH 4 + -N and NO 3 - -N contents. Treatment with 12.5 kg fresh A. imbricata per mandarin orange tree was effective to improve the quality of the mandarin orange garden soil. This study analyzed the mechanism of the changes in functional bacterial groups and genes involved in key metabolic processes of the nitrogen cycle in soil treated by A. imbricata.

  4. Identification of the WRKY gene family and functional analysis of two genes in Caragana intermedia.

    Science.gov (United States)

    Wan, Yongqing; Mao, Mingzhu; Wan, Dongli; Yang, Qi; Yang, Feiyun; Mandlaa; Li, Guojing; Wang, Ruigang

    2018-02-09

    WRKY transcription factors, one of the largest families of transcriptional regulators in plants, play important roles in plant development and various stress responses. The WRKYs of Caragana intermedia are still not well characterized, although many WRKYs have been identified in various plant species. We identified 53 CiWRKY genes from C. intermedia transcriptome data, 28 of which exhibited complete open reading frames (ORFs). These CiWRKYs were divided into three groups via phylogenetic analysis according to their WRKY domains and zinc finger motifs. Conserved domain analysis showed that the CiWRKY proteins contain a highly conserved WRKYGQK motif and two variant motifs (WRKYGKK and WKKYEEK). The subcellular localization of CiWRKY26 and CiWRKY28-1 indicated that these two proteins localized exclusively to nuclei, supporting their role as transcription factors. The expression patterns of the 28 CiWRKYs with complete ORFs were examined through quantitative real-time PCR (qRT-PCR) in various tissues and under different abiotic stresses (drought, cold, salt, high-pH and abscisic acid (ABA)). The results showed that each CiWRKY responded to at least one stress treatment. Furthermore, overexpression of CiWRKY75-1 and CiWRKY40-4 in Arabidopsis thaliana suppressed the drought stress tolerance of the plants and delayed leaf senescence, respectively. Fifty-three CiWRKY genes from the C. intermedia transcriptome were identified and divided into three groups via phylogenetic analysis. The expression patterns of the 28 CiWRKYs under different abiotic stresses suggested that each CiWRKY responded to at least one stress treatment. Overexpression of CiWRKY75-1 and CiWRKY40-4 suppressed the drought stress tolerance of Arabidopsis and delayed leaf senescence, respectively. These results provide a basis for the molecular mechanism through which CiWRKYs mediate stress tolerance.

  5. Screening Key Genes Associated with the Development and Progression of Non-small Cell Lung Cancer Based on Gene-enrichment Analysis and Meta-analysis

    Directory of Open Access Journals (Sweden)

    Wenwu HE

    2012-07-01

    Full Text Available Background and objective Non-small cell lung cancer (NSCLC is one of the most common malignant tumors; however, its causes are still not completely understood. This study was designed to screen the key genes and pathways related to NSCLC occurrence and development and to establish the scientific foundation for the genetic mechanisms and targeted therapy of NSCLC. Methods Both gene set-enrichment analysis (GSEA and meta-analysis (meta were used to screen the critical pathways and genes that might be corretacted with the development and progression of lung cancer at the transcription level. Results Using the GSEA and meta methods, focal adhesion and regulation of actin cytoskeleton were determined to be the more prominent overlapping significant pathways. In the focal adhesion pathway, 31 genes were statistically significant (P<0.05, whereas in the regulation of actin cytoskeleton pathway, 32 genes were statistically significant (P<0.05. Conclusion The focal adhesion and the regulation of actin cytoskeleton pathways might play important roles in the occurrence and development of NSCLC. Further studies are needed to determine the biological function for the positiue genes.

  6. Identification, isolation and expression analysis of auxin response factor (ARF) genes in Solanum lycopersicum.

    Science.gov (United States)

    Wu, Jian; Wang, Feiyan; Cheng, Lin; Kong, Fuling; Peng, Zhen; Liu, Songyu; Yu, Xiaolin; Lu, Gang

    2011-11-01

    Auxin response factors (ARFs) encode transcriptional factors that bind specifically to the TGTCTC-containing auxin response elements found in the promoters of primary/early auxin response genes that regulate plant development. In this study, investigation of the tomato genome revealed 21 putative functional ARF genes (SlARFs), a number comparable to that found in Arabidopsis (23) and rice (25). The full cDNA sequences of 15 novel SlARFs were isolated and delineated by sequencing of PCR products. A comprehensive genome-wide analysis of this gene family is presented, including the gene structures, chromosome locations, phylogeny, and conserved motifs. In addition, a comparative analysis between ARF family genes in tomato and maize was performed. A phylogenetic tree generated from alignments of the full-length protein sequences of 21 OsARFs, 23 AtARFs, 31 ZmARFs, and 21 SlARFs revealed that these ARFs were clustered into four major groups. However, we could not find homologous genes in rice, maize, or tomato with AtARF12-15 and AtARF20-23. The expression patterns of tomato ARF genes were analyzed by quantitative real-time PCR. Our comparative analysis will help to define possible functions for many of these newly isolated ARF-family genes in plant development.

  7. Expression and functional assessment of candidate type 2 diabetes susceptibility genes identify four new genes contributing to human insulin secretion

    Directory of Open Access Journals (Sweden)

    Fatou K. Ndiaye

    2017-06-01

    expression of Prc1, Srr, Zfand6, and Zfand3 was found in mouse pancreatic islets with altered beta-cell function. Conclusions: This study showed the ability of post-GWAS functional studies to identify new genes and pathways involved in human pancreatic beta-cell function and in T2D pathophysiology. Keywords: EndoC-βH1, Expression analysis, Genome-wide association study, Insulin secretion, RNAi screening, Type 2 diabetes

  8. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Directory of Open Access Journals (Sweden)

    Alamar Santiago

    2009-09-01

    Full Text Available Abstract Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new

  9. A new set of ESTs and cDNA clones from full-length and normalized libraries for gene discovery and functional characterization in citrus

    Science.gov (United States)

    Marques, M Carmen; Alonso-Cantabrana, Hugo; Forment, Javier; Arribas, Raquel; Alamar, Santiago; Conejero, Vicente; Perez-Amador, Miguel A

    2009-01-01

    Background Interpretation of ever-increasing raw sequence information generated by modern genome sequencing technologies faces multiple challenges, such as gene function analysis and genome annotation. Indeed, nearly 40% of genes in plants encode proteins of unknown function. Functional characterization of these genes is one of the main challenges in modern biology. In this regard, the availability of full-length cDNA clones may fill in the gap created between sequence information and biological knowledge. Full-length cDNA clones facilitate functional analysis of the corresponding genes enabling manipulation of their expression in heterologous systems and the generation of a variety of tagged versions of the native protein. In addition, the development of full-length cDNA sequences has the power to improve the quality of genome annotation. Results We developed an integrated method to generate a new normalized EST collection enriched in full-length and rare transcripts of different citrus species from multiple tissues and developmental stages. We constructed a total of 15 cDNA libraries, from which we isolated 10,898 high-quality ESTs representing 6142 different genes. Percentages of redundancy and proportion of full-length clones range from 8 to 33, and 67 to 85, respectively, indicating good efficiency of the approach employed. The new EST collection adds 2113 new citrus ESTs, representing 1831 unigenes, to the collection of citrus genes available in the public databases. To facilitate functional analysis, cDNAs were introduced in a Gateway-based cloning vector for high-throughput functional analysis of genes in planta. Herein, we describe the technical methods used in the library construction, sequence analysis of clones and the overexpression of CitrSEP, a citrus homolog to the Arabidopsis SEP3 gene, in Arabidopsis as an example of a practical application of the engineered Gateway vector for functional analysis. Conclusion The new EST collection denotes an

  10. Methodology for the inference of gene function from phenotype data.

    Science.gov (United States)

    Ascensao, Joao A; Dolan, Mary E; Hill, David P; Blake, Judith A

    2014-12-12

    Biomedical ontologies are increasingly instrumental in the advancement of biological research primarily through their use to efficiently consolidate large amounts of data into structured, accessible sets. However, ontology development and usage can be hampered by the segregation of knowledge by domain that occurs due to independent development and use of the ontologies. The ability to infer data associated with one ontology to data associated with another ontology would prove useful in expanding information content and scope. We here focus on relating two ontologies: the Gene Ontology (GO), which encodes canonical gene function, and the Mammalian Phenotype Ontology (MP), which describes non-canonical phenotypes, using statistical methods to suggest GO functional annotations from existing MP phenotype annotations. This work is in contrast to previous studies that have focused on inferring gene function from phenotype primarily through lexical or semantic similarity measures. We have designed and tested a set of algorithms that represents a novel methodology to define rules for predicting gene function by examining the emergent structure and relationships between the gene functions and phenotypes rather than inspecting the terms semantically. The algorithms inspect relationships among multiple phenotype terms to deduce if there are cases where they all arise from a single gene function. We apply this methodology to data about genes in the laboratory mouse that are formally represented in the Mouse Genome Informatics (MGI) resource. From the data, 7444 rule instances were generated from five generalized rules, resulting in 4818 unique GO functional predictions for 1796 genes. We show that our method is capable of inferring high-quality functional annotations from curated phenotype data. As well as creating inferred annotations, our method has the potential to allow for the elucidation of unforeseen, biologically significant associations between gene function and

  11. Comparative modular analysis of gene expression in vertebrate organs

    Directory of Open Access Journals (Sweden)

    Piasecka Barbara

    2012-03-01

    Full Text Available Abstract Background The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Results Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Conclusions Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.

  12. Comparative modular analysis of gene expression in vertebrate organs.

    Science.gov (United States)

    Piasecka, Barbara; Kutalik, Zoltán; Roux, Julien; Bergmann, Sven; Robinson-Rechavi, Marc

    2012-03-29

    The degree of conservation of gene expression between homologous organs largely remains an open question. Several recent studies reported some evidence in favor of such conservation. Most studies compute organs' similarity across all orthologous genes, whereas the expression level of many genes are not informative about organ specificity. Here, we use a modularization algorithm to overcome this limitation through the identification of inter-species co-modules of organs and genes. We identify such co-modules using mouse and human microarray expression data. They are functionally coherent both in terms of genes and of organs from both organisms. We show that a large proportion of genes belonging to the same co-module are orthologous between mouse and human. Moreover, their zebrafish orthologs also tend to be expressed in the corresponding homologous organs. Notable exceptions to the general pattern of conservation are the testis and the olfactory bulb. Interestingly, some co-modules consist of single organs, while others combine several functionally related organs. For instance, amygdala, cerebral cortex, hypothalamus and spinal cord form a clearly discernible unit of expression, both in mouse and human. Our study provides a new framework for comparative analysis which will be applicable also to other sets of large-scale phenotypic data collected across different species.

  13. Polyploidization altered gene functions in cotton (Gossypium spp.).

    Science.gov (United States)

    Xu, Zhanyou; Yu, John Z; Cho, Jaemin; Yu, Jing; Kohel, Russell J; Percy, Richard G

    2010-12-16

    Cotton (Gossypium spp.) is an important crop plant that is widely grown to produce both natural textile fibers and cottonseed oil. Cotton fibers, the economically more important product of the cotton plant, are seed trichomes derived from individual cells of the epidermal layer of the seed coat. It has been known for a long time that large numbers of genes determine the development of cotton fiber, and more recently it has been determined that these genes are distributed across At and Dt subgenomes of tetraploid AD cottons. In the present study, the organization and evolution of the fiber development genes were investigated through the construction of an integrated genetic and physical map of fiber development genes whose functions have been verified and confirmed. A total of 535 cotton fiber development genes, including 103 fiber transcription factors, 259 fiber development genes, and 173 SSR-contained fiber ESTs, were analyzed at the subgenome level. A total of 499 fiber related contigs were selected and assembled. Together these contigs covered about 151 Mb in physical length, or about 6.7% of the tetraploid cotton genome. Among the 499 contigs, 397 were anchored onto individual chromosomes. Results from our studies on the distribution patterns of the fiber development genes and transcription factors between the At and Dt subgenomes showed that more transcription factors were from Dt subgenome than At, whereas more fiber development genes were from At subgenome than Dt. Combining our mapping results with previous reports that more fiber QTLs were mapped in Dt subgenome than At subgenome, the results suggested a new functional hypothesis for tetraploid cotton. After the merging of the two diploid Gossypium genomes, the At subgenome has provided most of the genes for fiber development, because it continues to function similar to its fiber producing diploid A genome ancestor. On the other hand, the Dt subgenome, with its non-fiber producing D genome ancestor

  14. On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report.

    Directory of Open Access Journals (Sweden)

    Paul D Thomas

    Full Text Available A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011 has proposed a metric for the "functional similarity" between two genes that uses only the Gene Ontology (GO annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the "ortholog conjecture" (or, more properly, the "ortholog functional conservation hypothesis". First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1 that GO annotations are often incomplete, potentially in a biased manner, and subject to an "open world assumption" (absence of an annotation does not imply absence of a function, and 2 that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the

  15. FUNCTIONAL SPECIALIZATION OF DUPLICATED FLAVONOID BIOSYNTHESIS GENES IN WHEAT

    Directory of Open Access Journals (Sweden)

    Khlestkina E.

    2012-08-01

    Full Text Available Gene duplication followed by subfunctionalization and neofunctionalization is of a great evolutionary importance. In plant genomes, duplicated genes may result from either polyploidization (homoeologous genes or segmental chromosome duplications (paralogous genes. In allohexaploid wheat Triticum aestivum L. (2n=6x=42, genome BBAADD, both homoeologous and paralogous copies were found for the regulatory gene Myc encoding MYC-like transcriptional factor in the biosynthesis of flavonoid pigments, anthocyanins, and for the structural gene F3h encoding one of the key enzymes of flavonoid biosynthesis, flavanone 3-hydroxylase. From the 5 copies (3 homoeologous and 2 paralogous of the Myc gene found in T. aestivum, only one plays a regulatory role in anthocyanin biosynthesis, interacting complementary with another transcriptional factor (MYB-like to confer purple pigmentation of grain pericarp in wheat. The role and functionality of the other 4 copies of the Myc gene remain unknown. From the 4 functional copies of the F3h gene in T. aestivum, three homoeologues have similar function. They are expressed in wheat organs colored with anthocyanins or in the endosperm, participating there in biosynthesis of uncolored flavonoid substances. The fourth copy (the B-genomic paralogue is transcribed neither in wheat organs colored with anthocyanins nor in seeds, however, it’s expression has been noticed in roots of aluminium-stressed plants, where the three homoeologous copies are not active. Functional diversification of the duplicated flavonoid biosynthesis genes in wheat may be a reason for maintenance of the duplicated copies and preventing them from pseudogenization.The study was supported by RFBR (11-04-92707. We also thank Ms. Galina Generalova for technical assistance.

  16. Comparative genomic analysis of the WRKY III gene family in populus, grape, arabidopsis and rice.

    Science.gov (United States)

    Wang, Yiyi; Feng, Lin; Zhu, Yuxin; Li, Yuan; Yan, Hanwei; Xiang, Yan

    2015-09-08

    WRKY III genes have significant functions in regulating plant development and resistance. In plant, WRKY gene family has been studied in many species, however, there still lack a comprehensive analysis of WRKY III genes in the woody plant species poplar, three representative lineages of flowering plant species are incorporated in most analyses: Arabidopsis (a model plant for annual herbaceous dicots), grape (one model plant for perennial dicots) and Oryza sativa (a model plant for monocots). In this study, we identified 10, 6, 13 and 28 WRKY III genes in the genomes of Populus trichocarpa, grape (Vitis vinifera), Arabidopsis thaliana and rice (Oryza sativa), respectively. Phylogenetic analysis revealed that the WRKY III proteins could be divided into four clades. By microsynteny analysis, we found that the duplicated regions were more conserved between poplar and grape than Arabidopsis or rice. We dated their duplications by Ks analysis of Populus WRKY III genes and demonstrated that all the blocks were formed after the divergence of monocots and dicots. Strong purifying selection has played a key role in the maintenance of WRKY III genes in Populus. Tissue expression analysis of the WRKY III genes in Populus revealed that five were most highly expressed in the xylem. We also performed quantitative real-time reverse transcription PCR analysis of WRKY III genes in Populus treated with salicylic acid, abscisic acid and polyethylene glycol to explore their stress-related expression patterns. This study highlighted the duplication and diversification of the WRKY III gene family in Populus and provided a comprehensive analysis of this gene family in the Populus genome. Our results indicated that the majority of WRKY III genes of Populus was expanded by large-scale gene duplication. The expression pattern of PtrWRKYIII gene identified that these genes play important roles in the xylem during poplar growth and development, and may play crucial role in defense to drought

  17. Defining functional distances over Gene Ontology

    Directory of Open Access Journals (Sweden)

    del Pozo Angela

    2008-01-01

    Full Text Available Abstract Background A fundamental problem when trying to define the functional relationships between proteins is the difficulty in quantifying functional similarities, even when well-structured ontologies exist regarding the activity of proteins (i.e. 'gene ontology' -GO-. However, functional metrics can overcome the problems in the comparing and evaluating functional assignments and predictions. As a reference of proximity, previous approaches to compare GO terms considered linkage in terms of ontology weighted by a probability distribution that balances the non-uniform 'richness' of different parts of the Direct Acyclic Graph. Here, we have followed a different approach to quantify functional similarities between GO terms. Results We propose a new method to derive 'functional distances' between GO terms that is based on the simultaneous occurrence of terms in the same set of Interpro entries, instead of relying on the structure of the GO. The coincidence of GO terms reveals natural biological links between the GO functions and defines a distance model Df which fulfils the properties of a Metric Space. The distances obtained in this way can be represented as a hierarchical 'Functional Tree'. Conclusion The method proposed provides a new definition of distance that enables the similarity between GO terms to be quantified. Additionally, the 'Functional Tree' defines groups with biological meaning enhancing its utility for protein function comparison and prediction. Finally, this approach could be for function-based protein searches in databases, and for analysing the gene clusters produced by DNA array experiments.

  18. Characterization and Functional Analysis of Five MADS-Box B Class Genes Related to Floral Organ Identification in Tagetes erecta.

    Directory of Open Access Journals (Sweden)

    Ye Ai

    Full Text Available According to the floral organ development ABC model, B class genes specify petal and stamen identification. In order to study the function of B class genes in flower development of Tagetes erecta, five MADS-box B class genes were identified and their expression and putative functions were studied. Sequence comparisons and phylogenetic analyses indicated that there were one PI-like gene-TePI, two euAP3-like genes-TeAP3-1 and TeAP3-2, and two TM6-like genes-TeTM6-1 and TeTM6-2 in T. erecta. Strong expression levels of these genes were detected in stamens of the disk florets, but little or no expression was detected in bracts, receptacles or vegetative organs. Yeast hybrid experiments of the B class proteins showed that TePI protein could form a homodimer and heterodimers with all the other four B class proteins TeAP3-1, TeAP3-2, TeTM6-1 and TeTM6-2. No homodimer or interaction was observed between the euAP3 and TM6 clade members. Over-expression of five B class genes of T. erecta in Nicotiana rotundifolia showed that only the transgenic plants of 35S::TePI showed altered floral morphology compared with the non-transgenic line. This study could contribute to the understanding of the function of B class genes in flower development of T. erecta, and provide a theoretical basis for further research to change floral organ structures and create new materials for plant breeding.

  19. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool.

    Science.gov (United States)

    Chen, Edward Y; Tan, Christopher M; Kou, Yan; Duan, Qiaonan; Wang, Zichen; Meirelles, Gabriela Vaz; Clark, Neil R; Ma'ayan, Avi

    2013-04-15

    System-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement. Here, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios. Enrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr.

  20. Tissue-specific functional networks for prioritizing phenotype and disease genes.

    Directory of Open Access Journals (Sweden)

    Yuanfang Guan

    Full Text Available Integrated analyses of functional genomics data have enormous potential for identifying phenotype-associated genes. Tissue-specificity is an important aspect of many genetic diseases, reflecting the potentially different roles of proteins and pathways in diverse cell lineages. Accounting for tissue specificity in global integration of functional genomics data is challenging, as "functionality" and "functional relationships" are often not resolved for specific tissue types. We address this challenge by generating tissue-specific functional networks, which can effectively represent the diversity of protein function for more accurate identification of phenotype-associated genes in the laboratory mouse. Specifically, we created 107 tissue-specific functional relationship networks through integration of genomic data utilizing knowledge of tissue-specific gene expression patterns. Cross-network comparison revealed significantly changed genes enriched for functions related to specific tissue development. We then utilized these tissue-specific networks to predict genes associated with different phenotypes. Our results demonstrate that prediction performance is significantly improved through using the tissue-specific networks as compared to the global functional network. We used a testis-specific functional relationship network to predict genes associated with male fertility and spermatogenesis phenotypes, and experimentally confirmed one top prediction, Mbyl1. We then focused on a less-common genetic disease, ataxia, and identified candidates uniquely predicted by the cerebellum network, which are supported by both literature and experimental evidence. Our systems-level, tissue-specific scheme advances over traditional global integration and analyses and establishes a prototype to address the tissue-specific effects of genetic perturbations, diseases and drugs.

  1. Utility and Limitations of Using Gene Expression Data to Identify Functional Associations.

    Directory of Open Access Journals (Sweden)

    Sahra Uygun

    2016-12-01

    Full Text Available Gene co-expression has been widely used to hypothesize gene function through guilt-by association. However, it is not clear to what degree co-expression is informative, whether it can be applied to genes involved in different biological processes, and how the type of dataset impacts inferences about gene functions. Here our goal is to assess the utility and limitations of using co-expression as a criterion to recover functional associations between genes. By determining the percentage of gene pairs in a metabolic pathway with significant expression correlation, we found that many genes in the same pathway do not have similar transcript profiles and the choice of dataset, annotation quality, gene function, expression similarity measure, and clustering approach significantly impacts the ability to recover functional associations between genes using Arabidopsis thaliana as an example. Some datasets are more informative in capturing coordinated expression profiles and larger data sets are not always better. In addition, to recover the maximum number of known pathways and identify candidate genes with similar functions, it is important to explore rather exhaustively multiple dataset combinations, similarity measures, clustering algorithms and parameters. Finally, we validated the biological relevance of co-expression cluster memberships with an independent phenomics dataset and found that genes that consistently cluster with leucine degradation genes tend to have similar leucine levels in mutants. This study provides a framework for obtaining gene functional associations by maximizing the information that can be obtained from gene expression datasets.

  2. Transcriptome analysis reveals key differentially expressed genes involved in wheat grain development

    Directory of Open Access Journals (Sweden)

    Yonglong Yu

    2016-04-01

    Full Text Available Wheat seed development is an important physiological process of seed maturation and directly affects wheat yield and quality. In this study, we performed dynamic transcriptome microarray analysis of an elite Chinese bread wheat cultivar (Jimai 20 during grain development using the GeneChip Wheat Genome Array. Grain morphology and scanning electron microscope observations showed that the period of 11–15 days post-anthesis (DPA was a key stage for the synthesis and accumulation of seed starch. Genome-wide transcriptional profiling and significance analysis of microarrays revealed that the period from 11 to 15 DPA was more important than the 15–20 DPA stage for the synthesis and accumulation of nutritive reserves. Series test of cluster analysis of differential genes revealed five statistically significant gene expression profiles. Gene ontology annotation and enrichment analysis gave further information about differentially expressed genes, and MapMan analysis revealed expression changes within functional groups during seed development. Metabolic pathway network analysis showed that major and minor metabolic pathways regulate one another to ensure regular seed development and nutritive reserve accumulation. We performed gene co-expression network analysis to identify genes that play vital roles in seed development and identified several key genes involved in important metabolic pathways. The transcriptional expression of eight key genes involved in starch and protein synthesis and stress defense was further validated by qRT-PCR. Our results provide new insight into the molecular mechanisms of wheat seed development and the determinants of yield and quality.

  3. Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis

    Science.gov (United States)

    Zhang, Zhang; Liu, Jingxing; Wu, Jiayan; Yu, Jun

    2013-01-01

    The regulation of gene expression is essential for eukaryotes, as it drives the processes of cellular differentiation and morphogenesis, leading to the creation of different cell types in multicellular organisms. RNA-Sequencing (RNA-Seq) provides researchers with a powerful toolbox for characterization and quantification of transcriptome. Many different human tissue/cell transcriptome datasets coming from RNA-Seq technology are available on public data resource. The fundamental issue here is how to develop an effective analysis method to estimate expression pattern similarities between different tumor tissues and their corresponding normal tissues. We define the gene expression pattern from three directions: 1) expression breadth, which reflects gene expression on/off status, and mainly concerns ubiquitously expressed genes; 2) low/high or constant/variable expression genes, based on gene expression level and variation; and 3) the regulation of gene expression at the gene structure level. The cluster analysis indicates that gene expression pattern is higher related to physiological condition rather than tissue spatial distance. Two sets of human housekeeping (HK) genes are defined according to cell/tissue types, respectively. To characterize the gene expression pattern in gene expression level and variation, we firstly apply improved K-means algorithm and a gene expression variance model. We find that cancer-associated HK genes (a HK gene is specific in cancer group, while not in normal group) are expressed higher and more variable in cancer condition than in normal condition. Cancer-associated HK genes prefer to AT-rich genes, and they are enriched in cell cycle regulation related functions and constitute some cancer signatures. The expression of large genes is also avoided in cancer group. These studies will help us understand which cell type-specific patterns of gene expression differ among different cell types, and particularly for cancer. PMID:23382867

  4. Functional analysis of jasmonate-responsive transcription factors in Arabidopsis thaliana

    NARCIS (Netherlands)

    Zarei, Adel

    2007-01-01

    The aim of the studies described in this thesis was the functional analysis of JA-responsive transcription factors in Arabidopsis with an emphasis on the interaction with the promoters of their target genes. In short, the following new results were obtained. The promoter of the PDF1.2 gene contains

  5. Functional evolution of cis-regulatory modules at a homeotic gene in Drosophila.

    Directory of Open Access Journals (Sweden)

    Margaret C W Ho

    2009-11-01

    Full Text Available It is a long-held belief in evolutionary biology that the rate of molecular evolution for a given DNA sequence is inversely related to the level of functional constraint. This belief holds true for the protein-coding homeotic (Hox genes originally discovered in Drosophila melanogaster. Expression of the Hox genes in Drosophila embryos is essential for body patterning and is controlled by an extensive array of cis-regulatory modules (CRMs. How the regulatory modules functionally evolve in different species is not clear. A comparison of the CRMs for the Abdominal-B gene from different Drosophila species reveals relatively low levels of overall sequence conservation. However, embryonic enhancer CRMs from other Drosophila species direct transgenic reporter gene expression in the same spatial and temporal patterns during development as their D. melanogaster orthologs. Bioinformatic analysis reveals the presence of short conserved sequences within defined CRMs, representing gap and pair-rule transcription factor binding sites. One predicted binding site for the gap transcription factor KRUPPEL in the IAB5 CRM was found to be altered in Superabdominal (Sab mutations. In Sab mutant flies, the third abdominal segment is transformed into a copy of the fifth abdominal segment. A model for KRUPPEL-mediated repression at this binding site is presented. These findings challenge our current understanding of the relationship between sequence evolution at the molecular level and functional activity of a CRM. While the overall sequence conservation at Drosophila CRMs is not distinctive from neighboring genomic regions, functionally critical transcription factor binding sites within embryonic enhancer CRMs are highly conserved. These results have implications for understanding mechanisms of gene expression during embryonic development, enhancer function, and the molecular evolution of eukaryotic regulatory modules.

  6. Genome-wide identification, functional analysis and expression ...

    African Journals Online (AJOL)

    Fuyou Fu

    2013-07-24

    Jul 24, 2013 ... Key words: ABC transporter, potato, pleiotropic drug resistance (PDR), RNA-seq. INTRODUCTION ..... of relative transcript accumulation of each of 55 PDR genes as determined by RNA-seq analysis are presented as a heatmap, with ... specificities provide clues to the endogenous function of the individual ...

  7. A functional U-statistic method for association analysis of sequencing data.

    Science.gov (United States)

    Jadhav, Sneha; Tong, Xiaoran; Lu, Qing

    2017-11-01

    Although sequencing studies hold great promise for uncovering novel variants predisposing to human diseases, the high dimensionality of the sequencing data brings tremendous challenges to data analysis. Moreover, for many complex diseases (e.g., psychiatric disorders) multiple related phenotypes are collected. These phenotypes can be different measurements of an underlying disease, or measurements characterizing multiple related diseases for studying common genetic mechanism. Although jointly analyzing these phenotypes could potentially increase the power of identifying disease-associated genes, the different types of phenotypes pose challenges for association analysis. To address these challenges, we propose a nonparametric method, functional U-statistic method (FU), for multivariate analysis of sequencing data. It first constructs smooth functions from individuals' sequencing data, and then tests the association of these functions with multiple phenotypes by using a U-statistic. The method provides a general framework for analyzing various types of phenotypes (e.g., binary and continuous phenotypes) with unknown distributions. Fitting the genetic variants within a gene using a smoothing function also allows us to capture complexities of gene structure (e.g., linkage disequilibrium, LD), which could potentially increase the power of association analysis. Through simulations, we compared our method to the multivariate outcome score test (MOST), and found that our test attained better performance than MOST. In a real data application, we apply our method to the sequencing data from Minnesota Twin Study (MTS) and found potential associations of several nicotine receptor subunit (CHRN) genes, including CHRNB3, associated with nicotine dependence and/or alcohol dependence. © 2017 WILEY PERIODICALS, INC.

  8. Transcriptional Regulatory Network Analysis of MYB Transcription Factor Family Genes in Rice

    Directory of Open Access Journals (Sweden)

    Shuchi eSmita

    2015-12-01

    Full Text Available MYB transcription factor (TF is one of the largest TF families and regulates defense responses to various stresses, hormone signaling as well as many metabolic and developmental processes in plants. Understanding these regulatory hierarchies of gene expression networks in response to developmental and environmental cues is a major challenge due to the complex interactions between the genetic elements. Correlation analyses are useful to unravel co-regulated gene pairs governing biological process as well as identification of new candidate hub genes in response to these complex processes. High throughput expression profiling data are highly useful for construction of co-expression networks. In the present study, we utilized transcriptome data for comprehensive regulatory network studies of MYB TFs by top down and guide gene approaches. More than 50% of OsMYBs were strongly correlated under fifty experimental conditions with 51 hub genes via top down approach. Further, clusters were identified using Markov Clustering (MCL. To maximize the clustering performance, parameter evaluation of the MCL inflation score (I was performed in terms of enriched GO categories by measuring F-score. Comparison of co-expressed cluster and clads analyzed from phylogenetic analysis signifies their evolutionarily conserved co-regulatory role. We utilized compendium of known interaction and biological role with Gene Ontology enrichment analysis to hypothesize function of coexpressed OsMYBs. In the other part, the transcriptional regulatory network analysis by guide gene approach revealed 40 putative targets of 26 OsMYB TF hubs with high correlation value utilizing 815 microarray data. The putative targets with MYB-binding cis-elements enrichment in their promoter region, functional co-occurrence as well as nuclear localization supports our finding. Specially, enrichment of MYB binding regions involved in drought-inducibility implying their regulatory role in drought

  9. Gene expression and functional studies of the optic nerve head astrocyte transcriptome from normal African Americans and Caucasian Americans donors.

    Directory of Open Access Journals (Sweden)

    Haixi Miao

    2008-08-01

    Full Text Available To determine whether optic nerve head (ONH astrocytes, a key cellular component of glaucomatous neuropathy, exhibit differential gene expression in primary cultures of astrocytes from normal African American (AA donors compared to astrocytes from normal Caucasian American (CA donors.We used oligonucleotide Affymetrix microarray (HG U133A & HG U133A 2.0 chips to compare gene expression levels in cultured ONH astrocytes from twelve CA and twelve AA normal age matched donor eyes. Chips were normalized with Robust Microarray Analysis (RMA in R using Bioconductor. Significant differential gene expression levels were detected using mixed effects modeling and Statistical Analysis of Microarray (SAM. Functional analysis and Gene Ontology were used to classify differentially expressed genes. Differential gene expression was validated by quantitative real time RT-PCR. Protein levels were detected by Western blots and ELISA. Cell adhesion and migration assays tested physiological responses. Glutathione (GSH assay detected levels of intracellular GSH.Multiple analyses selected 87 genes differentially expressed between normal AA and CA (P<0.01. The most relevant genes expressed in AA were categorized by function, including: signal transduction, response to stress, ECM genes, migration and cell adhesion.These data show that normal astrocytes from AA and CA normal donors display distinct expression profiles that impact astrocyte functions in the ONH. Our data suggests that differences in gene expression in ONH astrocytes may be specific to the development and/or progression of glaucoma in AA.

  10. Towards precise classification of cancers based on robust gene functional expression profiles

    Directory of Open Access Journals (Sweden)

    Zhu Jing

    2005-03-01

    Full Text Available Abstract Background Development of robust and efficient methods for analyzing and interpreting high dimension gene expression profiles continues to be a focus in computational biology. The accumulated experiment evidence supports the assumption that genes express and perform their functions in modular fashions in cells. Therefore, there is an open space for development of the timely and relevant computational algorithms that use robust functional expression profiles towards precise classification of complex human diseases at the modular level. Results Inspired by the insight that genes act as a module to carry out a highly integrated cellular function, we thus define a low dimension functional expression profile for data reduction. After annotating each individual gene to functional categories defined in a proper gene function classification system such as Gene Ontology applied in this study, we identify those functional categories enriched with differentially expressed genes. For each functional category or functional module, we compute a summary measure (s for the raw expression values of the annotated genes to capture the overall activity level of the module. In this way, we can treat the gene expressions within a functional module as an integrative data point to replace the multiple values of individual genes. We compare the classification performance of decision trees based on functional expression profiles with the conventional gene expression profiles using four publicly available datasets, which indicates that precise classification of tumour types and improved interpretation can be achieved with the reduced functional expression profiles. Conclusion This modular approach is demonstrated to be a powerful alternative approach to analyzing high dimension microarray data and is robust to high measurement noise and intrinsic biological variance inherent in microarray data. Furthermore, efficient integration with current biological knowledge

  11. FGWAS: Functional genome wide association analysis.

    Science.gov (United States)

    Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu

    2017-10-01

    Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Genome-wide analysis of the ATP-binding cassette (ABC) transporter gene family in the silkworm, Bombyx mori.

    Science.gov (United States)

    Xie, Xiaodong; Cheng, Tingcai; Wang, Genhong; Duan, Jun; Niu, Weihuan; Xia, Qingyou

    2012-07-01

    The ATP-binding cassette (ABC) superfamily is a larger protein family with diverse physiological functions in all kingdoms of life. We identified 53 ABC transporters in the silkworm genome, and classified them into eight subfamilies (A-H). Comparative genome analysis revealed that the silkworm has an expanded ABCC subfamily with more members than Drosophila melanogaster, Caenorhabditis elegans, or Homo sapiens. Phylogenetic analysis showed that the ABCE and ABCF genes were highly conserved in the silkworm, indicating possible involvement in fundamental biological processes. Five multidrug resistance-related genes in the ABCB subfamily and two multidrug resistance-associated-related genes in the ABCC subfamily indicated involvement in biochemical defense. Genetic variation analysis revealed four ABC genes that might be evolving under positive selection. Moreover, the silkworm ABCC4 gene might be important for silkworm domestication. Microarray analysis showed that the silkworm ABC genes had distinct expression patterns in different tissues on day 3 of the fifth instar. These results might provide new insights for further functional studies on the ABC genes in the silkworm genome.

  13. Human microRNA target analysis and gene ontology clustering by GOmir, a novel stand-alone application.

    Science.gov (United States)

    Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia

    2009-06-16

    microRNAs (miRNAs) are single-stranded RNA molecules of about 20-23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA.

  14. EPConDB: a web resource for gene expression related to pancreatic development, beta-cell function and diabetes.

    Science.gov (United States)

    Mazzarelli, Joan M; Brestelli, John; Gorski, Regina K; Liu, Junmin; Manduchi, Elisabetta; Pinney, Deborah F; Schug, Jonathan; White, Peter; Kaestner, Klaus H; Stoeckert, Christian J

    2007-01-01

    EPConDB (http://www.cbil.upenn.edu/EPConDB) is a public web site that supports research in diabetes, pancreatic development and beta-cell function by providing information about genes expressed in cells of the pancreas. EPConDB displays expression profiles for individual genes and information about transcripts, promoter elements and transcription factor binding sites. Gene expression results are obtained from studies examining tissue expression, pancreatic development and growth, differentiation of insulin-producing cells, islet or beta-cell injury, and genetic models of impaired beta-cell function. The expression datasets are derived using different microarray platforms, including the BCBC PancChips and Affymetrix gene expression arrays. Other datasets include semi-quantitative RT-PCR and MPSS expression studies. For selected microarray studies, lists of differentially expressed genes, derived from PaGE analysis, are displayed on the site. EPConDB provides database queries and tools to examine the relationship between a gene, its transcriptional regulation, protein function and expression in pancreatic tissues.

  15. The ALMT Gene Family Performs Multiple Functions in Plants

    Directory of Open Access Journals (Sweden)

    Jie Liu

    2018-02-01

    Full Text Available The aluminium activated malate transporter (ALMT gene family is named after the first member of the family identified in wheat (Triticum aestivum L.. The product of this gene controls resistance to aluminium (Al toxicity. ALMT genes encode transmembrane proteins that function as anion channels and perform multiple functions involving the transport of organic anions (e.g., carboxylates and inorganic anions in cells. They share a PF11744 domain and are classified in the Fusaric acid resistance protein-like superfamily, CL0307. The proteins typically have five to seven transmembrane regions in the N-terminal half and a long hydrophillic C-terminal tail but predictions of secondary structure vary. Although widely spread in plants, relatively little information is available on the roles performed by other members of this family. In this review, we summarized functions of ALMT gene families, including Al resistance, stomatal function, mineral nutrition, microbe interactions, fruit acidity, light response and seed development.

  16. Community Structure Analysis of Gene Interaction Networks in Duchenne Muscular Dystrophy.

    Directory of Open Access Journals (Sweden)

    Tejaswini Narayanan

    Full Text Available Duchenne Muscular Dystrophy (DMD is an important pathology associated with the human skeletal muscle and has been studied extensively. Gene expression measurements on skeletal muscle of patients afflicted with DMD provides the opportunity to understand the underlying mechanisms that lead to the pathology. Community structure analysis is a useful computational technique for understanding and modeling genetic interaction networks. In this paper, we leverage this technique in combination with gene expression measurements from normal and DMD patient skeletal muscle tissue to study the structure of genetic interactions in the context of DMD. We define a novel framework for transforming a raw dataset of gene expression measurements into an interaction network, and subsequently apply algorithms for community structure analysis for the extraction of topological communities. The emergent communities are analyzed from a biological standpoint in terms of their constituent biological pathways, and an interpretation that draws correlations between functional and structural organization of the genetic interactions is presented. We also compare these communities and associated functions in pathology against those in normal human skeletal muscle. In particular, differential enhancements are observed in the following pathways between pathological and normal cases: Metabolic, Focal adhesion, Regulation of actin cytoskeleton and Cell adhesion, and implication of these mechanisms are supported by prior work. Furthermore, our study also includes a gene-level analysis to identify genes that are involved in the coupling between the pathways of interest. We believe that our results serve to highlight important distinguishing features in the structural/functional organization of constituent biological pathways, as it relates to normal and DMD cases, and provide the mechanistic basis for further biological investigations into specific pathways differently regulated

  17. Detailed analysis of putative genes encoding small proteins in legume genomes

    Directory of Open Access Journals (Sweden)

    Gabriel eGuillén

    2013-06-01

    Full Text Available Diverse plant genome sequencing projects coupled with powerful bioinformatics tools have facilitated massive data analysis to construct specialized databases classified according to cellular function. However, there are still a considerable number of genes encoding proteins whose function has not yet been characterized. Included in this category are small proteins (SPs, 30-150 amino acids encoded by short open reading frames (sORFs. SPs play important roles in plant physiology, growth, and development. Unfortunately, protocols focused on the genome-wide identification and characterization of sORFs are scarce or remain poorly implemented. As a result, these genes are underrepresented in many genome annotations. In this work, we exploited publicly available genome sequences of Phaseolus vulgaris, Medicago truncatula, Glycine max and Lotus japonicus to analyze the abundance of annotated SPs in plant legumes. Our strategy to uncover bona fide sORFs at the genome level was centered in bioinformatics analysis of characteristics such as evidence of expression (transcription, presence of known protein regions or domains, and identification of orthologous genes in the genomes explored. We collected 6170, 10461, 30521, and 23599 putative sORFs from P. vulgaris, G. max, M. truncatula, and L. japonicus genomes, respectively. Expressed sequence tags (ESTs available in the DFCI Gene Index database provided evidence that ~one-third of the predicted legume sORFs are expressed. Most potential SPs have a counterpart in a different plant species and counterpart regions or domains in larger proteins. Potential functional sORFs were also classified according to a reduced set of GO categories, and the expression of 13 of them during P. vulgaris nodule ontogeny was confirmed by qPCR. This analysis provides a collection of sORFs that potentially encode for meaningful SPs, and offers the possibility of their further functional evaluation.

  18. Systematic enrichment analysis of gene expression profiling studies identifies consensus pathways implicated in colorectal cancer development

    Directory of Open Access Journals (Sweden)

    Jesús Lascorz

    2011-01-01

    Full Text Available Background: A large number of gene expression profiling (GEP studies on colorectal carcinogenesis have been performed but no reliable gene signature has been identified so far due to the lack of reproducibility in the reported genes. There is growing evidence that functionally related genes, rather than individual genes, contribute to the etiology of complex traits. We used, as a novel approach, pathway enrichment tools to define functionally related genes that are consistently up- or down-regulated in colorectal carcinogenesis. Materials and Methods: We started the analysis with 242 unique annotated genes that had been reported by any of three recent meta-analyses covering GEP studies on genes differentially expressed in carcinoma vs normal mucosa. Most of these genes (218, 91.9% had been reported in at least three GEP studies. These 242 genes were submitted to bioinformatic analysis using a total of nine tools to detect enrichment of Gene Ontology (GO categories or Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. As a final consistency criterion the pathway categories had to be enriched by several tools to be taken into consideration. Results: Our pathway-based enrichment analysis identified the categories of ribosomal protein constituents, extracellular matrix receptor interaction, carbonic anhydrase isozymes, and a general category related to inflammation and cellular response as significantly and consistently overrepresented entities. Conclusions: We triaged the genes covered by the published GEP literature on colorectal carcinogenesis and subjected them to multiple enrichment tools in order to identify the consistently enriched gene categories. These turned out to have known functional relationships to cancer development and thus deserve further investigation.

  19. Structure and function of the human metallothionein gene family: Final technical report

    International Nuclear Information System (INIS)

    Karin, M.

    1986-01-01

    The full nucleotide sequence of two additional human metallothionein (hMT) genes has been determined. These genes, hMT-I/sub B/ and hMT-I/sub F/, are located within the MT-I gene cluster we have described originally. The hMT-I/sub F/ gene is the first hMT-I gene whose amino acid sequence is in complete agreement with the published sequence of the human MT-I proteins. Therefore it is likely to be an active gene encoding a functional protein. However, since we have just completed the sequence analysis, we have not characterized this gene further yet. The hMT-I/sub B/ gene is closely linked to the hMT-I/sub A/ gene, and two pseudogenes, hMT-I/sub C/ and hMT-I/sub D/ separate the two. From its nucleotide sequence hMT-I/sub B/ seems to be an active gene, encoding a functional protein even though it differs in four positions from the published sequence of human MT-I proteins. This gene is expressed in a human hepatoma cell line, HepG2, and its expression is stimulated by Cd ++ . Using gene fusions to the viral thymidine-kinase gene we find that hMT-I/sub B/, like the hMT-I/sub A/ and hMT-II/sub A/ genes, contains a heavy metal responsive promoterregulatory element within its 5' flanking region. We analyzed the level of hMT-I/sub B/ mRNA in a variety of human cell lines by the S1 nuclease technique, and compared it to the expression of the hMT-II/sub A/ gene. While the hMT-II/sub A/ gene was expressed in all of the cell lines analyzed, the hMT-I/sub B/ gene was expressed in liver and kidney derived cell lines cells. This suggest that the expression of the hMT-I/sub B/ gene is controlled in a tissue specific manner. 13 refs

  20. The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

    Science.gov (United States)

    Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

    2013-10-01

    The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.

  1. Identification of novel risk genes associated with type 1 diabetes mellitus using a genome-wide gene-based association analysis.

    Science.gov (United States)

    Qiu, Ying-Hua; Deng, Fei-Yan; Li, Min-Jing; Lei, Shu-Feng

    2014-11-01

    Type 1 diabetes mellitus is a serious disorder characterized by destruction of pancreatic β-cells, culminating in absolute insulin deficiency. Genetic factors contribute to the susceptibility of type 1 diabetes mellitus. The aim of the present study was to identify more susceptibility genes of type 1 diabetes mellitus. We carried out an initial gene-based genome-wide association study in a total of 4,075 type 1 diabetes mellitus cases and 2,604 controls by using the Gene-based Association Test using Extended Simes procedure. Furthermore, we carried out replication studies, differential expression analysis and functional annotation clustering analysis to support the significance of the identified susceptibility genes. We identified 452 genes associated with type 1 diabetes mellitus, even after adapting the genome-wide threshold for significance (P diabetes mellitus, which were ignored in single-nucleotide polymorphism-based association analysis and were not previously reported. We found that 53 genes have supportive evidence from replication studies and/or differential expression studies. In particular, seven genes including four non-human leukocyte antigen (HLA) genes (RASIP1, STRN4, BCAR1 and MYL2) are replicated in at least one independent population and also differentially expressed in peripheral blood mononuclear cells or monocytes. Furthermore, the associated genes tend to enrich in immune-related pathways or Gene Ontology project terms. The present results suggest the high power of gene-based association analysis in detecting disease-susceptibility genes. Our findings provide more insights into the genetic basis of type 1 diabetes mellitus.

  2. Identification of conserved drought-adaptive genes using a cross-species meta-analysis approach.

    Science.gov (United States)

    Shaar-Moshe, Lidor; Hübner, Sariel; Peleg, Zvi

    2015-05-03

    Drought is the major environmental stress threatening crop-plant productivity worldwide. Identification of new genes and metabolic pathways involved in plant adaptation to progressive drought stress at the reproductive stage is of great interest for agricultural research. We developed a novel Cross-Species meta-Analysis of progressive Drought stress at the reproductive stage (CSA:Drought) to identify key drought adaptive genes and mechanisms and to test their evolutionary conservation. Empirically defined filtering criteria were used to facilitate a robust integration of 17 deposited microarray experiments (148 arrays) of Arabidopsis, rice, wheat and barley. By prioritizing consistency over intensity, our approach was able to identify 225 differentially expressed genes shared across studies and taxa. Gene ontology enrichment and pathway analyses classified the shared genes into functional categories involved predominantly in metabolic processes (e.g. amino acid and carbohydrate metabolism), regulatory function (e.g. protein degradation and transcription) and response to stimulus. We further investigated drought related cis-acting elements in the shared gene promoters, and the evolutionary conservation of shared genes. The universal nature of the identified drought-adaptive genes was further validated in a fifth species, Brachypodium distachyon that was not included in the meta-analysis. qPCR analysis of 27, randomly selected, shared orthologs showed similar expression pattern as was found by the CSA:Drought.In accordance, morpho-physiological characterization of progressive drought stress, in B. distachyon, highlighted the key role of osmotic adjustment as evolutionary conserved drought-adaptive mechanism. Our CSA:Drought strategy highlights major drought-adaptive genes and metabolic pathways that were only partially, if at all, reported in the original studies included in the meta-analysis. These genes include a group of unclassified genes that could be involved

  3. Construction and Analysis of Functional Networks in the Gut Microbiome of Type 2 Diabetes Patients.

    Science.gov (United States)

    Li, Lianshuo; Wang, Zicheng; He, Peng; Ma, Shining; Du, Jie; Jiang, Rui

    2016-10-01

    Although networks of microbial species have been widely used in the analysis of 16S rRNA sequencing data of a microbiome, the construction and analysis of a complete microbial gene network are in general problematic because of the large number of microbial genes in metagenomics studies. To overcome this limitation, we propose to map microbial genes to functional units, including KEGG orthologous groups and the evolutionary genealogy of genes: Non-supervised Orthologous Groups (eggNOG) orthologous groups, to enable the construction and analysis of a microbial functional network. We devised two statistical methods to infer pairwise relationships between microbial functional units based on a deep sequencing dataset of gut microbiome from type 2 diabetes (T2D) patients as well as healthy controls. Networks containing such functional units and their significant interactions were constructed subsequently. We conducted a variety of analyses of global properties, local properties, and functional modules in the resulting functional networks. Our data indicate that besides the observations consistent with the current knowledge, this study provides novel biological insights into the gut microbiome associated with T2D. Copyright © 2016. Production and hosting by Elsevier Ltd.

  4. Identification and function analysis of canine stimulator of interferon gene (STING).

    Science.gov (United States)

    Zhang, Yuxiang; Zhu, Mengyan; Li, Gairu; Liu, Jie; Zhai, Xiaofeng; Wang, Ruyi; Zhang, Junyan; Xing, Gang; Gu, Jinyan; Yan, Liping; Lei, Jing; Sun, Haifeng; Shi, Zhiyu; Liu, Fei; Hu, Boli; Su, Shuo; Zhou, Jiyong

    2017-12-01

    Stimulator of interferon gene (STING) plays an important role in the cyclic GMP-AMP synthase (cGAS)-mediated activation of type I IFN responses. In this study, we identified and cloned canine STING gene. Full-length STING encodes a 375 amino acid product that shares the highest similarity with feline STING. Highest levels of mRNA of canine STING were detected in the spleen and lungs while the lowest levels in the heart and muscle. Analysis of its cellular localization showed that STING is localizes to the endoplasmic reticulum. STING overexpression induced the IFN response via the IRF3 and NF-κB pathways and up-regulated the expression of ISG15 and viperin. However, knockdown of STING did not inhibit the IFN-β response triggered by poly(dA:dT), poly(I:C), or SeV. Finally, overexpression of STING significantly inhibited the replication of canine influenza virus H3N2. Collectively, our findings indicate that STING is involved in the regulation of the IFN-β pathway in canine. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. Functional gene diversity of soil microbial communities from five oil-contaminated fields in China.

    Science.gov (United States)

    Liang, Yuting; Van Nostrand, Joy D; Deng, Ye; He, Zhili; Wu, Liyou; Zhang, Xu; Li, Guanghe; Zhou, Jizhong

    2011-03-01

    To compare microbial functional diversity in different oil-contaminated fields and to know the effects of oil contaminant and environmental factors, soil samples were taken from typical oil-contaminated fields located in five geographic regions of China. GeoChip, a high-throughput functional gene array, was used to evaluate the microbial functional genes involved in contaminant degradation and in other major biogeochemical/metabolic processes. Our results indicated that the overall microbial community structures were distinct in each oil-contaminated field, and samples were clustered by geographic locations. The organic contaminant degradation genes were most abundant in all samples and presented a similar pattern under oil contaminant stress among the five fields. In addition, alkane and aromatic hydrocarbon degradation genes such as monooxygenase and dioxygenase were detected in high abundance in the oil-contaminated fields. Canonical correspondence analysis indicated that the microbial functional patterns were highly correlated to the local environmental variables, such as oil contaminant concentration, nitrogen and phosphorus contents, salt and pH. Finally, a total of 59% of microbial community variation from GeoChip data can be explained by oil contamination, geographic location and soil geochemical parameters. This study provided insights into the in situ microbial functional structures in oil-contaminated fields and discerned the linkages between microbial communities and environmental variables, which is important to the application of bioremediation in oil-contaminated sites.

  6. Genome-wide identification, characterization and phylogenetic analysis of 50 catfish ATP-binding cassette (ABC) transporter genes.

    Science.gov (United States)

    Liu, Shikai; Li, Qi; Liu, Zhanjiang

    2013-01-01

    Although a large set of full-length transcripts was recently assembled in catfish, annotation of large gene families, especially those with duplications, is still a great challenge. Most often, complexities in annotation cause mis-identification and thereby much confusion in the scientific literature. As such, detailed phylogenetic analysis and/or orthology analysis are required for annotation of genes involved in gene families. The ATP-binding cassette (ABC) transporter gene superfamily is a large gene family that encodes membrane proteins that transport a diverse set of substrates across membranes, playing important roles in protecting organisms from diverse environment. In this work, we identified a set of 50 ABC transporters in catfish genome. Phylogenetic analysis allowed their identification and annotation into seven subfamilies, including 9 ABCA genes, 12 ABCB genes, 12 ABCC genes, 5 ABCD genes, 2 ABCE genes, 4 ABCF genes and 6 ABCG genes. Most ABC transporters are conserved among vertebrates, though cases of recent gene duplications and gene losses do exist. Gene duplications in catfish were found for ABCA1, ABCB3, ABCB6, ABCC5, ABCD3, ABCE1, ABCF2 and ABCG2. The whole set of catfish ABC transporters provide the essential genomic resources for future biochemical, toxicological and physiological studies of ABC drug efflux transporters. The establishment of orthologies should allow functional inferences with the information from model species, though the function of lineage-specific genes can be distinct because of specific living environment with different selection pressure.

  7. Transcriptomic meta-analysis identifies gene expression characteristics in various samples of HIV-infected patients with nonprogressive disease.

    Science.gov (United States)

    Zhang, Le-Le; Zhang, Zi-Ning; Wu, Xian; Jiang, Yong-Jun; Fu, Ya-Jing; Shang, Hong

    2017-09-12

    A small proportion of HIV-infected patients remain clinically and/or immunologically stable for years, including elite controllers (ECs) who have undetectable viremia (10 years). However, the mechanism of nonprogression needs to be further resolved. In this study, a transcriptome meta-analysis was performed on nonprogressor and progressor microarray data to identify differential transcriptome pathways and potential biomarkers. Using the INMEX (integrative meta-analysis of expression data) program, we performed the meta-analysis to identify consistently differentially expressed genes (DEGs) in nonprogressors and further performed functional interpretation (gene ontology analysis and pathway analysis) of the DEGs identified in the meta-analysis. Five microarray datasets (81 cases and 98 controls in total), including whole blood, CD4 + and CD8 + T cells, were collected for meta-analysis. We determined that nonprogressors have reduced expression of important interferon-stimulated genes (ISGs), CD38, lymphocyte activation gene 3 (LAG-3) in whole blood, CD4 + and CD8 + T cells. Gene ontology (GO) analysis showed a significant enrichment in DEGs that function in the type I interferon signaling pathway. Upregulated pathways, including the PI3K-Akt signaling pathway in whole blood, cytokine-cytokine receptor interaction in CD4 + T cells and the MAPK signaling pathway in CD8 + T cells, were identified in nonprogressors compared with progressors. In each metabolic functional category, the number of downregulated DEGs was more than the upregulated DEGs, and almost all genes were downregulated DEGs in the oxidative phosphorylation (OXPHOS) and tricarboxylic acid (TCA) cycle in the three types of samples. Our transcriptomic meta-analysis provides a comprehensive evaluation of the gene expression profiles in major blood types of nonprogressors, providing new insights in the understanding of HIV pathogenesis and developing strategies to delay HIV disease progression.

  8. Evolution and functional analysis of the Pif97 gene of the Pacific oyster Crassostrea gigas

    Directory of Open Access Journals (Sweden)

    Xiaotong WANG, Xiaorui SONG, Tong WANG, Qihui ZHU, Guoying MIAO, Yuanxin CHEN, Xiaodong FANG, Huayong QUE, Li LI, Guofan ZHANG

    2013-02-01

    Full Text Available Mollusc shell matrix proteins (SMPs are important functional components embedded in the shell and play a role in shell formation. A SMP (Pif177 was identified previously from the nacreous layer of the Japanese pearl oyster Pinctada fucata, and its cleavage products (named pfPif97 and pfPif80 proteins were found to bind to the chitin framework and induce aragonite crystal formation and orient the c axis. In this study, a homologue of pfPif177 was cloned from the mantle of the Pacific oyster Crassostrea gigas, containing the homologue of pfPif97 only and not pfPif80. This finding hints at the large divergence in gene structure between the two species. This homologue (cgPif97 shares characteristics with pfPif97, and suggests that the biological functions of these two proteins may be similar. The expression pattern of cgPif97 in different tissues and development stages indicates that it may play an important role in shell formation of the adult oyster. The morphology of the inner shell surface was affected by injected siRNA of cgPif97 and the calcite laths of the shell became thinner and narrower when the siRNA dose increased, suggesting that the cgPif97 gene plays an important role in calcite shell formation in C. gigas. In conclusion, we found evidence that the Pif177 gene evolved very fast but still retains a similar function among species [Current Zoology 59 (1: 109–115, 2013].

  9. Genome-wide analysis of E. coli cell-gene interactions.

    Science.gov (United States)

    Cardinale, S; Cambray, G

    2017-11-23

    The pursuit of standardization and reliability in synthetic biology has achieved, in recent years, a number of advances in the design of more predictable genetic parts for biological circuits. However, even with the development of high-throughput screening methods and whole-cell models, it is still not possible to predict reliably how a synthetic genetic construct interacts with all cellular endogenous systems. This study presents a genome-wide analysis of how the expression of synthetic genes is affected by systematic perturbations of cellular functions. We found that most perturbations modulate expression indirectly through an effect on cell size, putting forward the existence of a generic Size-Expression interaction in the model prokaryote Escherichia coli. The Size-Expression interaction was quantified by inserting a dual fluorescent reporter gene construct into each of the 3822 single-gene deletion strains comprised in the KEIO collection. Cellular size was measured for single cells via flow cytometry. Regression analyses were used to discriminate between expression-specific and gene-specific effects. Functions of the deleted genes broadly mapped onto three systems with distinct primary influence on the Size-Expression map. Perturbations in the Division and Biosynthesis (DB) system led to a large-cell and high-expression phenotype. In contrast, disruptions of the Membrane and Motility (MM) system caused small-cell and low-expression phenotypes. The Energy, Protein synthesis and Ribosome (EPR) system was predominantly associated with smaller cells and positive feedback on ribosome function. Feedback between cell growth and gene expression is widespread across cell systems. Even though most gene disruptions proximally affect one component of the Size-Expression interaction, the effect therefore ultimately propagates to both. More specifically, we describe the dual impact of growth on cell size and gene expression through cell division and ribosomal content

  10. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-03-01

    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  11. Identification of distinct genes associated with seawater aspiration-induced acute lung injury by gene expression profile analysis

    Science.gov (United States)

    Liu, Wei; Pan, Lei; Zhang, Minlong; Bo, Liyan; Li, Congcong; Liu, Qingqing; Wang, Li; Jin, Faguang

    2016-01-01

    Seawater aspiration-induced acute lung injury (ALI) is a syndrome associated with a high mortality rate, which is characterized by severe hypoxemia, pulmonary edema and inflammation. The present study is the first, to the best of our knowledge, to analyze gene expression profiles from a rat model of seawater aspiration-induced ALI. Adult male Sprague-Dawley rats were instilled with seawater (4 ml/kg) in the seawater aspiration-induced ALI group (S group) or with distilled water (4 ml/kg) in the distilled water negative control group (D group). In the blank control group (C group) the rats' tracheae were exposed without instillation. Subsequently, lung samples were examined by histopathology; total protein concentration was detected in bronchoalveolar lavage fluid (BALF); lung wet/dry weight ratios were determined; and transcript expression was detected by gene sequencing analysis. The results demonstrated that histopathological alterations, pulmonary edema and total protein concentrations in BALF were increased in the S group compared with in the D group. Analysis of differential gene expression identified up and downregulated genes in the S group compared with in the D and C groups. A gene ontology analysis of the differential gene expression revealed enrichment of genes in the functional pathways associated with neutrophil chemotaxis, immune and defense responses, and cytokine activity. Kyoto Encyclopedia of Genes and Genomes analysis revealed that the cytokine-cytokine receptor interaction pathway was one of the most important pathways involved in seawater aspiration-induced ALI. In conclusion, activation of the cytokine-cytokine receptor interaction pathway may have an essential role in the progression of seawater aspiration-induced ALI, and the downregulation of tumor necrosis factor superfamily member 10 may enhance inflammation. Furthermore, IL-6 may be considered a biomarker in seawater aspiration-induced ALI. PMID:27509884

  12. Expanded functional diversity of shaker K(+ channels in cnidarians is driven by gene expansion.

    Directory of Open Access Journals (Sweden)

    Timothy Jegla

    Full Text Available The genome of the cnidarian Nematostella vectensis (starlet sea anemone provides a molecular genetic view into the first nervous systems, which appeared in a late common ancestor of cnidarians and bilaterians. Nematostella has a surprisingly large and diverse set of neuronal signaling genes including paralogs of most neuronal signaling molecules found in higher metazoans. Several ion channel gene families are highly expanded in the sea anemone, including three subfamilies of the Shaker K(+ channel gene family: Shaker (Kv1, Shaw (Kv3 and Shal (Kv4. In order to better understand the physiological significance of these voltage-gated K(+ channel expansions, we analyzed the function of 18 members of the 20 gene Shaker subfamily in Nematostella. Six of the Nematostella Shaker genes express functional homotetrameric K(+ channels in vitro. These include functional orthologs of bilaterian Shakers and channels with an unusually high threshold for voltage activation. We identified 11 Nematostella Shaker genes with a distinct "silent" or "regulatory" phenotype; these encode subunits that function only in heteromeric channels and serve to further diversify Nematostella Shaker channel gating properties. Subunits with the regulatory phenotype have not previously been found in the Shaker subfamily, but have evolved independently in the Shab (Kv2 family in vertebrates and the Shal family in a cnidarian. Phylogenetic analysis indicates that regulatory subunits were present in ancestral cnidarians, but have continued to diversity at a high rate after the split between anthozoans and hydrozoans. Comparison of Shaker family gene complements from diverse metazoan species reveals frequent, large scale duplication has produced highly unique sets of Shaker channels in the major metazoan lineages.

  13. Memory functions reveal structural properties of gene regulatory networks

    Science.gov (United States)

    Perez-Carrasco, Ruben

    2018-01-01

    Gene regulatory networks (GRNs) control cellular function and decision making during tissue development and homeostasis. Mathematical tools based on dynamical systems theory are often used to model these networks, but the size and complexity of these models mean that their behaviour is not always intuitive and the underlying mechanisms can be difficult to decipher. For this reason, methods that simplify and aid exploration of complex networks are necessary. To this end we develop a broadly applicable form of the Zwanzig-Mori projection. By first converting a thermodynamic state ensemble model of gene regulation into mass action reactions we derive a general method that produces a set of time evolution equations for a subset of components of a network. The influence of the rest of the network, the bulk, is captured by memory functions that describe how the subnetwork reacts to its own past state via components in the bulk. These memory functions provide probes of near-steady state dynamics, revealing information not easily accessible otherwise. We illustrate the method on a simple cross-repressive transcriptional motif to show that memory functions not only simplify the analysis of the subnetwork but also have a natural interpretation. We then apply the approach to a GRN from the vertebrate neural tube, a well characterised developmental transcriptional network composed of four interacting transcription factors. The memory functions reveal the function of specific links within the neural tube network and identify features of the regulatory structure that specifically increase the robustness of the network to initial conditions. Taken together, the study provides evidence that Zwanzig-Mori projections offer powerful and effective tools for simplifying and exploring the behaviour of GRNs. PMID:29470492

  14. The Drosophila melanogaster methuselah gene: a novel gene with ancient functions.

    Directory of Open Access Journals (Sweden)

    Ana Rita Araújo

    Full Text Available The Drosophila melanogaster G protein-coupled receptor gene, methuselah (mth, has been described as a novel gene that is less than 10 million years old. Nevertheless, it shows a highly specific expression pattern in embryos, larvae, and adults, and has been implicated in larval development, stress resistance, and in the setting of adult lifespan, among others. Although mth belongs to a gene subfamily with 16 members in D. melanogaster, there is no evidence for functional redundancy in this subfamily. Therefore, it is surprising that a novel gene influences so many traits. Here, we explore the alternative hypothesis that mth is an old gene. Under this hypothesis, in species distantly related to D. melanogaster, there should be a gene with features similar to those of mth. By performing detailed phylogenetic, synteny, protein structure, and gene expression analyses we show that the D. virilis GJ12490 gene is the orthologous of mth in species distantly related to D. melanogaster. We also show that, in D. americana (a species of the virilis group of Drosophila, a common amino acid polymorphism at the GJ12490 orthologous gene is significantly associated with developmental time, size, and lifespan differences. Our results imply that GJ12490 orthologous genes are candidates for developmental time and lifespan differences in Drosophila in general.

  15. Association between MASP-2 gene polymorphism and risk of infection diseases: A meta-analysis.

    Science.gov (United States)

    Fu, Jie; Wang, Jingqiu; Luo, Yanping; Zhang, Lifeng; Zhang, Yuan; Dong, Xinfang; Yu, Hongjuan; Cao, Mingqiang; Ma, Xingming

    2016-11-01

    The role of MASP-2 is vital in the process of complement activation by the lectin pathway. It is generally considered that the functional activation of MASP-2 contribute to the infection disease development process. To analyze the association between MASP-2 functional gene (rs72550870) polymorphism and the infection disease risk by a meta-analysis. Relevant case-control studies were identified by searching Cochrane Library, PubMed, Emabase, DOAJ, CAB Abstracts, CSA, CINAHL, EBSCO, Scopus, Global Health, Index Copernicus, CA, China National Knowledge Infrastructure (CNKI) databases up to 10th January 2016. The data were extracted and the methodological quality of studies were evaluated. The STATA 12.0 software was used to perform statistical analysis. 9 studies were included. There was no significant association between masp-2 gene (p.D120G, rs72550870) polymorphism and the risk of infection disease under the allele model (G vs. A: OR = 0.89, 95%CI = 0.66-1.21)(P = 0.445>0.05) and the recessive model (AG + GG vs.AA: OR = 0.88, 95%CI = 0.65-1.20) (P = 0.428>0.05). This is the first comprehensive meta-analysis indicates that the MASP-2 functional gene (rs72550870) polymorphism is not associated with the infection diseases, and the key functional gene polymorphism of rs72550870 did not increase susceptibility to the infection diseases. Similarly, there were no obvious difference in subgroup analysis based on geographical areas and pathogenic microorganisms. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Analysis of global gene expression in Brachypodium distachyon reveals extensive network plasticity in response to abiotic stress.

    Directory of Open Access Journals (Sweden)

    Henry D Priest

    Full Text Available Brachypodium distachyon is a close relative of many important cereal crops. Abiotic stress tolerance has a significant impact on productivity of agriculturally important food and feedstock crops. Analysis of the transcriptome of Brachypodium after chilling, high-salinity, drought, and heat stresses revealed diverse differential expression of many transcripts. Weighted Gene Co-Expression Network Analysis revealed 22 distinct gene modules with specific profiles of expression under each stress. Promoter analysis implicated short DNA sequences directly upstream of module members in the regulation of 21 of 22 modules. Functional analysis of module members revealed enrichment in functional terms for 10 of 22 network modules. Analysis of condition-specific correlations between differentially expressed gene pairs revealed extensive plasticity in the expression relationships of gene pairs. Photosynthesis, cell cycle, and cell wall expression modules were down-regulated by all abiotic stresses. Modules which were up-regulated by each abiotic stress fell into diverse and unique gene ontology GO categories. This study provides genomics resources and improves our understanding of abiotic stress responses of Brachypodium.

  17. Genome-Wide Identification, Evolutionary Analysis, and Stress Responses of the GRAS Gene Family in Castor Beans

    Directory of Open Access Journals (Sweden)

    Wei Xu

    2016-06-01

    Full Text Available Plant-specific GRAS transcription factors play important roles in regulating growth, development, and stress responses. Castor beans (Ricinus communis are important non-edible oilseed plants, cultivated worldwide for its seed oils and its adaptability to growth conditions. In this study, we identified and characterized a total of 48 GRAS genes based on the castor bean genome. Combined with phylogenetic analysis, the castor bean GRAS members were divided into 13 distinct groups. Functional divergence analysis revealed the presence of mostly Type-I functional divergence. The gene structures and conserved motifs, both within and outside the GRAS domain, were characterized. Gene expression analysis, performed in various tissues and under a range of abiotic stress conditions, uncovered the potential functions of GRAS members in regulating plant growth development and stress responses. The results obtained from this study provide valuable information toward understanding the potential molecular mechanisms of GRAS proteins in castor beans. These findings also serve as a resource for identifying the genes that allow castor beans to grow in stressful conditions and to enable further breeding and genetic improvements in agriculture.

  18. Identification of cytokinin-responsive genes using microarray meta-analysis and RNA-Seq in Arabidopsis.

    Science.gov (United States)

    Bhargava, Apurva; Clabaugh, Ivory; To, Jenn P; Maxwell, Bridey B; Chiang, Yi-Hsuan; Schaller, G Eric; Loraine, Ann; Kieber, Joseph J

    2013-05-01

    Cytokinins are N(6)-substituted adenine derivatives that play diverse roles in plant growth and development. We sought to define a robust set of genes regulated by cytokinin as well as to query the response of genes not represented on microarrays. To this end, we performed a meta-analysis of microarray data from a variety of cytokinin-treated samples and used RNA-seq to examine cytokinin-regulated gene expression in Arabidopsis (Arabidopsis thaliana). Microarray meta-analysis using 13 microarray experiments combined with empirically defined filtering criteria identified a set of 226 genes differentially regulated by cytokinin, a subset of which has previously been validated by other methods. RNA-seq validated about 73% of the up-regulated genes identified by this meta-analysis. In silico promoter analysis indicated an overrepresentation of type-B Arabidopsis response regulator binding elements, consistent with the role of type-B Arabidopsis response regulators as primary mediators of cytokinin-responsive gene expression. RNA-seq analysis identified 73 cytokinin-regulated genes that were not represented on the ATH1 microarray. Representative genes were verified using quantitative reverse transcription-polymerase chain reaction and NanoString analysis. Analysis of the genes identified reveals a substantial effect of cytokinin on genes encoding proteins involved in secondary metabolism, particularly those acting in flavonoid and phenylpropanoid biosynthesis, as well as in the regulation of redox state of the cell, particularly a set of glutaredoxin genes. Novel splicing events were found in members of some gene families that are known to play a role in cytokinin signaling or metabolism. The genes identified in this analysis represent a robust set of cytokinin-responsive genes that are useful in the analysis of cytokinin function in plants.

  19. Biochemical mechanisms determine the functional compatibility of heterologous genes

    DEFF Research Database (Denmark)

    Porse, Andreas; Schou, Thea S.; Munck, Christian

    2018-01-01

    -gene libraries have suggested that sequence composition is a strong barrier for the successful integration of heterologous genes. Here we sample 200 diverse genes, representing >80% of sequenced antibiotic resistance genes, to interrogate the factors governing genetic compatibility in new hosts. In contrast...... factors governing the functionality and fitness of antibiotic resistance genes. These findings emphasize the importance of biochemical mechanism for heterologous gene compatibility, and suggest physiological constraints as a pivotal feature orienting the evolution of antibiotic resistance....

  20. Xylella fastidiosa gene expression analysis by DNA microarrays

    Directory of Open Access Journals (Sweden)

    Regiane F. Travensolo

    2009-01-01

    Full Text Available Xylella fastidiosa genome sequencing has generated valuable data by identifying genes acting either on metabolic pathways or in associated pathogenicity and virulence. Based on available information on these genes, new strategies for studying their expression patterns, such as microarray technology, were employed. A total of 2,600 primer pairs were synthesized and then used to generate fragments using the PCR technique. The arrays were hybridized against cDNAs labeled during reverse transcription reactions and which were obtained from bacteria grown under two different conditions (liquid XDM2 and liquid BCYE. All data were statistically analyzed to verify which genes were differentially expressed. In addition to exploring conditions for X. fastidiosa genome-wide transcriptome analysis, the present work observed the differential expression of several classes of genes (energy, protein, amino acid and nucleotide metabolism, transport, degradation of substances, toxins and hypothetical proteins, among others. The understanding of expressed genes in these two different media will be useful in comprehending the metabolic characteristics of X. fastidiosa, and in evaluating how important certain genes are for the functioning and survival of these bacteria in plants.

  1. Use of linear discriminant function analysis in seed morphotype ...

    African Journals Online (AJOL)

    Use of linear discriminant function analysis in seed morphotype relationship study in 31 ... Data were collected on 100-seed weight, seed length and seed width. ... to the Mesoamerican gene pool, comprising the cultigroups Sieva-Big Lima, ...

  2. De novo synthesis and functional analysis of the phosphatase-encoding gene acI-B of uncultured Actinobacteria from Lake Stechlin (NE Germany).

    Science.gov (United States)

    Srivastava, Abhishek; McMahon, Katherine D; Stepanauskas, Ramunas; Grossart, Hans-Peter

    2015-12-01

    The National Center for Biotechnology Information [http://www.ncbi.nlm.nih.gov/guide/taxonomy/] database enlists more than 15,500 bacterial species. But this also includes a plethora of uncultured bacterial representations. Owing to their metabolism, they directly influence biogeochemical cycles, which underscores the the important status of bacteria on our planet. To study the function of a gene from an uncultured bacterium, we have undertaken a de novo gene synthesis approach. Actinobacteria of the acI-B subcluster are important but yet uncultured members of the bacterioplankton in temperate lakes of the northern hemisphere such as oligotrophic Lake Stechlin (NE Germany). This lake is relatively poor in phosphate (P) and harbors on average ~1.3 x 10 6 bacterial cells/ml, whereby Actinobacteria of the ac-I lineage can contribute to almost half of the entire bacterial community depending on seasonal variability. Single cell genome analysis of Actinobacterium SCGC AB141-P03, a member of the acI-B tribe in Lake Stechlin has revealed several phosphate-metabolizing genes. The genome of acI-B Actinobacteria indicates potential to degrade polyphosphate compound. To test for this genetic potential, we targeted the exoP-annotated gene potentially encoding polyphosphatase and synthesized it artificially to examine its biochemical role. Heterologous overexpression of the gene in Escherichia coli and protein purification revealed phosphatase activity. Comparative genome analysis suggested that homologs of this gene should be also present in other Actinobacteria of the acI lineages. This strategic retention of specialized genes in their genome provides a metabolic advantage over other members of the aquatic food web in a P-limited ecosystem. [Int Microbiol 2016; 19(1):39-47]. Copyright© by the Spanish Society for Microbiology and Institute for Catalan Studies.

  3. Functional analysis of ABC transporter genes from Botrytis cinerea identifies BcatrB as a transporter of eugenol

    NARCIS (Netherlands)

    Schoonbeek, H.; Nistelrooy, van J.G.M.; Waard, de M.A.

    2003-01-01

    The role of multiple ATP-binding cassette (ABC) and major facilitator superfamily (MFS) transporter genes from the plant pathogenic fungus Botrytis cinerea in protection against natural fungitoxic compounds was studied by expression analysis and phenotyping of gene-replacement mutants. The

  4. Stably Expressed Genes Involved in Basic Cellular Functions.

    Directory of Open Access Journals (Sweden)

    Kejian Wang

    Full Text Available Stably Expressed Genes (SEGs whose expression varies within a narrow range may be involved in core cellular processes necessary for basic functions. To identify such genes, we re-analyzed existing RNA-Seq gene expression profiles across 11 organs at 4 developmental stages (from immature to old age in both sexes of F344 rats (n = 4/group; 320 samples. Expression changes (calculated as the maximum expression / minimum expression for each gene of >19000 genes across organs, ages, and sexes ranged from 2.35 to >109-fold, with a median of 165-fold. The expression of 278 SEGs was found to vary ≤4-fold and these genes were significantly involved in protein catabolism (proteasome and ubiquitination, RNA transport, protein processing, and the spliceosome. Such stability of expression was further validated in human samples where the expression variability of the homologous human SEGs was significantly lower than that of other genes in the human genome. It was also found that the homologous human SEGs were generally less subject to non-synonymous mutation than other genes, as would be expected of stably expressed genes. We also found that knockout of SEG homologs in mouse models was more likely to cause complete preweaning lethality than non-SEG homologs, corroborating the fundamental roles played by SEGs in biological development. Such stably expressed genes and pathways across life-stages suggest that tight control of these processes is important in basic cellular functions and that perturbation by endogenous (e.g., genetics or exogenous agents (e.g., drugs, environmental factors may cause serious adverse effects.

  5. Analysis of the function of the agouti gene in obesity and diabetes

    Energy Technology Data Exchange (ETDEWEB)

    Mynatt, R.L.; Miltenberger, R.J.; Klebig, M.L. [and others

    1996-09-01

    This chapter discusses the agouti gene and dominant mutations in that gene that lead to agouti-induced obesity, and recent work with transgenic mice to elucidate the role of agouti in obesity. Agouti was cloned in 1992 by the lab of Rick Woychik at Oak Ridge National Laboratory, making it the first of many recently cloned mouse obesity genes. Sequence analysis predicted that mouse agouti is a secreted protein of 131 amino acids. The mature protein has a basic central region (lys57-arg85), a proline-rich domain (pro86-pro91) and a C-terminal region (cys 92-cys 13 1) containing 10 cysteine residues which form 5 disulfide bonds. The human homologue of agouti has also been cloned by the Woychik lab and maps to human chromosome 20q 11.2. Human agouti is 132 amino acids long and is 85% similar to the mouse agouti protein and is normally expressed in adipose tissue. The researchers have been able to recapitulate obesity, hyperinsulinemia, and hyperglycemia with the ubiquitous expression of agouti. Agouti expression in either liver and adipose tissue alone does not cause obesity, and there`s a dose-dependent effect of agouti on body weight, food efficiency, body temperature, and insulin and glucose levels.

  6. Automatic assignment of prokaryotic genes to functional categories using literature profiling.

    Directory of Open Access Journals (Sweden)

    Raul Torrieri

    Full Text Available In the last years, there was an exponential increase in the number of publicly available genomes. Once finished, most genome projects lack financial support to review annotations. A few of these gene annotations are based on a combination of bioinformatics evidence, however, in most cases, annotations are based solely on sequence similarity to a previously known gene, which was most probably annotated in the same way. As a result, a large number of predicted genes remain unassigned to any functional category despite the fact that there is enough evidence in the literature to predict their function. We developed a classifier trained with term-frequency vectors automatically disclosed from text corpora of an ensemble of genes representative of each functional category of the J. Craig Venter Institute Comprehensive Microbial Resource (JCVI-CMR ontology. The classifier achieved up to 84% precision with 68% recall (for confidence≥0.4, F-measure 0.76 (recall and precision equally weighted in an independent set of 2,220 genes, from 13 bacterial species, previously classified by JCVI-CMR into unambiguous categories of its ontology. Finally, the classifier assigned (confidence≥0.7 to functional categories a total of 5,235 out of the ∼24 thousand genes previously in categories "Unknown function" or "Unclassified" for which there is literature in MEDLINE. Two biologists reviewed the literature of 100 of these genes, randomly picket, and assigned them to the same functional categories predicted by the automatic classifier. Our results confirmed the hypothesis that it is possible to confidently assign genes of a real world repository to functional categories, based exclusively on the automatic profiling of its associated literature. The LitProf--Gene Classifier web server is accessible at: www.cebio.org/litprofGC.

  7. Analysis of gene expression profile microarray data in complex regional pain syndrome.

    Science.gov (United States)

    Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing

    2017-09-01

    The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.

  8. Predictability of Genetic Interactions from Functional Gene Modules

    Directory of Open Access Journals (Sweden)

    Jonathan H. Young

    2017-02-01

    Full Text Available Characterizing genetic interactions is crucial to understanding cellular and organismal response to gene-level perturbations. Such knowledge can inform the selection of candidate disease therapy targets, yet experimentally determining whether genes interact is technically nontrivial and time-consuming. High-fidelity prediction of different classes of genetic interactions in multiple organisms would substantially alleviate this experimental burden. Under the hypothesis that functionally related genes tend to share common genetic interaction partners, we evaluate a computational approach to predict genetic interactions in Homo sapiens, Drosophila melanogaster, and Saccharomyces cerevisiae. By leveraging knowledge of functional relationships between genes, we cross-validate predictions on known genetic interactions and observe high predictive power of multiple classes of genetic interactions in all three organisms. Additionally, our method suggests high-confidence candidate interaction pairs that can be directly experimentally tested. A web application is provided for users to query genes for predicted novel genetic interaction partners. Finally, by subsampling the known yeast genetic interaction network, we found that novel genetic interactions are predictable even when knowledge of currently known interactions is minimal.

  9. Genome-wide identification, characterisation and expression analysis of the MADS-box gene family in Prunus mume.

    Science.gov (United States)

    Xu, Zongda; Zhang, Qixiang; Sun, Lidan; Du, Dongliang; Cheng, Tangren; Pan, Huitang; Yang, Weiru; Wang, Jia

    2014-10-01

    MADS-box genes encode transcription factors that play crucial roles in plant development, especially in flower and fruit development. To gain insight into this gene family in Prunus mume, an important ornamental and fruit plant in East Asia, and to elucidate their roles in flower organ determination and fruit development, we performed a genome-wide identification, characterisation and expression analysis of MADS-box genes in this Rosaceae tree. In this study, 80 MADS-box genes were identified in P. mume and categorised into MIKC, Mα, Mβ, Mγ and Mδ groups based on gene structures and phylogenetic relationships. The MIKC group could be further classified into 12 subfamilies. The FLC subfamily was absent in P. mume and the six tandemly arranged DAM genes might experience a species-specific evolution process in P. mume. The MADS-box gene family might experience an evolution process from MIKC genes to Mδ genes to Mα, Mβ and Mγ genes. The expression analysis suggests that P. mume MADS-box genes have diverse functions in P. mume development and the functions of duplicated genes diverged after the duplication events. In addition to its involvement in the development of female gametophytes, type I genes also play roles in male gametophytes development. In conclusion, this study adds to our understanding of the roles that the MADS-box genes played in flower and fruit development and lays a foundation for selecting candidate genes for functional studies in P. mume and other species. Furthermore, this study also provides a basis to study the evolution of the MADS-box family.

  10. Cross-organism learning method to discover new gene functionalities.

    Science.gov (United States)

    Domeniconi, Giacomo; Masseroli, Marco; Moro, Gianluca; Pinoli, Pietro

    2016-04-01

    Knowledge of gene and protein functions is paramount for the understanding of physiological and pathological biological processes, as well as in the development of new drugs and therapies. Analyses for biomedical knowledge discovery greatly benefit from the availability of gene and protein functional feature descriptions expressed through controlled terminologies and ontologies, i.e., of gene and protein biomedical controlled annotations. In the last years, several databases of such annotations have become available; yet, these valuable annotations are incomplete, include errors and only some of them represent highly reliable human curated information. Computational techniques able to reliably predict new gene or protein annotations with an associated likelihood value are thus paramount. Here, we propose a novel cross-organisms learning approach to reliably predict new functionalities for the genes of an organism based on the known controlled annotations of the genes of another, evolutionarily related and better studied, organism. We leverage a new representation of the annotation discovery problem and a random perturbation of the available controlled annotations to allow the application of supervised algorithms to predict with good accuracy unknown gene annotations. Taking advantage of the numerous gene annotations available for a well-studied organism, our cross-organisms learning method creates and trains better prediction models, which can then be applied to predict new gene annotations of a target organism. We tested and compared our method with the equivalent single organism approach on different gene annotation datasets of five evolutionarily related organisms (Homo sapiens, Mus musculus, Bos taurus, Gallus gallus and Dictyostelium discoideum). Results show both the usefulness of the perturbation method of available annotations for better prediction model training and a great improvement of the cross-organism models with respect to the single-organism ones

  11. Data Integration and Applications of Functional Gene Networks in Drosophila Melanogaster

    Science.gov (United States)

    Costello, James Christopher

    2009-01-01

    Understanding the function of every gene in the genome is a central goal in the biological sciences. This includes full characterization of a genes phenotypic effects, molecular interactions, the evolutionary forces that shape its function(s), and how these functions interrelate. Despite a long history and considerable effort to understand all…

  12. Functional analysis of 14 genes that constitute the purine catabolic pathway in Bacillus subtilis and evidence for a novel regulon controlled by the PucR transcription activator

    DEFF Research Database (Denmark)

    Schultz, Anna Charlotte; Nygaard, P.; Saxild, Hans Henrik

    2001-01-01

    The soil bacterium Bacillus subtilis has developed a highly controlled system for the utilization of a diverse array of low molecular-weight compounds as a nitrogen source when the preferred nitrogen sources, e.g., glutamate plus ammonia, are exhausted. We have identified such a system...... for the utilization of purines as nitrogen source in B. subtilis. Based on growth studies of strains with knockout mutations in genes, complemented with enzyme analysis, we could ascribe functions to 14 genes encoding enzymes or proteins of the purine degradation pathway. A functional xanthine dehydrogenase requires......ABCDE unit was decreased 16-fold, while expression of pucR was decreased 4-fold in the presence of allantoin. We have identified genes of the purine degradation pathway in B. subtilis and showed that their expression is subject to both general nitrogen catabolite control and pathway-specific control....

  13. A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis

    Directory of Open Access Journals (Sweden)

    Akira Ishikawa

    2017-11-01

    Full Text Available Large numbers of quantitative trait loci (QTL affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.

  14. A Strategy for Identifying Quantitative Trait Genes Using Gene Expression Analysis and Causal Analysis.

    Science.gov (United States)

    Ishikawa, Akira

    2017-11-27

    Large numbers of quantitative trait loci (QTL) affecting complex diseases and other quantitative traits have been reported in humans and model animals. However, the genetic architecture of these traits remains elusive due to the difficulty in identifying causal quantitative trait genes (QTGs) for common QTL with relatively small phenotypic effects. A traditional strategy based on techniques such as positional cloning does not always enable identification of a single candidate gene for a QTL of interest because it is difficult to narrow down a target genomic interval of the QTL to a very small interval harboring only one gene. A combination of gene expression analysis and statistical causal analysis can greatly reduce the number of candidate genes. This integrated approach provides causal evidence that one of the candidate genes is a putative QTG for the QTL. Using this approach, I have recently succeeded in identifying a single putative QTG for resistance to obesity in mice. Here, I outline the integration approach and discuss its usefulness using my studies as an example.

  15. Functional features of gene expression profiles differentiating gastrointestinal stromal tumours according to KIT mutations and expression

    International Nuclear Information System (INIS)

    Ostrowski, Jerzy; Dobosz, Anna Jerzak Vel; Jarosz, Dorota; Ruka, Wlodzimierz; Wyrwicz, Lucjan S; Polkowski, Marcin; Paziewska, Agnieszka; Skrzypczak, Magdalena; Goryca, Krzysztof; Rubel, Tymon; Kokoszyñska, Katarzyna; Rutkowski, Piotr; Nowecki, Zbigniew I

    2009-01-01

    Gastrointestinal stromal tumours (GISTs) represent a heterogeneous group of tumours of mesenchymal origin characterized by gain-of-function mutations in KIT or PDGFRA of the type III receptor tyrosine kinase family. Although mutations in either receptor are thought to drive an early oncogenic event through similar pathways, two previous studies reported the mutation-specific gene expression profiles. However, their further conclusions were rather discordant. To clarify the molecular characteristics of differentially expressed genes according to GIST receptor mutations, we combined microarray-based analysis with detailed functional annotations. Total RNA was isolated from 29 frozen gastric GISTs and processed for hybridization on GENECHIP ® HG-U133 Plus 2.0 microarrays (Affymetrix). KIT and PDGFRA were analyzed by sequencing, while related mRNA levels were analyzed by quantitative RT-PCR. Fifteen and eleven tumours possessed mutations in KIT and PDGFRA, respectively; no mutation was found in three tumours. Gene expression analysis identified no discriminative profiles associated with clinical or pathological parameters, even though expression of hundreds of genes differentiated tumour receptor mutation and expression status. Functional features of genes differentially expressed between the two groups of GISTs suggested alterations in angiogenesis and G-protein-related and calcium signalling. Our study has identified novel molecular elements likely to be involved in receptor-dependent GIST development and allowed confirmation of previously published results. These elements may be potential therapeutic targets and novel markers of KIT mutation status

  16. Gene Regulation, Modulation, and Their Applications in Gene Expression Data Analysis

    Directory of Open Access Journals (Sweden)

    Mario Flores

    2013-01-01

    Full Text Available Common microarray and next-generation sequencing data analysis concentrate on tumor subtype classification, marker detection, and transcriptional regulation discovery during biological processes by exploring the correlated gene expression patterns and their shared functions. Genetic regulatory network (GRN based approaches have been employed in many large studies in order to scrutinize for dysregulation and potential treatment controls. In addition to gene regulation and network construction, the concept of the network modulator that has significant systemic impact has been proposed, and detection algorithms have been developed in past years. Here we provide a unified mathematic description of these methods, followed with a brief survey of these modulator identification algorithms. As an early attempt to extend the concept to new RNA regulation mechanism, competitive endogenous RNA (ceRNA, into a modulator framework, we provide two applications to illustrate the network construction, modulation effect, and the preliminary finding from these networks. Those methods we surveyed and developed are used to dissect the regulated network under different modulators. Not limit to these, the concept of “modulation” can adapt to various biological mechanisms to discover the novel gene regulation mechanisms.

  17. An intronic microRNA silences genes that are functionally antagonistic to its host gene.

    Science.gov (United States)

    Barik, Sailen

    2008-09-01

    MicroRNAs (miRNAs) are short noncoding RNAs that down-regulate gene expression by silencing specific target mRNAs. While many miRNAs are transcribed from their own genes, nearly half map within introns of 'host' genes, the significance of which remains unclear. We report that transcriptional activation of apoptosis-associated tyrosine kinase (AATK), essential for neuronal differentiation, also generates miR-338 from an AATK gene intron that silences a family of mRNAs whose protein products are negative regulators of neuronal differentiation. We conclude that an intronic miRNA, transcribed together with the host gene mRNA, may serve the interest of its host gene by silencing a cohort of genes that are functionally antagonistic to the host gene itself.

  18. Fine mapping and candidate gene analysis of the virescent gene v 1 in Upland cotton (Gossypium hirsutum).

    Science.gov (United States)

    Mao, Guangzhi; Ma, Qiang; Wei, Hengling; Su, Junji; Wang, Hantao; Ma, Qifeng; Fan, Shuli; Song, Meizhen; Zhang, Xianlong; Yu, Shuxun

    2018-02-01

    The young leaves of virescent mutants are yellowish and gradually turn green as the plants reach maturity. Understanding the genetic basis of virescent mutants can aid research of the regulatory mechanisms underlying chloroplast development and chlorophyll biosynthesis, as well as contribute to the application of virescent traits in crop breeding. In this study, fine mapping was employed, and a recessive gene (v 1 ) from a virescent mutant of Upland cotton was narrowed to an 84.1-Kb region containing ten candidate genes. The GhChlI gene encodes the cotton Mg-chelatase I subunit (CHLI) and was identified as the candidate gene for the virescent mutation using gene annotation. BLAST analysis showed that the GhChlI gene has two copies, Gh_A10G0282 and Gh_D10G0283. Sequence analysis indicated that the coding region (CDS) of GhChlI is 1269 bp in length, with three predicted exons and one non-synonymous nucleotide mutation (G1082A) in the third exon of Gh_D10G0283, with an amino acid (AA) substitution of arginine (R) to lysine (K). GhChlI-silenced TM-1 plants exhibited a lower GhChlI expression level, a lower chlorophyll content, and the virescent phenotype. Analysis of upstream regulatory elements and expression levels of GhChlI showed that the expression quantity of GhChlI may be normal, and with the development of the true leaf, the increase in the Gh_A10G0282 dosage may partially make up for the deficiency of Gh_D10G0283 in the v 1 mutant. Phylogenetic analysis and sequence alignment revealed that the protein sequence encoded by the third exon of GhChlI is highly conserved across diverse plant species, in which AA substitutions among the completely conserved residues frequently result in changes in leaf color in various species. These results suggest that the mutation (G1082A) within the GhChlI gene may cause a functional defect of the GhCHLI subunit and thus the virescent phenotype in the v 1 mutant. The GhChlI mutation not only provides a tool for understanding the

  19. Gene-specific function prediction for non-synonymous mutations in monogenic diabetes genes.

    Directory of Open Access Journals (Sweden)

    Quan Li

    Full Text Available The rapid progress of genomic technologies has been providing new opportunities to address the need of maturity-onset diabetes of the young (MODY molecular diagnosis. However, whether a new mutation causes MODY can be questionable. A number of in silico methods have been developed to predict functional effects of rare human mutations. The purpose of this study is to compare the performance of different bioinformatics methods in the functional prediction of nonsynonymous mutations in each MODY gene, and provides reference matrices to assist the molecular diagnosis of MODY. Our study showed that the prediction scores by different methods of the diabetes mutations were highly correlated, but were more complimentary than replacement to each other. The available in silico methods for the prediction of diabetes mutations had varied performances across different genes. Applying gene-specific thresholds defined by this study may be able to increase the performance of in silico prediction of disease-causing mutations.

  20. Functional Potential of Bacterial Communities using Gene Context Information

    Directory of Open Access Journals (Sweden)

    Anwesha Mohapatra

    2017-12-01

    Full Text Available Estimation of the functional potential of a bacterial genome can be determined by accurate annotation of its metabolic pathways. Existing homology based methods for pathway annotation fail to account for homologous genes that participate in multiple pathways, causing overestimation of gene copy number. Mere presence of constituent genes of a candidate pathway which are dispersed on a genome often results in incorrect annotation, thereby leading to erroneous gene abundance and pathway estimation. Clusters of evolutionarily conserved coregulated genes are characteristic features in bacterial genomes and their spatial arrangement in the genome is constrained by the pathway encoded by them. Thus, in order to improve the accuracy of pathway prediction, it is important to augment homology based annotation with gene organization information. In this communication, we present a methodology considering prioritization of gene context for improved pathway annotation. Extensive literature mining was performed to confirm conserved juxtaposed arrangement of gene components of various pathways. Our method was utilized to identify and analyse the functional potential of all available completely sequenced bacterial genomes. The accuracy of the predicted gene clusters and their importance in metabolic pathways will be demonstrated using a few case studies. One of such case study corresponds to butyrate production pathways in gut bacteria where it was observed that gut pathogens and commensals possess a distinct set of pathway components. In another example, we will demonstrate how our methodology improves the prediction accuracy of carbohydrate metabolic potential in human microbial communities. Applicability of our method for estimation of functional potential in bacterial communities present in diverse environments will also be illustrated.

  1. Genome-Wide Identification and Analysis of the TIFY Gene Family in Grape

    Science.gov (United States)

    Zhang, Yucheng; Gao, Min; Singer, Stacy D.; Fei, Zhangjun; Wang, Hua; Wang, Xiping

    2012-01-01

    Background The TIFY gene family constitutes a plant-specific group of genes with a broad range of functions. This family encodes four subfamilies of proteins, including ZML, TIFY, PPD and JASMONATE ZIM-Domain (JAZ) proteins. JAZ proteins are targets of the SCFCOI1 complex, and function as negative regulators in the JA signaling pathway. Recently, it has been reported in both Arabidopsis and rice that TIFY genes, and especially JAZ genes, may be involved in plant defense against insect feeding, wounding, pathogens and abiotic stresses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant TIFY family members is limited, especially in a woody species such as grape. Methodology/Principal Findings A total of two TIFY, four ZML, two PPD and 11 JAZ genes were identified in the Vitis vinifera genome. Phylogenetic analysis of TIFY protein sequences from grape, Arabidopsis and rice indicated that the grape TIFY proteins are more closely related to those of Arabidopsis than those of rice. Both segmental and tandem duplication events have been major contributors to the expansion of the grape TIFY family. In addition, synteny analysis between grape and Arabidopsis demonstrated that homologues of several grape TIFY genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of lineages that led to grape and Arabidopsis. Analyses of microarray and quantitative real-time RT-PCR expression data revealed that grape TIFY genes are not a major player in the defense against biotrophic pathogens or viruses. However, many of these genes were responsive to JA and ABA, but not SA or ET. Conclusion The genome-wide identification, evolutionary and expression analyses of grape TIFY genes should facilitate further research of this gene family and provide new insights regarding their evolutionary history and regulatory control. PMID:22984514

  2. Network Analysis of Human Genes Influencing Susceptibility to Mycobacterial Infections

    Science.gov (United States)

    Lipner, Ettie M.; Garcia, Benjamin J.; Strong, Michael

    2016-01-01

    Tuberculosis and nontuberculous mycobacterial infections constitute a high burden of pulmonary disease in humans, resulting in over 1.5 million deaths per year. Building on the premise that genetic factors influence the instance, progression, and defense of infectious disease, we undertook a systems biology approach to investigate relationships among genetic factors that may play a role in increased susceptibility or control of mycobacterial infections. We combined literature and database mining with network analysis and pathway enrichment analysis to examine genes, pathways, and networks, involved in the human response to Mycobacterium tuberculosis and nontuberculous mycobacterial infections. This approach allowed us to examine functional relationships among reported genes, and to identify novel genes and enriched pathways that may play a role in mycobacterial susceptibility or control. Our findings suggest that the primary pathways and genes influencing mycobacterial infection control involve an interplay between innate and adaptive immune proteins and pathways. Signaling pathways involved in autoimmune disease were significantly enriched as revealed in our networks. Mycobacterial disease susceptibility networks were also examined within the context of gene-chemical relationships, in order to identify putative drugs and nutrients with potential beneficial immunomodulatory or anti-mycobacterial effects. PMID:26751573

  3. Genome-wide analysis of the homeodomain-leucine zipper (HD-ZIP) gene family in peach (Prunus persica).

    Science.gov (United States)

    Zhang, C H; Ma, R J; Shen, Z J; Sun, X; Korir, N K; Yu, M L

    2014-04-08

    In this study, 33 homeodomain-leucine zipper (HD-ZIP) genes were identified in peach using the HD-ZIP amino acid sequences of Arabidopsis thaliana as a probe. Based on the phylogenetic analysis and the individual gene or protein characteristics, the HD-ZIP gene family in peach can be classified into 4 subfamilies, HD-ZIP I, II, III, and IV, containing 14, 7, 4, and 8 members, respectively. The most closely related peach HD-ZIP members within the same subfamilies shared very similar gene structure in terms of either intron/exon numbers or lengths. Almost all members of the same subfamily shared common motif compositions, thereby implying that the HD-ZIP proteins within the same subfamily may have functional similarity. The 33 peach HD-ZIP genes were distributed across scaffolds 1 to 7. Although the primary structure varied among HD-ZIP family proteins, their tertiary structures were similar. The results from this study will be useful in selecting candidate genes from specific subfamilies for functional analysis.

  4. Analysis of Msx1 and Msx2 transactivation function in the context of the heat shock 70 (Hspa1b) gene promoter.

    Science.gov (United States)

    Zhuang, Fengfeng; Nguyen, Manuel P; Shuler, Charles; Liu, Yi-Hsin

    2009-04-03

    Previous studies have shown that Msx proteins control gene transcription predominantly through repression mechanisms. However, gene expression studies using either the gain-of-function or the loss-of-function mutants revealed many gene targets whose expression require functional Msx proteins. To date, investigations into the mechanisms of Msx-dependent transactivation have been hindered by the lack of a responsive promoter. Here, we demonstrated the usefulness of the mouse Hspa1b promoter in probing Msx-dependent mechanisms of gene activation. We showed that Msx protein activates Hspa1b promoter via its C-terminal domain. The activation absolutely depends on the HSEs and physical interactions between Msx proteins and heat shock factors may play a contributing role.

  5. Linking Advanced Visualization and MATLAB for the Analysis of 3D Gene Expression Data

    Energy Technology Data Exchange (ETDEWEB)

    Ruebel, Oliver; Keranen, Soile V.E.; Biggin, Mark; Knowles, David W.; Weber, Gunther H.; Hagen, Hans; Hamann, Bernd; Bethel, E. Wes

    2011-03-30

    Three-dimensional gene expression PointCloud data generated by the Berkeley Drosophila Transcription Network Project (BDTNP) provides quantitative information about the spatial and temporal expression of genes in early Drosophila embryos at cellular resolution. The BDTNP team visualizes and analyzes Point-Cloud data using the software application PointCloudXplore (PCX). To maximize the impact of novel, complex data sets, such as PointClouds, the data needs to be accessible to biologists and comprehensible to developers of analysis functions. We address this challenge by linking PCX and Matlab via a dedicated interface, thereby providing biologists seamless access to advanced data analysis functions and giving bioinformatics researchers the opportunity to integrate their analysis directly into the visualization application. To demonstrate the usefulness of this approach, we computationally model parts of the expression pattern of the gene even skipped using a genetic algorithm implemented in Matlab and integrated into PCX via our Matlab interface.

  6. Genome-wide Identification and Expression Analysis of the CDPK Gene Family in Grape, Vitis spp.

    Science.gov (United States)

    Zhang, Kai; Han, Yong-Tao; Zhao, Feng-Li; Hu, Yang; Gao, Yu-Rong; Ma, Yan-Fei; Zheng, Yi; Wang, Yue-Jin; Wen, Ying-Qiang

    2015-06-30

    Calcium-dependent protein kinases (CDPKs) play vital roles in plant growth and development, biotic and abiotic stress responses, and hormone signaling. Little is known about the CDPK gene family in grapevine. In this study, we performed a genome-wide analysis of the 12X grape genome (Vitis vinifera) and identified nineteen CDPK genes. Comparison of the structures of grape CDPK genes allowed us to examine their functional conservation and differentiation. Segmentally duplicated grape CDPK genes showed high structural conservation and contributed to gene family expansion. Additional comparisons between grape and Arabidopsis thaliana demonstrated that several grape CDPK genes occured in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grapevine and Arabidopsis. Phylogenetic analysis divided the grape CDPK genes into four groups. Furthermore, we examined the expression of the corresponding nineteen homologous CDPK genes in the Chinese wild grape (Vitis pseudoreticulata) under various conditions, including biotic stress, abiotic stress, and hormone treatments. The expression profiles derived from reverse transcription and quantitative PCR suggested that a large number of VpCDPKs responded to various stimuli on the transcriptional level, indicating their versatile roles in the responses to biotic and abiotic stresses. Moreover, we examined the subcellular localization of VpCDPKs by transiently expressing six VpCDPK-GFP fusion proteins in Arabidopsis mesophyll protoplasts; this revealed high variability consistent with potential functional differences. Taken as a whole, our data provide significant insights into the evolution and function of grape CDPKs and a framework for future investigation of grape CDPK genes.

  7. MAGMA: Generalized Gene-Set Analysis of GWAS Data

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  8. MAGMA: generalized gene-set analysis of GWAS data.

    NARCIS (Netherlands)

    de Leeuw, C.A.; Mooij, J.M.; Heskes, T.; Posthuma, D.

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical

  9. TARGETED ANALYSIS OF JAK-STAT-SOCS GENES IN DAIRY CATTLE

    Directory of Open Access Journals (Sweden)

    Arun Sondur Jayappa

    2015-12-01

    Full Text Available The Janus kinase and signal transducer and activator of transcription (JAK-STAT pathway genes along with suppressors of cytokine signalling (SOCS family genes play a crucial role in controlling cytokine signals in the mammary gland and thus mammary gland development. Mammary gene expression studies showed differential expression patterns for all the JAK-STAT pathway genes. Gene expression studies using qRT-PCR revealed differential expression of SOCS2, SOCS4 and SOCS5 genes across the lactation cycle in dairy cows. Using genotypes from 1,546 Australian Holstein- Friesian bulls, a statistical model based on SNPs within 500kb of JAK-STAT pathway genes, and SOCS genes alone was carried out. The analysis suggested that these genes and pathways make a significant contribution to the Australian milk production traits. Selection of 24 SNPs close to SOCS1, SOCS3, SOCS5, SOCS7 and CISH genes were significantly associated with, Australian Profit Ranking (APR, Australian Selection Index (ASI and protein yield (PY. This study supports the view that there may be some merit in choosing SNPs around functionally relevant genes for the selection and genetic improvement schemes for dairy production traits.

  10. Genetic architecture of HIV-1 genes circulating in north India & their functional implications.

    Science.gov (United States)

    Neogi, Ujjwal; Sood, Vikas; Ronsard, Larence; Singh, Jyotsna; Lata, Sneh; Ramachandran, V G; Das, S; Wanchu, Ajay; Banerjea, Akhil C

    2011-12-01

    This review presents data on genetic and functional analysis of some of the HIV-1 genes derived from HIV-1 infected individuals from north India (Delhi, Punjab and Chandigarh). We found evidence of novel B/C recombinants in HIV-1 LTR region showing relatedness to China/Myanmar with 3 copies of Nfκb sites; B/C/D mosaic genomes for HIV-1 Vpr and novel B/C Tat. We reported appearance of a complex recombinant form CRF_02AG of HIV-1 envelope sequences which is predominantly found in Central/Western Africa. Also one Indian HIV-1 envelope subtype C sequence suggested exclusive CXCR4 co-receptor usage. This extensive recombination, which is observed in about 10 per cent HIV-1 infected individuals in the Vpr genes, resulted in remarkably altered functions when compared with prototype subtype B Vpr. The Vpu C was found to be more potent in causing apoptosis when compared with Vpu B when analyzed for subG1 DNA content. The functional implications of these changes as well as in other genes of HIV-1 are discussed in detail with possible implications for subtype-specific pathogenesis highlighted.

  11. Suitable Reference Genes for Accurate Gene Expression Analysis in Parsley (Petroselinum crispum) for Abiotic Stresses and Hormone Stimuli.

    Science.gov (United States)

    Li, Meng-Yao; Song, Xiong; Wang, Feng; Xiong, Ai-Sheng

    2016-01-01

    Parsley, one of the most important vegetables in the Apiaceae family, is widely used in the food, medicinal, and cosmetic industries. Recent studies on parsley mainly focus on its chemical composition, and further research involving the analysis of the plant's gene functions and expressions is required. qPCR is a powerful method for detecting very low quantities of target transcript levels and is widely used to study gene expression. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, four software, namely geNorm, NormFinder, BestKeeper, and RefFinder were used to evaluate the expression stabilities of eight candidate reference genes of parsley ( GAPDH, ACTIN, eIF-4 α, SAND, UBC, TIP41, EF-1 α, and TUB ) under various conditions, including abiotic stresses (heat, cold, salt, and drought) and hormone stimuli treatments (GA, SA, MeJA, and ABA). Results showed that EF-1 α and TUB were the most stable genes for abiotic stresses, whereas EF-1 α, GAPDH , and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1 α and TUB were the most stable reference genes among all tested samples, and UBC was the least stable one. Expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study can guide the selection of suitable reference genes in gene expression in parsley.

  12. Suitable reference genes for accurate gene expression analysis in parsley (Petroselinum crispum for abiotic stresses and hormone stimuli

    Directory of Open Access Journals (Sweden)

    Meng-Yao Li

    2016-09-01

    Full Text Available Parsley is one of the most important vegetable in Apiaceae family and widely used in food industry, medicinal and cosmetic. The recent studies in parsley are mainly focus on chemical composition, further research involving the analysis of the gene functions and expressions will be required. qPCR is a powerful method for detecting very low quantities of target transcript levels and widely used for gene expression studies. To ensure the accuracy of results, a suitable reference gene is necessary for expression normalization. In this study, three software geNorm, NormFinder, and BestKeeper were used to evaluate the expression stabilities of eight candidate reference genes (GAPDH, ACTIN, eIF-4α, SAND, UBC, TIP41, EF-1α, and TUB under various conditions including abiotic stresses (heat, cold, salt, and drought and hormone stimuli treatments (GA, SA, MeJA, and ABA. The results showed that EF-1α and TUB were identified as the most stable genes for abiotic stresses, while EF-1α, GAPDH, and TUB were the top three choices for hormone stimuli treatments. Moreover, EF-1α and TUB were the most stable reference genes across all the tested samples, while UBC was the least stable one. The expression analysis of PcDREB1 and PcDREB2 further verified that the selected stable reference genes were suitable for gene expression normalization. This study provides a guideline for selection the suitable reference genes in gene expression in parsley.

  13. Time-Course Analysis of Gene Expression During the Saccharomyces cerevisiae Hypoxic Response

    Directory of Open Access Journals (Sweden)

    Nasrine Bendjilali

    2017-01-01

    Full Text Available Many cells experience hypoxia, or low oxygen, and respond by dramatically altering gene expression. In the yeast Saccharomyces cerevisiae, genes that respond are required for many oxygen-dependent cellular processes, such as respiration, biosynthesis, and redox regulation. To more fully characterize the global response to hypoxia, we exposed yeast to hypoxic conditions, extracted RNA at different times, and performed RNA sequencing (RNA-seq analysis. Time-course statistical analysis revealed hundreds of genes that changed expression by up to 550-fold. The genes responded with varying kinetics suggesting that multiple regulatory pathways are involved. We identified most known oxygen-regulated genes and also uncovered new regulated genes. Reverse transcription-quantitative PCR (RT-qPCR analysis confirmed that the lysine methyltransferase EFM6 and the recombinase DMC1, both conserved in humans, are indeed oxygen-responsive. Looking more broadly, oxygen-regulated genes participate in expected processes like respiration and lipid metabolism, but also in unexpected processes like amino acid and vitamin metabolism. Using principle component analysis, we discovered that the hypoxic response largely occurs during the first 2 hr and then a new steady-state expression state is achieved. Moreover, we show that the oxygen-dependent genes are not part of the previously described environmental stress response (ESR consisting of genes that respond to diverse types of stress. While hypoxia appears to cause a transient stress, the hypoxic response is mostly characterized by a transition to a new state of gene expression. In summary, our results reveal that hypoxia causes widespread and complex changes in gene expression to prepare the cell to function with little or no oxygen.

  14. Time-Course Analysis of Gene Expression During the Saccharomyces cerevisiae Hypoxic Response.

    Science.gov (United States)

    Bendjilali, Nasrine; MacLeon, Samuel; Kalra, Gurmannat; Willis, Stephen D; Hossian, A K M Nawshad; Avery, Erica; Wojtowicz, Olivia; Hickman, Mark J

    2017-01-05

    Many cells experience hypoxia, or low oxygen, and respond by dramatically altering gene expression. In the yeast Saccharomyces cerevisiae, genes that respond are required for many oxygen-dependent cellular processes, such as respiration, biosynthesis, and redox regulation. To more fully characterize the global response to hypoxia, we exposed yeast to hypoxic conditions, extracted RNA at different times, and performed RNA sequencing (RNA-seq) analysis. Time-course statistical analysis revealed hundreds of genes that changed expression by up to 550-fold. The genes responded with varying kinetics suggesting that multiple regulatory pathways are involved. We identified most known oxygen-regulated genes and also uncovered new regulated genes. Reverse transcription-quantitative PCR (RT-qPCR) analysis confirmed that the lysine methyltransferase EFM6 and the recombinase DMC1, both conserved in humans, are indeed oxygen-responsive. Looking more broadly, oxygen-regulated genes participate in expected processes like respiration and lipid metabolism, but also in unexpected processes like amino acid and vitamin metabolism. Using principle component analysis, we discovered that the hypoxic response largely occurs during the first 2 hr and then a new steady-state expression state is achieved. Moreover, we show that the oxygen-dependent genes are not part of the previously described environmental stress response (ESR) consisting of genes that respond to diverse types of stress. While hypoxia appears to cause a transient stress, the hypoxic response is mostly characterized by a transition to a new state of gene expression. In summary, our results reveal that hypoxia causes widespread and complex changes in gene expression to prepare the cell to function with little or no oxygen. Copyright © 2017 Bendjilali et al.

  15. Molecular and functional analysis of DIR1; a novel gene with a potential role in induced radioresistance

    International Nuclear Information System (INIS)

    Young, S.M.; McKeen, H.; Valentine, A.; Burke, G.; Hirst, D.; Robson, T.

    2003-01-01

    Full text: There is now little doubt about the existence of radioprotective mechanisms that are upregulated following exposure to small doses of ionizing radiation and other DNA-damaging agents. The identification of genes whose expression is altered following exposure to a low dose of ionizing radiation will be an important step in understanding these phenomena. We have identified a novel gene, DIR1, that is transiently repressed by low radiation doses (Robson et al.,1997 and 1999) and is otherwise expressed in a wide range of cell lines and tissues. The repression of this gene is in the dose range where induced radioresistance is observed in a number of cell survival studies (Joiner et al., 2001) implicating this gene in induced radioresistance. Using antisense strategies, we have demonstrated that the DIR1 gene product appears to be involved in cell survival and DNA repair in a range of cell lines following exposure to X-rays (Robson et al., 1999 and 2000). Using microchip array analysis we have been able to identify a number of genes activated as a consequence of DIR1 repression. Preliminary data implicate genes involved in repair, cell cycle and stress response and include ATM and BRCA2. We are now confirming these responses using northern and western blot analysis. Yeast two hybrid analysis has also been useful in demonstrating interacting proteins. One protein, which interacts with DIR1 is similar to murine UIP28, a RING finger protein which interacts with the ubiquitin conjugating enzyme, UbcM4. Interestingly, the ubiquitin (Ub)/proteosome pathway regulates many cellular processes including apoptosis, cell cycle progression, stress responses, development and transcriptional regulation. Further characterisation of these downstream genes and interacting proteins will allow us to:- i) dissect the cellular pathways involved in adaptation to oxidative and genotoxic stress ii) elucidate the mechanisms involved in many disease pathologies iii) identify new

  16. A Resource of Quantitative Functional Annotation for Homo sapiens Genes.

    Science.gov (United States)

    Taşan, Murat; Drabkin, Harold J; Beaver, John E; Chua, Hon Nian; Dunham, Julie; Tian, Weidong; Blake, Judith A; Roth, Frederick P

    2012-02-01

    The body of human genomic and proteomic evidence continues to grow at ever-increasing rates, while annotation efforts struggle to keep pace. A surprisingly small fraction of human genes have clear, documented associations with specific functions, and new functions continue to be found for characterized genes. Here we assembled an integrated collection of diverse genomic and proteomic data for 21,341 human genes and make quantitative associations of each to 4333 Gene Ontology terms. We combined guilt-by-profiling and guilt-by-association approaches to exploit features unique to the data types. Performance was evaluated by cross-validation, prospective validation, and by manual evaluation with the biological literature. Functional-linkage networks were also constructed, and their utility was demonstrated by identifying candidate genes related to a glioma FLN using a seed network from genome-wide association studies. Our annotations are presented-alongside existing validated annotations-in a publicly accessible and searchable web interface.

  17. Bioinformatics tools for quantitative and functional metagenome and metatranscriptome data analysis in microbes.

    Science.gov (United States)

    Niu, Sheng-Yong; Yang, Jinyu; McDermaid, Adam; Zhao, Jing; Kang, Yu; Ma, Qin

    2017-05-08

    Metagenomic and metatranscriptomic sequencing approaches are more frequently being used to link microbiota to important diseases and ecological changes. Many analyses have been used to compare the taxonomic and functional profiles of microbiota across habitats or individuals. While a large portion of metagenomic analyses focus on species-level profiling, some studies use strain-level metagenomic analyses to investigate the relationship between specific strains and certain circumstances. Metatranscriptomic analysis provides another important insight into activities of genes by examining gene expression levels of microbiota. Hence, combining metagenomic and metatranscriptomic analyses will help understand the activity or enrichment of a given gene set, such as drug-resistant genes among microbiome samples. Here, we summarize existing bioinformatics tools of metagenomic and metatranscriptomic data analysis, the purpose of which is to assist researchers in deciding the appropriate tools for their microbiome studies. Additionally, we propose an Integrated Meta-Function mapping pipeline to incorporate various reference databases and accelerate functional gene mapping procedures for both metagenomic and metatranscriptomic analyses. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. Functional analysis of a novel hydrogen peroxide resistance gene in Lactobacillus casei strain Shirota.

    Science.gov (United States)

    Serata, Masaki; Kiwaki, Mayumi; Iino, Tohru

    2016-11-01

    Lactic acid bacteria have a variety of mechanisms for tolerance to oxygen and reactive oxygen species, and these mechanisms differ among species. Lactobacillus casei strain Shirota grows well under aerobic conditions, indicating that the various systems involved in oxidative stress resistance function in this strain. To elucidate the mechanism of oxidative stress resistance in L. casei strain Shirota, we examined the transcriptome response to oxygen or hydrogen peroxide exposure. We then focused on an uncharacterized gene that was found to be up-regulated by both oxygen and hydrogen peroxide stress; we named the gene hprA1 (hydrogen peroxide resistance gene). This gene is widely distributed among lactobacilli. We investigated the involvement of this gene in oxidative stress resistance, as well as the mechanism of tolerance to hydrogen peroxide. Growth of L. casei MS105, an hprA1-disrupted mutant, was not affected by oxygen stress, whereas the survival rate of MS105 after hydrogen peroxide treatment was markedly reduced compared to that of the wild-type. However, the activity of MS105 in eliminating hydrogen peroxide was similar to that of the wild-type. We cloned hprA1 from L. caseiShirota and purified recombinant HprA1 protein from Escherichia coli. We demonstrated that the recombinant HprA1 protein bound to iron and prevented the formation of a hydroxyl radical in vitro. Thus, HprA1 protein probably contributes to hydrogen peroxide tolerance in L. casei strain Shirota by binding to iron in the cells and preventing the formation of a hydroxyl radical.

  19. Analysis of Microbial Communities in the Oil Reservoir Subjected to CO2-Flooding by Using Functional Genes as Molecular Biomarkers for Microbial CO2 Sequestration

    Directory of Open Access Journals (Sweden)

    Jin-Feng eLiu

    2015-03-01

    Full Text Available Sequestration of CO2 in oil reservoirs is considered to be one of the feasible options for mitigating atmospheric CO2 building up and also for the in situ potential bioconversion of stored CO2 to methane. However, the information on these functional microbial communities and the impact of CO2 storage on them is hardly available. In this paper a comprehensive molecular survey was performed on microbial communities in production water samples from oil reservoirs experienced CO2-flooding by analysis of functional genes involved in the process, including cbbM, cbbL, fthfs, [FeFe]-hydrogenase and mcrA. As a comparison, these functional genes in the production water samples from oil reservoir only experienced water-flooding in areas of the same oil bearing bed were also analyzed. It showed that these functional genes were all of rich diversity in these samples, and the functional microbial communities and their diversity were strongly affected by a long-term exposure to injected CO2. More interestingly, microorganisms affiliated with members of the genera Methanothemobacter, Acetobacterium and Halothiobacillus as well as hydrogen producers in CO2 injected area either increased or remained unchanged in relative abundance compared to that in water-flooded area, which implied that these microorganisms could adapt to CO2 injection and, if so, demonstrated the potential for microbial fixation and conversion of CO2 into methane in subsurface oil reservoirs.

  20. Prioritization of epilepsy associated candidate genes by convergent analysis.

    Science.gov (United States)

    Jia, Peilin; Ewers, Jeffrey M; Zhao, Zhongming

    2011-02-24

    Epilepsy is a severe neurological disorder affecting a large number of individuals, yet the underlying genetic risk factors for epilepsy remain unclear. Recent studies have revealed several recurrent copy number variations (CNVs) that are more likely to be associated with epilepsy. The responsible gene(s) within these regions have yet to be definitively linked to the disorder, and the implications of their interactions are not fully understood. Identification of these genes may contribute to a better pathological understanding of epilepsy, and serve to implicate novel therapeutic targets for further research. In this study, we examined genes within heterozygous deletion regions identified in a recent large-scale study, encompassing a diverse spectrum of epileptic syndromes. By integrating additional protein-protein interaction data, we constructed subnetworks for these CNV-region genes and also those previously studied for epilepsy. We observed 20 genes common to both networks, primarily concentrated within a small molecular network populated by GABA receptor, BDNF/MAPK signaling, and estrogen receptor genes. From among the hundreds of genes in the initial networks, these were designated by convergent evidence for their likely association with epilepsy. Importantly, the identified molecular network was found to contain complex interrelationships, providing further insight into epilepsy's underlying pathology. We further performed pathway enrichment and crosstalk analysis and revealed a functional map which indicates the significant enrichment of closely related neurological, immune, and kinase regulatory pathways. The convergent framework we proposed here provides a unique and powerful approach to screening and identifying promising disease genes out of typically hundreds to thousands of genes in disease-related CNV-regions. Our network and pathway analysis provides important implications for the underlying molecular mechanisms for epilepsy. The strategy can be

  1. Prioritization of epilepsy associated candidate genes by convergent analysis.

    Directory of Open Access Journals (Sweden)

    Peilin Jia

    2011-02-01

    Full Text Available Epilepsy is a severe neurological disorder affecting a large number of individuals, yet the underlying genetic risk factors for epilepsy remain unclear. Recent studies have revealed several recurrent copy number variations (CNVs that are more likely to be associated with epilepsy. The responsible gene(s within these regions have yet to be definitively linked to the disorder, and the implications of their interactions are not fully understood. Identification of these genes may contribute to a better pathological understanding of epilepsy, and serve to implicate novel therapeutic targets for further research.In this study, we examined genes within heterozygous deletion regions identified in a recent large-scale study, encompassing a diverse spectrum of epileptic syndromes. By integrating additional protein-protein interaction data, we constructed subnetworks for these CNV-region genes and also those previously studied for epilepsy. We observed 20 genes common to both networks, primarily concentrated within a small molecular network populated by GABA receptor, BDNF/MAPK signaling, and estrogen receptor genes. From among the hundreds of genes in the initial networks, these were designated by convergent evidence for their likely association with epilepsy. Importantly, the identified molecular network was found to contain complex interrelationships, providing further insight into epilepsy's underlying pathology. We further performed pathway enrichment and crosstalk analysis and revealed a functional map which indicates the significant enrichment of closely related neurological, immune, and kinase regulatory pathways.The convergent framework we proposed here provides a unique and powerful approach to screening and identifying promising disease genes out of typically hundreds to thousands of genes in disease-related CNV-regions. Our network and pathway analysis provides important implications for the underlying molecular mechanisms for epilepsy. The

  2. Evolution and functional insights of different ancestral orthologous clades of chitin synthase genes in the fungal tree of life

    Directory of Open Access Journals (Sweden)

    Mu eLi

    2016-02-01

    Full Text Available Chitin synthases (CHSs are key enzymes in the biosynthesis of chitin, an important structural component of fungal cell walls that can trigger innate immune responses in host plants and animals. Members of CHS gene family perform various functions in fungal cellular processes. Previous studies focused primarily on classifying diverse CHSs into different classes, regardless of their functional diversification, or on characterizing their functions in individual fungal species. A complete and systematic comparative analysis of CHS genes based on their orthologous relationships will be valuable for elucidating the evolution and functions of different CHS genes in fungi. Here, we identified and compared members of the CHS gene family across the fungal tree of life, including 18 divergent fungal lineages. Phylogenetic analysis revealed that the fungal CHS gene family is comprised of at least 10 ancestral orthologous clades, which have undergone multiple independent duplications and losses in different fungal lineages during evolution. Interestingly, one of these CHS clades (class III was expanded in plant or animal pathogenic fungi belonging to different fungal lineages. Two clades (classes VIb and VIc identified for the first time in this study occurred mainly in plant pathogenic fungi from Sordariomycetes and Dothideomycetes. Moreover, members of classes III and VIb were specifically up-regulated during plant infection, suggesting important roles in pathogenesis. In addition, CHS-associated networks conserved among plant pathogenic fungi are involved in various biological processes, including sexual reproduction and plant infection. We also identified specificity-determining sites, many of which are located at or adjacent to important structural and functional sites that are potentially responsible for functional divergence of different CHS classes. Overall, our results provide new insights into the evolution and function of members of CHS gene

  3. Analysis of the complement and molecular evolution of tRNA genes in cow

    Directory of Open Access Journals (Sweden)

    Barris Wesley C

    2009-04-01

    Full Text Available Abstract Background Detailed information regarding the number and organization of transfer RNA (tRNA genes at the genome level is becoming readily available with the increase of DNA sequencing of whole genomes. However the identification of functional tRNA genes is challenging for species that have large numbers of repetitive elements containing tRNA derived sequences, such as Bos taurus. Reliable identification and annotation of entire sets of tRNA genes allows the evolution of tRNA genes to be understood on a genomic scale. Results In this study, we explored the B. taurus genome using bioinformatics and comparative genomics approaches to catalogue and analyze cow tRNA genes. The initial analysis of the cow genome using tRNAscan-SE identified 31,868 putative tRNA genes and 189,183 pseudogenes, where 28,830 of the 31,868 predicted tRNA genes were classified as repetitive elements by the RepeatMasker program. We then used comparative genomics to further discriminate between functional tRNA genes and tRNA-derived sequences for the remaining set of 3,038 putative tRNA genes. For our analysis, we used the human, chimpanzee, mouse, rat, horse, dog, chicken and fugu genomes to predict that the number of active tRNA genes in cow lies in the vicinity of 439. Of this set, 150 tRNA genes were 100% identical in their sequences across all nine vertebrate genomes studied. Using clustering analyses, we identified a new tRNA-GlyCCC subfamily present in all analyzed mammalian genomes. We suggest that this subfamily originated from an ancestral tRNA-GlyGCC gene via a point mutation prior to the radiation of the mammalian lineages. Lastly, in a separate analysis we created phylogenetic profiles for each putative cow tRNA gene using a representative set of genomes to gain an overview of common evolutionary histories of tRNA genes. Conclusion The use of a combination of bioinformatics and comparative genomics approaches has allowed the confident identification of a

  4. A Combination of CRISPR/Cas9 and Standardized RNAi as a Versatile Platform for the Characterization of Gene Function

    Directory of Open Access Journals (Sweden)

    Sebastian Wissel

    2016-08-01

    Full Text Available Traditional loss-of-function studies in Drosophila suffer from a number of shortcomings, including off-target effects in the case of RNA interference (RNAi or the stochastic nature of mosaic clonal analysis. Here, we describe minimal in vivo GFP interference (miGFPi as a versatile strategy to characterize gene function and to conduct highly stringent, cell type-specific loss-of-function experiments in Drosophila. miGFPi combines CRISPR/Cas9-mediated tagging of genes at their endogenous locus with an immunotag and an exogenous 21 nucleotide RNAi effector sequence with the use of a single reagent, highly validated RNAi line targeting this sequence. We demonstrate the utility and time effectiveness of this method by characterizing the function of the Polymerase I (Pol I-associated transcription factor Tif-1a, and the previously uncharacterized gene MESR4, in the Drosophila female germline stem cell lineage. In addition, we show that miGFPi serves as a powerful technique to functionally characterize individual isoforms of a gene. We exemplify this aspect of miGFPi by studying isoform-specific loss-of-function phenotypes of the longitudinals lacking (lola gene in neural stem cells. Altogether, the miGFPi strategy constitutes a generalized loss-of-function approach that is amenable to the study of the function of all genes in the genome in a stringent and highly time effective manner.

  5. Identification of Personalized Chemoresistance Genes in Subtypes of Basal-Like Breast Cancer Based on Functional Differences Using Pathway Analysis.

    Directory of Open Access Journals (Sweden)

    Tong Wu

    Full Text Available Breast cancer is a highly heterogeneous disease that is clinically classified into several subtypes. Among these subtypes, basal-like breast cancer largely overlaps with triple-negative breast cancer (TNBC, and these two groups are generally studied together as a single entity. Differences in the molecular makeup of breast cancers can result in different treatment strategies and prognoses for patients with different breast cancer subtypes. Compared with other subtypes, basal-like and other ER+ breast cancer subtypes exhibit marked differences in etiologic factors, clinical characteristics and therapeutic potential. Anthracycline drugs are typically used as the first-line clinical treatment for basal-like breast cancer subtypes. However, certain patients develop drug resistance following chemotherapy, which can lead to disease relapse and death. Even among patients with basal-like breast cancer, there can be significant molecular differences, and it is difficult to identify specific drug resistance proteins in any given patient using conventional variance testing methods. Therefore, we designed a new method for identifying drug resistance genes. Subgroups, personalized biomarkers, and therapy targets were identified using cluster analysis of differentially expressed genes. We found that basal-like breast cancer could be further divided into at least four distinct subgroups, including two groups at risk for drug resistance and two groups characterized by sensitivity to pharmacotherapy. Based on functional differences among these subgroups, we identified nine biomarkers related to drug resistance: SYK, LCK, GAB2, PAWR, PPARG, MDFI, ZAP70, CIITA and ACTA1. Finally, based on the deviation scores of the examined pathways, 16 pathways were shown to exhibit varying degrees of abnormality in the various subgroups, indicating that patients with different subtypes of basal-like breast cancer can be characterized by differences in the functional status of

  6. Identification and functional analysis of the gene cluster for fructan utilization in Prevotella intermedia.

    Science.gov (United States)

    Fuse, Haruka; Fukamachi, Haruka; Inoue, Mitsuko; Igarashi, Takeshi

    2013-02-25

    Fructanase enzymes hydrolyze the β-2,6 and β-2,1 linkages of levan and inulin fructans, respectively. We analyzed the influence of fructan on the growth of Prevotella intermedia. The growth of P. intermedia was enhanced by addition of inulin, implying that P. intermedia could also use inulin. Based on this finding, we identified and analyzed the genes encoding a putative fructanase (FruA), sugar transporter (FruB), and fructokinase (FruK) in the genome of strain ATCC25611. Transcript analysis by RT-PCR showed that the fruABK genes were co-transcribed as a single mRNA and semi-quantitative analysis confirmed that the fruA gene was induced in response to fructose and inulin. Recombinant FruA and FruK were purified and characterized biochemically. FruA strongly hydrolyzed inulin, with slight degradation of levan via an exo-type mechanism, revealing that FruA is an exo-β-d-fructanase. FruK converted fructose to fructose-6-phosphate in the presence of ATP, confirming that FruK is an ATP-dependent fructokinase. These results suggest that P. intermedia can utilize fructan as a carbon source for growth, and that the fructanase, sugar transporter, and fructokinase proteins we identified are involved in this fructan utilization. Copyright © 2012 Elsevier B.V. All rights reserved.

  7. Human intronless genes: Functional groups, associated diseases, evolution, and mRNA processing in absence of splicing

    International Nuclear Information System (INIS)

    Grzybowska, Ewa A.

    2012-01-01

    Highlights: ► Functional characteristics of intronless genes (IGs). ► Diseases associated with IGs. ► Origin and evolution of IGs. ► mRNA processing without splicing. -- Abstract: Intronless genes (IGs) constitute approximately 3% of the human genome. Human IGs are essentially different in evolution and functionality from the IGs of unicellular eukaryotes, which represent the majority in their genomes. Functional analysis of IGs has revealed a massive over-representation of signal transduction genes and genes encoding regulatory proteins important for growth, proliferation, and development. IGs also often display tissue-specific expression, usually in the nervous system and testis. These characteristics translate into IG-associated diseases, mainly neuropathies, developmental disorders, and cancer. IGs represent recent additions to the genome, created mostly by retroposition of processed mRNAs with retained functionality. Processing, nuclear export, and translation of these mRNAs should be hampered dramatically by the lack of splice factors, which normally tightly cover mature transcripts and govern their fate. However, natural IGs manage to maintain satisfactory expression levels. Different mechanisms by which IGs solve the problem of mRNA processing and nuclear export are discussed here, along with their possible impact on reporter studies.

  8. Evidence-based gene models for structural and functional annotations of the oil palm genome.

    Science.gov (United States)

    Chan, Kuang-Lim; Tatarinova, Tatiana V; Rosli, Rozana; Amiruddin, Nadzirah; Azizi, Norazah; Halim, Mohd Amin Ab; Sanusi, Nik Shazana Nik Mohd; Jayanthi, Nagappan; Ponomarenko, Petr; Triska, Martin; Solovyev, Victor; Firdaus-Raih, Mohd; Sambanthamurthi, Ravigadevi; Murphy, Denis; Low, Eng-Ti Leslie

    2017-09-08

    Oil palm is an important source of edible oil. The importance of the crop, as well as its long breeding cycle (10-12 years) has led to the sequencing of its genome in 2013 to pave the way for genomics-guided breeding. Nevertheless, the first set of gene predictions, although useful, had many fragmented genes. Classification and characterization of genes associated with traits of interest, such as those for fatty acid biosynthesis and disease resistance, were also limited. Lipid-, especially fatty acid (FA)-related genes are of particular interest for the oil palm as they specify oil yields and quality. This paper presents the characterization of the oil palm genome using different gene prediction methods and comparative genomics analysis, identification of FA biosynthesis and disease resistance genes, and the development of an annotation database and bioinformatics tools. Using two independent gene-prediction pipelines, Fgenesh++ and Seqping, 26,059 oil palm genes with transcriptome and RefSeq support were identified from the oil palm genome. These coding regions of the genome have a characteristic broad distribution of GC 3 (fraction of cytosine and guanine in the third position of a codon) with over half the GC 3 -rich genes (GC 3  ≥ 0.75286) being intronless. In comparison, only one-seventh of the oil palm genes identified are intronless. Using comparative genomics analysis, characterization of conserved domains and active sites, and expression analysis, 42 key genes involved in FA biosynthesis in oil palm were identified. For three of them, namely EgFABF, EgFABH and EgFAD3, segmental duplication events were detected. Our analysis also identified 210 candidate resistance genes in six classes, grouped by their protein domain structures. We present an accurate and comprehensive annotation of the oil palm genome, focusing on analysis of important categories of genes (GC 3 -rich and intronless), as well as those associated with important functions, such as FA

  9. Cartilage-selective genes identified in genome-scale analysis of non-cartilage and cartilage gene expression

    Directory of Open Access Journals (Sweden)

    Cohn Zachary A

    2007-06-01

    Full Text Available Abstract Background Cartilage plays a fundamental role in the development of the human skeleton. Early in embryogenesis, mesenchymal cells condense and differentiate into chondrocytes to shape the early skeleton. Subsequently, the cartilage anlagen differentiate to form the growth plates, which are responsible for linear bone growth, and the articular chondrocytes, which facilitate joint function. However, despite the multiplicity of roles of cartilage during human fetal life, surprisingly little is known about its transcriptome. To address this, a whole genome microarray expression profile was generated using RNA isolated from 18–22 week human distal femur fetal cartilage and compared with a database of control normal human tissues aggregated at UCLA, termed Celsius. Results 161 cartilage-selective genes were identified, defined as genes significantly expressed in cartilage with low expression and little variation across a panel of 34 non-cartilage tissues. Among these 161 genes were cartilage-specific genes such as cartilage collagen genes and 25 genes which have been associated with skeletal phenotypes in humans and/or mice. Many of the other cartilage-selective genes do not have established roles in cartilage or are novel, unannotated genes. Quantitative RT-PCR confirmed the unique pattern of gene expression observed by microarray analysis. Conclusion Defining the gene expression pattern for cartilage has identified new genes that may contribute to human skeletogenesis as well as provided further candidate genes for skeletal dysplasias. The data suggest that fetal cartilage is a complex and transcriptionally active tissue and demonstrate that the set of genes selectively expressed in the tissue has been greatly underestimated.

  10. Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.).

    Science.gov (United States)

    Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu

    2015-01-01

    The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut.

  11. Gene function analysis by artificial microRNAs in Physcomitrella patens.

    KAUST Repository

    Khraiwesh, Basel; Fattash, Isam; Arif, Muhammad Asif; Frank, Wolfgang

    2011-01-01

    MicroRNAs (miRNAs) are ~21 nt long small RNAs transcribed from endogenous MIR genes which form precursor RNAs with a characteristic hairpin structure. miRNAs control the expression of cognate target genes by binding to reverse complementary

  12. Identification of bovine leukemia virus tax function associated with host cell transcription, signaling, stress response and immune response pathway by microarray-based gene expression analysis

    Directory of Open Access Journals (Sweden)

    Arainga Mariluz

    2012-03-01

    Full Text Available Abstract Background Bovine leukemia virus (BLV is associated with enzootic bovine leukosis and is closely related to human T-cell leukemia virus type I. The Tax protein of BLV is a transcriptional activator of viral replication and a key contributor to oncogenic potential. We previously identified interesting mutant forms of Tax with elevated (TaxD247G or reduced (TaxS240P transactivation effects on BLV replication and propagation. However, the effects of these mutations on functions other than transcriptional activation are unknown. In this study, to identify genes that play a role in the cascade of signal events regulated by wild-type and mutant Tax proteins, we used a large-scale host cell gene-profiling approach. Results Using a microarray containing approximately 18,400 human mRNA transcripts, we found several alterations after the expression of Tax proteins in genes involved in many cellular functions such as transcription, signal transduction, cell growth, apoptosis, stress response, and immune response, indicating that Tax protein has multiple biological effects on various cellular environments. We also found that TaxD247G strongly regulated more genes involved in transcription, signal transduction, and cell growth functions, contrary to TaxS240P, which regulated fewer genes. In addition, the expression of genes related to stress response significantly increased in the presence of TaxS240P as compared to wild-type Tax and TaxD247G. By contrast, the largest group of downregulated genes was related to immune response, and the majority of these genes belonged to the interferon family. However, no significant difference in the expression level of downregulated genes was observed among the Tax proteins. Finally, the expression of important cellular factors obtained from the human microarray results were validated at the RNA and protein levels by real-time quantitative reverse transcription-polymerase chain reaction and western blotting

  13. Multiple genetic interaction experiments provide complementary information useful for gene function prediction.

    Directory of Open Access Journals (Sweden)

    Magali Michaut

    Full Text Available Genetic interactions help map biological processes and their functional relationships. A genetic interaction is defined as a deviation from the expected phenotype when combining multiple genetic mutations. In Saccharomyces cerevisiae, most genetic interactions are measured under a single phenotype - growth rate in standard laboratory conditions. Recently genetic interactions have been collected under different phenotypic readouts and experimental conditions. How different are these networks and what can we learn from their differences? We conducted a systematic analysis of quantitative genetic interaction networks in yeast performed under different experimental conditions. We find that networks obtained using different phenotypic readouts, in different conditions and from different laboratories overlap less than expected and provide significant unique information. To exploit this information, we develop a novel method to combine individual genetic interaction data sets and show that the resulting network improves gene function prediction performance, demonstrating that individual networks provide complementary information. Our results support the notion that using diverse phenotypic readouts and experimental conditions will substantially increase the amount of gene function information produced by genetic interaction screens.

  14. IGF-I Gene Therapy in Aging Rats Modulates Hippocampal Genes Relevant to Memory Function.

    Science.gov (United States)

    Pardo, Joaquín; Abba, Martin C; Lacunza, Ezequiel; Ogundele, Olalekan M; Paiva, Isabel; Morel, Gustavo R; Outeiro, Tiago F; Goya, Rodolfo G

    2018-03-14

    In rats, learning and memory performance decline during normal aging, which makes this rodent species a suitable model to evaluate therapeutic strategies. In aging rats, insulin-like growth factor-I (IGF-I), is known to significantly improve spatial memory accuracy as compared to control counterparts. A constellation of gene expression changes underlie the hippocampal phenotype of aging but no studies on the effects of IGF-I on the hippocampal transcriptome of old rodents have been documented. Here, we assessed the effects of IGF-I gene therapy on spatial memory performance in old female rats and compared them with changes in the hippocampal transcriptome. In the Barnes maze test, experimental rats showed a significantly higher exploratory frequency of the goal hole than controls. Hippocampal RNA-sequencing showed that 219 genes are differentially expressed in 28-month-old rats intracerebroventricularly injected with an adenovector expressing rat IGF-I as compared with placebo adenovector-injected counterparts. From the differentially expressed genes, 81 were down and 138 upregulated. From those genes, a list of functionally relevant genes, concerning hippocampal IGF-I expression, synaptic plasticity as well as neuronal function was identified. Our results provide an initial glimpse at the molecular mechanisms underlying the neuroprotective actions of IGF-I in the aging brain.

  15. Automated discovery of functional generality of human gene expression programs.

    Directory of Open Access Journals (Sweden)

    Georg K Gerber

    2007-08-01

    Full Text Available An important research problem in computational biology is the identification of expression programs, sets of co-expressed genes orchestrating normal or pathological processes, and the characterization of the functional breadth of these programs. The use of human expression data compendia for discovery of such programs presents several challenges including cellular inhomogeneity within samples, genetic and environmental variation across samples, uncertainty in the numbers of programs and sample populations, and temporal behavior. We developed GeneProgram, a new unsupervised computational framework based on Hierarchical Dirichlet Processes that addresses each of the above challenges. GeneProgram uses expression data to simultaneously organize tissues into groups and genes into overlapping programs with consistent temporal behavior, to produce maps of expression programs, which are sorted by generality scores that exploit the automatically learned groupings. Using synthetic and real gene expression data, we showed that GeneProgram outperformed several popular expression analysis methods. We applied GeneProgram to a compendium of 62 short time-series gene expression datasets exploring the responses of human cells to infectious agents and immune-modulating molecules. GeneProgram produced a map of 104 expression programs, a substantial number of which were significantly enriched for genes involved in key signaling pathways and/or bound by NF-kappaB transcription factors in genome-wide experiments. Further, GeneProgram discovered expression programs that appear to implicate surprising signaling pathways or receptor types in the response to infection, including Wnt signaling and neurotransmitter receptors. We believe the discovered map of expression programs involved in the response to infection will be useful for guiding future biological experiments; genes from programs with low generality scores might serve as new drug targets that exhibit minimal

  16. Hox gene function and interaction in the milkweed bug Oncopeltus fasciatus (Hemiptera).

    Science.gov (United States)

    Angelini, David R; Liu, Paul Z; Hughes, Cynthia L; Kaufman, Thomas C

    2005-11-15

    Studies in genetic model organisms such as Drosophila have demonstrated that the homeotic complex (Hox) genes impart segmental identity during embryogenesis. Comparative studies in a wide range of other insect taxa have shown that the Hox genes are expressed in largely conserved domains along the anterior-posterior body axis, but whether they are performing the same functions in different insects is an open question. Most of the Hox genes have been studied functionally in only a few holometabolous insects that undergo metamorphosis. Thus, it is unclear how the Hox genes are functioning in the majority of direct-developing insects and other arthropods. To address this question, we used a combination of RNAi and in situ hybridization to reveal the expression, functions, and regulatory interactions of the Hox genes in the milkweed bug Oncopeltus fasciatus. Our results reveal many similarities and some interesting differences compared to Drosophila. We find that the gene Antennapedia is required for the identity of all three thoracic segments, while Ultrabithorax, abdominal-A and Abdominal-B cooperate to pattern the abdomen. The three abdominal genes exhibit posterior prevalence like in Drosophila, but apparently via some post-transcriptional mechanism. The functions of the head genes proboscipedia, Deformed, and Sex combs reduced were shown previously, and here we find that the complex temporal expression of pb in the labium is like that of other insects, but its regulatory relationship with Scr is unique. Overall, our data reveal that the evolution of insect Hox genes has included many small changes within general conservation of expression and function, and that the milkweed bug provides a useful model for understanding the roles of Hox genes in a direct-developing insect.

  17. Genome-wide identification, evolutionary and expression analysis of the aspartic protease gene superfamily in grape

    Science.gov (United States)

    2013-01-01

    Background Aspartic proteases (APs) are a large family of proteolytic enzymes found in almost all organisms. In plants, they are involved in many biological processes, such as senescence, stress responses, programmed cell death, and reproduction. Prior to the present study, no grape AP gene(s) had been reported, and their research on woody species was very limited. Results In this study, a total of 50 AP genes (VvAP) were identified in the grape genome, among which 30 contained the complete ASP domain. Synteny analysis within grape indicated that segmental and tandem duplication events contributed to the expansion of the grape AP family. Additional analysis between grape and Arabidopsis demonstrated that several grape AP genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of grape and Arabidopsis. Phylogenetic relationships of the 30 VvAPs with the complete ASP domain and their Arabidopsis orthologs, as well as their gene and protein features were analyzed and their cellular localization was predicted. Moreover, expression profiles of VvAP genes in six different tissues were determined, and their transcript abundance under various stresses and hormone treatments were measured. Twenty-seven VvAP genes were expressed in at least one of the six tissues examined; nineteen VvAPs responded to at least one abiotic stress, 12 VvAPs responded to powdery mildew infection, and most of the VvAPs responded to SA and ABA treatments. Furthermore, integrated synteny and phylogenetic analysis identified orthologous AP genes between grape and Arabidopsis, providing a unique starting point for investigating the function of grape AP genes. Conclusions The genome-wide identification, evolutionary and expression analyses of grape AP genes provide a framework for future analysis of AP genes in defining their roles during stress response. Integrated synteny and phylogenetic analyses provide novel insight into the

  18. Combining Shigella Tn-seq data with gold-standard E. coli gene deletion data suggests rare transitions between essential and non-essential gene functionality.

    Science.gov (United States)

    Freed, Nikki E; Bumann, Dirk; Silander, Olin K

    2016-09-06

    Gene essentiality - whether or not a gene is necessary for cell growth - is a fundamental component of gene function. It is not well established how quickly gene essentiality can change, as few studies have compared empirical measures of essentiality between closely related organisms. Here we present the results of a Tn-seq experiment designed to detect essential protein coding genes in the bacterial pathogen Shigella flexneri 2a 2457T on a genome-wide scale. Superficial analysis of this data suggested that 481 protein-coding genes in this Shigella strain are critical for robust cellular growth on rich media. Comparison of this set of genes with a gold-standard data set of essential genes in the closely related Escherichia coli K12 BW25113 revealed that an excessive number of genes appeared essential in Shigella but non-essential in E. coli. Importantly, and in converse to this comparison, we found no genes that were essential in E. coli and non-essential in Shigella, implying that many genes were artefactually inferred as essential in Shigella. Controlling for such artefacts resulted in a much smaller set of discrepant genes. Among these, we identified three sets of functionally related genes, two of which have previously been implicated as critical for Shigella growth, but which are dispensable for E. coli growth. The data presented here highlight the small number of protein coding genes for which we have strong evidence that their essentiality status differs between the closely related bacterial taxa E. coli and Shigella. A set of genes involved in acetate utilization provides a canonical example. These results leave open the possibility of developing strain-specific antibiotic treatments targeting such differentially essential genes, but suggest that such opportunities may be rare in closely related bacteria.

  19. Serial analysis of gene expression (SAGE in bovine trypanotolerance: preliminary results

    Directory of Open Access Journals (Sweden)

    David Berthier

    2003-06-01

    Full Text Available Abstract In Africa, trypanosomosis is a tsetse-transmitted disease which represents the most important constraint to livestock production. Several indigenous West African taurine (Bos taurus breeds, such as the Longhorn (N'Dama cattle are well known to control trypanosome infections. This genetic ability named "trypanotolerance" results from various biological mechanisms under multigenic control. The methodologies used so far have not succeeded in identifying the complete pool of genes involved in trypanotolerance. New post genomic biotechnologies such as transcriptome analyses are efficient in characterising the pool of genes involved in the expression of specific biological functions. We used the serial analysis of gene expression (SAGE technique to construct, from Peripheral Blood Mononuclear Cells of an N'Dama cow, 2 total mRNA transcript libraries, at day 0 of a Trypanosoma congolense experimental infection and at day 10 post-infection, corresponding to the peak of parasitaemia. Bioinformatic comparisons in the bovine genomic databases allowed the identification of 187 up- and down- regulated genes, EST and unknown functional genes. Identification of the genes involved in trypanotolerance will allow to set up specific microarray sets for further metabolic and pharmacological studies and to design field marker-assisted selection by introgression programmes.

  20. Gene Network Analysis in Amygdala following Taste Aversion Learning in Rats

    Directory of Open Access Journals (Sweden)

    Siva K. Panguluri

    2013-01-01

    Full Text Available Conditioned taste aversion (CTA is an adaptive behavior that benefits survival of animals including humans and also serves as a powerful model to study the neural mechanisms of learning. Memory formation is a necessary component of CTA learning and involves neural processing and regulation of gene expression in the amygdala. Many studies have been focused on the identification of intracellular signaling cascades involved in CTA, but not late responsive genes underlying the long-lasting behavioral plasticity. In this study, we explored in silico experiments to identify persistent changes in gene expression associated with CTA in rats. We used oligonucleotide microarrays to identify 248 genes in the amygdala regulated by CTA. Pathway Studio and IPA software analyses showed that the differentially expressed genes in the amygdala fall in diverse functional categories such as behavior, psychological disorders, nervous system development and function, and cell-to-cell signaling. Conditioned taste aversion is a complex behavioral trait which involves association of visceral and taste inputs, consolidation of taste and visceral information, memory formation, retrieval of stored information, and extinction phase. In silico analysis of differentially expressed genes is therefore necessary to manipulate specific phase/stage of CTA to understand the molecular insight.

  1. ANALYSIS OF CELLULAR REACTION TO IFN-γ STIMULATION BY A SOFTWARE PACKAGE GeneExpressionAnalyser

    Directory of Open Access Journals (Sweden)

    A. V. Saetchnikov

    2014-01-01

    Full Text Available The software package GeneExpressionAnalyser for analysis of the DNA microarray experi-mental data has been developed. The algorithms of data analysis, differentially expressed genes and biological functions of the cell are described. The efficiency of the developed package is tested on the published experimental data devoted to the time-course research of the changes in the human cell un-der the influence of IFN-γ on melanoma. The developed software has a number of advantages over the existing software: it is free, has a simple and intuitive graphical interface, allows to analyze different types of DNA microarrays, contains a set of methods for complete data analysis and performs effec-tive gene annotation for a selected list of genes.

  2. CRISPR/Cas9 Promotes Functional Study of Testis Specific X-Linked Gene In Vivo.

    Directory of Open Access Journals (Sweden)

    Minyan Li

    Full Text Available Mammalian spermatogenesis is a highly regulated multistage process of sperm generation. It is hard to uncover the real function of a testis specific gene in vitro since the in vitro model is not yet mature. With the development of the CRISPR/Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR-associated 9 system, we can now rapidly generate knockout mouse models of testis specific genes to study the process of spermatogenesis in vivo. SYCP3-like X-linked 2 (SLX2 is a germ cell specific component, which contains a Cor1 domain and belongs to the XLR (X-linked, lymphocyte regulated family. Previous studies suggested that SLX2 might play an important role in mouse spermatogenesis based on its subcellular localization and interacting proteins. However, the function of SLX2 in vivo is still elusive. Here, to investigate the functions of SLX2 in spermatogenesis, we disrupted the Slx2 gene by using the CRISPR/Cas9 system. Since Slx2 is a testis specific X-linked gene, we obtained knockout male mice in the first generation and accelerated the study process. Compared with wild-type mice, Slx2 knockout mice have normal testis and epididymis. Histological observation of testes sections showed that Slx2 knockout affected none of the three main stages of spermatogenesis: mitosis, meiosis and spermiogenesis. In addition, we further confirmed that disruption of Slx2 did not affect the number of spermatogonial stem cells, meiosis progression or XY body formation by immunofluorescence analysis. As spermatogenesis was normal in Slx2 knockout mice, these mice were fertile. Taken together, we showed that Slx2 itself is not an essential gene for mouse spermatogenesis and CRISPR/Cas9 technique could speed up the functional study of testis specific X-linked gene in vivo.

  3. Global analysis of gene expression in response to L-Cysteine deprivation in the anaerobic protozoan parasite Entamoeba histolytica

    Science.gov (United States)

    2011-01-01

    Background Entamoeba histolytica, an enteric protozoan parasite, causes amebic colitis and extra intestinal abscesses in millions of inhabitants of endemic areas. E. histolytica completely lacks glutathione metabolism but possesses L-cysteine as the principle low molecular weight thiol. L-Cysteine is essential for the structure, stability, and various protein functions, including catalysis, electron transfer, redox regulation, nitrogen fixation, and sensing for regulatory processes. Recently, we demonstrated that in E. histolytica, L-cysteine regulates various metabolic pathways including energy, amino acid, and phospholipid metabolism. Results In this study, employing custom-made Affymetrix microarrays, we performed time course (3, 6, 12, 24, and 48 h) gene expression analysis upon L-cysteine deprivation. We identified that out of 9,327 genes represented on the array, 290 genes encoding proteins with functions in metabolism, signalling, DNA/RNA regulation, electron transport, stress response, membrane transport, vesicular trafficking/secretion, and cytoskeleton were differentially expressed (≥3 fold) at one or more time points upon L-cysteine deprivation. Approximately 60% of these modulated genes encoded proteins of no known function and annotated as hypothetical proteins. We also attempted further functional analysis of some of the most highly modulated genes by L-cysteine depletion. Conclusions To our surprise, L-cysteine depletion caused only limited changes in the expression of genes involved in sulfur-containing amino acid metabolism and oxidative stress defense. In contrast, we observed significant changes in the expression of several genes encoding iron sulfur flavoproteins, a major facilitator super-family transporter, regulator of nonsense transcripts, NADPH-dependent oxido-reductase, short chain dehydrogenase, acetyltransferases, and various other genes involved in diverse cellular functions. This study represents the first genome-wide analysis of

  4. Global analysis of gene expression in response to L-Cysteine deprivation in the anaerobic protozoan parasite Entamoeba histolytica

    Directory of Open Access Journals (Sweden)

    Jeelani Ghulam

    2011-05-01

    Full Text Available Abstract Background Entamoeba histolytica, an enteric protozoan parasite, causes amebic colitis and extra intestinal abscesses in millions of inhabitants of endemic areas. E. histolytica completely lacks glutathione metabolism but possesses L-cysteine as the principle low molecular weight thiol. L-Cysteine is essential for the structure, stability, and various protein functions, including catalysis, electron transfer, redox regulation, nitrogen fixation, and sensing for regulatory processes. Recently, we demonstrated that in E. histolytica, L-cysteine regulates various metabolic pathways including energy, amino acid, and phospholipid metabolism. Results In this study, employing custom-made Affymetrix microarrays, we performed time course (3, 6, 12, 24, and 48 h gene expression analysis upon L-cysteine deprivation. We identified that out of 9,327 genes represented on the array, 290 genes encoding proteins with functions in metabolism, signalling, DNA/RNA regulation, electron transport, stress response, membrane transport, vesicular trafficking/secretion, and cytoskeleton were differentially expressed (≥3 fold at one or more time points upon L-cysteine deprivation. Approximately 60% of these modulated genes encoded proteins of no known function and annotated as hypothetical proteins. We also attempted further functional analysis of some of the most highly modulated genes by L-cysteine depletion. Conclusions To our surprise, L-cysteine depletion caused only limited changes in the expression of genes involved in sulfur-containing amino acid metabolism and oxidative stress defense. In contrast, we observed significant changes in the expression of several genes encoding iron sulfur flavoproteins, a major facilitator super-family transporter, regulator of nonsense transcripts, NADPH-dependent oxido-reductase, short chain dehydrogenase, acetyltransferases, and various other genes involved in diverse cellular functions. This study represents the first

  5. AGROBEST: an efficient Agrobacterium-mediated transient expression method for versatile gene function analyses in Arabidopsis seedlings

    Science.gov (United States)

    2014-01-01

    Background Transient gene expression via Agrobacterium-mediated DNA transfer offers a simple and fast method to analyze transgene functions. Although Arabidopsis is the most-studied model plant with powerful genetic and genomic resources, achieving highly efficient and consistent transient expression for gene function analysis in Arabidopsis remains challenging. Results We developed a highly efficient and robust Agrobacterium-mediated transient expression system, named AGROBEST (Agrobacterium-mediated enhanced seedling transformation), which achieves versatile analysis of diverse gene functions in intact Arabidopsis seedlings. Using β-glucuronidase (GUS) as a reporter for Agrobacterium-mediated transformation assay, we show that the use of a specific disarmed Agrobacterium strain with vir gene pre-induction resulted in homogenous GUS staining in cotyledons of young Arabidopsis seedlings. Optimization with AB salts in plant culture medium buffered with acidic pH 5.5 during Agrobacterium infection greatly enhanced the transient expression levels, which were significantly higher than with two existing methods. Importantly, the optimized method conferred 100% infected seedlings with highly increased transient expression in shoots and also transformation events in roots of ~70% infected seedlings in both the immune receptor mutant efr-1 and wild-type Col-0 seedlings. Finally, we demonstrated the versatile applicability of the method for examining transcription factor action and circadian reporter-gene regulation as well as protein subcellular localization and protein–protein interactions in physiological contexts. Conclusions AGROBEST is a simple, fast, reliable, and robust transient expression system enabling high transient expression and transformation efficiency in Arabidopsis seedlings. Demonstration of the proof-of-concept experiments elevates the transient expression technology to the level of functional studies in Arabidopsis seedlings in addition to previous

  6. Nitrogen Cycle Evaluation (NiCE) Chip for the Simultaneous Analysis of Multiple N-Cycle Associated Genes.

    Science.gov (United States)

    Oshiki, Mamoru; Segawa, Takahiro; Ishii, Satoshi

    2018-02-02

    Various microorganisms play key roles in the Nitrogen (N) cycle. Quantitative PCR (qPCR) and PCR-amplicon sequencing of the N cycle functional genes allow us to analyze the abundance and diversity of microbes responsible in the N transforming reactions in various environmental samples. However, analysis of multiple target genes can be cumbersome and expensive. PCR-independent analysis, such as metagenomics and metatranscriptomics, is useful but expensive especially when we analyze multiple samples and try to detect N cycle functional genes present at relatively low abundance. Here, we present the application of microfluidic qPCR chip technology to simultaneously quantify and prepare amplicon sequence libraries for multiple N cycle functional genes as well as taxon-specific 16S rRNA gene markers for many samples. This approach, named as N cycle evaluation (NiCE) chip, was evaluated by using DNA from pure and artificially mixed bacterial cultures and by comparing the results with those obtained by conventional qPCR and amplicon sequencing methods. Quantitative results obtained by the NiCE chip were comparable to those obtained by conventional qPCR. In addition, the NiCE chip was successfully applied to examine abundance and diversity of N cycle functional genes in wastewater samples. Although non-specific amplification was detected on the NiCE chip, this could be overcome by optimizing the primer sequences in the future. As the NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes, this tool should advance our ability to explore N cycling in various samples. Importance. We report a novel approach, namely Nitrogen Cycle Evaluation (NiCE) chip by using microfluidic qPCR chip technology. By sequencing the amplicons recovered from the NiCE chip, we can assess diversities of the N cycle functional genes. The NiCE chip technology is applicable to analyze the temporal dynamics of the N cycle gene

  7. Structure, evolution and functional inference on the Mildew Locus O (MLO) gene family in three cultivated Cucurbitaceae spp.

    Science.gov (United States)

    Iovieno, Paolo; Andolfo, Giuseppe; Schiavulli, Adalgisa; Catalano, Domenico; Ricciardi, Luigi; Frusciante, Luigi; Ercolano, Maria Raffaella; Pavan, Stefano

    2015-12-29

    The powdery mildew disease affects thousands of plant species and arguably represents the major fungal threat for many Cucurbitaceae crops, including melon (Cucumis melo L.), watermelon (Citrullus lanatus L.) and zucchini (Cucurbita pepo L.). Several studies revealed that specific members of the Mildew Locus O (MLO) gene family act as powdery mildew susceptibility factors. Indeed, their inactivation, as the result of gene knock-out or knock-down, is associated with a peculiar form of resistance, referred to as mlo resistance. We exploited recently available genomic information to provide a comprehensive overview of the MLO gene family in Cucurbitaceae. We report the identification of 16 MLO homologs in C. melo, 14 in C. lanatus and 18 in C. pepo genomes. Bioinformatic treatment of data allowed phylogenetic inference and the prediction of several ortholog pairs and groups. Comparison with functionally characterized MLO genes and, in C. lanatus, gene expression analysis, resulted in the detection of candidate powdery mildew susceptibility factors. We identified a series of conserved amino acid residues and motifs that are likely to play a major role for the function of MLO proteins. Finally, we performed a codon-based evolutionary analysis indicating a general high level of purifying selection in the three Cucurbitaceae MLO gene families, and the occurrence of regions under diversifying selection in candidate susceptibility factors. Results of this study may help to address further biological questions concerning the evolution and function of MLO genes. Moreover, data reported here could be conveniently used by breeding research, aiming to select powdery mildew resistant cultivars in Cucurbitaceae.

  8. Functional characterization of KanP, a methyltransferase from the kanamycin biosynthetic gene cluster of Streptomyces kanamyceticus.

    Science.gov (United States)

    Nepal, Keshav Kumar; Yoo, Jin Cheol; Sohng, Jae Kyung

    2010-09-20

    KanP, a putative methyltransferase, is located in the kanamycin biosynthetic gene cluster of Streptomyces kanamyceticus ATCC12853. Amino acid sequence analysis of KanP revealed the presence of S-adenosyl-L-methionine binding motifs, which are present in other O-methyltransferases. The kanP gene was expressed in Escherichia coli BL21 (DE3) to generate the E. coli KANP recombinant strain. The conversion of external quercetin to methylated quercetin in the culture extract of E. coli KANP proved the function of kanP as S-adenosyl-L-methionine-dependent methyltransferase. This is the first report concerning the identification of an O-methyltransferase gene from the kanamycin gene cluster. The resistant activity assay and RT-PCR analysis demonstrated the leeway for obtaining methylated kanamycin derivatives from the wild-type strain of kanamycin producer. 2009 Elsevier GmbH. All rights reserved.

  9. Investigating a multigene prognostic assay based on significant pathways for Luminal A breast cancer through gene expression profile analysis.

    Science.gov (United States)

    Gao, Haiyan; Yang, Mei; Zhang, Xiaolan

    2018-04-01

    The present study aimed to investigate potential recurrence-risk biomarkers based on significant pathways for Luminal A breast cancer through gene expression profile analysis. Initially, the gene expression profiles of Luminal A breast cancer patients were downloaded from The Cancer Genome Atlas database. The differentially expressed genes (DEGs) were identified using a Limma package and the hierarchical clustering analysis was conducted for the DEGs. In addition, the functional pathways were screened using Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses and rank ratio calculation. The multigene prognostic assay was exploited based on the statistically significant pathways and its prognostic function was tested using train set and verified using the gene expression data and survival data of Luminal A breast cancer patients downloaded from the Gene Expression Omnibus. A total of 300 DEGs were identified between good and poor outcome groups, including 176 upregulated genes and 124 downregulated genes. The DEGs may be used to effectively distinguish Luminal A samples with different prognoses verified by hierarchical clustering analysis. There were 9 pathways screened as significant pathways and a total of 18 DEGs involved in these 9 pathways were identified as prognostic biomarkers. According to the survival analysis and receiver operating characteristic curve, the obtained 18-gene prognostic assay exhibited good prognostic function with high sensitivity and specificity to both the train and test samples. In conclusion the 18-gene prognostic assay including the key genes, transcription factor 7-like 2, anterior parietal cortex and lymphocyte enhancer factor-1 may provide a new method for predicting outcomes and may be conducive to the promotion of precision medicine for Luminal A breast cancer.

  10. Digital gene expression analysis in mice lung with coinfection of influenza and streptococcus pneumoniae.

    Science.gov (United States)

    Luo, Jun; Zhou, Linlin; Wang, Hongren; Qin, Zhen; Xiang, Li; Zhu, Jie; Huang, Xiaojun; Yang, Yuan; Li, Wanyi; Wang, Baoning; Li, Mingyuan

    2017-12-22

    Influenza A virus (IAV) and Streptococcus pneumoniae (SP) are two major upper respiratory tract pathogens that can also cause infection in polarized bronchial epithelial cells to exacerbate disease in coinfected individuals which may result in significant morbidity. However, the underlying molecular mechanism is poorly understood. Here, we employed BALB/c ByJ mice inflected with SP, IAV, IAV followed by SP (IAV+SP) and PBS (Control) as models to survey the global gene expression using digital gene expression (DGE) profiling. We attempt to gain insights into the underlying genetic basis of this synergy at the expression level. Gene expression profiles were obtain using the Illimina/Hisseq sequencing technique, and further analyzed by enrichment analysis of Gene Ontology (GO) and Pathway function. The hematoxylin-eosin (HE) staining revealed different tissue changes in groups during which IAV+SP group showed the most severe cell apoptosis. Compared with Control, a total of 2731, 3221 and 3946 differentially expressed genes (DEGs) were detected in SP, IAV and IAV+SP respectively. Besides, sixty-two GO terms were identified by Gene Ontology functional enrichment analysis, such as cell killing, biological regulation, response to stimulus, signaling, biological adhesion, enzyme regulator activity, receptor regulator activity and translation regulator activity. Pathway significant enrichment analysis indicated the dysregulation of multiple pathways, including apoptosis pathway. Among these, five selected genes were further verified by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). This study shows that infection with SP, IAV or IAV+SP induces apoptosis with different degrees which might provide insights into the molecular mechanisms to facilitate further research.

  11. An in silico assessment of gene function and organization of the phenylpropanoid pathway metabolic networks in Arabidopsis thaliana and limitations thereof

    Science.gov (United States)

    Costa, Michael A.; Collins, R. Eric; Anterola, Aldwin M.; Cochrane, Fiona C.; Davin, Laurence B.; Lewis, Norman G.

    2003-01-01

    The Arabidopsis genome sequencing in 2000 gave to science the first blueprint of a vascular plant. Its successful completion also prompted the US National Science Foundation to launch the Arabidopsis 2010 initiative, the goal of which is to identify the function of each gene by 2010. In this study, an exhaustive analysis of The Institute for Genomic Research (TIGR) and The Arabidopsis Information Resource (TAIR) databases, together with all currently compiled EST sequence data, was carried out in order to determine to what extent the various metabolic networks from phenylalanine ammonia lyase (PAL) to the monolignols were organized and/or could be predicted. In these databases, there are some 65 genes which have been annotated as encoding putative enzymatic steps in monolignol biosynthesis, although many of them have only very low homology to monolignol pathway genes of known function in other plant systems. Our detailed analysis revealed that presently only 13 genes (two PALs, a cinnamate-4-hydroxylase, a p-coumarate-3-hydroxylase, a ferulate-5-hydroxylase, three 4-coumarate-CoA ligases, a cinnamic acid O-methyl transferase, two cinnamoyl-CoA reductases) and two cinnamyl alcohol dehydrogenases can be classified as having a bona fide (definitive) function; the remaining 52 genes currently have undetermined physiological roles. The EST database entries for this particular set of genes also provided little new insight into how the monolignol pathway was organized in the different tissues and organs, this being perhaps a consequence of both limitations in how tissue samples were collected and in the incomplete nature of the EST collections. This analysis thus underscores the fact that even with genomic sequencing, presumed to provide the entire suite of putative genes in the monolignol-forming pathway, a very large effort needs to be conducted to establish actual catalytic roles (including enzyme versatility), as well as the physiological function(s) for each member

  12. Genome-wide analysis of the WRKY gene family in physic nut (Jatropha curcas L.).

    Science.gov (United States)

    Xiong, Wangdan; Xu, Xueqin; Zhang, Lin; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2013-07-25

    The WRKY proteins, which contain highly conserved WRKYGQK amino acid sequences and zinc-finger-like motifs, constitute a large family of transcription factors in plants. They participate in diverse physiological and developmental processes. WRKY genes have been identified and characterized in a number of plant species. We identified a total of 58 WRKY genes (JcWRKY) in the genome of the physic nut (Jatropha curcas L.). On the basis of their conserved WRKY domain sequences, all of the JcWRKY proteins could be assigned to one of the previously defined groups, I-III. Phylogenetic analysis of JcWRKY genes with Arabidopsis and rice WRKY genes, and separately with castor bean WRKY genes, revealed no evidence of recent gene duplication in JcWRKY gene family. Analysis of transcript abundance of JcWRKY gene products were tested in different tissues under normal growth condition. In addition, 47 WRKY genes responded to at least one abiotic stress (drought, salinity, phosphate starvation and nitrogen starvation) in individual tissues (leaf, root and/or shoot cortex). Our study provides a useful reference data set as the basis for cloning and functional analysis of physic nut WRKY genes. Copyright © 2013 Elsevier B.V. All rights reserved.

  13. Molecular characterization and functional analysis of ubiquitin extension genes from the potato cyst nematode Globodera rostochiensis

    Science.gov (United States)

    Ubiquitin is a highly conserved 76-amino acid protein found in every eukaryotic cell. It has been proposed that ubiquitin has many cellular functions including DNA repair, transcription regulation, regulation of cell cycle and apoptosis. We identified two ubiquitin extension genes (Gr-Ubi1 and Gr-Ub...

  14. Isolation of the functional human excision repair gene ERCC5 by intercosmid recombination

    International Nuclear Information System (INIS)

    Mudgett, J.S.; MacInnes, M.A.

    1990-01-01

    The complete human nucleotide exicision repair gene ERCC5 was isolated as a functional gene on overlapping cosmids. ERCC5 corrects the excision repair deficiency of Chinese hamster ovary cell line UV135, of complementation group 5. Cosmids that contained human sequences were obtained from a UV-resistant cell line derived from UV135 cells transformed with human genomic DNA. Individually, none of the cosmids complemented the UV135 repair defect; cosmid groups were formed to represent putative human genomic regions, and specific pairs of cosmids that effectively transformed UV135 cells to UV resistance were identified. Analysis of transformants derived from the active cosmid pairs showed that the functional 32-kbp ERCC5 gene was reconstructed by homologous intercosmid recombination. The cloned human sequences exhibited 100% concordance with the locus designated genetically as ERCC5 located on human chromosome 13q. Cosmid-transformed UV135 host cells repaired cytotoxic damage to levels about 70% of normal and repaired UV-irradiated shuttle vector DNA to levels about 82% of normal

  15. Comparative gene expression analysis of the human periodontal ligament in deciduous and permanent teeth.

    Directory of Open Access Journals (Sweden)

    Je Seon Song

    Full Text Available There are histological and functional differences between human deciduous and permanent periodontal ligament (PDL tissues. The aim of this study was to determine the differences between these two types of tissue at the molecular level by comparing their gene expression patterns. PDL samples were obtained from permanent premolars (n = 38 and anterior deciduous teeth (n = 31 extracted from 40 healthy persons. Comparative cDNA microarray analysis revealed several differences in gene expression between the deciduous and permanent PDL tissues. These findings were verified by qRT-PCR (quantitative reverse-transcription-polymerase chain reaction analysis, and the areas where genes are expressed were revealed by immunohistochemical staining. The expressions of 21 genes were up-regulated in deciduous relative to PDL tissues, and those of 30 genes were up-regulated in permanent relative to deciduous PDL tissues. The genes that were up-regulated in deciduous PDL tissues were those involved in the formation of the extracellular matrix (LAMC2, LAMB3, and COMP, tissue development (IGF2BP, MAB21L2, and PAX3, and inflammatory or immune reactions leading to tissue degradation (IL1A, CCL21, and CCL18. The up-regulated genes in permanent PDL tissues were related to tissue degradation (IL6 and ADAMTS18, myocontraction (PDE3B, CASQ2, and MYH10, and neurological responses (FOS, NCAM2, SYT1, SLC22A3, DOCK3, LRRTM1, LRRTM3, PRSS12, and ARPP21. The analysis of differential gene expressions between deciduous and permanent PDL tissues aids our understanding of histological and functional differences between them at the molecular level.

  16. Comparative gene expression analysis of the human periodontal ligament in deciduous and permanent teeth.

    Science.gov (United States)

    Song, Je Seon; Hwang, Dong Hwan; Kim, Seong-Oh; Jeon, Mijeong; Choi, Byung-Jai; Jung, Han-Sung; Moon, Seok Jun; Park, Wonse; Choi, Hyung-Jun

    2013-01-01

    There are histological and functional differences between human deciduous and permanent periodontal ligament (PDL) tissues. The aim of this study was to determine the differences between these two types of tissue at the molecular level by comparing their gene expression patterns. PDL samples were obtained from permanent premolars (n = 38) and anterior deciduous teeth (n = 31) extracted from 40 healthy persons. Comparative cDNA microarray analysis revealed several differences in gene expression between the deciduous and permanent PDL tissues. These findings were verified by qRT-PCR (quantitative reverse-transcription-polymerase chain reaction) analysis, and the areas where genes are expressed were revealed by immunohistochemical staining. The expressions of 21 genes were up-regulated in deciduous relative to PDL tissues, and those of 30 genes were up-regulated in permanent relative to deciduous PDL tissues. The genes that were up-regulated in deciduous PDL tissues were those involved in the formation of the extracellular matrix (LAMC2, LAMB3, and COMP), tissue development (IGF2BP, MAB21L2, and PAX3), and inflammatory or immune reactions leading to tissue degradation (IL1A, CCL21, and CCL18). The up-regulated genes in permanent PDL tissues were related to tissue degradation (IL6 and ADAMTS18), myocontraction (PDE3B, CASQ2, and MYH10), and neurological responses (FOS, NCAM2, SYT1, SLC22A3, DOCK3, LRRTM1, LRRTM3, PRSS12, and ARPP21). The analysis of differential gene expressions between deciduous and permanent PDL tissues aids our understanding of histological and functional differences between them at the molecular level.

  17. Expression analysis and functional characterization of a novel cold-responsive gene CbCOR15a from Capsella bursa-pastoris.

    Science.gov (United States)

    Zhou, Mingqi; Wu, Lihua; Liang, Jing; Shen, Chen; Lin, Juan

    2012-05-01

    The cold-responsive (COR) genes involved in C-repeat binding factor signaling pathway function essentially in cold acclimation of higher plants. A novel COR gene CbCOR15a from shepherd's purse (Capsella bursa-pastoris) was predicted to be a homolog of COR15 in Arabidopsis. The analysis of tissue specific expression pattern as well as characterization of the CbCOR15a promoter revealed that the expression of CbCOR15a was induced by coldness not only in leaves and stem but also in roots. Sequence analysis showed that a 909 bp promoter region of CbCOR15a contained two CRT/DRE elements, two ABRE elements, one auxin-responsive TGA-element and one MeJA-responsive CGTCA-motif. In young seedlings the expression of CbCOR15a could be apparently increased by SA, ABA, MeJA and IAA, and transiently increased by GA(3) accompanied by obvious feedback suppression. According to the altered physiological index values in tobacco under cold treatments, the overexpression of CbCOR15a significantly increased the cold tolerance of transgenic tobacco plants. It can be suggested that CbCOR15a was involved in cold response of Capsella bursa-pastoris associated with SA, ABA, MeJA, IAA and GA(3) regulation and confers enhanced cold acclimation in transgenic plants.

  18. Amiloride-enhanced gene transfection of octa-arginine functionalized calcium phosphate nanoparticles.

    Directory of Open Access Journals (Sweden)

    Juan Ramón Vanegas Sáenz

    Full Text Available Nanoparticles represent promising gene delivery systems in biomedicine to facilitate prolonged gene expression with low toxicity compared to viral vectors. Specifically, nanoparticles of calcium phosphate (nCaP, the main inorganic component of human bone, exhibit high biocompatibility and good biodegradability and have been reported to have high affinity for protein or DNA, having thus been used as gene transfer vectors. On the other hand, Octa-arginine (R8, which has a high permeability to cell membrane, has been reported to improve intracellular delivery systems. Here, we present an optimized method for nCaP-mediated gene delivery using an octa-arginine (R8-functionalized nCaP vector containing a marker or functional gene construct. nCaP particle size was between 220-580 nm in diameter and all R8-functionalized nCaPs carried a positive charge. R8 concentration significantly improved nCaP transfection efficiency with high cell compatibility in human mesenchymal stem cells (hMSC and human osteoblasts (hOB in particular, suggesting nCaPs as a good option for non-viral vector gene delivery. Furthermore, pre-treatment with different endocytosis inhibitors identified that the endocytic pathway differed among cell lines and functionalized nanoparticles, with amiloride increasing transfection efficiency of R8-functionalized nCaPs in hMSC and hOB.

  19. Cloning and Functional Characterization of the Maize (Zea mays L.) Carotenoid Epsilon Hydroxylase Gene

    Science.gov (United States)

    Sheng, Yanmin; Wang, Yingdian; Capell, Teresa; Shi, Lianxuan; Ni, Xiuzhen; Sandmann, Gerhard; Christou, Paul; Zhu, Changfu

    2015-01-01

    The assignment of functions to genes in the carotenoid biosynthesis pathway is necessary to understand how the pathway is regulated and to obtain the basic information required for metabolic engineering. Few carotenoid ε-hydroxylases have been functionally characterized in plants although this would provide insight into the hydroxylation steps in the pathway. We therefore isolated mRNA from the endosperm of maize (Zea mays L., inbred line B73) and cloned a full-length cDNA encoding CYP97C19, a putative heme-containing carotenoid ε hydroxylase and member of the cytochrome P450 family. The corresponding CYP97C19 genomic locus on chromosome 1 was found to comprise a single-copy gene with nine introns. We expressed CYP97C19 cDNA under the control of the constitutive CaMV 35S promoter in the Arabidopsis thaliana lut1 knockout mutant, which lacks a functional CYP97C1 (LUT1) gene. The analysis of carotenoid levels and composition showed that lutein accumulated to high levels in the rosette leaves of the transgenic lines but not in the untransformed lut1 mutants. These results allowed the unambiguous functional annotation of maize CYP97C19 as an enzyme with strong zeinoxanthin ε-ring hydroxylation activity. PMID:26030746

  20. Genes involved in immunity and apoptosis are associated with human presbycusis based on microarray analysis.

    Science.gov (United States)

    Dong, Yang; Li, Ming; Liu, Puzhao; Song, Haiyan; Zhao, Yuping; Shi, Jianrong

    2014-06-01

    Genes involved in immunity and apoptosis were associated with human presbycusis. CCR3 and GILZ played an important role in the pathogenesis of presbycusis, probably through regulating chemokine receptor, T-cell apoptosis, or T-cell activation pathways. To identify genes associated with human presbycusis and explore the molecular mechanism of presbycusis. Hearing function was tested by pure-tone audiometry. Microarray analysis was performed to identify presbycusis-correlated genes by Illumina Human-6 BeadChip using the peripheral blood samples of subjects. To identify biological process categories and pathways associated with presbycusis-correlated genes, bioinformatics analysis was carried out by Gene Ontology Tree Machine (GOTM) and database for annotation, visualization, and integrated discovery (DAVID). Quantitative RT-PCR (qRT-PCR) was used to validate the microarray data. Microarray analysis identified 469 up-regulated genes and 323 down-regulated genes. Both the dominant biological processes by Gene Ontology (GO) analysis and the enriched pathways by Kyoto encyclopedia of genes and genomes (KEGG) and BIOCARTA showed that genes involved in immunity and apoptosis were associated with presbycusis. In addition, CCR3, GILZ, CXCL10, and CX3CR1 genes showed consistent difference between groups for both the gene chip and qRT-PCR data. The differences of CCR3 and GILZ between presbycusis patients and controls were statistically significant (p < 0.05).

  1. Network analysis of genomic alteration profiles reveals co-altered functional modules and driver genes for glioblastoma.

    Science.gov (United States)

    Gu, Yunyan; Wang, Hongwei; Qin, Yao; Zhang, Yujing; Zhao, Wenyuan; Qi, Lishuang; Zhang, Yuannv; Wang, Chenguang; Guo, Zheng

    2013-03-01

    The heterogeneity of genetic alterations in human cancer genomes presents a major challenge to advancing our understanding of cancer mechanisms and identifying cancer driver genes. To tackle this heterogeneity problem, many approaches have been proposed to investigate genetic alterations and predict driver genes at the individual pathway level. However, most of these approaches ignore the correlation of alteration events between pathways and miss many genes with rare alterations collectively contributing to carcinogenesis. Here, we devise a network-based approach to capture the cooperative functional modules hidden in genome-wide somatic mutation and copy number alteration profiles of glioblastoma (GBM) from The Cancer Genome Atlas (TCGA), where a module is a set of altered genes with dense interactions in the protein interaction network. We identify 7 pairs of significantly co-altered modules that involve the main pathways known to be altered in GBM (TP53, RB and RTK signaling pathways) and highlight the striking co-occurring alterations among these GBM pathways. By taking into account the non-random correlation of gene alterations, the property of co-alteration could distinguish oncogenic modules that contain driver genes involved in the progression of GBM. The collaboration among cancer pathways suggests that the redundant models and aggravating models could shed new light on the potential mechanisms during carcinogenesis and provide new indications for the design of cancer therapeutic strategies.

  2. Functional and evolutionary analysis of alternatively spliced genes is consistent with an early eukaryotic origin of alternative splicing

    Directory of Open Access Journals (Sweden)

    Penny David

    2007-10-01

    Full Text Available Abstract Background Alternative splicing has been reported in various eukaryotic groups including plants, apicomplexans, diatoms, amoebae, animals and fungi. However, whether widespread alternative splicing has evolved independently in the different eukaryotic groups or was inherited from their last common ancestor, and may therefore predate multicellularity, is still unknown. To better understand the origin and evolution of alternative splicing and its usage in diverse organisms, we studied alternative splicing in 12 eukaryotic species, comparing rates of alternative splicing across genes of different functional classes, cellular locations, intron/exon structures and evolutionary origins. Results For each species, we find that genes from most functional categories are alternatively spliced. Ancient genes (shared between animals, fungi and plants show high levels of alternative splicing. Genes with products expressed in the nucleus or plasma membrane are generally more alternatively spliced while those expressed in extracellular location show less alternative splicing. We find a clear correspondence between incidence of alternative splicing and intron number per gene both within and between genomes. In general, we find several similarities in patterns of alternative splicing across these diverse eukaryotes. Conclusion Along with previous studies indicating intron-rich genes with weak intron boundary consensus and complex spliceosomes in ancestral organisms, our results suggest that at least a simple form of alternative splicing may already have been present in the unicellular ancestor of plants, fungi and animals. A role for alternative splicing in the evolution of multicellularity then would largely have arisen by co-opting the preexisting process.

  3. Radiation-induced genomic instability, and the cloning and functional analysis of its related gene

    International Nuclear Information System (INIS)

    Muto, Masahiro; Kanari, Yasuyoshi; Kubo, Eiko; Yamada, Yutaka

    2000-01-01

    cloning and functional analysis of the related genes. (author)

  4. Suppression subtractive hybridization and comparative expression analysis to identify developmentally regulated genes in filamentous fungi.

    Science.gov (United States)

    Gesing, Stefan; Schindler, Daniel; Nowrousian, Minou

    2013-09-01

    Ascomycetes differentiate four major morphological types of fruiting bodies (apothecia, perithecia, pseudothecia and cleistothecia) that are derived from an ancestral fruiting body. Thus, fruiting body differentiation is most likely controlled by a set of common core genes. One way to identify such genes is to search for genes with evolutionary conserved expression patterns. Using suppression subtractive hybridization (SSH), we selected differentially expressed transcripts in Pyronema confluens (Pezizales) by comparing two cDNA libraries specific for sexual and for vegetative development, respectively. The expression patterns of selected genes from both libraries were verified by quantitative real time PCR. Expression of several corresponding homologous genes was found to be conserved in two members of the Sordariales (Sordaria macrospora and Neurospora crassa), a derived group of ascomycetes that is only distantly related to the Pezizales. Knockout studies with N. crassa orthologues of differentially regulated genes revealed a functional role during fruiting body development for the gene NCU05079, encoding a putative MFS peptide transporter. These data indicate conserved gene expression patterns and a functional role of the corresponding genes during fruiting body development; such genes are candidates of choice for further functional analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  5. Prion protein (PrP) gene-knockout cell lines: insight into functions of the PrP

    Science.gov (United States)

    Sakudo, Akikazu; Onodera, Takashi

    2015-01-01

    Elucidation of prion protein (PrP) functions is crucial to fully understand prion diseases. A major approach to studying PrP functions is the use of PrP gene-knockout (Prnp−/−) mice. So far, six types of Prnp−/− mice have been generated, demonstrating the promiscuous functions of PrP. Recently, other PrP family members, such as Doppel and Shadoo, have been found. However, information obtained from comparative studies of structural and functional analyses of these PrP family proteins do not fully reveal PrP functions. Recently, varieties of Prnp−/− cell lines established from Prnp−/− mice have contributed to the analysis of PrP functions. In this mini-review, we focus on Prnp−/− cell lines and summarize currently available Prnp−/− cell lines and their characterizations. In addition, we introduce the recent advances in the methodology of cell line generation with knockout or knockdown of the PrP gene. We also discuss how these cell lines have provided valuable insights into PrP functions and show future perspectives. PMID:25642423

  6. Functional redundancy and/or ongoing pseudogenization among F-box protein genes expressed in Arabidopsis male gametophyte.

    Science.gov (United States)

    Ikram, Sobia; Durandet, Monique; Vesa, Simona; Pereira, Serge; Guerche, Philippe; Bonhomme, Sandrine

    2014-06-01

    F-box protein genes family is one of the largest gene families in plants, with almost 700 predicted genes in the model plant Arabidopsis. F-box proteins are key components of the ubiquitin proteasome system that allows targeted protein degradation. Transcriptome analyses indicate that half of these F-box protein genes are found expressed in microspore and/or pollen, i.e., during male gametogenesis. To assess the role of F-box protein genes during this crucial developmental step, we selected 34 F-box protein genes recorded as highly and specifically expressed in pollen and isolated corresponding insertion mutants. We checked the expression level of each selected gene by RT-PCR and confirmed pollen expression for 25 genes, but specific expression for only 10 of the 34 F-box protein genes. In addition, we tested the expression level of selected F-box protein genes in 24 mutant lines and showed that 11 of them were null mutants. Transmission analysis of the mutations to the progeny showed that none of the single mutations was gametophytic lethal. These unaffected transmission efficiencies suggested leaky mutations or functional redundancy among F-box protein genes. Cytological observation of the gametophytes in the mutants confirmed these results. Combinations of mutations in F-box protein genes from the same subfamily did not lead to transmission defect either, further highlighting functional redundancy and/or a high proportion of pseudogenes among these F-box protein genes.

  7. Pathway-based analysis of a melanoma genome-wide association study: analysis of genes related to tumour-immunosuppression.

    Directory of Open Access Journals (Sweden)

    Nils Schoof

    Full Text Available Systemic immunosuppression is a risk factor for melanoma, and sunburn-induced immunosuppression is thought to be causal. Genes in immunosuppression pathways are therefore candidate melanoma-susceptibility genes. If variants within these genes individually have a small effect on disease risk, the association may be undetected in genome-wide association (GWA studies due to low power to reach a high significance level. Pathway-based approaches have been suggested as a method of incorporating a priori knowledge into the analysis of GWA studies. In this study, the association of 1113 single nucleotide polymorphisms (SNPs in 43 genes (39 genomic regions related to immunosuppression have been analysed using a gene-set approach in 1539 melanoma cases and 3917 controls from the GenoMEL consortium GWA study. The association between melanoma susceptibility and the whole set of tumour-immunosuppression genes, and also predefined functional subgroups of genes, was considered. The analysis was based on a measure formed by summing the evidence from the most significant SNP in each gene, and significance was evaluated empirically by case-control label permutation. An association was found between melanoma and the complete set of genes (p(emp=0.002, as well as the subgroups related to the generation of tolerogenic dendritic cells (p(emp=0.006 and secretion of suppressive factors (p(emp=0.0004, thus providing preliminary evidence of involvement of tumour-immunosuppression gene polymorphisms in melanoma susceptibility. The analysis was repeated on a second phase of the GenoMEL study, which showed no evidence of an association. As one of the first attempts to replicate a pathway-level association, our results suggest that low power and heterogeneity may present challenges.

  8. Cell functional enviromics: Unravelling the function of environmental factors

    Directory of Open Access Journals (Sweden)

    Alves Paula M

    2011-06-01

    Full Text Available Abstract Background While functional genomics, focused on gene functions and gene-gene interactions, has become a very active field of research in molecular biology, equivalent methodologies embracing the environment and gene-environment interactions are relatively less developed. Understanding the function of environmental factors is, however, of paramount importance given the complex, interactive nature of environmental and genetic factors across multiple time scales. Results Here, we propose a systems biology framework, where the function of environmental factors is set at its core. We set forth a "reverse" functional analysis approach, whereby cellular functions are reconstructed from the analysis of dynamic envirome data. Our results show these data sets can be mapped to less than 20 core cellular functions in a typical mammalian cell culture, while explaining over 90% of flux data variance. A functional enviromics map can be created, which provides a template for manipulating the environmental factors to induce a desired phenotypic trait. Conclusion Our results support the feasibility of cellular function reconstruction guided by the analysis and manipulation of dynamic envirome data.

  9. Large-scale gene-centric analysis identifies novel variants for coronary artery disease

    NARCIS (Netherlands)

    Butterworth, A.S.; Braund, P.S.; Hardwick, R.J.; Saleheen, D.; Peden, J.F.; Soranzo, N.; Chambers, J.C.; Kleber, M.E.; Keating, B.; Qasim, A.; Klopp, N.; Erdmann, J.; Basart, H.; Baumert, J.H.; Bezzina, C.R.; Boehm, B.O.; Brocheton, J.; Bugert, P.; Cambien, F.; Collins, R.; Couper, D.; Jong, J.S. de; Diemert, P.; Ejebe, K.; Elbers, C.C.; Elliott, P.; Fornage, M.; Frossard, P.; Garner, S.; Hunt, S.E.; Kastelein, J.J.; Klungel, O.H.; Kluter, H.; Koch, K.; Konig, I.R.; Kooner, A.S.; Liu, K.; McPherson, R.; Musameh, M.D.; Musani, S.; Papanicolaou, G.; Peters, A.; Peters, B.J.; Potter, S.; Psaty, B.M.; Rasheed, A.; Scott, J.; Seedorf, U.; Sehmi, J.S.; Sotoodehnia, N.; Stark, K.; Stephens, J.; Schoot, C.E. van der; Schouw, Y.T. van der; Harst, P. van der; Vasan, R.S.; Wilde, A.A.; Willenborg, C.; Winkelmann, B.R.; Zaidi, M.; Zhang, W.; Ziegler, A.; Koenig, W.; Matz, W.; Trip, M.D.; Reilly, M.P.; Kathiresan, S.; Schunkert, H.; Hamsten, A.; Hall, A.S.; Kooner, J.S.; Thompson, S.G.; Thompson, J.R.; Watkins, H.; Danesh, J.; Barnes, T.; Rafelt, S.; Codd, V.; Bruinsma, N.; Dekker, L.R.; Henriques, J.P.; Koch, K.T.; Winter, R.J. de; Alings, M.; Allaart, C.F.; Gorgels, A.P.; Verheugt, F.W.A.; Mueller, M.; Meisinger, C.; DerOhannessian, S.; Mehta, N.N.; Ferguson, J.; Hakonarson, H.; Matthai, W.; Wilensky, R.; Hopewell, J.C.; Parish, S.; Linksted, P.; Notman, J.; Gonzalez, H.; Young, A.; Ostley, T.; Munday, A.; Goodwin, N.; Verdon, V.; Shah, S.; Edwards, C.; Mathews, C.; Gunter, R.; Benham, J.; Davies, C.; Cobb, M.; Cobb, L.; Crowther, J.; Richards, A.; Silver, M.; Tochlin, S.; Mozley, S.; Clark, S.; Radley, M.; Kourellias, K.; Olsson, P.; Barlera, S.; Tognoni, G.; Rust, S.; Assmann, G.; Heath, S.; Zelenika, D.; Gut, I.; Green, F.; Farrall, M.; Goel, A.; Ongen, H.; Franzosi, M.G.; Lathrop, M.; Clarke, R.; Aly, A.; Anner, K.; Bjorklund, K.; Blomgren, G.; Cederschiold, B.; Danell-Toverud, K.; Eriksson, P.; Grundstedt, U.; Heinonen, M.; Hellenius, M.L.; Hooft, F. van 't; Husman, K.; Lagercrantz, J.; Larsson, A.; Larsson, M.; Mossfeldt, M.; Malarstig, A.; Olsson, G.; Sabater-Lleal, M.; Sennblad, B.; Silveira, A.; Strawbridge, R.; Soderholm, B.; Ohrvik, J.; Zaman, K.S.; Mallick, N.H.; Azhar, M.; Samad, A.; Ishaq, M.; Shah, N.; Samuel, M.; Kathiresan, S.C.; Assimes, T.L.; Holm, H.; Preuss, M.; Stewart, A.F.; Barbalic, M.; Gieger, C.; Absher, D.; Aherrahrou, Z.; Allayee, H.; Altshuler, D.; Anand, S.; Andersen, K.; Anderson, J.L.; Ardissino, D.; Ball, S.G.; Balmforth, A.J.; Barnes, T.A.; Becker, L.C.; Becker, D.M.; Berger, K.; Bis, J.C.; Boekholdt, S.M.; Boerwinkle, E.; Brown, M.J.; Burnett, M.S.; Buysschaert, I.; Carlquist, J.F.; Chen, L.; Davies, R.W.; Dedoussis, G.; Dehghan, A.; Demissie, S.; Devaney, J.; Do, R.; Doering, A.; El Mokhtari, N.E.; Ellis, S.G.; Elosua, R.; Engert, J.C.; Epstein, S.; Faire, U. de; Fischer, M.; Folsom, A.R.; Freyer, J.; Gigante, B.; Girelli, D.; Gretarsdottir, S.; Gudnason, V.; Gulcher, J.R.; Tennstedt, S.; Halperin, E.; Hammond, N.; Hazen, S.L.; Hofman, A.; Horne, B.D.; Illig, T.; Iribarren, C.; Jones, G.T.; Jukema, J.W.; Kaiser, M.A.; Kaplan, L.M.; Khaw, K.T.; Knowles, J.W.; Kolovou, G.; Kong, A.; Laaksonen, R.; Lambrechts, D.; Leander, K.; Li, M.; Lieb, W.; Lettre, G.; Loley, C.; Lotery, A.J.; Mannucci, P.M.; Martinelli, N.; McKeown, P.P.; Meitinger, T.; Melander, O.; Merlini, P.A.; Mooser, V.; Morgan, T.; Muhleisen T.W., .; Muhlestein, J.B.; Musunuru, K.; Nahrstaedt, J.; Nothen, Markus; Olivieri, O.; Peyvandi, F.; Patel, R.S.; Patterson, C.C.; Qu, L.; Quyyumi, A.A.; Rader, D.J.; Rallidis, L.S.; Rice, C.; Roosendaal, F.R.; Rubin, D.; Salomaa, V.; Sampietro, M.L.; Sandhu, M.S.; Schadt, E.; Schafer, A.; Schillert, A.; Schreiber, S.; Schrezenmeir, J.; Schwartz, S.M.; Siscovick, D.S.; Sivananthan, M.; Sivapalaratnam, S.; Smith, A.V.; Smith, T.B.; Snoep, J.D.; Spertus, J.A.; Stefansson, K.; Stirrups, K.; Stoll, M.; Tang, W.H.; Thorgeirsson, G.; Thorleifsson, G.; Tomaszewski, M.; Uitterlinden, A.G.; Rij, A.M. van; Voight, B.F.; Wareham, N.J.; AWells, G.; Wichmann, H.E.; Witteman, J.C.; Wright, B.J.; Ye, S.; Cupples, L.A.; Quertermous, T.; Marz, W.; Blankenberg, S.; Thorsteinsdottir, U.; Roberts, R.; O'Donnell, C.J.; Onland-Moret, N.C.; Setten, J. van; Bakker, P.I. de; Verschuren, W.M.; Boer, J.M.; Wijmenga, C.; Hofker, M.H.; Maitland-van der Zee, A.H.; Boer, A. de; Grobbee, D.E.; Attwood, T.; Belz, S.; Cooper, J.; Crisp-Hihn, A.; Deloukas, P.; Foad, N.; Goodall, A.H.; Gracey, J.; Gray, E.; Gwilliams, R.; Heimerl, S.; Hengstenberg, C.; Jolley, J.; Krishnan, U.; Lloyd-Jones, H.; Lugauer, I.; Lundmark, P.; Maouche, S.; Moore, J.S.; Muir, D.; Murray, E.; Nelson, C.P.; Neudert, J.; Niblett, D.; O'Leary, K.; Ouwehand, W.H.; Pollard, H.; Rankin, A.; Rice, C.M.; Sager, H.; Samani, N.J.; Sambrook, J.; Schmitz, G.; Scholz, M.; Schroeder, L.; Syvannen, A.C.; Wallace, C.

    2011-01-01

    Coronary artery disease (CAD) has a significant genetic contribution that is incompletely characterized. To complement genome-wide association (GWA) studies, we conducted a large and systematic candidate gene study of CAD susceptibility, including analysis of many uncommon and functional variants.

  10. A functional genomics approach using metabolomics and in silico pathway analysis

    DEFF Research Database (Denmark)

    Förster, Jochen; Gombert, Andreas Karoly; Nielsen, Jens

    2002-01-01

    analysis techniques and changes in the genotype will in many cases lead to different metabolite profiles. Here, a theoretical framework that may be applied to identify the function of orphan genes is presented. The approach is based on a combination of metabolome analysis combined with in silico pathway...

  11. Characterization of the MLO gene family in Rosaceae and gene expression analysis in Malus domestica.

    Science.gov (United States)

    Pessina, Stefano; Pavan, Stefano; Catalano, Domenico; Gallotta, Alessandra; Visser, Richard G F; Bai, Yuling; Malnoy, Mickael; Schouten, Henk J

    2014-07-22

    Powdery mildew (PM) is a major fungal disease of thousands of plant species, including many cultivated Rosaceae. PM pathogenesis is associated with up-regulation of MLO genes during early stages of infection, causing down-regulation of plant defense pathways. Specific members of the MLO gene family act as PM-susceptibility genes, as their loss-of-function mutations grant durable and broad-spectrum resistance. We carried out a genome-wide characterization of the MLO gene family in apple, peach and strawberry, and we isolated apricot MLO homologs through a PCR-approach. Evolutionary relationships between MLO homologs were studied and syntenic blocks constructed. Homologs that are candidates for being PM susceptibility genes were inferred by phylogenetic relationships with functionally characterized MLO genes and, in apple, by monitoring their expression following inoculation with the PM causal pathogen Podosphaera leucotricha. Genomic tools available for Rosaceae were exploited in order to characterize the MLO gene family. Candidate MLO susceptibility genes were identified. In follow-up studies it can be investigated whether silencing or a loss-of-function mutations in one or more of these candidate genes leads to PM resistance.

  12. Functional requirements driving the gene duplication in 12 Drosophila species.

    Science.gov (United States)

    Zhong, Yan; Jia, Yanxiao; Gao, Yang; Tian, Dacheng; Yang, Sihai; Zhang, Xiaohui

    2013-08-15

    Gene duplication supplies the raw materials for novel gene functions and many gene families arisen from duplication experience adaptive evolution. Most studies of young duplicates have focused on mammals, especially humans, whereas reports describing their genome-wide evolutionary patterns across the closely related Drosophila species are rare. The sequenced 12 Drosophila genomes provide the opportunity to address this issue. In our study, 3,647 young duplicate gene families were identified across the 12 Drosophila species and three types of expansions, species-specific, lineage-specific and complex expansions, were detected in these gene families. Our data showed that the species-specific young duplicate genes predominated (86.6%) over the other two types. Interestingly, many independent species-specific expansions in the same gene family have been observed in many species, even including 11 or 12 Drosophila species. Our data also showed that the functional bias observed in these young duplicate genes was mainly related to responses to environmental stimuli and biotic stresses. This study reveals the evolutionary patterns of young duplicates across 12 Drosophila species on a genomic scale. Our results suggest that convergent evolution acts on young duplicate genes after the species differentiation and adaptive evolution may play an important role in duplicate genes for adaption to ecological factors and environmental changes in Drosophila.

  13. The Genome Sequence of Leishmania (Leishmania) amazonensis: Functional Annotation and Extended Analysis of Gene Models

    Science.gov (United States)

    Real, Fernando; Vidal, Ramon Oliveira; Carazzolle, Marcelo Falsarella; Mondego, Jorge Maurício Costa; Costa, Gustavo Gilson Lacerda; Herai, Roberto Hirochi; Würtele, Martin; de Carvalho, Lucas Miguel; e Ferreira, Renata Carmona; Mortara, Renato Arruda; Barbiéri, Clara Lucia; Mieczkowski, Piotr; da Silveira, José Franco; Briones, Marcelo Ribeiro da Silva; Pereira, Gonçalo Amarante Guimarães; Bahia, Diana

    2013-01-01

    We present the sequencing and annotation of the Leishmania (Leishmania) amazonensis genome, an etiological agent of human cutaneous leishmaniasis in the Amazon region of Brazil. L. (L.) amazonensis shares features with Leishmania (L.) mexicana but also exhibits unique characteristics regarding geographical distribution and clinical manifestations of cutaneous lesions (e.g. borderline disseminated cutaneous leishmaniasis). Predicted genes were scored for orthologous gene families and conserved domains in comparison with other human pathogenic Leishmania spp. Carboxypeptidase, aminotransferase, and 3′-nucleotidase genes and ATPase, thioredoxin, and chaperone-related domains were represented more abundantly in L. (L.) amazonensis and L. (L.) mexicana species. Phylogenetic analysis revealed that these two species share groups of amastin surface proteins unique to the genus that could be related to specific features of disease outcomes and host cell interactions. Additionally, we describe a hypothetical hybrid interactome of potentially secreted L. (L.) amazonensis proteins and host proteins under the assumption that parasite factors mimic their mammalian counterparts. The model predicts an interaction between an L. (L.) amazonensis heat-shock protein and mammalian Toll-like receptor 9, which is implicated in important immune responses such as cytokine and nitric oxide production. The analysis presented here represents valuable information for future studies of leishmaniasis pathogenicity and treatment. PMID:23857904

  14. Aspirin exposure reveals novel genes associated with platelet function and cardiovascular events.

    Science.gov (United States)

    Voora, Deepak; Cyr, Derek; Lucas, Joseph; Chi, Jen-Tsan; Dungan, Jennifer; McCaffrey, Timothy A; Katz, Richard; Newby, L Kristin; Kraus, William E; Becker, Richard C; Ortel, Thomas L; Ginsburg, Geoffrey S

    2013-10-01

    The aim of this study was to develop ribonucleic acid (RNA) profiles that could serve as novel biomarkers for the response to aspirin. Aspirin reduces death and myocardial infarction (MI), suggesting that aspirin interacts with biological pathways that may underlie these events. Aspirin was administered, followed by whole-blood RNA microarray profiling, in a discovery cohort of healthy volunteers (HV1) (n = 50) and 2 validation cohorts of healthy volunteers (HV2) (n = 53) and outpatient cardiology patients (OPC) (n = 25). Platelet function was assessed using the platelet function score (PFS) in HV1 and HV2 and the VerifyNow Aspirin Test (Accumetrics, Inc., San Diego, California) in OPC. Bayesian sparse factor analysis identified sets of coexpressed transcripts, which were examined for associations with PFS in HV1 and validated in HV2 and OPC. Proteomic analysis confirmed the association of validated transcripts in platelet proteins. Validated gene sets were tested for association with death or MI in 2 patient cohorts (n = 587 total) from RNA samples collected at cardiac catheterization. A set of 60 coexpressed genes named the "aspirin response signature" (ARS) was associated with PFS in HV1 (r = -0.31, p = 0.03), HV2 (r = -0.34, Bonferroni p = 0.03), and OPC (p = 0.046). Corresponding proteins for the 17 ARS genes were identified in the platelet proteome, of which 6 were associated with PFS. The ARS was associated with death or MI in both patient cohorts (odds ratio: 1.2 [p = 0.01]; hazard ratio: 1.5 [p = 0.001]), independent of cardiovascular risk factors. Compared with traditional risk factors, reclassification (net reclassification index = 31% to 37%, p ≤ 0.0002) was improved by including the ARS or 1 of its genes, ITGA2B. RNA profiles of platelet-specific genes are novel biomarkers for identifying patients who do not respond adequately to aspirin and who are at risk for death or MI. Copyright © 2013 American College of Cardiology Foundation. Published by

  15. Plant ion channels: gene families, physiology, and functional genomics analyses.

    Science.gov (United States)

    Ward, John M; Mäser, Pascal; Schroeder, Julian I

    2009-01-01

    Distinct potassium, anion, and calcium channels in the plasma membrane and vacuolar membrane of plant cells have been identified and characterized by patch clamping. Primarily owing to advances in Arabidopsis genetics and genomics, and yeast functional complementation, many of the corresponding genes have been identified. Recent advances in our understanding of ion channel genes that mediate signal transduction and ion transport are discussed here. Some plant ion channels, for example, ALMT and SLAC anion channel subunits, are unique. The majority of plant ion channel families exhibit homology to animal genes; such families include both hyperpolarization- and depolarization-activated Shaker-type potassium channels, CLC chloride transporters/channels, cyclic nucleotide-gated channels, and ionotropic glutamate receptor homologs. These plant ion channels offer unique opportunities to analyze the structural mechanisms and functions of ion channels. Here we review gene families of selected plant ion channel classes and discuss unique structure-function aspects and their physiological roles in plant cell signaling and transport.

  16. Comparative analysis of gene expression by microarray analysis of male and female flowers of Asparagus officinalis.

    Science.gov (United States)

    Gao, Wu-Jun; Li, Shu-Fen; Zhang, Guo-Jun; Wang, Ning-Na; Deng, Chuan-Liang; Lu, Long-Dou

    2013-01-01

    To identify rapidly a number of genes probably involved in sex determination and differentiation of the dioecious plant Asparagus officinalis, gene expression profiles in early flower development for male and female plants were investigated by microarray assay with 8,665 probes. In total, 638 male-biased and 543 female-biased genes were identified. These genes with biased-expression for male and female were involved in a variety of processes associated with molecular functions, cellular components, and biological processes, suggesting that a complex mechanism underlies the sex development of asparagus. Among the differentially expressed genes involved in the reproductive process, a number of genes associated with floral development were identified. Reverse transcription-PCR was performed for validation, and the results were largely consistent with those obtained by microarray analysis. The findings of this study might contribute to understanding of the molecular mechanisms of sex determination and differentiation in dioecious asparagus and provide a foundation for further studies of this plant.

  17. Functional importance of conserved domains in the flowering-time gene CONSTANS demonstrated by analysis of mutant alleles and transgenic plants.

    Science.gov (United States)

    Robson, F; Costa, M M; Hepworth, S R; Vizir, I; Piñeiro, M; Reeves, P H; Putterill, J; Coupland, G

    2001-12-01

    CONSTANS promotes flowering of Arabidopsis in response to long-day conditions. We show that CONSTANS is a member of an Arabidopsis gene family that comprises 16 other members. The CO-Like proteins encoded by these genes contain two segments of homology: a zinc finger containing region near their amino terminus and a CCT (CO, CO-Like, TOC1) domain near their carboxy terminus. Analysis of seven classical co mutant alleles demonstrated that the mutations all occur within either the zinc finger region or the CCT domain, confirming that the two regions of homology are important for CO function. The zinc fingers are most similar to those of B-boxes, which act as protein-protein interaction domains in several transcription factors described in animals. Segments of CO protein containing the CCT domain localize GFP to the nucleus, but one mutation that affects the CCT domain delays flowering without affecting the nuclear localization function, suggesting that this domain has additional functions. All eight co alleles, including one recovered by pollen irradiation in which DNA encoding both B-boxes is deleted, are shown to be semidominant. This dominance appears to be largely due to a reduction in CO dosage in the heterozygous plants. However, some alleles may also actively delay flowering, because overexpression from the CaMV 35S promoter of the co-3 allele, that has a mutation in the second B-box, delayed flowering of wild-type plants. The significance of these observations for the role of CO in the control of flowering time is discussed.

  18. Characterization and phylogenetic analysis of lectin gene cDNA isolated from sea cucumber ( Apostichopus japonicus) body wall

    Science.gov (United States)

    Xue, Zhuang; Li, Hui; Liu, Yang; Zhou, Wei; Sun, Jing; Wang, Xiuli

    2017-12-01

    As a `living fossil' of species origin and `rich treasure' of food and nutrition development, sea cucumber has received a lot of attentions from researchers. The cDNA library construction and EST sequencing of blood had been conducted previously in our lab. The bioinformatic analysis provided a gene fragment which is highly homologous with the genes of lectin family, named AjL ( Apostichopus japonicus lectin). To characterize and determine the phylogeny of AjL genes in early evolution, we isolated a full-length cDNA of lectin gene from the body wall of A. japonicus. The open reading frame of this gene contained 489 bp and encoded a 163 amino acids secretory protein being homologous to lectins of mammals and aquatic organisms. The deduced protein included a lectin-like domain. SDS-PAGE analysis showed that AjL migrated as a specific band (about 36.09 kDa under reducing), and agglutinated against rabbit red blood cells. AjL was similar to chain A of CEL-IV in space structure. We predicted that AjL may play the same role of CEL-IV. Our results suggested that more than one lectin gene functioned in sea cucumber and most of other species, which was fused by uncertain sequences during the evolution and encoded different proteins with diverse functions. Our findings provided the insights into the function and characteristics of lectin genes invertebrates. The results will also be helpful for the identification and structural, functional, and evolutionary analyses of lectin genes.

  19. Meta-analysis of differentiating mouse embryonic stem cell gene expression kinetics reveals early change of a small gene set.

    Directory of Open Access Journals (Sweden)

    Clive H Glover

    2006-11-01

    Full Text Available Stem cell differentiation involves critical changes in gene expression. Identification of these should provide endpoints useful for optimizing stem cell propagation as well as potential clues about mechanisms governing stem cell maintenance. Here we describe the results of a new meta-analysis methodology applied to multiple gene expression datasets from three mouse embryonic stem cell (ESC lines obtained at specific time points during the course of their differentiation into various lineages. We developed methods to identify genes with expression changes that correlated with the altered frequency of functionally defined, undifferentiated ESC in culture. In each dataset, we computed a novel statistical confidence measure for every gene which captured the certainty that a particular gene exhibited an expression pattern of interest within that dataset. This permitted a joint analysis of the datasets, despite the different experimental designs. Using a ranking scheme that favored genes exhibiting patterns of interest, we focused on the top 88 genes whose expression was consistently changed when ESC were induced to differentiate. Seven of these (103728_at, 8430410A17Rik, Klf2, Nr0b1, Sox2, Tcl1, and Zfp42 showed a rapid decrease in expression concurrent with a decrease in frequency of undifferentiated cells and remained predictive when evaluated in additional maintenance and differentiating protocols. Through a novel meta-analysis, this study identifies a small set of genes whose expression is useful for identifying changes in stem cell frequencies in cultures of mouse ESC. The methods and findings have broader applicability to understanding the regulation of self-renewal of other stem cell types.

  20. ELFN1-AS1: A Novel Primate Gene with Possible MicroRNA Function Expressed Predominantly in Human Tumors

    Directory of Open Access Journals (Sweden)

    Dmitrii E. Polev

    2014-01-01

    Full Text Available Human gene LOC100505644 uncharacterized LOC100505644 [Homo sapiens] (Entrez Gene ID 100505644 is abundantly expressed in tumors but weakly expressed in few normal tissues. Till now the function of this gene remains unknown. Here we identified the chromosomal borders of the transcribed region and the major splice form of the LOC100505644-specific transcript. We characterised the major regulatory motifs of the gene and its splice sites. Analysis of the secondary structure of the major transcript variant revealed a hairpin-like structure characteristic for precursor microRNAs. Comparative genomic analysis of the locus showed that it originated in primates de novo. Taken together, our data indicate that human gene LOC100505644 encodes some non-protein coding RNA, likely a microRNA. It was assigned a gene symbol ELFN1-AS1 (ELFN1 antisense RNA 1 (non-protein coding. This gene combines features of evolutionary novelty and predominant expression in tumors.

  1. Comparison between smaller ruptured intracranial aneurysm and larger un-ruptured intracranial aneurysm: gene expression profile analysis.

    Science.gov (United States)

    Li, Hao; Li, Haowen; Yue, Haiyan; Wang, Wen; Yu, Lanbing; ShuoWang; Cao, Yong; Zhao, Jizong

    2017-07-01

    As it grows in size, an intracranial aneurysm (IA) is prone to rupture. In this study, we compared two extreme groups of IAs, ruptured IAs (RIAs) smaller than 10 mm and un-ruptured IAs (UIAs) larger than 10 mm, to investigate the genes involved in the facilitation and prevention of IA rupture. The aneurismal walls of 6 smaller saccular RIAs (size smaller than 10 mm), 6 larger saccular UIAs (size larger than 10 mm) and 12 paired control arteries were obtained during surgery. The transcription profiles of these samples were studied by microarray analysis. RT-qPCR was used to confirm the expression of the genes of interest. In addition, functional group analysis of the differentially expressed genes was performed. Between smaller RIAs and larger UIAs, 101 genes and 179 genes were significantly over-expressed, respectively. In addition, functional group analysis demonstrated that the up-regulated genes in smaller RIAs mainly participated in the cellular response to metal ions and inorganic substances, while most of the up-regulated genes in larger UIAs were involved in inflammation and extracellular matrix (ECM) organization. Moreover, compared with control arteries, inflammation was up-regulated and muscle-related biological processes were down-regulated in both smaller RIAs and larger UIAs. The genes involved in the cellular response to metal ions and inorganic substances may facilitate the rupture of IAs. In addition, the healing process, involving inflammation and ECM organization, may protect IAs from rupture.

  2. Genome-wide analysis of WRKY gene family in the sesame genome and identification of the WRKY genes involved in responses to abiotic stresses.

    Science.gov (United States)

    Li, Donghua; Liu, Pan; Yu, Jingyin; Wang, Linhai; Dossa, Komivi; Zhang, Yanxin; Zhou, Rong; Wei, Xin; Zhang, Xiurong

    2017-09-11

    Sesame (Sesamum indicum L.) is one of the world's most important oil crops. However, it is susceptible to abiotic stresses in general, and to waterlogging and drought stresses in particular. The molecular mechanisms of abiotic stress tolerance in sesame have not yet been elucidated. The WRKY domain transcription factors play significant roles in plant growth, development, and responses to stresses. However, little is known about the number, location, structure, molecular phylogenetics, and expression of the WRKY genes in sesame. We performed a comprehensive study of the WRKY gene family in sesame and identified 71 SiWRKYs. In total, 65 of these genes were mapped to 15 linkage groups within the sesame genome. A phylogenetic analysis was performed using a related species (Arabidopsis thaliana) to investigate the evolution of the sesame WRKY genes. Tissue expression profiles of the WRKY genes demonstrated that six SiWRKY genes were highly expressed in all organs, suggesting that these genes may be important for plant growth and organ development in sesame. Analysis of the SiWRKY gene expression patterns revealed that 33 and 26 SiWRKYs respond strongly to waterlogging and drought stresses, respectively. Changes in the expression of 12 SiWRKY genes were observed at different times after the waterlogging and drought treatments had begun, demonstrating that sesame gene expression patterns vary in response to abiotic stresses. In this study, we analyzed the WRKY family of transcription factors encoded by the sesame genome. Insight was gained into the classification, evolution, and function of the SiWRKY genes, revealing their putative roles in a variety of tissues. Responses to abiotic stresses in different sesame cultivars were also investigated. The results of our study provide a better understanding of the structures and functions of sesame WRKY genes and suggest that manipulating these WRKYs could enhance resistance to waterlogging and drought.

  3. Transient transformation meets gene function discovery: the strawberry fruit case

    Directory of Open Access Journals (Sweden)

    Michela eGuidarelli

    2015-06-01

    Full Text Available Beside the well known nutritional and health benefits, strawberry (Fragaria X ananassa crop draws increasing attention as plant model system for the Rosaceae family, due to the short generation time, the rapid in vitro regeneration, and to the availability of the genome sequence of F. X ananassa and of the closely related F. vesca species. In the last years, the use of high-throughput sequence technologies provided large amounts of molecular information on the genes possibly related to several biological processes of this crop. Nevertheless, the function of most genes or gene products is still poorly understood and needs investigation. Transient transformation technology provides a powerful tool to study gene function in vivo, avoiding difficult drawbacks that typically affect the stable transformation protocols, such as transformation efficiency, transformants selection and regeneration. In this review we provide an overview of the use of transient expression in the investigation of the function of genes important for strawberry fruit development, defence and nutritional properties. The technical aspects related to an efficient use of this technique are described, and the possible impact and application in strawberry crop improvement are discussed.

  4. Analysis of pan-genome to identify the core genes and essential genes of Brucella spp.

    Science.gov (United States)

    Yang, Xiaowen; Li, Yajie; Zang, Juan; Li, Yexia; Bie, Pengfei; Lu, Yanli; Wu, Qingmin

    2016-04-01

    Brucella spp. are facultative intracellular pathogens, that cause a contagious zoonotic disease, that can result in such outcomes as abortion or sterility in susceptible animal hosts and grave, debilitating illness in humans. For deciphering the survival mechanism of Brucella spp. in vivo, 42 Brucella complete genomes from NCBI were analyzed for the pan-genome and core genome by identification of their composition and function of Brucella genomes. The results showed that the total 132,143 protein-coding genes in these genomes were divided into 5369 clusters. Among these, 1710 clusters were associated with the core genome, 1182 clusters with strain-specific genes and 2477 clusters with dispensable genomes. COG analysis indicated that 44 % of the core genes were devoted to metabolism, which were mainly responsible for energy production and conversion (COG category C), and amino acid transport and metabolism (COG category E). Meanwhile, approximately 35 % of the core genes were in positive selection. In addition, 1252 potential essential genes were predicted in the core genome by comparison with a prokaryote database of essential genes. The results suggested that the core genes in Brucella genomes are relatively conservation, and the energy and amino acid metabolism play a more important role in the process of growth and reproduction in Brucella spp. This study might help us to better understand the mechanisms of Brucella persistent infection and provide some clues for further exploring the gene modules of the intracellular survival in Brucella spp.

  5. Functional analysis

    CERN Document Server

    Kantorovich, L V

    1982-01-01

    Functional Analysis examines trends in functional analysis as a mathematical discipline and the ever-increasing role played by its techniques in applications. The theory of topological vector spaces is emphasized, along with the applications of functional analysis to applied analysis. Some topics of functional analysis connected with applications to mathematical economics and control theory are also discussed. Comprised of 18 chapters, this book begins with an introduction to the elements of the theory of topological spaces, the theory of metric spaces, and the theory of abstract measure space

  6. Functional pathway analysis of genes associated with response to treatment for chronic hepatitis C.

    Science.gov (United States)

    Birerdinc, A; Afendy, A; Stepanova, M; Younossi, I; Manyam, G; Baranova, A; Younossi, Z M

    2010-10-01

    Chronic hepatitis C (CH-C) is among the most common causes of chronic liver disease. Approximately 50% of patients with CH-C treated with pegylated interferon-α and ribavirin (PEG-IFN-α + RBV) achieve a sustained virological response (SVR). Several factors such as genotype 1, African American (AA) race, obesity and the absence of an early virological response (EVR) are associated with low SVR. This study elucidates molecular pathways deregulated in patients with CH-C with negative predictors of response to antiviral therapy. Sixty-eight patients with CH-C who underwent a full course of treatment with PEG-IFN-α + RBV were included in the study. Pretreatment blood samples were collected in PAXgene™ RNA tubes. EVR, complete EVR (cEVR), and SVR rates were 76%, 57% and 41%, respectively. Total RNA was extracted from pretreatment peripheral blood mononuclear cells, quantified and used for one-step RT-PCR to profile 154 mRNAs. The expression of mRNAs was normalized with six 'housekeeping' genes. Differentially expressed genes were separated into up and downregulated gene lists according to the presence or absence of a risk factor and subjected to KEGG Pathway Painter which allows high-throughput visualization of the pathway-specific changes in expression profiles. The genes were consolidated into the networks associated with known predictors of response. Before treatment, various genes associated with core components of the JAK/STAT pathway were activated in the cohorts least likely to achieve SVR. Genes related to focal adhesion and TGF-β pathways were activated in some patients with negative predictors of response. Pathway-centred analysis of gene expression profiles from treated patients with CH-C points to the Janus kinase-signal transducers and activators of transcription signalling cascade as the major pathogenetic component responsible for not achieving SVR. In addition, focal adhesion and TGF-β pathways are associated with some predictors of response.

  7. Transcriptome Sequencing Analysis and Functional Identification of Sex Differentiation Genes from the Mosquito Parasitic Nematode, Romanomermis wuchangensis.

    Directory of Open Access Journals (Sweden)

    Mingyue Duan

    Full Text Available Mosquito-transmitted diseases like malaria and dengue fever are global problem and an estimated 50-100 million of dengue or dengue hemorrhagic fever cases are reported worldwide every year. The mermithid nematode Romanomermis wuchangensis has been successfully used as an ecosystem-friendly biocontrol agent for mosquito prevention in laboratory studies. However, this nematode can not undergo sex differentiation in vitro culture, which has seriously affected their application of biocontrol in the field. In this study, based on transcriptome sequencing analysis of R. wuchangensis, Rwucmab-3, Rwuclaf-1 and Rwuctra-2 were cloned and used to investigate molecular regulatory function of sex differentiation. qRT-PCR results demonstrated that the expression level of Rwucmab-3 between male and female displayed obvious difference on the 3rd day of parasitic stage, which was earlier than Rwuclaf-1 and Rwuctra-2, highlighting sex differentiation process may start on the 3rd day of parasitic stage. Besides, FITC was used as a marker to test dsRNA uptake efficiency of R. wuchangensis, which fluorescence intensity increased with FITC concentration after 16 h incubation, indicating this nematode can successfully ingest soaking solution via its cuticle. RNAi results revealed the sex ratio of R. wuchangensis from RNAi treated groups soaked in dsRNA of Rwucmab-3 was significantly higher than gfp dsRNA treated groups and control groups, highlighting RNAi of Rwumab-3 may hinder the development of male nematodes. These results suggest that Rwucmab-3 mainly involves in the initiation of sex differentiation and the development of male sexual dimorphism. Rwuclaf-1 and Rwuctra-2 may play vital role in nematode reproductive and developmental system. In conclusion, transcript sequences presented in this study could provide more bioinformatics resources for future studies on gene cloning and other molecular regulatory mechanism in R. wuchangensis. Moreover, identification

  8. Meta-analysis and candidate gene mining of low-phosphorus tolerance in maize.

    Science.gov (United States)

    Zhang, Hongwei; Uddin, Mohammed Shalim; Zou, Cheng; Xie, Chuanxiao; Xu, Yunbi; Li, Wen-Xue

    2014-03-01

    Plants with tolerance to low-phosphorus (P) can grow better under low-P conditions, and understanding of genetic mechanisms of low-P tolerance can not only facilitate identifying relevant genes but also help to develop low-P tolerant cultivars. QTL meta-analysis was conducted after a comprehensive review of the reports on QTL mapping for low-P tolerance-related traits in maize. Meta-analysis produced 23 consensus QTL (cQTL), 17 of which located in similar chromosome regions to those previously reported to influence root traits. Meanwhile, candidate gene mining yielded 215 genes, 22 of which located in the cQTL regions. These 22 genes are homologous to 14 functionally characterized genes that were found to participate in plant low-P tolerance, including genes encoding miR399s, Pi transporters and purple acid phosphatases. Four cQTL loci (cQTL2-1, cQTL5-3, cQTL6-2, and cQTL10-2) may play important roles for low-P tolerance because each contains more original QTL and has better consistency across previous reports. © 2014 Institute of Botany, Chinese Academy of Sciences.

  9. Convergent functional genomics in addiction research - a translational approach to study candidate genes and gene networks.

    Science.gov (United States)

    Spanagel, Rainer

    2013-01-01

    Convergent functional genomics (CFG) is a translational methodology that integrates in a Bayesian fashion multiple lines of evidence from studies in human and animal models to get a better understanding of the genetics of a disease or pathological behavior. Here the integration of data sets that derive from forward genetics in animals and genetic association studies including genome wide association studies (GWAS) in humans is described for addictive behavior. The aim of forward genetics in animals and association studies in humans is to identify mutations (e.g. SNPs) that produce a certain phenotype; i.e. "from phenotype to genotype". Most powerful in terms of forward genetics is combined quantitative trait loci (QTL) analysis and gene expression profiling in recombinant inbreed rodent lines or genetically selected animals for a specific phenotype, e.g. high vs. low drug consumption. By Bayesian scoring genomic information from forward genetics in animals is then combined with human GWAS data on a similar addiction-relevant phenotype. This integrative approach generates a robust candidate gene list that has to be functionally validated by means of reverse genetics in animals; i.e. "from genotype to phenotype". It is proposed that studying addiction relevant phenotypes and endophenotypes by this CFG approach will allow a better determination of the genetics of addictive behavior.

  10. Cloning and bioinformatic analysis of lovastatin biosynthesis regulatory gene lovE.

    Science.gov (United States)

    Huang, Xin; Li, Hao-ming

    2009-08-05

    Lovastatin is an effective drug for treatment of hyperlipidemia. This study aimed to clone lovastatin biosynthesis regulatory gene lovE and analyze the structure and function of its encoding protein. According to the lovastatin synthase gene sequence from genebank, primers were designed to amplify and clone the lovastatin biosynthesis regulatory gene lovE from Aspergillus terrus genomic DNA. Bioinformatic analysis of lovE and its encoding animo acid sequence was performed through internet resources and software like DNAMAN. Target fragment lovE, almost 1500 bp in length, was amplified from Aspergillus terrus genomic DNA and the secondary and three-dimensional structures of LovE protein were predicted. In the lovastatin biosynthesis process lovE is a regulatory gene and LovE protein is a GAL4-like transcriptional factor.

  11. Gene expression and functional annotation of the human and mouse choroid plexus epithelium.

    Directory of Open Access Journals (Sweden)

    Sarah F Janssen

    Full Text Available BACKGROUND: The choroid plexus epithelium (CPE is a lobed neuro-epithelial structure that forms the outer blood-brain barrier. The CPE protrudes into the brain ventricles and produces the cerebrospinal fluid (CSF, which is crucial for brain homeostasis. Malfunction of the CPE is possibly implicated in disorders like Alzheimer disease, hydrocephalus or glaucoma. To study human genetic diseases and potential new therapies, mouse models are widely used. This requires a detailed knowledge of similarities and differences in gene expression and functional annotation between the species. The aim of this study is to analyze and compare gene expression and functional annotation of healthy human and mouse CPE. METHODS: We performed 44k Agilent microarray hybridizations with RNA derived from laser dissected healthy human and mouse CPE cells. We functionally annotated and compared the gene expression data of human and mouse CPE using the knowledge database Ingenuity. We searched for common and species specific gene expression patterns and function between human and mouse CPE. We also made a comparison with previously published CPE human and mouse gene expression data. RESULTS: Overall, the human and mouse CPE transcriptomes are very similar. Their major functionalities included epithelial junctions, transport, energy production, neuro-endocrine signaling, as well as immunological, neurological and hematological functions and disorders. The mouse CPE presented two additional functions not found in the human CPE: carbohydrate metabolism and a more extensive list of (neural developmental functions. We found three genes specifically expressed in the mouse CPE compared to human CPE, being ACE, PON1 and TRIM3 and no human specifically expressed CPE genes compared to mouse CPE. CONCLUSION: Human and mouse CPE transcriptomes are very similar, and display many common functionalities. Nonetheless, we also identified a few genes and pathways which suggest that the CPE

  12. A human-specific de novo protein-coding gene associated with human brain functions.

    Directory of Open Access Journals (Sweden)

    Chuan-Yun Li

    2010-03-01

    Full Text Available To understand whether any human-specific new genes may be associated with human brain functions, we computationally screened the genetic vulnerable factors identified through Genome-Wide Association Studies and linkage analyses of nicotine addiction and found one human-specific de novo protein-coding gene, FLJ33706 (alternative gene symbol C20orf203. Cross-species analysis revealed interesting evolutionary paths of how this gene had originated from noncoding DNA sequences: insertion of repeat elements especially Alu contributed to the formation of the first coding exon and six standard splice junctions on the branch leading to humans and chimpanzees, and two subsequent substitutions in the human lineage escaped two stop codons and created an open reading frame of 194 amino acids. We experimentally verified FLJ33706's mRNA and protein expression in the brain. Real-Time PCR in multiple tissues demonstrated that FLJ33706 was most abundantly expressed in brain. Human polymorphism data suggested that FLJ33706 encodes a protein under purifying selection. A specifically designed antibody detected its protein expression across human cortex, cerebellum and midbrain. Immunohistochemistry study in normal human brain cortex revealed the localization of FLJ33706 protein in neurons. Elevated expressions of FLJ33706 were detected in Alzheimer's brain samples, suggesting the role of this novel gene in human-specific pathogenesis of Alzheimer's disease. FLJ33706 provided the strongest evidence so far that human-specific de novo genes can have protein-coding potential and differential protein expression, and be involved in human brain functions.

  13. Exercise-associated DNA methylation change in skeletal muscle and the importance of imprinted genes: a bioinformatics meta-analysis.

    Science.gov (United States)

    Brown, William M

    2015-12-01

    Epigenetics is the study of processes--beyond DNA sequence alteration--producing heritable characteristics. For example, DNA methylation modifies gene expression without altering the nucleotide sequence. A well-studied DNA methylation-based phenomenon is genomic imprinting (ie, genotype-independent parent-of-origin effects). We aimed to elucidate: (1) the effect of exercise on DNA methylation and (2) the role of imprinted genes in skeletal muscle gene networks (ie, gene group functional profiling analyses). Gene ontology (ie, gene product elucidation)/meta-analysis. 26 skeletal muscle and 86 imprinted genes were subjected to g:Profiler ontology analysis. Meta-analysis assessed exercise-associated DNA methylation change. g:Profiler found four muscle gene networks with imprinted loci. Meta-analysis identified 16 articles (387 genes/1580 individuals) associated with exercise. Age, method, sample size, sex and tissue variation could elevate effect size bias. Only skeletal muscle gene networks including imprinted genes were reported. Exercise-associated effect sizes were calculated by gene. Age, method, sample size, sex and tissue variation were moderators. Six imprinted loci (RB1, MEG3, UBE3A, PLAGL1, SGCE, INS) were important for muscle gene networks, while meta-analysis uncovered five exercise-associated imprinted loci (KCNQ1, MEG3, GRB10, L3MBTL1, PLAGL1). DNA methylation decreased with exercise (60% of loci). Exercise-associated DNA methylation change was stronger among older people (ie, age accounted for 30% of the variation). Among older people, genes exhibiting DNA methylation decreases were part of a microRNA-regulated gene network functioning to suppress cancer. Imprinted genes were identified in skeletal muscle gene networks and exercise-associated DNA methylation change. Exercise-associated DNA methylation modification could rewind the 'epigenetic clock' as we age. CRD42014009800. Published by the BMJ Publishing Group Limited. For permission to use (where

  14. Principles of gene microarray data analysis.

    Science.gov (United States)

    Mocellin, Simone; Rossi, Carlo Riccardo

    2007-01-01

    The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.

  15. Comparative mapping reveals similar linkage of functional genes to ...

    Indian Academy of Sciences (India)

    genes between O. sativa and B. napus may have consistent function and control similar traits, which may be ..... acea chromosomes reveals islands of conserved organization. ... 1998 Conserved structure and function of the Arabidopsis flow-.

  16. The function and evolution of Wnt genes in arthropods.

    Science.gov (United States)

    Murat, Sophie; Hopfen, Corinna; McGregor, Alistair P

    2010-11-01

    Wnt signalling is required for a wide range of developmental processes, from cleavage to patterning and cell migration. There are 13 subfamilies of Wnt ligand genes and this diverse repertoire appeared very early in metazoan evolution. In this review, we first summarise the known Wnt gene repertoire in various arthropods. Insects appear to have lost several Wnt subfamilies, either generally, such as Wnt3, or in lineage specific patterns, for example, the loss of Wnt7 in Anopheles. In Drosophila and Acyrthosiphon, only seven and six Wnt subfamilies are represented, respectively; however, the finding of nine Wnt genes in Tribolium suggests that arthropods had a larger repertoire ancestrally. We then discuss what is currently known about the expression and developmental function of Wnt ligands in Drosophila and other insects in comparison to other arthropods, such as the spiders Achaearanea and Cupiennius. We conclude that studies of Wnt genes have given us much insight into the developmental roles of some of these ligands. However, given the frequent loss of Wnt genes in insects and the derived development of Drosophila, further studies of these important genes are required in a broader range of arthropods to fully understand their developmental function and evolution. Copyright © 2010 Elsevier Ltd. All rights reserved.

  17. NMD Microarray Analysis for Rapid Genome-Wide Screen of Mutated Genes in Cancer

    Directory of Open Access Journals (Sweden)

    Maija Wolf

    2005-01-01

    Full Text Available Gene mutations play a critical role in cancer development and progression, and their identification offers possibilities for accurate diagnostics and therapeutic targeting. Finding genes undergoing mutations is challenging and slow, even in the post-genomic era. A new approach was recently developed by Noensie and Dietz to prioritize and focus the search, making use of nonsense-mediated mRNA decay (NMD inhibition and microarray analysis (NMD microarrays in the identification of transcripts containing nonsense mutations. We combined NMD microarrays with array-based CGH (comparative genomic hybridization in order to identify inactivation of tumor suppressor genes in cancer. Such a “mutatomics” screening of prostate cancer cell lines led to the identification of inactivating mutations in the EPHB2 gene. Up to 8% of metastatic uncultured prostate cancers also showed mutations of this gene whose loss of function may confer loss of tissue architecture. NMD microarray analysis could turn out to be a powerful research method to identify novel mutated genes in cancer cell lines, providing targets that could then be further investigated for their clinical relevance and therapeutic potential.

  18. Immune function genes CD99L2, JARID2 and TPO show association with autism spectrum disorder

    Directory of Open Access Journals (Sweden)

    Ramos Paula S

    2012-06-01

    Full Text Available Abstract Background A growing number of clinical and basic research studies have implicated immunological abnormalities as being associated with and potentially responsible for the cognitive and behavioral deficits seen in autism spectrum disorder (ASD children. Here we test the hypothesis that immune-related gene loci are associated with ASD. Findings We identified 2,012 genes of known immune-function via Ingenuity Pathway Analysis. Family-based tests of association were computed on the 22,904 single nucleotide polymorphisms (SNPs from the 2,012 immune-related genes on 1,510 trios available at the Autism Genetic Resource Exchange (AGRE repository. Several SNPs in immune-related genes remained statistically significantly associated with ASD after adjusting for multiple comparisons. Specifically, we observed significant associations in the CD99 molecule-like 2 region (CD99L2, rs11796490, P = 4.01 × 10-06, OR = 0.68 (0.58-0.80, in the jumonji AT rich interactive domain 2 (JARID2 gene (rs13193457, P = 2.71 × 10-06, OR = 0.61 (0.49-0.75, and in the thyroid peroxidase gene (TPO (rs1514687, P = 5.72 × 10-06, OR = 1.46 (1.24-1.72. Conclusions This study suggests that despite the lack of a general enrichment of SNPs in immune function genes in ASD children, several novel genes with known immune functions are associated with ASD.

  19. Colony size measurement of the yeast gene deletion strains for functional genomics

    Directory of Open Access Journals (Sweden)

    Mir-Rashed Nadereh

    2007-04-01

    Full Text Available Abstract Background Numerous functional genomics approaches have been developed to study the model organism yeast, Saccharomyces cerevisiae, with the aim of systematically understanding the biology of the cell. Some of these techniques are based on yeast growth differences under different conditions, such as those generated by gene mutations, chemicals or both. Manual inspection of the yeast colonies that are grown under different conditions is often used as a method to detect such growth differences. Results Here, we developed a computerized image analysis system called Growth Detector (GD, to automatically acquire quantitative and comparative information for yeast colony growth. GD offers great convenience and accuracy over the currently used manual growth measurement method. It distinguishes true yeast colonies in a digital image and provides an accurate coordinate oriented map of the colony areas. Some post-processing calculations are also conducted. Using GD, we successfully detected a genetic linkage between the molecular activity of the plant-derived antifungal compound berberine and gene expression components, among other cellular processes. A novel association for the yeast mek1 gene with DNA damage repair was also identified by GD and confirmed by a plasmid repair assay. The results demonstrate the usefulness of GD for yeast functional genomics research. Conclusion GD offers significant improvement over the manual inspection method to detect relative yeast colony size differences. The speed and accuracy associated with GD makes it an ideal choice for large-scale functional genomics investigations.

  20. Genome Wide Identification, Evolutionary, and Expression Analysis of VQ Genes from Two Pyrus Species.

    Science.gov (United States)

    Cao, Yunpeng; Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping

    2018-04-23

    The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice ( Oryza sativa ), maize ( Zea mays ), and Arabidopsis ( Arabidopsis thaliana ). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis , respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis , respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis . A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis , respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus , and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis .

  1. Analysis of four achaete-scute homologs in Bombyx mori reveals new viewpoints of the evolution and functions of this gene family.

    Science.gov (United States)

    Zhou, Qingxiang; Zhang, Tianyi; Xu, Weihua; Yu, Linlin; Yi, Yongzhu; Zhang, Zhifang

    2008-03-06

    achaete-scute complexe (AS-C) has been widely studied at genetic, developmental and evolutional levels. Genes of this family encode proteins containing a highly conserved bHLH domain, which take part in the regulation of the development of central nervous system and peripheral nervous system. Many AS-C homologs have been isolated from various vertebrates and invertebrates. Also, AS-C genes are duplicated during the evolution of Diptera. Functions besides neural development controlling have also been found in Drosophila AS-C genes. We cloned four achaete-scute homologs (ASH) from the lepidopteran model organism Bombyx mori, including three proneural genes and one neural precursor gene. Proteins encoded by them contained the characteristic bHLH domain and the three proneural ones were also found to have the C-terminal conserved motif. These genes regulated promoter activity through the Class A E-boxes in vitro. Though both Bm-ASH and Drosophila AS-C have four members, they are not in one by one corresponding relationships. Results of RT-PCR and real-time PCR showed that Bm-ASH genes were expressed in different larval tissues, and had well-regulated expressional profiles during the development of embryo and wing/wing disc. There are four achaete-scute homologs in Bombyx mori, the second insect having four AS-C genes so far, and these genes have multiple functions in silkworm life cycle. AS-C gene duplication in insects occurs after or parallel to, but not before the taxonomic order formation during evolution.

  2. Gene set analysis for GWAS

    DEFF Research Database (Denmark)

    Debrabant, Birgit; Soerensen, Mette

    2014-01-01

    Abstract We discuss the use of modified Kolmogorov-Smirnov (KS) statistics in the context of gene set analysis and review corresponding null and alternative hypotheses. Especially, we show that, when enhancing the impact of highly significant genes in the calculation of the test statistic, the co...

  3. Functional Analysis of OMICs Data and Small Molecule Compounds in an Integrated "Knowledge-Based" Platform.

    Science.gov (United States)

    Dubovenko, Alexey; Nikolsky, Yuri; Rakhmatulin, Eugene; Nikolskaya, Tatiana

    2017-01-01

    Analysis of NGS and other sequencing data, gene variants, gene expression, proteomics, and other high-throughput (OMICs) data is challenging because of its biological complexity and high level of technical and biological noise. One way to deal with both problems is to perform analysis with a high fidelity annotated knowledgebase of protein interactions, pathways, and functional ontologies. This knowledgebase has to be structured in a computer-readable format and must include software tools for managing experimental data, analysis, and reporting. Here, we present MetaCore™ and Key Pathway Advisor (KPA), an integrated platform for functional data analysis. On the content side, MetaCore and KPA encompass a comprehensive database of molecular interactions of different types, pathways, network models, and ten functional ontologies covering human, mouse, and rat genes. The analytical toolkit includes tools for gene/protein list enrichment analysis, statistical "interactome" tool for the identification of over- and under-connected proteins in the dataset, and a biological network analysis module made up of network generation algorithms and filters. The suite also features Advanced Search, an application for combinatorial search of the database content, as well as a Java-based tool called Pathway Map Creator for drawing and editing custom pathway maps. Applications of MetaCore and KPA include molecular mode of action of disease research, identification of potential biomarkers and drug targets, pathway hypothesis generation, analysis of biological effects for novel small molecule compounds and clinical applications (analysis of large cohorts of patients, and translational and personalized medicine).

  4. Investigating the effect of paralogs on microarray gene-set analysis

    LENUS (Irish Health Repository)

    Faure, Andre J

    2011-01-24

    Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

  5. Identification and functional analysis of gene cluster involvement in biosynthesis of the cyclic lipopeptide antibiotic pelgipeptin produced by Paenibacillus elgii

    Directory of Open Access Journals (Sweden)

    Qian Chao-Dong

    2012-09-01

    Full Text Available Abstract Background Pelgipeptin, a potent antibacterial and antifungal agent, is a non-ribosomally synthesised lipopeptide antibiotic. This compound consists of a β-hydroxy fatty acid and nine amino acids. To date, there is no information about its biosynthetic pathway. Results A potential pelgipeptin synthetase gene cluster (plp was identified from Paenibacillus elgii B69 through genome analysis. The gene cluster spans 40.8 kb with eight open reading frames. Among the genes in this cluster, three large genes, plpD, plpE, and plpF, were shown to encode non-ribosomal peptide synthetases (NRPSs, with one, seven, and one module(s, respectively. Bioinformatic analysis of the substrate specificity of all nine adenylation domains indicated that the sequence of the NRPS modules is well collinear with the order of amino acids in pelgipeptin. Additional biochemical analysis of four recombinant adenylation domains (PlpD A1, PlpE A1, PlpE A3, and PlpF A1 provided further evidence that the plp gene cluster involved in pelgipeptin biosynthesis. Conclusions In this study, a gene cluster (plp responsible for the biosynthesis of pelgipeptin was identified from the genome sequence of Paenibacillus elgii B69. The identification of the plp gene cluster provides an opportunity to develop novel lipopeptide antibiotics by genetic engineering.

  6. Babelomics: an integrative platform for the analysis of transcriptomics, proteomics and genomic data with advanced functional profiling

    Science.gov (United States)

    Medina, Ignacio; Carbonell, José; Pulido, Luis; Madeira, Sara C.; Goetz, Stefan; Conesa, Ana; Tárraga, Joaquín; Pascual-Montano, Alberto; Nogales-Cadenas, Ruben; Santoyo, Javier; García, Francisco; Marbà, Martina; Montaner, David; Dopazo, Joaquín

    2010-01-01

    Babelomics is a response to the growing necessity of integrating and analyzing different types of genomic data in an environment that allows an easy functional interpretation of the results. Babelomics includes a complete suite of methods for the analysis of gene expression data that include normalization (covering most commercial platforms), pre-processing, differential gene expression (case-controls, multiclass, survival or continuous values), predictors, clustering; large-scale genotyping assays (case controls and TDTs, and allows population stratification analysis and correction). All these genomic data analysis facilities are integrated and connected to multiple options for the functional interpretation of the experiments. Different methods of functional enrichment or gene set enrichment can be used to understand the functional basis of the experiment analyzed. Many sources of biological information, which include functional (GO, KEGG, Biocarta, Reactome, etc.), regulatory (Transfac, Jaspar, ORegAnno, miRNAs, etc.), text-mining or protein–protein interaction modules can be used for this purpose. Finally a tool for the de novo functional annotation of sequences has been included in the system. This provides support for the functional analysis of non-model species. Mirrors of Babelomics or command line execution of their individual components are now possible. Babelomics is available at http://www.babelomics.org. PMID:20478823

  7. Age-Specific Gene Expression Profiles of Rhesus Monkey Ovaries Detected by Microarray Analysis

    Directory of Open Access Journals (Sweden)

    Hengxi Wei

    2015-01-01

    Full Text Available The biological function of human ovaries declines with age. To identify the potential molecular changes in ovarian aging, we performed genome-wide gene expression analysis by microarray of ovaries from young, middle-aged, and old rhesus monkeys. Microarray data was validated by quantitative real-time PCR. Results showed that a total of 503 (60 upregulated, 443 downregulated and 84 (downregulated genes were differentially expressed in old ovaries compared to young and middle-aged groups, respectively. No difference in gene expression was found between middle-aged and young groups. Differentially expressed genes were mainly enriched in cell and organelle, cellular and physiological process, binding, and catalytic activity. These genes were primarily associated with KEGG pathways of cell cycle, DNA replication and repair, oocyte meiosis and maturation, MAPK, TGF-beta, and p53 signaling pathway. Genes upregulated were involved in aging, defense response, oxidation reduction, and negative regulation of cellular process; genes downregulated have functions in reproduction, cell cycle, DNA and RNA process, macromolecular complex assembly, and positive regulation of macromolecule metabolic process. These findings show that monkey ovary undergoes substantial change in global transcription with age. Gene expression profiles are useful in understanding the mechanisms underlying ovarian aging and age-associated infertility in primates.

  8. Functional validation of candidate genes detected by genomic feature models

    DEFF Research Database (Denmark)

    Rohde, Palle Duun; Østergaard, Solveig; Kristensen, Torsten Nygaard

    2018-01-01

    Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait...... then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated...

  9. Proteomic analysis uncovers a metabolic phenotype in C. elegans after nhr-40 reduction of function

    International Nuclear Information System (INIS)

    Pohludka, Michal; Simeckova, Katerina; Vohanka, Jaroslav; Yilma, Petr; Novak, Petr; Krause, Michael W.; Kostrouchova, Marta; Kostrouch, Zdenek

    2008-01-01

    Caenorhabditis elegans has an unexpectedly large number (284) of genes encoding nuclear hormone receptors, most of which are nematode-specific and are of unknown function. We have exploited comparative two-dimensional chromatography of synchronized cultures of wild type C. elegans larvae and a mutant in nhr-40 to determine if proteomic approaches will provide additional insight into gene function. Chromatofocusing, followed by reversed-phase chromatography and mass spectrometry, identified altered chromatographic patterns for a set of proteins, many of which function in muscle and metabolism. Prompted by the proteomic analysis, we find that the penetrance of the developmental phenotypes in the mutant is enhanced at low temperatures and by food restriction. The combination of our phenotypic and proteomic analysis strongly suggests that NHR-40 provides a link between metabolism and muscle development. Our results highlight the utility of comparative two-dimensional chromatography to provide a relatively rapid method to gain insight into gene function

  10. Genome-wide analysis of autophagy-associated genes in foxtail millet (Setaria italica L.) and characterization of the function of SiATG8a in conferring tolerance to nitrogen starvation in rice.

    Science.gov (United States)

    Li, Weiwei; Chen, Ming; Wang, Erhui; Hu, Liqin; Hawkesford, Malcolm J; Zhong, Li; Chen, Zhu; Xu, Zhaoshi; Li, Liancheng; Zhou, Yongbin; Guo, Changhong; Ma, Youzhi

    2016-10-12

    Autophagy is a cellular degradation process that is highly evolutionarily-conserved in yeast, plants, and animals. In plants, autophagy plays important roles in regulating intracellular degradation and recycling of amino acids in response to nutrient starvation, senescence, and other environmental stresses. Foxtail millet (Setaria italica) has strong resistance to stresses and has been proposed as an ideal material for use in the study of the physiological mechanisms of abiotic stress tolerance in plants. Although the genome sequence of foxtail millet (Setaria italica) is available, the characteristics and functions of abiotic stress-related genes remain largely unknown for this species. A total of 37 putative ATG (autophagy-associated genes) genes in the foxtail millet genome were identified. Gene duplication analysis revealed that both segmental and tandem duplication events have played significant roles in the expansion of the ATG gene family in foxtail millet. Comparative synteny mapping between the genomes of foxtail millet and rice suggested that the ATG genes in both species have common ancestors, as their ATG genes were primarily located in similar syntenic regions. Gene expression analysis revealed the induced expression of 31 SiATG genes by one or more phytohormone treatments, 26 SiATG genes by drought, salt and cold, 24 SiATG genes by darkness and 25 SiATG genes by nitrogen starvation. Results of qRT-PCR showing that among 37 SiATG genes, the expression level of SiATG8a was the highest after nitrogen starvation treatment 24 h, suggesting its potential role in tolerance to nutrient starvation. Moreover, the heterologous expression of SiATG8a in rice improved nitrogen starvation tolerance. Compared to wild type rice, the transgenic rice performed better and had higher aboveground total nitrogen content when the plants were grown under nitrogen starvation conditions. Our results deepen understanding about the characteristics and functions of ATG genes in

  11. Significant Microsynteny with New Evolutionary Highlights Is Detected through Comparative Genomic Sequence Analysis of Maize CCCH IX Gene Subfamily

    Directory of Open Access Journals (Sweden)

    Wei-Jun Chen

    2015-01-01

    Full Text Available CCCH zinc finger proteins, which are characterized by the presence of three cysteine residues and one histidine residue, play important roles in RNA processing in plants. Subfamily IX CCCH proteins were recently shown to function in stress tolerances. In this study, we analyzed CCCH IX genes in Zea mays, Oryza sativa, and Sorghum bicolor. These genes, which are almost intronless, were divided into four groups based on phylogenetic analysis. Microsynteny analysis revealed microsynteny in regions of some gene pairs, indicating that segmental duplication has played an important role in the expansion of this gene family. In addition, we calculated the dates of duplication by Ks analysis, finding that all microsynteny blocks were formed after the monocot-eudicot divergence. We found that deletions, multiplications, and inversions were shown to have occurred over the course of evolution. Moreover, the Ka/Ks ratios indicated that the genes in these three grass species are under strong purifying selection. Finally, we investigated the evolutionary patterns of some gene pairs conferring tolerance to abiotic stress, laying the foundation for future functional studies of these transcription factors.

  12. Comparative genomics of Geobacter chemotaxis genes reveals diverse signaling function

    Directory of Open Access Journals (Sweden)

    Antommattei Frances M

    2008-10-01

    Full Text Available Abstract Background Geobacter species are δ-Proteobacteria and are often the predominant species in a variety of sedimentary environments where Fe(III reduction is important. Their ability to remediate contaminated environments and produce electricity makes them attractive for further study. Cell motility, biofilm formation, and type IV pili all appear important for the growth of Geobacter in changing environments and for electricity production. Recent studies in other bacteria have demonstrated that signaling pathways homologous to the paradigm established for Escherichia coli chemotaxis can regulate type IV pili-dependent motility, the synthesis of flagella and type IV pili, the production of extracellular matrix material, and biofilm formation. The classification of these pathways by comparative genomics improves the ability to understand how Geobacter thrives in natural environments and better their use in microbial fuel cells. Results The genomes of G. sulfurreducens, G. metallireducens, and G. uraniireducens contain multiple (~70 homologs of chemotaxis genes arranged in several major clusters (six, seven, and seven, respectively. Unlike the single gene cluster of E. coli, the Geobacter clusters are not all located near the flagellar genes. The probable functions of some Geobacter clusters are assignable by homology to known pathways; others appear to be unique to the Geobacter sp. and contain genes of unknown function. We identified large numbers of methyl-accepting chemotaxis protein (MCP homologs that have diverse sensing domain architectures and generate a potential for sensing a great variety of environmental signals. We discuss mechanisms for class-specific segregation of the MCPs in the cell membrane, which serve to maintain pathway specificity and diminish crosstalk. Finally, the regulation of gene expression in Geobacter differs from E. coli. The sequences of predicted promoter elements suggest that the alternative sigma factors

  13. A genome-wide analysis of the flax (Linum usitatissimum L.) dirigent protein family: from gene identification and evolution to differential regulation.

    Energy Technology Data Exchange (ETDEWEB)

    Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Laine, Eric; Davin, Laurence B.; Cort, John R.; Lewis, Norman G.; Hano, Christophe

    2018-04-30

    Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved

  14. A genome-wide analysis of the flax (Linum usitatissimum L.) dirigent protein family: from gene identification and evolution to differential regulation.

    Science.gov (United States)

    Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Lainé, Éric; Davin, Laurence B; Cort, John R; Lewis, Norman G; Hano, Christophe

    2018-05-01

    Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in

  15. Identification and Evolutionary Analysis of Potential Candidate Genes in a Human Eating Disorder

    Directory of Open Access Journals (Sweden)

    Ubadah Sabbagh

    2016-01-01

    Full Text Available The purpose of this study was to find genes linked with eating disorders and associated with both metabolic and neural systems. Our operating hypothesis was that there are genetic factors underlying some eating disorders resting in both those pathways. Specifically, we are interested in disorders that may rest in both sleep and metabolic function, generally called Night Eating Syndrome (NES. A meta-analysis of the Gene Expression Omnibus targeting the mammalian nervous system, sleep, and obesity studies was performed, yielding numerous genes of interest. Through a text-based analysis of the results, a number of potential candidate genes were identified. VGF, in particular, appeared to be relevant both to obesity and, broadly, to brain or neural development. VGF is a highly connected protein that interacts with numerous targets via proteolytically digested peptides. We examined VGF from an evolutionary perspective to determine whether other available evidence supported a role for the gene in human disease. We conclude that some of the already identified variants in VGF from human polymorphism studies may contribute to eating disorders and obesity. Our data suggest that there is enough evidence to warrant eGWAS and GWAS analysis of these genes in NES patients in a case-control study.

  16. Identification and Evolutionary Analysis of Potential Candidate Genes in a Human Eating Disorder.

    Science.gov (United States)

    Sabbagh, Ubadah; Mullegama, Saman; Wyckoff, Gerald J

    2016-01-01

    The purpose of this study was to find genes linked with eating disorders and associated with both metabolic and neural systems. Our operating hypothesis was that there are genetic factors underlying some eating disorders resting in both those pathways. Specifically, we are interested in disorders that may rest in both sleep and metabolic function, generally called Night Eating Syndrome (NES). A meta-analysis of the Gene Expression Omnibus targeting the mammalian nervous system, sleep, and obesity studies was performed, yielding numerous genes of interest. Through a text-based analysis of the results, a number of potential candidate genes were identified. VGF, in particular, appeared to be relevant both to obesity and, broadly, to brain or neural development. VGF is a highly connected protein that interacts with numerous targets via proteolytically digested peptides. We examined VGF from an evolutionary perspective to determine whether other available evidence supported a role for the gene in human disease. We conclude that some of the already identified variants in VGF from human polymorphism studies may contribute to eating disorders and obesity. Our data suggest that there is enough evidence to warrant eGWAS and GWAS analysis of these genes in NES patients in a case-control study.

  17. Generalized functional linear models for gene-based case-control association studies.

    Science.gov (United States)

    Fan, Ruzong; Wang, Yifan; Mills, James L; Carter, Tonia C; Lobach, Iryna; Wilson, Alexander F; Bailey-Wilson, Joan E; Weeks, Daniel E; Xiong, Momiao

    2014-11-01

    By using functional data analysis techniques, we developed generalized functional linear models for testing association between a dichotomous trait and multiple genetic variants in a genetic region while adjusting for covariates. Both fixed and mixed effect models are developed and compared. Extensive simulations show that Rao's efficient score tests of the fixed effect models are very conservative since they generate lower type I errors than nominal levels, and global tests of the mixed effect models generate accurate type I errors. Furthermore, we found that the Rao's efficient score test statistics of the fixed effect models have higher power than the sequence kernel association test (SKAT) and its optimal unified version (SKAT-O) in most cases when the causal variants are both rare and common. When the causal variants are all rare (i.e., minor allele frequencies less than 0.03), the Rao's efficient score test statistics and the global tests have similar or slightly lower power than SKAT and SKAT-O. In practice, it is not known whether rare variants or common variants in a gene region are disease related. All we can assume is that a combination of rare and common variants influences disease susceptibility. Thus, the improved performance of our models when the causal variants are both rare and common shows that the proposed models can be very useful in dissecting complex traits. We compare the performance of our methods with SKAT and SKAT-O on real neural tube defects and Hirschsprung's disease datasets. The Rao's efficient score test statistics and the global tests are more sensitive than SKAT and SKAT-O in the real data analysis. Our methods can be used in either gene-disease genome-wide/exome-wide association studies or candidate gene analyses. © 2014 WILEY PERIODICALS, INC.

  18. Microarray analysis of differentially expressed genes and their functions in omental visceral adipose tissues of pregnant women with vs. without gestational diabetes mellitus

    Science.gov (United States)

    Qian, Yuan; Sun, Hao; Xiao, Hongli; Ma, Meirun; Xiao, Xue; Qu, Qinzai

    2017-01-01

    Increasing evidence has shown that insulin resistance in omental visceral adipose tissue (OVAT) is a characteristic of gestational diabetes mellitus (GDM). The present study aimed to identify differentially expressed genes (DEGs) and their associated functions and pathways involved in the pathogenesis of GDM by comparing the expression profiles of OVATs obtained from pregnant Chinese women with and without GDM during caesarian section. A total of 935 DEGs were identified, including 450 downregulated and 485 upregulated genes. In the gene ontology category cellular components, the DEGs were predominantly associated with functions of the extracellular region, while receptor binding was predominant in the molecular function category and biological process terms included antigen processing and presentation, extracellular matrix organization, positive regulation of cell-substrate adhesion, response to nutrients and response to dietary excess. Functional enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were performed and a functional interaction network was constructed. Functions of downregulated genes included antigen processing and presentation as well as cell adhesion molecules, while those of upregulated genes included transforming growth factor (TGF)-β-signaling, focal adhesion, phosphoinositide-3 kinase-Akt-signaling, P53 signaling, extracellular matrix-receptor interaction and regulation of actin cytoskeleton pathway. The five main pathways associated with GDM were antigen processing and presentation, cell adhesion molecules, Type 1 diabetes mellitus, natural killer cell-mediated cytotoxicity and TGF-β signaling. These pathways were included in the KEGG pathway categories of ‘signaling molecules and interaction’, ‘immune system’ and ‘inflammatory response’, suggesting that these processes are involved in GDM. The results of the present study enhanced the present understanding of the mechanisms associated with insulin

  19. Mapped clone and functional analysis of leaf-color gene Ygl7 in a rice hybrid (Oryza sativa L. ssp. indica).

    Science.gov (United States)

    Deng, Xiao-juan; Zhang, Hai-qing; Wang, Yue; He, Feng; Liu, Jin-ling; Xiao, Xiao; Shu, Zhi-feng; Li, Wei; Wang, Guo-huai; Wang, Guo-liang

    2014-01-01

    Leaf-color is an effective marker to identify the hybridization of rice. Leaf-color related genes function in chloroplast development and the photosynthetic pigment biosynthesis of higher plants. The ygl7 (yellow-green leaf 7) is a mutant with spontaneous yellow-green leaf phenotype across the whole lifespan but with no change to its yield traits. We cloned gene Ygl7 (Os03g59640) which encodes a magnesium-chelatase ChlD protein. Expression of ygl7 turns green-leaves to yellow, whereas RNAi-mediated silence of Ygl7 causes a lethal phenotype of the transgenic plants. This indicates the importance of the gene for rice plant. On the other hand, it corroborates that ygl7 is a non-null mutants. The content of photosynthetic pigment is lower in Ygl7 than the wild type, but its light efficiency was comparatively high. All these results indicated that the mutational YGL7 protein does not cause a complete loss of original function but instead acts as a new protein performing a new function. This new function partially includes its preceding function and possesses an additional feature to promote photosynthesis. Chl1, Ygl98, and Ygl3 are three alleles of the OsChlD gene that have been documented previously. However, mutational sites of OsChlD mutant gene and their encoded protein products were different in the three mutants. The three mutants have suppressed grain output. In our experiment, plant materials of three mutants (ygl7, chl1, and ygl98) all exhibited mutational leaf-color during the whole growth period. This result was somewhat different from previous studies. We used ygl7 as female crossed with chl1 and ygl98, respectively. Both the F1 and F2 generation display yellow-green leaf phenotype with their chlorophyll and carotenoid content falling between the values of their parents. Moreover, we noted an important phenomenon: ygl7-NIL's leaf-color is yellow, not yellowy-green, and this is also true of all back-crossed offspring with ygl7.

  20. Mapped clone and functional analysis of leaf-color gene Ygl7 in a rice hybrid (Oryza sativa L. ssp. indica.

    Directory of Open Access Journals (Sweden)

    Xiao-juan Deng

    Full Text Available Leaf-color is an effective marker to identify the hybridization of rice. Leaf-color related genes function in chloroplast development and the photosynthetic pigment biosynthesis of higher plants. The ygl7 (yellow-green leaf 7 is a mutant with spontaneous yellow-green leaf phenotype across the whole lifespan but with no change to its yield traits. We cloned gene Ygl7 (Os03g59640 which encodes a magnesium-chelatase ChlD protein. Expression of ygl7 turns green-leaves to yellow, whereas RNAi-mediated silence of Ygl7 causes a lethal phenotype of the transgenic plants. This indicates the importance of the gene for rice plant. On the other hand, it corroborates that ygl7 is a non-null mutants. The content of photosynthetic pigment is lower in Ygl7 than the wild type, but its light efficiency was comparatively high. All these results indicated that the mutational YGL7 protein does not cause a complete loss of original function but instead acts as a new protein performing a new function. This new function partially includes its preceding function and possesses an additional feature to promote photosynthesis. Chl1, Ygl98, and Ygl3 are three alleles of the OsChlD gene that have been documented previously. However, mutational sites of OsChlD mutant gene and their encoded protein products were different in the three mutants. The three mutants have suppressed grain output. In our experiment, plant materials of three mutants (ygl7, chl1, and ygl98 all exhibited mutational leaf-color during the whole growth period. This result was somewhat different from previous studies. We used ygl7 as female crossed with chl1 and ygl98, respectively. Both the F1 and F2 generation display yellow-green leaf phenotype with their chlorophyll and carotenoid content falling between the values of their parents. Moreover, we noted an important phenomenon: ygl7-NIL's leaf-color is yellow, not yellowy-green, and this is also true of all back-crossed offspring with ygl7.

  1. Identification and functional analysis of a novel mutation in the SOX10 gene associated with Waardenburg syndrome type IV.

    Science.gov (United States)

    Wang, Hong-Han; Chen, Hong-Sheng; Li, Hai-Bo; Zhang, Hua; Mei, Ling-Yun; He, Chu-Feng; Wang, Xing-Wei; Men, Mei-Chao; Jiang, Lu; Liao, Xin-Bin; Wu, Hong; Feng, Yong

    2014-03-15

    Waardenburg syndrome type IV (WS4) is a rare genetic disorder, characterized by auditory-pigmentary abnormalities and Hirschsprung disease. Mutations of the EDNRB gene, EDN3 gene, or SOX10 gene are responsible for WS4. In the present study, we reported a case of a Chinese patient with clinical features of WS4. In addition, the three genes mentioned above were sequenced in order to identify whether mutations are responsible for the case. We revealed a novel nonsense mutation, c.1063C>T (p.Q355*), in the last coding exon of SOX10. The same mutation was not found in three unaffected family members or 100 unrelated controls. Then, the function and mechanism of the mutation were investigated in vitro. We found both wild-type (WT) and mutant SOX10 p.Q355* were detected at the expected size and their expression levels are equivalent. The mutant protein also localized in the nucleus and retained the DNA-binding activity as WT counterpart; however, it lost its transactivation capability on the MITF promoter and acted as a dominant-negative repressor impairing function of the WT SOX10. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Genome-wide identification and expression analysis of SBP-like transcription factor genes in Moso Bamboo (Phyllostachys edulis).

    Science.gov (United States)

    Pan, Feng; Wang, Yue; Liu, Huanglong; Wu, Min; Chu, Wenyuan; Chen, Danmei; Xiang, Yan

    2017-06-27

    The SQUAMOSA promoter binding protein-like (SPL) proteins are plant-specific transcription factors (TFs) that function in a variety of developmental processes including growth, flower development, and signal transduction. SPL proteins are encoded by a gene family, and these genes have been characterized in two model grass species, Zea mays and Oryza sativa. The SPL gene family has not been well studied in moso bamboo (Phyllostachys edulis), a woody grass species. We identified 32 putative PeSPL genes in the P. edulis genome. Phylogenetic analysis arranged the PeSPL protein sequences in eight groups. Similarly, phylogenetic analysis of the SBP-like and SBP proteins from rice and maize clustered them into eight groups analogous to those from P. edulis. Furthermore, the deduced PeSPL proteins in each group contained very similar conserved sequence motifs. Our analyses indicate that the PeSPL genes experienced a large-scale duplication event ~15 million years ago (MYA), and that divergence between the PeSPL and OsSPL genes occurred 34 MYA. The stress-response expression profiles and tissue-specificity of the putative PeSPL gene promoter regions showed that SPL genes in moso bamboo have potential biological functions in stress resistance as well as in growth and development. We therefore examined PeSPL gene expression in response to different plant hormone and drought (polyethylene glycol-6000; PEG) treatments to mimic biotic and abiotic stresses. Expression of three (PeSPL10, -12, -17), six (PeSPL1, -10, -12, -17, -20, -31), and nine (PeSPL5, -8, -9, -14, -15, -19, -20, -31, -32) genes remained relatively stable after treating with salicylic acid (SA), gibberellic acid (GA), and PEG, respectively, while the expression patterns of other genes changed. In addition, analysis of tissue-specific expression of the moso bamboo SPL genes during development showed differences in their spatiotemporal expression patterns, and many were expressed at high levels in flowers and

  3. [Key effect genes responding to nerve injury identified by gene ontology and computer pattern recognition].

    Science.gov (United States)

    Pan, Qian; Peng, Jin; Zhou, Xue; Yang, Hao; Zhang, Wei

    2012-07-01

    In order to screen out important genes from large gene data of gene microarray after nerve injury, we combine gene ontology (GO) method and computer pattern recognition technology to find key genes responding to nerve injury, and then verify one of these screened-out genes. Data mining and gene ontology analysis of gene chip data GSE26350 was carried out through MATLAB software. Cd44 was selected from screened-out key gene molecular spectrum by comparing genes' different GO terms and positions on score map of principal component. Function interferences were employed to influence the normal binding of Cd44 and one of its ligands, chondroitin sulfate C (CSC), to observe neurite extension. Gene ontology analysis showed that the first genes on score map (marked by red *) mainly distributed in molecular transducer activity, receptor activity, protein binding et al molecular function GO terms. Cd44 is one of six effector protein genes, and attracted us with its function diversity. After adding different reagents into the medium to interfere the normal binding of CSC and Cd44, varying-degree remissions of CSC's inhibition on neurite extension were observed. CSC can inhibit neurite extension through binding Cd44 on the neuron membrane. This verifies that important genes in given physiological processes can be identified by gene ontology analysis of gene chip data.

  4. Comparative analysis of taxonomic, functional, and metabolic patterns of microbiomes from 14 full-scale biogas reactors by metagenomic sequencing and radioisotopic analysis.

    Science.gov (United States)

    Luo, Gang; Fotidis, Ioannis A; Angelidaki, Irini

    2016-01-01

    Biogas production is a very complex process due to the high complexity in diversity and interactions of the microorganisms mediating it, and only limited and diffuse knowledge exists about the variation of taxonomic and functional patterns of microbiomes across different biogas reactors, and their relationships with the metabolic patterns. The present study used metagenomic sequencing and radioisotopic analysis to assess the taxonomic, functional, and metabolic patterns of microbiomes from 14 full-scale biogas reactors operated under various conditions treating either sludge or manure. The results from metagenomic analysis showed that the dominant methanogenic pathway revealed by radioisotopic analysis was not always correlated with the taxonomic and functional compositions. It was found by radioisotopic experiments that the aceticlastic methanogenic pathway was dominant, while metagenomics analysis showed higher relative abundance of hydrogenotrophic methanogens. Principal coordinates analysis showed the sludge-based samples were clearly distinct from the manure-based samples for both taxonomic and functional patterns, and canonical correspondence analysis showed that the both temperature and free ammonia were crucial environmental variables shaping the taxonomic and functional patterns. The study further the overall patterns of functional genes were strongly correlated with overall patterns of taxonomic composition across different biogas reactors. The discrepancy between the metabolic patterns determined by metagenomic analysis and metabolic pathways determined by radioisotopic analysis was found. Besides, a clear correlation between taxonomic and functional patterns was demonstrated for biogas reactors, and also the environmental factors that shaping both taxonomic and functional genes patterns were identified.

  5. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

    Science.gov (United States)

    Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

    2015-11-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.

  6. Microarray analysis of the gene expression profile in triethylene glycol dimethacrylate-treated human dental pulp cells.

    Science.gov (United States)

    Torun, D; Torun, Z Ö; Demirkaya, K; Sarper, M; Elçi, M P; Avcu, F

    2017-11-01

    Triethylene glycol dimethacrylate (TEGDMA) is an important resin monomer commonly used in the structure of dental restorative materials. Recent studies have shown that unpolymerized resin monomers may be released into the oral environment and cause harmful biological effects. We investigated changes in the gene expression profiles of TEGDMA-treated human dental pulp cells (hDPCs) following short- (1-day) and long-term (7-days) exposure. HDPCs were exposed to a noncytotoxic concentration of TEGDMA, and gene expression profiles were evaluated by microarray analysis. The results were confirmed by quantitative reverse-transcriptase PCR (qRT PCR). In total, 1282 and 1319 genes (up- or down-regulated) were differentially expressed compared with control group after the 1- and 7-day incubation periods, respectively. Biological ontology-based analyses revealed that metabolic, cellular, and developmental processes constituted the largest groups of biological functional processes. qRT-PCR analysis on bone morphogenetic protein-2 (BMP-2), BMP-4, secreted protein, acidic, cysteine-rich, collagen type I alpha 1, oxidative stress-induced growth inhibitor 1, MMP3, interleukin-6, and heme oxygenase-1 genes confirmed the changes in expression observed in the microarray analysis. Our results suggest that TEGDMA can change the many functions of hDPCs through large changes in gene expression levels and complex interactions with different signaling pathways.

  7. Bioinformatics analysis of the phytoene synthase gene in cabbage (Brassica oleracea var. capitata)

    Science.gov (United States)

    Sun, Bo; Jiang, Min; Xue, Shengling; Zheng, Aihong; Zhang, Fen; Tang, Haoru

    2018-04-01

    Phytoene Synthase (PSY) is an important enzyme in carotenoid biosynthesis. Here, the Brassica oleracea var. capitata PSY (BocPSY) gene sequences were obtained from Brassica database (BRAD), and preformed for bioinformatics analysis. The BocPSY1, BocPSY2 and BocPSY3 genes mapped to chromosomes 2,3 and 9, and contains an open reading frame of 1,248 bp, 1,266 bp and 1,275 bp that encodes a 415, 421, 424 amino acid protein, respectively. Subcellular localization predicted all BocPSY genes were in the chloroplast. The conserved domain of the BocPSY protein is PLN02632. Homology analysis indicates that the levels of identity among BocPSYs were all more than 85%, and the PSY protein is apparently conserved during plant evolution. The findings of the present study provide a molecular basis for the elucidation of PSY gene function in cabbage.

  8. GxGrare: gene-gene interaction analysis method for rare variants from high-throughput sequencing data.

    Science.gov (United States)

    Kwon, Minseok; Leem, Sangseob; Yoon, Joon; Park, Taesung

    2018-03-19

    With the rapid advancement of array-based genotyping techniques, genome-wide association studies (GWAS) have successfully identified common genetic variants associated with common complex diseases. However, it has been shown that only a small proportion of the genetic etiology of complex diseases could be explained by the genetic factors identified from GWAS. This missing heritability could possibly be explained by gene-gene interaction (epistasis) and rare variants. There has been an exponential growth of gene-gene interaction analysis for common variants in terms of methodological developments and practical applications. Also, the recent advancement of high-throughput sequencing technologies makes it possible to conduct rare variant analysis. However, little progress has been made in gene-gene interaction analysis for rare variants. Here, we propose GxGrare which is a new gene-gene interaction method for the rare variants in the framework of the multifactor dimensionality reduction (MDR) analysis. The proposed method consists of three steps; 1) collapsing the rare variants, 2) MDR analysis for the collapsed rare variants, and 3) detect top candidate interaction pairs. GxGrare can be used for the detection of not only gene-gene interactions, but also interactions within a single gene. The proposed method is illustrated with 1080 whole exome sequencing data of the Korean population in order to identify causal gene-gene interaction for rare variants for type 2 diabetes. The proposed GxGrare performs well for gene-gene interaction detection with collapsing of rare variants. GxGrare is available at http://bibs.snu.ac.kr/software/gxgrare which contains simulation data and documentation. Supported operating systems include Linux and OS X.

  9. Progress and challenges in the computational prediction of gene function using networks [v1; ref status: indexed, http://f1000r.es/SqmJUM

    Directory of Open Access Journals (Sweden)

    Paul Pavlidis

    2012-09-01

    Full Text Available In this opinion piece, we attempt to unify recent arguments we have made that serious confounds affect the use of network data to predict and characterize gene function. The development of computational approaches to determine gene function is a major strand of computational genomics research. However, progress beyond using BLAST to transfer annotations has been surprisingly slow. We have previously argued that a large part of the reported success in using "guilt by association" in network data is due to the tendency of methods to simply assign new functions to already well-annotated genes. While such predictions will tend to be correct, they are generic; it is true, but not very helpful, that a gene with many functions is more likely to have any function. We have also presented evidence that much of the remaining performance in cross-validation cannot be usefully generalized to new predictions, making progressive improvement in analysis difficult to engineer. Here we summarize our findings about how these problems will affect network analysis, discuss some ongoing responses within the field to these issues, and consolidate some recommendations and speculation, which we hope will modestly increase the reliability and specificity of gene function prediction.

  10. Evolution of the snake body form reveals homoplasy in amniote Hox gene function.

    Science.gov (United States)

    Head, Jason J; Polly, P David

    2015-04-02

    Hox genes regulate regionalization of the axial skeleton in vertebrates, and changes in their expression have been proposed to be a fundamental mechanism driving the evolution of new body forms. The origin of the snake-like body form, with its deregionalized pre-cloacal axial skeleton, has been explained as either homogenization of Hox gene expression domains, or retention of standard vertebrate Hox domains with alteration of downstream expression that suppresses development of distinct regions. Both models assume a highly regionalized ancestor, but the extent of deregionalization of the primaxial domain (vertebrae, dorsal ribs) of the skeleton in snake-like body forms has never been analysed. Here we combine geometric morphometrics and maximum-likelihood analysis to show that the pre-cloacal primaxial domain of elongate, limb-reduced lizards and snakes is not deregionalized compared with limbed taxa, and that the phylogenetic structure of primaxial morphology in reptiles does not support a loss of regionalization in the evolution of snakes. We demonstrate that morphometric regional boundaries correspond to mapped gene expression domains in snakes, suggesting that their primaxial domain is patterned by a normally functional Hox code. Comparison of primaxial osteology in fossil and modern amniotes with Hox gene distributions within Amniota indicates that a functional, sequentially expressed Hox code patterned a subtle morphological gradient along the anterior-posterior axis in stem members of amniote clades and extant lizards, including snakes. The highly regionalized skeletons of extant archosaurs and mammals result from independent evolution in the Hox code and do not represent ancestral conditions for clades with snake-like body forms. The developmental origin of snakes is best explained by decoupling of the primaxial and abaxial domains and by increases in somite number, not by changes in the function of primaxial Hox genes.

  11. Model-based gene set analysis for Bioconductor.

    Science.gov (United States)

    Bauer, Sebastian; Robinson, Peter N; Gagneur, Julien

    2011-07-01

    Gene Ontology and other forms of gene-category analysis play a major role in the evaluation of high-throughput experiments in molecular biology. Single-category enrichment analysis procedures such as Fisher's exact test tend to flag large numbers of redundant categories as significant, which can complicate interpretation. We have recently developed an approach called model-based gene set analysis (MGSA), that substantially reduces the number of redundant categories returned by the gene-category analysis. In this work, we present the Bioconductor package mgsa, which makes the MGSA algorithm available to users of the R language. Our package provides a simple and flexible application programming interface for applying the approach. The mgsa package has been made available as part of Bioconductor 2.8. It is released under the conditions of the Artistic license 2.0. peter.robinson@charite.de; julien.gagneur@embl.de.

  12. Functional conservation of the Drosophila gooseberry gene and its evolutionary alleles.

    Directory of Open Access Journals (Sweden)

    Wei Liu

    Full Text Available The Drosophila Pax gene gooseberry (gsb is required for development of the larval cuticle and CNS, survival to adulthood, and male fertility. These functions can be rescued in gsb mutants by two gsb evolutionary alleles, gsb-Prd and gsb-Pax3, which express the Drosophila Paired and mouse Pax3 proteins under the control of gooseberry cis-regulatory region. Therefore, both Paired and Pax3 proteins have conserved all the Gsb functions that are required for survival of embryos to fertile adults, despite the divergent primary sequences in their C-terminal halves. As gsb-Prd and gsb-Pax3 uncover a gsb function involved in male fertility, construction of evolutionary alleles may provide a powerful strategy to dissect hitherto unknown gene functions. Our results provide further evidence for the essential role of cis-regulatory regions in the functional diversification of duplicated genes during evolution.

  13. Comparative Genomic Analysis of Soybean Flowering Genes

    Science.gov (United States)

    Jung, Chol-Hee; Wong, Chui E.; Singh, Mohan B.; Bhalla, Prem L.

    2012-01-01

    Flowering is an important agronomic trait that determines crop yield. Soybean is a major oilseed legume crop used for human and animal feed. Legumes have unique vegetative and floral complexities. Our understanding of the molecular basis of flower initiation and development in legumes is limited. Here, we address this by using a computational approach to examine flowering regulatory genes in the soybean genome in comparison to the most studied model plant, Arabidopsis. For this comparison, a genome-wide analysis of orthologue groups was performed, followed by an in silico gene expression analysis of the identified soybean flowering genes. Phylogenetic analyses of the gene families highlighted the evolutionary relationships among these candidates. Our study identified key flowering genes in soybean and indicates that the vernalisation and the ambient-temperature pathways seem to be the most variant in soybean. A comparison of the orthologue groups containing flowering genes indicated that, on average, each Arabidopsis flowering gene has 2-3 orthologous copies in soybean. Our analysis highlighted that the CDF3, VRN1, SVP, AP3 and PIF3 genes are paralogue-rich genes in soybean. Furthermore, the genome mapping of the soybean flowering genes showed that these genes are scattered randomly across the genome. A paralogue comparison indicated that the soybean genes comprising the largest orthologue group are clustered in a 1.4 Mb region on chromosome 16 of soybean. Furthermore, a comparison with the undomesticated soybean (Glycine soja) revealed that there are hundreds of SNPs that are associated with putative soybean flowering genes and that there are structural variants that may affect the genes of the light-signalling and ambient-temperature pathways in soybean. Our study provides a framework for the soybean flowering pathway and insights into the relationship and evolution of flowering genes between a short-day soybean and the long-day plant, Arabidopsis. PMID:22679494

  14. Mutational analysis of the HGO gene in Finnish alkaptonuria patients

    Science.gov (United States)

    de Bernabe, D. B.-V.; Peterson, P.; Luopajarvi, K.; Matintalo, P.; Alho, A.; Konttinen, Y.; Krohn, K.; de Cordoba, S. R.; Ranki, A.

    1999-01-01

    Alkaptonuria (AKU), the prototypic inborn error of metabolism, has recently been shown to be caused by loss of function mutations in the homogentisate-1,2-dioxygenase gene (HGO). So far 17 mutations have been characterised in AKU patients of different ethnic origin. We describe three novel mutations (R58fs, R330S, and H371R) and one common AKU mutation (M368V), detected by mutational and polymorphism analysis of the HGO gene in five Finnish AKU pedigrees. The three novel AKU mutations are most likely specific for the Finnish population and have originated recently.


Keywords: alkaptonuria; homogentisate-1,2-dioxygenase; Finland PMID:10594001

  15. psygenet2r: a R/Bioconductor package for the analysis of psychiatric disease genes.

    Science.gov (United States)

    Gutiérrez-Sacristán, Alba; Hernández-Ferrer, Carles; González, Juan R; Furlong, Laura I

    2017-12-15

    Psychiatric disorders have a great impact on morbidity and mortality. Genotype-phenotype resources for psychiatric diseases are key to enable the translation of research findings to a better care of patients. PsyGeNET is a knowledge resource on psychiatric diseases and their genes, developed by text mining and curated by domain experts. We present psygenet2r, an R package that contains a variety of functions for leveraging PsyGeNET database and facilitating its analysis and interpretation. The package offers different types of queries to the database along with variety of analysis and visualization tools, including the study of the anatomical structures in which the genes are expressed and gaining insight of gene's molecular function. Psygenet2r is especially suited for network medicine analysis of psychiatric disorders. The package is implemented in R and is available under MIT license from Bioconductor (http://bioconductor.org/packages/release/bioc/html/psygenet2r.html). juanr.gonzalez@isglobal.org or laura.furlong@upf.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  16. A functional analysis of the formyl-coenzyme A (frc) gene from Lactobacillus reuteri 100-23C.

    Science.gov (United States)

    Kullin, B; Tannock, G W; Loach, D M; Kimura, K; Abratt, V R; Reid, S J

    2014-06-01

    To examine the role of the Lactobacillus reuteri 100-23C frc gene product in oxalate metabolism, host colonization and the acid stress response. Genes encoding putative formyl-CoA transferase (frc) and oxalyl-CoA decarboxylase (oxc) enzymes are present in the genome sequences of Lact. reuteri strains. Two strains isolated from humans harboured an IS200 insertion sequence in the frc ORF and a group 2 intron-associated transposase downstream of the frc gene, both of which were lacking in two strains of animal origin, which contained intact frc and oxc genes. An frc(-) insertional mutant of Lact. reuteri 100-23C was compared with the parent strain with respect to oxalate degradation, colonization of an RLF-mouse host model and growth in the presence of acids. Neither parent nor mutant degraded oxalate in vitro or in vivo. However, the parent outcompeted the frc(-) mutant in the mouse intestine during co-colonization and the frc(-) mutant showed a reduced growth rate in the presence of hydrochloric acid. Intact oxc and frc genes do not ensure oxalate degradation under the conditions tested. The frc gene product is important during host colonization and survival of acid stress by Lact. reuteri 100-23C. Oxalate metabolism by oxalate-degrading intestinal bacterial strains may be important in preventing urolithiasis and might lead to the derivation of probiotic products. To produce safe and efficacious probiotics, however, an understanding of the genetic characteristics of potential oxalate degraders must be obtained, together with knowledge of their functional ramifications. © 2014 The Society for Applied Microbiology.

  17. Lynx web services for annotations and systems analysis of multi-gene disorders.

    Science.gov (United States)

    Sulakhe, Dinanath; Taylor, Andrew; Balasubramanian, Sandhya; Feng, Bo; Xie, Bingqing; Börnigen, Daniela; Dave, Utpal J; Foster, Ian T; Gilliam, T Conrad; Maltsev, Natalia

    2014-07-01

    Lynx is a web-based integrated systems biology platform that supports annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Lynx has integrated multiple classes of biomedical data (genomic, proteomic, pathways, phenotypic, toxicogenomic, contextual and others) from various public databases as well as manually curated data from our group and collaborators (LynxKB). Lynx provides tools for gene list enrichment analysis using multiple functional annotations and network-based gene prioritization. Lynx provides access to the integrated database and the analytical tools via REST based Web Services (http://lynx.ci.uchicago.edu/webservices.html). This comprises data retrieval services for specific functional annotations, services to search across the complete LynxKB (powered by Lucene), and services to access the analytical tools built within the Lynx platform. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Functional analysis of PGRP-LA in Drosophila immunity.

    Directory of Open Access Journals (Sweden)

    Mathilde Gendrin

    Full Text Available PeptidoGlycan Recognition Proteins (PGRPs are key regulators of the insect innate antibacterial response. Even if they have been intensively studied, some of them have yet unknown functions. Here, we present a functional analysis of PGRP-LA, an as yet uncharacterized Drosophila PGRP. The PGRP-LA gene is located in cluster with PGRP-LC and PGRP-LF, which encode a receptor and a negative regulator of the Imd pathway, respectively. Structure predictions indicate that PGRP-LA would not bind to peptidoglycan, pointing to a regulatory role of this PGRP. PGRP-LA expression was enriched in barrier epithelia, but low in the fat body. Use of a newly generated PGRP-LA deficient mutant indicates that PGRP-LA is not required for the production of antimicrobial peptides by the fat body in response to a systemic infection. Focusing on the respiratory tract, where PGRP-LA is strongly expressed, we conducted a genome-wide microarray analysis of the tracheal immune response of wild-type, Relish, and PGRP-LA mutant larvae. Comparing our data to previous microarray studies, we report that a majority of genes regulated in the trachea upon infection differ from those induced in the gut or the fat body. Importantly, antimicrobial peptide gene expression was reduced in the tracheae of larvae and in the adult gut of PGRP-LA-deficient Drosophila upon oral bacterial infection. Together, our results suggest that PGRP-LA positively regulates the Imd pathway in barrier epithelia.

  19. Genome-wide analysis of the GRAS gene family in physic nut (Jatropha curcas L.).

    Science.gov (United States)

    Wu, Z Y; Wu, P Z; Chen, Y P; Li, M R; Wu, G J; Jiang, H W

    2015-12-29

    GRAS proteins play vital roles in plant growth and development. Physic nut (Jatropha curcas L.) was found to have a total of 48 GRAS family members (JcGRAS), 15 more than those found in Arabidopsis. The JcGRAS genes were divided into 12 subfamilies or 15 ancient monophyletic lineages based on the phylogenetic analysis of GRAS proteins from both flowering and lower plants. The functions of GRAS genes in 9 subfamilies have been reported previously for several plants, while the genes in the remaining 3 subfamilies were of unknown function; we named the latter families U1 to U3. No member of U3 subfamily is present in Arabidopsis and Poaceae species according to public genome sequence data. In comparison with the number of GRAS genes in Arabidopsis, more were detected in physic nut, resulting from the retention of many ancient GRAS subfamilies and the formation of tandem repeats during evolution. No evidence of recent duplication among JcGRAS genes was observed in physic nut. Based on digital gene expression data, 21 of the 48 genes exhibited differential expression in four tissues analyzed. Two members of subfamily U3 were expressed only in buds and flowers, implying that they may play specific roles. Our results provide valuable resources for future studies on the functions of GRAS proteins in physic nut.

  20. A functional alternative splicing mutation in AIRE gene causes autoimmune polyendocrine syndrome type 1.

    Directory of Open Access Journals (Sweden)

    Junyu Zhang

    Full Text Available Autoimmune polyendocrine syndrome type 1 (APS-1 is a rare autosomal recessive disease defined by the presence of two of the three conditions: mucocutaneous candidiasis, hypoparathyroidism, and Addison's disease. Loss-of-function mutations of the autoimmune regulator (AIRE gene have been linked to APS-1. Here we report mutational analysis and functional characterization of an AIRE mutation in a consanguineous Chinese family with APS-1. All exons of the AIRE gene and adjacent exon-intron sequences were amplified by PCR and subsequently sequenced. We identified a homozygous missense AIRE mutation c.463G>A (p.Gly155Ser in two siblings with different clinical features of APS-1. In silico splice-site prediction and minigene analysis were carried out to study the potential pathological consequence. Minigene splicing analysis and subsequent cDNA sequencing revealed that the AIRE mutation potentially compromised the recognition of the splice donor of intron 3, causing alternative pre-mRNA splicing by intron 3 retention. Furthermore, the aberrant AIRE transcript was identified in a heterozygous carrier of the c.463G>A mutation. The aberrant intron 3-retaining transcript generated a truncated protein (p.G155fsX203 containing the first 154 AIRE amino acids and followed by 48 aberrant amino acids. Therefore, our study represents the first functional characterization of the alternatively spliced AIRE mutation that may explain the pathogenetic role in APS-1.