WorldWideScience

Sample records for gene ontolgy tool

  1. The Arabidopsis co-expression tool (act): a WWW-based tool and database for microarray-based gene expression analysis

    DEFF Research Database (Denmark)

    Jen, C. H.; Manfield, I. W.; Michalopoulos, D. W.

    2006-01-01

    be examined using the novel clique finder tool to determine the sets of genes most likely to be regulated in a similar manner. In combination, these tools offer three levels of analysis: creation of correlation lists of co-expressed genes, refinement of these lists using two-dimensional scatter plots......We present a new WWW-based tool for plant gene analysis, the Arabidopsis Co-Expression Tool (act) , based on a large Arabidopsis thaliana microarray data set obtained from the Nottingham Arabidopsis Stock Centre. The co-expression analysis tool allows users to identify genes whose expression...

  2. Computational Tools and Algorithms for Designing Customized Synthetic Genes

    Energy Technology Data Exchange (ETDEWEB)

    Gould, Nathan [Department of Computer Science, The College of New Jersey, Ewing, NJ (United States); Hendy, Oliver [Department of Biology, The College of New Jersey, Ewing, NJ (United States); Papamichail, Dimitris, E-mail: papamicd@tcnj.edu [Department of Computer Science, The College of New Jersey, Ewing, NJ (United States)

    2014-10-06

    Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations.

  3. Computational Tools and Algorithms for Designing Customized Synthetic Genes

    International Nuclear Information System (INIS)

    Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris

    2014-01-01

    Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations.

  4. Computational Tools and Algorithms for Designing Customized Synthetic Genes

    Directory of Open Access Journals (Sweden)

    Nathan eGould

    2014-10-01

    Full Text Available Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de-novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations.

  5. Gene Ontology-Based Analysis of Zebrafish Omics Data Using the Web Tool Comparative Gene Ontology.

    Science.gov (United States)

    Ebrahimie, Esmaeil; Fruzangohar, Mario; Moussavi Nik, Seyyed Hani; Newman, Morgan

    2017-10-01

    Gene Ontology (GO) analysis is a powerful tool in systems biology, which uses a defined nomenclature to annotate genes/proteins within three categories: "Molecular Function," "Biological Process," and "Cellular Component." GO analysis can assist in revealing functional mechanisms underlying observed patterns in transcriptomic, genomic, and proteomic data. The already extensive and increasing use of zebrafish for modeling genetic and other diseases highlights the need to develop a GO analytical tool for this organism. The web tool Comparative GO was originally developed for GO analysis of bacterial data in 2013 ( www.comparativego.com ). We have now upgraded and elaborated this web tool for analysis of zebrafish genetic data using GOs and annotations from the Gene Ontology Consortium.

  6. GO(vis), a gene ontology visualization tool based on multi-dimensional values.

    Science.gov (United States)

    Ning, Zi; Jiang, Zhenran

    2010-05-01

    Most of gene product similarity measurements concentrate on the information content of Gene Ontology (GO) terms or use a path-based similarity between GO terms, which may ignore other important information contained in the structure of the ontology. In our study, we integrate different GO similarity measure approaches to analyze the functional relationship of genes and gene products with a new triangle-based visualization tool called GO(Vis). The purpose of this tool is to demonstrate the effect of three important information factors when measuring the similarity between gene products. One advantage of this tool is that its important ratio can be adjusted to meet different measuring requirements according to the biological knowledge of each factor. The experimental results demonstrate that GO(Vis) can display diagrams of the functional relationship for gene products effectively.

  7. GeneAnalytics: An Integrative Gene Set Analysis Tool for Next Generation Sequencing, RNAseq and Microarray Data.

    Science.gov (United States)

    Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit

    2016-03-01

    Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics

  8. New tools in regenerative medicine: gene therapy.

    Science.gov (United States)

    Muñoz Ruiz, Miguel; Regueiro, José R

    2012-01-01

    Gene therapy aims to transfer genetic material into cells to provide them with new functions. A gene transfer agent has to be safe, capable of expressing the desired gene for a sustained period of time in a sufficiently large population of cells to produce a biological effect. Identifying a gene transfer tool that meets all of these criteria has proven to be a difficult objective. Viral and nonviral vectors, in vivo, ex vivo and in situ strategies co-exist at present, although ex vivo lenti-or retroviral vectors are presently the most popular.Natural stem cells (from embryonic, hematopoietic, mesenchymal, or adult tissues) or induced progenitor stem (iPS) cells can be modified by gene therapy for use in regenerative medicine. Among them, hematopoietic stem cells have shown clear clinical benefit, but iPS cells hold humongous potential with no ethical concerns.

  9. A literature search tool for intelligent extraction of disease-associated genes.

    Science.gov (United States)

    Jung, Jae-Yoon; DeLuca, Todd F; Nelson, Tristan H; Wall, Dennis P

    2014-01-01

    To extract disorder-associated genes from the scientific literature in PubMed with greater sensitivity for literature-based support than existing methods. We developed a PubMed query to retrieve disorder-related, original research articles. Then we applied a rule-based text-mining algorithm with keyword matching to extract target disorders, genes with significant results, and the type of study described by the article. We compared our resulting candidate disorder genes and supporting references with existing databases. We demonstrated that our candidate gene set covers nearly all genes in manually curated databases, and that the references supporting the disorder-gene link are more extensive and accurate than other general purpose gene-to-disorder association databases. We implemented a novel publication search tool to find target articles, specifically focused on links between disorders and genotypes. Through comparison against gold-standard manually updated gene-disorder databases and comparison with automated databases of similar functionality we show that our tool can search through the entirety of PubMed to extract the main gene findings for human diseases rapidly and accurately.

  10. Mutated genes as research tool

    International Nuclear Information System (INIS)

    1981-01-01

    mutations, it was pointed out that analogous genetical structures exist in all living organisms, the more closely related, the more similar. This is reflected in strikingly similar biochemical pathways, leading from the primary gene message to the ultimate compound or trait. Induced mutations are a unique tool for analysing these gene-controlled pathways, thus leading also to a better understanding of natural evolution

  11. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

    Science.gov (United States)

    Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

    2015-11-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.

  12. FunGeneNet: a web tool to estimate enrichment of functional interactions in experimental gene sets.

    Science.gov (United States)

    Tiys, Evgeny S; Ivanisenko, Timofey V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2018-02-09

    Estimation of functional connectivity in gene sets derived from genome-wide or other biological experiments is one of the essential tasks of bioinformatics. A promising approach for solving this problem is to compare gene networks built using experimental gene sets with random networks. One of the resources that make such an analysis possible is CrossTalkZ, which uses the FunCoup database. However, existing methods, including CrossTalkZ, do not take into account individual types of interactions, such as protein/protein interactions, expression regulation, transport regulation, catalytic reactions, etc., but rather work with generalized types characterizing the existence of any connection between network members. We developed the online tool FunGeneNet, which utilizes the ANDSystem and STRING to reconstruct gene networks using experimental gene sets and to estimate their difference from random networks. To compare the reconstructed networks with random ones, the node permutation algorithm implemented in CrossTalkZ was taken as a basis. To study the FunGeneNet applicability, the functional connectivity analysis of networks constructed for gene sets involved in the Gene Ontology biological processes was conducted. We showed that the method sensitivity exceeds 0.8 at a specificity of 0.95. We found that the significance level of the difference between gene networks of biological processes and random networks is determined by the type of connections considered between objects. At the same time, the highest reliability is achieved for the generalized form of connections that takes into account all the individual types of connections. By taking examples of the thyroid cancer networks and the apoptosis network, it is demonstrated that key participants in these processes are involved in the interactions of those types by which these networks differ from random ones. FunGeneNet is a web tool aimed at proving the functionality of networks in a wide range of sizes of

  13. ReliefSeq: a gene-wise adaptive-K nearest-neighbor feature selection tool for finding gene-gene interactions and main effects in mRNA-Seq gene expression data.

    Directory of Open Access Journals (Sweden)

    Brett A McKinney

    Full Text Available Relief-F is a nonparametric, nearest-neighbor machine learning method that has been successfully used to identify relevant variables that may interact in complex multivariate models to explain phenotypic variation. While several tools have been developed for assessing differential expression in sequence-based transcriptomics, the detection of statistical interactions between transcripts has received less attention in the area of RNA-seq analysis. We describe a new extension and assessment of Relief-F for feature selection in RNA-seq data. The ReliefSeq implementation adapts the number of nearest neighbors (k for each gene to optimize the Relief-F test statistics (importance scores for finding both main effects and interactions. We compare this gene-wise adaptive-k (gwak Relief-F method with standard RNA-seq feature selection tools, such as DESeq and edgeR, and with the popular machine learning method Random Forests. We demonstrate performance on a panel of simulated data that have a range of distributional properties reflected in real mRNA-seq data including multiple transcripts with varying sizes of main effects and interaction effects. For simulated main effects, gwak-Relief-F feature selection performs comparably to standard tools DESeq and edgeR for ranking relevant transcripts. For gene-gene interactions, gwak-Relief-F outperforms all comparison methods at ranking relevant genes in all but the highest fold change/highest signal situations where it performs similarly. The gwak-Relief-F algorithm outperforms Random Forests for detecting relevant genes in all simulation experiments. In addition, Relief-F is comparable to the other methods based on computational time. We also apply ReliefSeq to an RNA-Seq study of smallpox vaccine to identify gene expression changes between vaccinia virus-stimulated and unstimulated samples. ReliefSeq is an attractive tool for inclusion in the suite of tools used for analysis of mRNA-Seq data; it has power to

  14. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool.

    Science.gov (United States)

    Chen, Edward Y; Tan, Christopher M; Kou, Yan; Duan, Qiaonan; Wang, Zichen; Meirelles, Gabriela Vaz; Clark, Neil R; Ma'ayan, Avi

    2013-04-15

    System-wide profiling of genes and proteins in mammalian cells produce lists of differentially expressed genes/proteins that need to be further analyzed for their collective functions in order to extract new knowledge. Once unbiased lists of genes or proteins are generated from such experiments, these lists are used as input for computing enrichment with existing lists created from prior knowledge organized into gene-set libraries. While many enrichment analysis tools and gene-set libraries databases have been developed, there is still room for improvement. Here, we present Enrichr, an integrative web-based and mobile software application that includes new gene-set libraries, an alternative approach to rank enriched terms, and various interactive visualization approaches to display enrichment results using the JavaScript library, Data Driven Documents (D3). The software can also be embedded into any tool that performs gene list analysis. We applied Enrichr to analyze nine cancer cell lines by comparing their enrichment signatures to the enrichment signatures of matched normal tissues. We observed a common pattern of up regulation of the polycomb group PRC2 and enrichment for the histone mark H3K27me3 in many cancer cell lines, as well as alterations in Toll-like receptor and interlukin signaling in K562 cells when compared with normal myeloid CD33+ cells. Such analyses provide global visualization of critical differences between normal tissues and cancer cell lines but can be applied to many other scenarios. Enrichr is an easy to use intuitive enrichment analysis web-based tool providing various types of visualization summaries of collective functions of gene lists. Enrichr is open source and freely available online at: http://amp.pharm.mssm.edu/Enrichr.

  15. ACE-it: a tool for genome-wide integration of gene dosage and RNA expression data

    NARCIS (Netherlands)

    van Wieringen, W.N.; Belien, J.A.M.; Vosse, S.; Achame, E.M.; Ylstra, B.

    2006-01-01

    Summary: We describe a tool, called ACE-it (Array CGH Expression integration tool). ACE-it links the chromosomal position of the gene dosage measured by array CGH to the genes measured by the expression array. ACE-it uses this link to statistically test whether gene dosage affects RNA expression. ©

  16. FastGCN: a GPU accelerated tool for fast gene co-expression networks.

    Directory of Open Access Journals (Sweden)

    Meimei Liang

    Full Text Available Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.

  17. Synthetic RNAs for Gene Regulation: Design Principles and Computational Tools

    International Nuclear Information System (INIS)

    Laganà, Alessandro; Shasha, Dennis; Croce, Carlo Maria

    2014-01-01

    The use of synthetic non-coding RNAs for post-transcriptional regulation of gene expression has not only become a standard laboratory tool for gene functional studies but it has also opened up new perspectives in the design of new and potentially promising therapeutic strategies. Bioinformatics has provided researchers with a variety of tools for the design, the analysis, and the evaluation of RNAi agents such as small-interfering RNA (siRNA), short-hairpin RNA (shRNA), artificial microRNA (a-miR), and microRNA sponges. More recently, a new system for genome engineering based on the bacterial CRISPR-Cas9 system (Clustered Regularly Interspaced Short Palindromic Repeats), was shown to have the potential to also regulate gene expression at both transcriptional and post-transcriptional level in a more specific way. In this mini review, we present RNAi and CRISPRi design principles and discuss the advantages and limitations of the current design approaches.

  18. Synthetic RNAs for Gene Regulation: Design Principles and Computational Tools

    Energy Technology Data Exchange (ETDEWEB)

    Laganà, Alessandro [Department of Molecular Virology, Immunology and Medical Genetics, Comprehensive Cancer Center, The Ohio State University, Columbus, OH (United States); Shasha, Dennis [Courant Institute of Mathematical Sciences, New York University, New York, NY (United States); Croce, Carlo Maria [Department of Molecular Virology, Immunology and Medical Genetics, Comprehensive Cancer Center, The Ohio State University, Columbus, OH (United States)

    2014-12-11

    The use of synthetic non-coding RNAs for post-transcriptional regulation of gene expression has not only become a standard laboratory tool for gene functional studies but it has also opened up new perspectives in the design of new and potentially promising therapeutic strategies. Bioinformatics has provided researchers with a variety of tools for the design, the analysis, and the evaluation of RNAi agents such as small-interfering RNA (siRNA), short-hairpin RNA (shRNA), artificial microRNA (a-miR), and microRNA sponges. More recently, a new system for genome engineering based on the bacterial CRISPR-Cas9 system (Clustered Regularly Interspaced Short Palindromic Repeats), was shown to have the potential to also regulate gene expression at both transcriptional and post-transcriptional level in a more specific way. In this mini review, we present RNAi and CRISPRi design principles and discuss the advantages and limitations of the current design approaches.

  19. GeneRecon—A coalescent based tool for fine-scale association mapping

    DEFF Research Database (Denmark)

    Mailund, Thomas; Schierup, Mikkel Heide; Pedersen, Christian Nørgaard Storm

    2006-01-01

    GeneRecon is a tool for fine-scale association mapping using a coalescence model. GeneRecon takes as input case-control data from phased or unphased SNP and micro-satellite genotypes. The posterior distribution of disease locus position is obtained by Metropolis Hastings sampling in the state space...

  20. Semantic integration of gene expression analysis tools and data sources using software connectors

    Science.gov (United States)

    2013-01-01

    Background The study and analysis of gene expression measurements is the primary focus of functional genomics. Once expression data is available, biologists are faced with the task of extracting (new) knowledge associated to the underlying biological phenomenon. Most often, in order to perform this task, biologists execute a number of analysis activities on the available gene expression dataset rather than a single analysis activity. The integration of heteregeneous tools and data sources to create an integrated analysis environment represents a challenging and error-prone task. Semantic integration enables the assignment of unambiguous meanings to data shared among different applications in an integrated environment, allowing the exchange of data in a semantically consistent and meaningful way. This work aims at developing an ontology-based methodology for the semantic integration of gene expression analysis tools and data sources. The proposed methodology relies on software connectors to support not only the access to heterogeneous data sources but also the definition of transformation rules on exchanged data. Results We have studied the different challenges involved in the integration of computer systems and the role software connectors play in this task. We have also studied a number of gene expression technologies, analysis tools and related ontologies in order to devise basic integration scenarios and propose a reference ontology for the gene expression domain. Then, we have defined a number of activities and associated guidelines to prescribe how the development of connectors should be carried out. Finally, we have applied the proposed methodology in the construction of three different integration scenarios involving the use of different tools for the analysis of different types of gene expression data. Conclusions The proposed methodology facilitates the development of connectors capable of semantically integrating different gene expression analysis tools

  1. G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Directory of Open Access Journals (Sweden)

    Lemay Danielle G

    2012-09-01

    Full Text Available Abstract Background In previous studies, gene neighborhoods—spatial clusters of co-expressed genes in the genome—have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Scoring Tool (G-NEST which combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all possible window sizes simultaneously. Results Using G-NEST on atlases of mouse and human tissue expression data, we found that large neighborhoods of ten or more genes are extremely rare in mammalian genomes. When they do occur, neighborhoods are typically composed of families of related genes. Both the highest scoring and the largest neighborhoods in mammalian genomes are formed by tandem gene duplication. Mammalian gene neighborhoods contain highly and variably expressed genes. Co-localized noisy gene pairs exhibit lower evolutionary conservation of their adjacent genome locations, suggesting that their shared transcriptional background may be disadvantageous. Genes that are essential to mammalian survival and reproduction are less likely to occur in neighborhoods, although neighborhoods are enriched with genes that function in mitosis. We also found that gene orientation and protein-protein interactions are partially responsible for maintenance of gene neighborhoods. Conclusions Our experiments using G-NEST confirm that tandem gene duplication is the primary driver of non-random gene order in mammalian genomes. Non-essentiality, co-functionality, gene orientation, and protein-protein interactions are additional forces that maintain gene neighborhoods, especially those formed by tandem duplicates. We expect G-NEST to be useful for other applications such as the identification of core regulatory modules, common transcriptional backgrounds, and chromatin domains. The

  2. PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements.

    Science.gov (United States)

    Mi, Huaiyu; Huang, Xiaosong; Muruganujan, Anushya; Tang, Haiming; Mills, Caitlin; Kang, Diane; Thomas, Paul D

    2017-01-04

    The PANTHER database (Protein ANalysis THrough Evolutionary Relationships, http://pantherdb.org) contains comprehensive information on the evolution and function of protein-coding genes from 104 completely sequenced genomes. PANTHER software tools allow users to classify new protein sequences, and to analyze gene lists obtained from large-scale genomics experiments. In the past year, major improvements include a large expansion of classification information available in PANTHER, as well as significant enhancements to the analysis tools. Protein subfamily functional classifications have more than doubled due to progress of the Gene Ontology Phylogenetic Annotation Project. For human genes (as well as a few other organisms), PANTHER now also supports enrichment analysis using pathway classifications from the Reactome resource. The gene list enrichment tools include a new 'hierarchical view' of results, enabling users to leverage the structure of the classifications/ontologies; the tools also allow users to upload genetic variant data directly, rather than requiring prior conversion to a gene list. The updated coding single-nucleotide polymorphisms (SNP) scoring tool uses an improved algorithm. The hidden Markov model (HMM) search tools now use HMMER3, dramatically reducing search times and improving accuracy of E-value statistics. Finally, the PANTHER Tree-Attribute Viewer has been implemented in JavaScript, with new views for exploring protein sequence evolution. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Degrees of separation as a statistical tool for evaluating candidate genes.

    Science.gov (United States)

    Nelson, Ronald M; Pettersson, Mats E

    2014-12-01

    Selection of candidate genes is an important step in the exploration of complex genetic architecture. The number of gene networks available is increasing and these can provide information to help with candidate gene selection. It is currently common to use the degree of connectedness in gene networks as validation in Genome Wide Association (GWA) and Quantitative Trait Locus (QTL) mapping studies. However, it can cause misleading results if not validated properly. Here we present a method and tool for validating the gene pairs from GWA studies given the context of the network they co-occur in. It ensures that proposed interactions and gene associations are not statistical artefacts inherent to the specific gene network architecture. The CandidateBacon package provides an easy and efficient method to calculate the average degree of separation (DoS) between pairs of genes to currently available gene networks. We show how these empirical estimates of average connectedness are used to validate candidate gene pairs. Validation of interacting genes by comparing their connectedness with the average connectedness in the gene network will provide support for said interactions by utilising the growing amount of gene network information available. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Gene therapy as a potential tool for treating neuroblastoma-a focused review.

    Science.gov (United States)

    Kumar, M D; Dravid, A; Kumar, A; Sen, D

    2016-05-01

    Neuroblastoma, a solid tumor caused by rapid division of undifferentiated neuroblasts, is the most common childhood malignancy affecting children aged genes is restored to normalcy. Gene therapy is a powerful tool with the potential to inhibit the deleterious effects of oncogenes by inserting corrected/normal genes into the genome. Both viral and non-viral vector-based gene therapies have been developed and adopted to deliver the target genes into neuroblastoma cells. These attempts have given hope to bringing in a new regime of treatment against neuroblastoma. A few gene-therapy-based treatment strategies have been tested in limited clinical trials yielding some positive results. This mini review is an attempt to provide an overview of the available options of gene therapy to treat neuroblastoma.

  5. CRISPR/Cas9-loxP-Mediated Gene Editing as a Novel Site-Specific Genetic Manipulation Tool.

    Science.gov (United States)

    Yang, Fayu; Liu, Changbao; Chen, Ding; Tu, Mengjun; Xie, Haihua; Sun, Huihui; Ge, Xianglian; Tang, Lianchao; Li, Jin; Zheng, Jiayong; Song, Zongming; Qu, Jia; Gu, Feng

    2017-06-16

    Cre-loxP, as one of the site-specific genetic manipulation tools, offers a method to study the spatial and temporal regulation of gene expression/inactivation in order to decipher gene function. CRISPR/Cas9-mediated targeted genome engineering technologies are sparking a new revolution in biological research. Whether the traditional site-specific genetic manipulation tool and CRISPR/Cas9 could be combined to create a novel genetic tool for highly specific gene editing is not clear. Here, we successfully generated a CRISPR/Cas9-loxP system to perform gene editing in human cells, providing the proof of principle that these two technologies can be used together for the first time. We also showed that distinct non-homologous end-joining (NHEJ) patterns from CRISPR/Cas9-mediated gene editing of the targeting sequence locates at the level of plasmids (episomal) and chromosomes. Specially, the CRISPR/Cas9-mediated NHEJ pattern in the nuclear genome favors deletions (64%-68% at the human AAVS1 locus versus 4%-28% plasmid DNA). CRISPR/Cas9-loxP, a novel site-specific genetic manipulation tool, offers a platform for the dissection of gene function and molecular insights into DNA-repair pathways. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  6. DaGO-Fun: tool for Gene Ontology-based functional analysis using term information content measures.

    Science.gov (United States)

    Mazandu, Gaston K; Mulder, Nicola J

    2013-09-25

    The use of Gene Ontology (GO) data in protein analyses have largely contributed to the improved outcomes of these analyses. Several GO semantic similarity measures have been proposed in recent years and provide tools that allow the integration of biological knowledge embedded in the GO structure into different biological analyses. There is a need for a unified tool that provides the scientific community with the opportunity to explore these different GO similarity measure approaches and their biological applications. We have developed DaGO-Fun, an online tool available at http://web.cbio.uct.ac.za/ITGOM, which incorporates many different GO similarity measures for exploring, analyzing and comparing GO terms and proteins within the context of GO. It uses GO data and UniProt proteins with their GO annotations as provided by the Gene Ontology Annotation (GOA) project to precompute GO term information content (IC), enabling rapid response to user queries. The DaGO-Fun online tool presents the advantage of integrating all the relevant IC-based GO similarity measures, including topology- and annotation-based approaches to facilitate effective exploration of these measures, thus enabling users to choose the most relevant approach for their application. Furthermore, this tool includes several biological applications related to GO semantic similarity scores, including the retrieval of genes based on their GO annotations, the clustering of functionally related genes within a set, and term enrichment analysis.

  7. New tools for Mendelian disease gene identification: PhenoDB variant analysis module; and GeneMatcher, a web-based tool for linking investigators with an interest in the same gene.

    Science.gov (United States)

    Sobreira, Nara; Schiettecatte, François; Boehm, Corinne; Valle, David; Hamosh, Ada

    2015-04-01

    Identifying the causative variant from among the thousands identified by whole-exome sequencing or whole-genome sequencing is a formidable challenge. To make this process as efficient and flexible as possible, we have developed a Variant Analysis Module coupled to our previously described Web-based phenotype intake tool, PhenoDB (http://researchphenodb.net and http://phenodb.org). When a small number of candidate-causative variants have been identified in a study of a particular patient or family, a second, more difficult challenge becomes proof of causality for any given variant. One approach to this problem is to find other cases with a similar phenotype and mutations in the same candidate gene. Alternatively, it may be possible to develop biological evidence for causality, an approach that is assisted by making connections to basic scientists studying the gene of interest, often in the setting of a model organism. Both of these strategies benefit from an open access, online site where individual clinicians and investigators could post genes of interest. To this end, we developed GeneMatcher (http://genematcher.org), a freely accessible Website that enables connections between clinicians and researchers across the world who share an interest in the same gene(s). © 2015 WILEY PERIODICALS, INC.

  8. GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists

    Directory of Open Access Journals (Sweden)

    Steinfeld Israel

    2009-02-01

    Full Text Available Abstract Background Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results. Results GOrilla is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression. GOrilla employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the top of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, GOrilla computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms. Conclusion GOrilla is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. GOrilla's unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. GOrilla is publicly available at: http://cbl-gorilla.cs.technion.ac.il

  9. Zebrafish Expression Ontology of Gene Sets (ZEOGS): A Tool to Analyze Enrichment of Zebrafish Anatomical Terms in Large Gene Sets

    Science.gov (United States)

    Marsico, Annalisa

    2013-01-01

    Abstract The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene

  10. Zebrafish Expression Ontology of Gene Sets (ZEOGS): a tool to analyze enrichment of zebrafish anatomical terms in large gene sets.

    Science.gov (United States)

    Prykhozhij, Sergey V; Marsico, Annalisa; Meijsing, Sebastiaan H

    2013-09-01

    The zebrafish (Danio rerio) is an established model organism for developmental and biomedical research. It is frequently used for high-throughput functional genomics experiments, such as genome-wide gene expression measurements, to systematically analyze molecular mechanisms. However, the use of whole embryos or larvae in such experiments leads to a loss of the spatial information. To address this problem, we have developed a tool called Zebrafish Expression Ontology of Gene Sets (ZEOGS) to assess the enrichment of anatomical terms in large gene sets. ZEOGS uses gene expression pattern data from several sources: first, in situ hybridization experiments from the Zebrafish Model Organism Database (ZFIN); second, it uses the Zebrafish Anatomical Ontology, a controlled vocabulary that describes connected anatomical structures; and third, the available connections between expression patterns and anatomical terms contained in ZFIN. Upon input of a gene set, ZEOGS determines which anatomical structures are overrepresented in the input gene set. ZEOGS allows one for the first time to look at groups of genes and to describe them in terms of shared anatomical structures. To establish ZEOGS, we first tested it on random gene selections and on two public microarray datasets with known tissue-specific gene expression changes. These tests showed that ZEOGS could reliably identify the tissues affected, whereas only very few enriched terms to none were found in the random gene sets. Next we applied ZEOGS to microarray datasets of 24 and 72 h postfertilization zebrafish embryos treated with beclomethasone, a potent glucocorticoid. This analysis resulted in the identification of several anatomical terms related to glucocorticoid-responsive tissues, some of which were stage-specific. Our studies highlight the ability of ZEOGS to extract spatial information from datasets derived from whole embryos, indicating that ZEOGS could be a useful tool to automatically analyze gene expression

  11. Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool.

    Science.gov (United States)

    Auerbach, Raymond K; Chen, Bin; Butte, Atul J

    2013-08-01

    Biological analysis has shifted from identifying genes and transcripts to mapping these genes and transcripts to biological functions. The ENCODE Project has generated hundreds of ChIP-Seq experiments spanning multiple transcription factors and cell lines for public use, but tools for a biomedical scientist to analyze these data are either non-existent or tailored to narrow biological questions. We present the ENCODE ChIP-Seq Significance Tool, a flexible web application leveraging public ENCODE data to identify enriched transcription factors in a gene or transcript list for comparative analyses. The ENCODE ChIP-Seq Significance Tool is written in JavaScript on the client side and has been tested on Google Chrome, Apple Safari and Mozilla Firefox browsers. Server-side scripts are written in PHP and leverage R and a MySQL database. The tool is available at http://encodeqt.stanford.edu. abutte@stanford.edu Supplementary material is available at Bioinformatics online.

  12. A dual selection based, targeted gene replacement tool for Magnaporthe grisea and Fusarium oxysporum.

    Science.gov (United States)

    Khang, Chang Hyun; Park, Sook-Young; Lee, Yong-Hwan; Kang, Seogchan

    2005-06-01

    Rapid progress in fungal genome sequencing presents many new opportunities for functional genomic analysis of fungal biology through the systematic mutagenesis of the genes identified through sequencing. However, the lack of efficient tools for targeted gene replacement is a limiting factor for fungal functional genomics, as it often necessitates the screening of a large number of transformants to identify the desired mutant. We developed an efficient method of gene replacement and evaluated factors affecting the efficiency of this method using two plant pathogenic fungi, Magnaporthe grisea and Fusarium oxysporum. This method is based on Agrobacterium tumefaciens-mediated transformation with a mutant allele of the target gene flanked by the herpes simplex virus thymidine kinase (HSVtk) gene as a conditional negative selection marker against ectopic transformants. The HSVtk gene product converts 5-fluoro-2'-deoxyuridine to a compound toxic to diverse fungi. Because ectopic transformants express HSVtk, while gene replacement mutants lack HSVtk, growing transformants on a medium amended with 5-fluoro-2'-deoxyuridine facilitates the identification of targeted mutants by counter-selecting against ectopic transformants. In addition to M. grisea and F. oxysporum, the method and associated vectors are likely to be applicable to manipulating genes in a broad spectrum of fungi, thus potentially serving as an efficient, universal functional genomic tool for harnessing the growing body of fungal genome sequence data to study fungal biology.

  13. fcGENE: a versatile tool for processing and transforming SNP datasets.

    Directory of Open Access Journals (Sweden)

    Nab Raj Roshyara

    Full Text Available Modern analysis of high-dimensional SNP data requires a number of biometrical and statistical methods such as pre-processing, analysis of population structure, association analysis and genotype imputation. Software used for these purposes often rely on specific and incompatible input and output data formats. Therefore extensive data management including multiple format conversions is necessary during analyses.In order to support fast and efficient management and bio-statistical quality control of high-dimensional SNP data, we developed the publically available software fcGENE using C++ object-oriented programming language. This software simplifies and automates the use of different existing analysis packages, especially during the workflow of genotype imputations and corresponding analyses.fcGENE transforms SNP data and imputation results into different formats required for a large variety of analysis packages such as PLINK, SNPTEST, HAPLOVIEW, EIGENSOFT, GenABEL and tools used for genotype imputation such as MaCH, IMPUTE, BEAGLE and others. Data Management tasks like merging, splitting, extracting SNP and pedigree information can be performed. fcGENE also supports a number of bio-statistical quality control processes and quality based filtering processes at SNP- and sample-wise level. The tool also generates templates of commands required to run specific software packages, especially those required for genotype imputation. We demonstrate the functionality of fcGENE by example workflows of SNP data analyses and provide a comprehensive manual of commands, options and applications.We have developed a user-friendly open-source software fcGENE, which comprehensively supports SNP data management, quality control and analysis workflows. Download statistics and corresponding feedbacks indicate that software is highly recognised and extensively applied by the scientific community.

  14. Gene Unprediction with Spurio: A tool to identify spurious protein sequences.

    Science.gov (United States)

    Höps, Wolfram; Jeffryes, Matt; Bateman, Alex

    2018-01-01

    We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation.  Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases.  We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes.  Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.

  15. GeneWiz browser: An Interactive Tool for Visualizing Sequenced Chromosomes

    DEFF Research Database (Denmark)

    Hallin, Peter Fischer; Stærfeldt, Hans Henrik; Rotenberg, Eva

    2009-01-01

    , standard atlases are pre-generated for all prokaryotic genomes available in GenBank, providing a fast overview of all available genomes, including recently deposited genome sequences. The tool is available online from http://www.cbs.dtu.dk/services/gwBrowser. [Supplemental material including interactive...... atlases is available online at http://www.cbs.dtu.dk/services/gwBrowser/suppl/]....... readability and increased functionality compared to other browsers. The tool allows the user to select the display of various genomic features, color setting and data ranges. Custom numerical data can be added to the plot, allowing for example visualization of gene expression and regulation data. Further...

  16. The GATO gene annotation tool for research laboratories

    Directory of Open Access Journals (Sweden)

    A. Fujita

    2005-11-01

    Full Text Available Large-scale genome projects have generated a rapidly increasing number of DNA sequences. Therefore, development of computational methods to rapidly analyze these sequences is essential for progress in genomic research. Here we present an automatic annotation system for preliminary analysis of DNA sequences. The gene annotation tool (GATO is a Bioinformatics pipeline designed to facilitate routine functional annotation and easy access to annotated genes. It was designed in view of the frequent need of genomic researchers to access data pertaining to a common set of genes. In the GATO system, annotation is generated by querying some of the Web-accessible resources and the information is stored in a local database, which keeps a record of all previous annotation results. GATO may be accessed from everywhere through the internet or may be run locally if a large number of sequences are going to be annotated. It is implemented in PHP and Perl and may be run on any suitable Web server. Usually, installation and application of annotation systems require experience and are time consuming, but GATO is simple and practical, allowing anyone with basic skills in informatics to access it without any special training. GATO can be downloaded at [http://mariwork.iq.usp.br/gato/]. Minimum computer free space required is 2 MB.

  17. Allen Brain Atlas-Driven Visualizations: a web-based gene expression energy visualization tool.

    Science.gov (United States)

    Zaldivar, Andrew; Krichmar, Jeffrey L

    2014-01-01

    The Allen Brain Atlas-Driven Visualizations (ABADV) is a publicly accessible web-based tool created to retrieve and visualize expression energy data from the Allen Brain Atlas (ABA) across multiple genes and brain structures. Though the ABA offers their own search engine and software for researchers to view their growing collection of online public data sets, including extensive gene expression and neuroanatomical data from human and mouse brain, many of their tools limit the amount of genes and brain structures researchers can view at once. To complement their work, ABADV generates multiple pie charts, bar charts and heat maps of expression energy values for any given set of genes and brain structures. Such a suite of free and easy-to-understand visualizations allows for easy comparison of gene expression across multiple brain areas. In addition, each visualization links back to the ABA so researchers may view a summary of the experimental detail. ABADV is currently supported on modern web browsers and is compatible with expression energy data from the Allen Mouse Brain Atlas in situ hybridization data. By creating this web application, researchers can immediately obtain and survey numerous amounts of expression energy data from the ABA, which they can then use to supplement their work or perform meta-analysis. In the future, we hope to enable ABADV across multiple data resources.

  18. Allen Brain Atlas-Driven Visualizations: A Web-Based Gene Expression Energy Visualization Tool

    Directory of Open Access Journals (Sweden)

    Andrew eZaldivar

    2014-05-01

    Full Text Available The Allen Brain Atlas-Driven Visualizations (ABADV is a publicly accessible web-based tool created to retrieve and visualize expression energy data from the Allen Brain Atlas (ABA across multiple genes and brain structures. Though the ABA offers their own search engine and software for researchers to view their growing collection of online public data sets, including extensive gene expression and neuroanatomical data from human and mouse brain, many of their tools limit the amount of genes and brain structures researchers can view at once. To complement their work, ABADV generates multiple pie charts, bar charts and heat maps of expression energy values for any given set of genes and brain structures. Such a suite of free and easy-to-understand visualizations allows for easy comparison of gene expression across multiple brain areas. In addition, each visualization links back to the ABA so researchers may view a summary of the experimental detail. ABADV is currently supported on modern web browsers and is compatible with expression energy data from the Allen Mouse Brain Atlas in situ hybridization data. By creating this web application, researchers can immediately obtain and survey numerous amounts of expression energy data from the ABA, which they can then use to supplement their work or perform meta-analysis. In the future, we hope to enable ABADV across multiple data resources.

  19. GENECODIS-Grid: An online grid-based tool to predict functional information in gene lists

    International Nuclear Information System (INIS)

    Nogales, R.; Mejia, E.; Vicente, C.; Montes, E.; Delgado, A.; Perez Griffo, F. J.; Tirado, F.; Pascual-Montano, A.

    2007-01-01

    In this work we introduce GeneCodis-Grid, a grid-based alternative to a bioinformatics tool named Genecodis that integrates different sources of biological information to search for biological features (annotations) that frequently co-occur in a set of genes and rank them by statistical significance. GeneCodis-Grid is a web-based application that takes advantage of two independent grid networks and a computer cluster managed by a meta-scheduler and a web server that host the application. The mining of concurrent biological annotations provides significant information for the functional analysis of gene list obtained by high throughput experiments in biology. Due to the large popularity of this tool, that has registered more than 13000 visits since its publication in January 2007, there is a strong need to facilitate users from different sites to access the system simultaneously. In addition, the complexity of some of the statistical tests used in this approach has made this technique a good candidate for its implementation in a Grid opportunistic environment. (Author)

  20. AnGeLi: A Tool for the Analysis of Gene Lists from Fission Yeast

    Directory of Open Access Journals (Sweden)

    Danny A Bitton

    2015-11-01

    Full Text Available Genome-wide assays and screens typically result in large lists of genes or proteins. Enrichments of functional or other biological properties within such lists can provide valuable insights and testable hypotheses. To systematically detect these enrichments can be challenging and time-consuming, because relevant data to compare against query gene lists are spread over many different sources. We have developed AnGeLi (Analysis of Gene Lists, an intuitive, integrated web-tool for comprehensive and customized interrogation of gene lists from the fission yeast, Schizosaccharomyces pombe. AnGeLi searches for significant enrichments among multiple qualitative and quantitative information sources, including gene and phenotype ontologies, genetic and protein interactions, numerous features of genes, transcripts, translation, and proteins such as copy numbers, chromosomal positions, genetic diversity, RNA polymerase II and ribosome occupancy, localization, conservation, half-lives, domains and molecular weight among others, as well as diverse sets of genes that are co-regulated or lead to the same phenotypes when mutated. AnGeLi uses robust statistics which can be tailored to specific needs. It also provides the option to upload user-defined gene sets to compare against the query list. Through an integrated data submission form, AnGeLi encourages the community to contribute additional curated gene lists to further increase the usefulness of this resource and to get the most from the ever increasing large-scale experiments. AnGeLi offers a rigorous yet flexible statistical analysis platform for rich insights into functional enrichments and biological context for query gene lists, thus providing a powerful exploratory tool through which S. pombe researchers can uncover fresh perspectives and unexpected connections from genomic data. AnGeLi is freely available at: www.bahlerlab.info/AnGeLi

  1. An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets.

    Science.gov (United States)

    Hosseini, Parsa; Tremblay, Arianne; Matthews, Benjamin F; Alkharouf, Nadim W

    2010-07-02

    The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data

  2. Calibration of Multiple In Silico Tools for Predicting Pathogenicity of Mismatch Repair Gene Missense Substitutions

    Science.gov (United States)

    Thompson, Bryony A.; Greenblatt, Marc S.; Vallee, Maxime P.; Herkert, Johanna C.; Tessereau, Chloe; Young, Erin L.; Adzhubey, Ivan A.; Li, Biao; Bell, Russell; Feng, Bingjian; Mooney, Sean D.; Radivojac, Predrag; Sunyaev, Shamil R.; Frebourg, Thierry; Hofstra, Robert M.W.; Sijmons, Rolf H.; Boucher, Ken; Thomas, Alun; Goldgar, David E.; Spurdle, Amanda B.; Tavtigian, Sean V.

    2015-01-01

    Classification of rare missense substitutions observed during genetic testing for patient management is a considerable problem in clinical genetics. The Bayesian integrated evaluation of unclassified variants is a solution originally developed for BRCA1/2. Here, we take a step toward an analogous system for the mismatch repair (MMR) genes (MLH1, MSH2, MSH6, and PMS2) that confer colon cancer susceptibility in Lynch syndrome by calibrating in silico tools to estimate prior probabilities of pathogenicity for MMR gene missense substitutions. A qualitative five-class classification system was developed and applied to 143 MMR missense variants. This identified 74 missense substitutions suitable for calibration. These substitutions were scored using six different in silico tools (Align-Grantham Variation Grantham Deviation, multivariate analysis of protein polymorphisms [MAPP], Mut-Pred, PolyPhen-2.1, Sorting Intolerant From Tolerant, and Xvar), using curated MMR multiple sequence alignments where possible. The output from each tool was calibrated by regression against the classifications of the 74 missense substitutions; these calibrated outputs are interpretable as prior probabilities of pathogenicity. MAPP was the most accurate tool and MAPP + PolyPhen-2.1 provided the best-combined model (R2 = 0.62 and area under receiver operating characteristic = 0.93). The MAPP + PolyPhen-2.1 output is sufficiently predictive to feed as a continuous variable into the quantitative Bayesian integrated evaluation for clinical classification of MMR gene missense substitutions. PMID:22949387

  3. Gene Therapy in Fanconi Anemia: A Matter of Time, Safety and Gene Transfer Tool Efficiency.

    Science.gov (United States)

    Verhoeyen, Els; Roman-Rodriguez, Francisco Jose; Cosset, Francois-Loic; Levy, Camille; Rio, Paula

    2017-01-01

    Fanconi anemia (FA) is a rare genetic syndrome characterized by progressive marrow failure. Gene therapy by infusion of FA-corrected autologous hematopoietic stem cells (HSCs) may offer a potential cure since it is a monogenetic disease with mutations in the FANC genes, coding for DNA repair enzymes [1]. However, the collection of hCD34+-cells in FA patients implies particular challenges because of the reduced numbers of progenitor cells present in their bone marrow (BM) [2] or mobilized peripheral blood [3-5]. In addition, the FA genetic defect fragilizes the HSCs [6]. These particular features might explain why the first clinical trials using murine leukemia virus derived retroviral vectors conducted for FA failed to show engraftment of corrected cells. The gene therapy field is now moving towards the use of lentiviral vectors (LVs) evidenced by recent succesful clinical trials for the treatment of patients suffering from adrenoleukodystrophy (ALD) [7], β-thalassemia [8], metachromatic leukodystrophy [9] and Wiskott-Aldrich syndrome [10]. LV trials for X-linked severe combined immunodificiency and Fanconi anemia (FA) defects were recently initiated [11, 12]. Fifteen years of preclinical studies using different FA mouse models and in vitro research allowed us to find the weak points in the in vitro culture and transduction conditions, which most probably led to the initial failure of FA HSC gene therapy. In this review, we will focus on the different obstacles, unique to FA gene therapy, and how they have been overcome through the development of optimized protocols for FA HSC culture and transduction and the engineering of new gene transfer tools for FA HSCs. These combined advances in the field hopefully will allow the correction of the FA hematological defect in the near future. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  4. Helping Students Understand Gene Regulation with Online Tools: A Review of MEME and Melina II, Motif Discovery Tools for Active Learning in Biology

    Directory of Open Access Journals (Sweden)

    David Treves

    2012-08-01

    Full Text Available Review of: MEME and Melina II, which are two free and easy-to-use online motif discovery tools that can be employed to actively engage students in learning about gene regulatory elements.

  5. Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data.

    Science.gov (United States)

    Yip, Shun H; Sham, Pak Chung; Wang, Junwen

    2018-02-21

    Traditional RNA sequencing (RNA-seq) allows the detection of gene expression variations between two or more cell populations through differentially expressed gene (DEG) analysis. However, genes that contribute to cell-to-cell differences are not discoverable with RNA-seq because RNA-seq samples are obtained from a mixture of cells. Single-cell RNA-seq (scRNA-seq) allows the detection of gene expression in each cell. With scRNA-seq, highly variable gene (HVG) discovery allows the detection of genes that contribute strongly to cell-to-cell variation within a homogeneous cell population, such as a population of embryonic stem cells. This analysis is implemented in many software packages. In this study, we compare seven HVG methods from six software packages, including BASiCS, Brennecke, scLVM, scran, scVEGs and Seurat. Our results demonstrate that reproducibility in HVG analysis requires a larger sample size than DEG analysis. Discrepancies between methods and potential issues in these tools are discussed and recommendations are made.

  6. Flexible tools for gene expression and silencing in tomato.

    Science.gov (United States)

    Fernandez, Ana I; Viron, Nicolas; Alhagdow, Moftah; Karimi, Mansour; Jones, Matthew; Amsellem, Ziva; Sicard, Adrien; Czerednik, Anna; Angenent, Gerco; Grierson, Donald; May, Sean; Seymour, Graham; Eshed, Yuval; Lemaire-Chamley, Martine; Rothan, Christophe; Hilson, Pierre

    2009-12-01

    As a genetic platform, tomato (Solanum lycopersicum) benefits from rich germplasm collections and ease of cultivation and transformation that enable the analysis of biological processes impossible to investigate in other model species. To facilitate the assembly of an open genetic toolbox designed to study Solanaceae, we initiated a joint collection of publicly available gene manipulation tools. We focused on the characterization of promoters expressed at defined time windows during fruit development, for the regulated expression or silencing of genes of interest. Five promoter sequences were captured as entry clones compatible with the versatile MultiSite Gateway format: PPC2, PG, TPRP, and IMA from tomato and CRC from Arabidopsis (Arabidopsis thaliana). Corresponding transcriptional fusions were made with the GUS gene, a nuclear-localized GUS-GFP reporter, and the chimeric LhG4 transcription factor. The activity of the promoters during fruit development and in fruit tissues was confirmed in transgenic tomato lines. Novel Gateway destination vectors were generated for the transcription of artificial microRNA (amiRNA) precursors and hairpin RNAs under the control of these promoters, with schemes only involving Gateway BP and LR Clonase reactions. Efficient silencing of the endogenous phytoene desaturase gene was demonstrated in transgenic tomato lines producing a matching amiRNA under the cauliflower mosaic virus 35S or PPC2 promoter. Lastly, taking advantage of the pOP/LhG4 two-component system, we found that well-characterized flower-specific Arabidopsis promoters drive the expression of reporters in patterns generally compatible with heterologous expression. Tomato lines and plasmids will be distributed through a new Nottingham Arabidopsis Stock Centre service unit dedicated to Solanaceae resources.

  7. Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data

    Directory of Open Access Journals (Sweden)

    Merchant Sabeeha S

    2011-07-01

    Full Text Available Abstract Background Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. Description The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of

  8. DR-Integrator: a new analytic tool for integrating DNA copy number and gene expression data.

    Science.gov (United States)

    Salari, Keyan; Tibshirani, Robert; Pollack, Jonathan R

    2010-02-01

    DNA copy number alterations (CNA) frequently underlie gene expression changes by increasing or decreasing gene dosage. However, only a subset of genes with altered dosage exhibit concordant changes in gene expression. This subset is likely to be enriched for oncogenes and tumor suppressor genes, and can be identified by integrating these two layers of genome-scale data. We introduce DNA/RNA-Integrator (DR-Integrator), a statistical software tool to perform integrative analyses on paired DNA copy number and gene expression data. DR-Integrator identifies genes with significant correlations between DNA copy number and gene expression, and implements a supervised analysis that captures genes with significant alterations in both DNA copy number and gene expression between two sample classes. DR-Integrator is freely available for non-commercial use from the Pollack Lab at http://pollacklab.stanford.edu/ and can be downloaded as a plug-in application to Microsoft Excel and as a package for the R statistical computing environment. The R package is available under the name 'DRI' at http://cran.r-project.org/. An example analysis using DR-Integrator is included as supplemental material. Supplementary data are available at Bioinformatics online.

  9. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets.

    Science.gov (United States)

    Khan, Aziz; Mathelier, Anthony

    2017-05-31

    A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. To address this gap, we have developed the Intervene tool, which provides an easy and automated interface for the effective intersection and visualization of genomic region or list sets, thus facilitating their analysis and interpretation. Intervene contains three modules: venn to generate Venn diagrams of up to six sets, upset to generate UpSet plots of multiple sets, and pairwise to compute and visualize intersections of multiple sets as clustered heat maps. Intervene, and its interactive web ShinyApp companion, generate publication-quality figures for the interpretation of genomic region and list sets. Intervene and its web application companion provide an easy command line and an interactive web interface to compute intersections of multiple genomic and list sets. They have the capacity to plot intersections using easy-to-interpret visual approaches. Intervene is developed and designed to meet the needs of both computer scientists and biologists. The source code is freely available at https://bitbucket.org/CBGR/intervene , with the web application available at https://asntech.shinyapps.io/intervene .

  10. AthaMap web tools for the analysis of transcriptional and posttranscriptional regulation of gene expression in Arabidopsis thaliana.

    Science.gov (United States)

    Hehl, Reinhard; Bülow, Lorenz

    2014-01-01

    The AthaMap database provides a map of verified and predicted transcription factor (TF) and small RNA-binding sites for the A. thaliana genome. The database can be used for bioinformatic predictions of putative regulatory sites. Several online web tools are available that address specific questions. Starting with the identification of transcription factor-binding sites (TFBS) in any gene of interest, colocalizing TFBS can be identified as well as common TFBS in a set of user-provided genes. Furthermore, genes can be identified that are potentially targeted by specific transcription factors or small inhibitory RNAs. This chapter provides detailed information on how each AthaMap web tool can be used online. Examples on how this database is used to address questions in circadian and diurnal regulation are given. Furthermore, complementary databases and databases that go beyond questions addressed with AthaMap are discussed.

  11. Online Analytical Processing (OLAP: A Fast and Effective Data Mining Tool for Gene Expression Databases

    Directory of Open Access Journals (Sweden)

    Alkharouf Nadim W.

    2005-01-01

    Full Text Available Gene expression databases contain a wealth of information, but current data mining tools are limited in their speed and effectiveness in extracting meaningful biological knowledge from them. Online analytical processing (OLAP can be used as a supplement to cluster analysis for fast and effective data mining of gene expression databases. We used Analysis Services 2000, a product that ships with SQLServer2000, to construct an OLAP cube that was used to mine a time series experiment designed to identify genes associated with resistance of soybean to the soybean cyst nematode, a devastating pest of soybean. The data for these experiments is stored in the soybean genomics and microarray database (SGMD. A number of candidate resistance genes and pathways were found. Compared to traditional cluster analysis of gene expression data, OLAP was more effective and faster in finding biologically meaningful information. OLAP is available from a number of vendors and can work with any relational database management system through OLE DB.

  12. Online analytical processing (OLAP): a fast and effective data mining tool for gene expression databases.

    Science.gov (United States)

    Alkharouf, Nadim W; Jamison, D Curtis; Matthews, Benjamin F

    2005-06-30

    Gene expression databases contain a wealth of information, but current data mining tools are limited in their speed and effectiveness in extracting meaningful biological knowledge from them. Online analytical processing (OLAP) can be used as a supplement to cluster analysis for fast and effective data mining of gene expression databases. We used Analysis Services 2000, a product that ships with SQLServer2000, to construct an OLAP cube that was used to mine a time series experiment designed to identify genes associated with resistance of soybean to the soybean cyst nematode, a devastating pest of soybean. The data for these experiments is stored in the soybean genomics and microarray database (SGMD). A number of candidate resistance genes and pathways were found. Compared to traditional cluster analysis of gene expression data, OLAP was more effective and faster in finding biologically meaningful information. OLAP is available from a number of vendors and can work with any relational database management system through OLE DB.

  13. GOssTo: a stand-alone application and a web tool for calculating semantic similarities on the Gene Ontology

    OpenAIRE

    Caniza, Horacio; Romero, Alfonso E.; Heron, Samuel; Yang, Haixuan; Devoto, Alessandra; Frasca, Marco; Mesiti, Marco; Valentini, Giorgio; Paccanaro, Alberto

    2014-01-01

    Summary: We present GOssTo, the Gene Ontology semantic similarity Tool, a user-friendly software system for calculating semantic similarities between gene products according to the Gene Ontology. GOssTo is bundled with six semantic similarity measures, including both term- and graph-based measures, and has extension capabilities to allow the user to add new similarities. Importantly, for any measure, GOssTo can also calculate the Random Walk Contribution that has been shown to greatly improve...

  14. The Multi-Purpose Tool of Tumor Immunotherapy: Gene-Engineered T Cells.

    Science.gov (United States)

    Mo, Zeming; Du, Peixin; Wang, Guoping; Wang, Yongsheng

    2017-01-01

    A detailed summary of the published clinical trials of chimeric antigen receptor T cells (CAR-T) and TCR-transduced T cells (TCR-T) was constructed to understand the development trend of adoptive T cell therapy (ACT). In contrast to TCR-T, the number of CAR-T clinical trials has increased dramatically in China in the last three years. The ACT seems to be very prosperous. But, the multidimensional interaction of tumor, tumor associated antigen (TAA) and normal tissue exacerbates the uncontrolled outcome of T cells gene therapy. It reminds us the importance that optimizing treatment security to prevent the fatal serious adverse events. How to balance the safety and effectiveness of the ACT? At least six measures can potentially optimize the safety of ACT. At the same time, with the application of gene editing techniques, more endogenous receptors are disrupted while more exogenous receptors are expressed on T cells. As a multi-purpose tool of tumor immunotherapy, gene-engineered T cells (GE-T) have been given different functional weapons. A network which is likely to link radiation therapy, tumor vaccines, CAR-T and TCR-T is being built. Moreover, more and more evidences indicated that the combination of the ACT and other therapies would further enhance the anti-tumor capacity of the GE-T.

  15. Avirulence (AVR) Gene-Based Diagnosis Complements Existing Pathogen Surveillance Tools for Effective Deployment of Resistance (R) Genes Against Rice Blast Disease.

    Science.gov (United States)

    Selisana, S M; Yanoria, M J; Quime, B; Chaipanya, C; Lu, G; Opulencia, R; Wang, G-L; Mitchell, T; Correll, J; Talbot, N J; Leung, H; Zhou, B

    2017-06-01

    Avirulence (AVR) genes in Magnaporthe oryzae, the fungal pathogen that causes the devastating rice blast disease, have been documented to be major targets subject to mutations to avoid recognition by resistance (R) genes. In this study, an AVR-gene-based diagnosis tool for determining the virulence spectrum of a rice blast pathogen population was developed and validated. A set of 77 single-spore field isolates was subjected to pathotype analysis using differential lines, each containing a single R gene, and classified into 20 virulent pathotypes, except for 4 isolates that lost pathogenicity. In all, 10 differential lines showed low frequency (95%), inferring the effectiveness of R genes present in the respective differential lines. In addition, the haplotypes of seven AVR genes were determined by polymerase chain reaction amplification and sequencing, if applicable. The calculated frequency of different AVR genes displayed significant variations in the population. AVRPiz-t and AVR-Pii were detected in 100 and 84.9% of the isolates, respectively. Five AVR genes such as AVR-Pik-D (20.5%) and AVR-Pik-E (1.4%), AVRPiz-t (2.7%), AVR-Pita (0%), AVR-Pia (0%), and AVR1-CO39 (0%) displayed low or even zero frequency. The frequency of AVR genes correlated almost perfectly with the resistance frequency of the cognate R genes in differential lines, except for International Rice Research Institute-bred blast-resistant lines IRBLzt-T, IRBLta-K1, and IRBLkp-K60. Both genetic analysis and molecular marker validation revealed an additional R gene, most likely Pi19 or its allele, in these three differential lines. This can explain the spuriously higher resistance frequency of each target R gene based on conventional pathotyping. This study demonstrates that AVR-gene-based diagnosis provides a precise, R-gene-specific, and differential line-free assessment method that can be used for determining the virulence spectrum of a rice blast pathogen population and for predicting the

  16. GOPET: A tool for automated predictions of Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Glatting Karl-Heinz

    2006-03-01

    Full Text Available Abstract Background Vast progress in sequencing projects has called for annotation on a large scale. A Number of methods have been developed to address this challenging task. These methods, however, either apply to specific subsets, or their predictions are not formalised, or they do not provide precise confidence values for their predictions. Description We recently established a learning system for automated annotation, trained with a broad variety of different organisms to predict the standardised annotation terms from Gene Ontology (GO. Now, this method has been made available to the public via our web-service GOPET (Gene Ontology term Prediction and Evaluation Tool. It supplies annotation for sequences of any organism. For each predicted term an appropriate confidence value is provided. The basic method had been developed for predicting molecular function GO-terms. It is now expanded to predict biological process terms. This web service is available via http://genius.embnet.dkfz-heidelberg.de/menu/biounit/open-husar Conclusion Our web service gives experimental researchers as well as the bioinformatics community a valuable sequence annotation device. Additionally, GOPET also provides less significant annotation data which may serve as an extended discovery platform for the user.

  17. Tools to covisualize and coanalyze proteomic data with genomes and transcriptomes: validation of genes and alternative mRNA splicing.

    Science.gov (United States)

    Pang, Chi Nam Ignatius; Tay, Aidan P; Aya, Carlos; Twine, Natalie A; Harkness, Linda; Hart-Smith, Gene; Chia, Samantha Z; Chen, Zhiliang; Deshpande, Nandan P; Kaakoush, Nadeem O; Mitchell, Hazel M; Kassem, Moustapha; Wilkins, Marc R

    2014-01-03

    Direct links between proteomic and genomic/transcriptomic data are not frequently made, partly because of lack of appropriate bioinformatics tools. To help address this, we have developed the PG Nexus pipeline. The PG Nexus allows users to covisualize peptides in the context of genomes or genomic contigs, along with RNA-seq reads. This is done in the Integrated Genome Viewer (IGV). A Results Analyzer reports the precise base position where LC-MS/MS-derived peptides cover genes or gene isoforms, on the chromosomes or contigs where this occurs. In prokaryotes, the PG Nexus pipeline facilitates the validation of genes, where annotation or gene prediction is available, or the discovery of genes using a "virtual protein"-based unbiased approach. We illustrate this with a comprehensive proteogenomics analysis of two strains of Campylobacter concisus . For higher eukaryotes, the PG Nexus facilitates gene validation and supports the identification of mRNA splice junction boundaries and splice variants that are protein-coding. This is illustrated with an analysis of splice junctions covered by human phosphopeptides, and other examples of relevance to the Chromosome-Centric Human Proteome Project. The PG Nexus is open-source and available from https://github.com/IntersectAustralia/ap11_Samifier. It has been integrated into Galaxy and made available in the Galaxy tool shed.

  18. FunGene: the functional gene pipeline and repository.

    Science.gov (United States)

    Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R

    2013-01-01

    Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.

  19. FunGene: the Functional Gene Pipeline and Repository

    Directory of Open Access Journals (Sweden)

    Jordan A. Fish

    2013-10-01

    Full Text Available Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer.While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/ offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.

  20. Assessment of the predictive accuracy of five in silico prediction tools, alone or in combination, and two metaservers to classify long QT syndrome gene mutations.

    Science.gov (United States)

    Leong, Ivone U S; Stuckey, Alexander; Lai, Daniel; Skinner, Jonathan R; Love, Donald R

    2015-05-13

    Long QT syndrome (LQTS) is an autosomal dominant condition predisposing to sudden death from malignant arrhythmia. Genetic testing identifies many missense single nucleotide variants of uncertain pathogenicity. Establishing genetic pathogenicity is an essential prerequisite to family cascade screening. Many laboratories use in silico prediction tools, either alone or in combination, or metaservers, in order to predict pathogenicity; however, their accuracy in the context of LQTS is unknown. We evaluated the accuracy of five in silico programs and two metaservers in the analysis of LQTS 1-3 gene variants. The in silico tools SIFT, PolyPhen-2, PROVEAN, SNPs&GO and SNAP, either alone or in all possible combinations, and the metaservers Meta-SNP and PredictSNP, were tested on 312 KCNQ1, KCNH2 and SCN5A gene variants that have previously been characterised by either in vitro or co-segregation studies as either "pathogenic" (283) or "benign" (29). The accuracy, sensitivity, specificity and Matthews Correlation Coefficient (MCC) were calculated to determine the best combination of in silico tools for each LQTS gene, and when all genes are combined. The best combination of in silico tools for KCNQ1 is PROVEAN, SNPs&GO and SIFT (accuracy 92.7%, sensitivity 93.1%, specificity 100% and MCC 0.70). The best combination of in silico tools for KCNH2 is SIFT and PROVEAN or PROVEAN, SNPs&GO and SIFT. Both combinations have the same scores for accuracy (91.1%), sensitivity (91.5%), specificity (87.5%) and MCC (0.62). In the case of SCN5A, SNAP and PROVEAN provided the best combination (accuracy 81.4%, sensitivity 86.9%, specificity 50.0%, and MCC 0.32). When all three LQT genes are combined, SIFT, PROVEAN and SNAP is the combination with the best performance (accuracy 82.7%, sensitivity 83.0%, specificity 80.0%, and MCC 0.44). Both metaservers performed better than the single in silico tools; however, they did not perform better than the best performing combination of in silico

  1. Analysis of cassava (Manihot esculenta) ESTs: A tool for the discovery of genes

    International Nuclear Information System (INIS)

    Zapata, Andres; Neme, Rafik; Sanabria, Carolina; Lopez, Camilo

    2011-01-01

    Cassava (Manihot esculenta) is the main source of calories for more than 1,000 millions of people around the world and has been consolidated as the fourth most important crop after rice, corn and wheat. Cassava is considered tolerant to abiotic and biotic stress conditions; nevertheless these characteristics are mainly present in non-commercial varieties. Genetic breeding strategies represent an alternative to introduce the desirable characteristics into commercial varieties. A fundamental step for accelerating the genetic breeding process in cassava requires the identification of genes associated to these characteristics. One rapid strategy for the identification of genes is the possibility to have a large collection of ESTs (expressed sequence tag). In this study, a complete analysis of cassava ESTs was done. The cassava ESTs represent 80,459 sequences which were assembled in a set of 29,231 unique genes (unigen), comprising 10,945 contigs and 18,286 singletones. These 29,231 unique genes represent about 80% of the genes of the cassava's genome. Between 5% and 10% of the unigenes of cassava not show similarity to any sequences present in the NCBI database and could be consider as cassava specific genes. a functional category was assigned to a group of sequences of the unigen set (29%) following the Gene Ontology Vocabulary. the molecular function component was the best represented with 43% of the sequences, followed by the biological process component (38%) and finally the cellular component with 19%. in the cassava ESTs collection, 3,709 microsatellites were identified and they could be used as molecular markers. this study represents an important contribution to the knowledge of the functional genomic structure of cassava and constitutes an important tool for the identification of genes associated to agricultural characteristics of interest that could be employed in cassava breeding programs.

  2. SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.

    Science.gov (United States)

    Merelli, Ivan; Calabria, Andrea; Cozzi, Paolo; Viti, Federica; Mosca, Ettore; Milanesi, Luciano

    2013-01-01

    The capability of correlating specific genotypes with human diseases is a complex issue in spite of all advantages arisen from high-throughput technologies, such as Genome Wide Association Studies (GWAS). New tools for genetic variants interpretation and for Single Nucleotide Polymorphisms (SNPs) prioritization are actually needed. Given a list of the most relevant SNPs statistically associated to a specific pathology as result of a genotype study, a critical issue is the identification of genes that are effectively related to the disease by re-scoring the importance of the identified genetic variations. Vice versa, given a list of genes, it can be of great importance to predict which SNPs can be involved in the onset of a particular disease, in order to focus the research on their effects. We propose a new bioinformatics approach to support biological data mining in the analysis and interpretation of SNPs associated to pathologies. This system can be employed to design custom genotyping chips for disease-oriented studies and to re-score GWAS results. The proposed method relies (1) on the data integration of public resources using a gene-centric database design, (2) on the evaluation of a set of static biomolecular annotations, defined as features, and (3) on the SNP scoring function, which computes SNP scores using parameters and weights set by users. We employed a machine learning classifier to set default feature weights and an ontological annotation layer to enable the enrichment of the input gene set. We implemented our method as a web tool called SNPranker 2.0 (http://www.itb.cnr.it/snpranker), improving our first published release of this system. A user-friendly interface allows the input of a list of genes, SNPs or a biological process, and to customize the features set with relative weights. As result, SNPranker 2.0 returns a list of SNPs, localized within input and ontologically enriched genes, combined with their prioritization scores. Different

  3. An online survival analysis tool to rapidly assess the effect of 22,277 genes on breast cancer prognosis using microarray data of 1,809 patients

    DEFF Research Database (Denmark)

    Györffy, B; Lanczky, A; Eklund, Aron Charles

    2010-01-01

    Validating prognostic or predictive candidate genes in appropriately powered breast cancer cohorts are of utmost interest. Our aim was to develop an online tool to draw survival plots, which can be used to assess the relevance of the expression levels of various genes on the clinical outcome both...... this integrative data analysis tool to confirm the prognostic power of the proliferation-related genes TOP2A and TOP2B, MKI67, CCND2, CCND3, CCNDE2, as well as CDKN1A, and TK2. We also validated the capability of microarrays to determine estrogen receptor status in 1,231 patients. The tool is highly valuable...

  4. MutaNET: a tool for automated analysis of genomic mutations in gene regulatory networks.

    Science.gov (United States)

    Hollander, Markus; Hamed, Mohamed; Helms, Volkhard; Neininger, Kerstin

    2018-03-01

    Mutations in genomic key elements can influence gene expression and function in various ways, and hence greatly contribute to the phenotype. We developed MutaNET to score the impact of individual mutations on gene regulation and function of a given genome. MutaNET performs statistical analyses of mutations in different genomic regions. The tool also incorporates the mutations in a provided gene regulatory network to estimate their global impact. The integration of a next-generation sequencing pipeline enables calling mutations prior to the analyses. As application example, we used MutaNET to analyze the impact of mutations in antibiotic resistance (AR) genes and their potential effect on AR of bacterial strains. MutaNET is freely available at https://sourceforge.net/projects/mutanet/. It is implemented in Python and supported on Mac OS X, Linux and MS Windows. Step-by-step instructions are available at http://service.bioinformatik.uni-saarland.de/mutanet/. volkhard.helms@bioinformatik.uni-saarland.de. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  5. WebScipio: An online tool for the determination of gene structures using protein sequences

    Directory of Open Access Journals (Sweden)

    Waack Stephan

    2008-09-01

    Full Text Available Abstract Background Obtaining the gene structure for a given protein encoding gene is an important step in many analyses. A software suited for this task should be readily accessible, accurate, easy to handle and should provide the user with a coherent representation of the most probable gene structure. It should be rigorous enough to optimise features on the level of single bases and at the same time flexible enough to allow for cross-species searches. Results WebScipio, a web interface to the Scipio software, allows a user to obtain the corresponding coding sequence structure of a here given a query protein sequence that belongs to an already assembled eukaryotic genome. The resulting gene structure is presented in various human readable formats like a schematic representation, and a detailed alignment of the query and the target sequence highlighting any discrepancies. WebScipio can also be used to identify and characterise the gene structures of homologs in related organisms. In addition, it offers a web service for integration with other programs. Conclusion WebScipio is a tool that allows users to get a high-quality gene structure prediction from a protein query. It offers more than 250 eukaryotic genomes that can be searched and produces predictions that are close to what can be achieved by manual annotation, for in-species and cross-species searches alike. WebScipio is freely accessible at http://www.webscipio.org.

  6. VennPainter: A Tool for the Comparison and Identification of Candidate Genes Based on Venn Diagrams.

    Directory of Open Access Journals (Sweden)

    Guoliang Lin

    Full Text Available VennPainter is a program for depicting unique and shared sets of genes lists and generating Venn diagrams, by using the Qt C++ framework. The software produces Classic Venn, Edwards' Venn and Nested Venn diagrams and allows for eight sets in a graph mode and 31 sets in data processing mode only. In comparison, previous programs produce Classic Venn and Edwards' Venn diagrams and allow for a maximum of six sets. The software incorporates user-friendly features and works in Windows, Linux and Mac OS. Its graphical interface does not require a user to have programing skills. Users can modify diagram content for up to eight datasets because of the Scalable Vector Graphics output. VennPainter can provide output results in vertical, horizontal and matrix formats, which facilitates sharing datasets as required for further identification of candidate genes. Users can obtain gene lists from shared sets by clicking the numbers on the diagram. Thus, VennPainter is an easy-to-use, highly efficient, cross-platform and powerful program that provides a more comprehensive tool for identifying candidate genes and visualizing the relationships among genes or gene families in comparative analysis.

  7. CRISPR/Cas9 as tool for functional study of genes involved in preimplantation embryo development.

    Directory of Open Access Journals (Sweden)

    Jeongwoo Kwon

    Full Text Available The CRISPR/Cas9 system has proven to be an efficient gene-editing tool for genome modification of cells and organisms. However, the applicability and efficiency of this system in pig embryos have not been studied in depth. Here, we aimed to remove porcine OCT4 function as a model case using the CRISPR/Cas9 system. Injection of Cas9 and single-guide RNA (sgRNA against OCT4 decreased the percentages of OCT4-positive embryos to 37-50% of total embryos, while ~100% of control embryos exhibited clear OCT4 immunostaining. We assessed the mutation status near the guide sequence using polymerase chain reaction (PCR and DNA sequencing, and a portion of blastocysts (20% in exon 2 and 50% in exon 5 had insertions/deletions near protospacer-adjacent motifs (PAMs. Different target sites had frequent deletions, but different concentrations of sgRNA made no impact. OCT4 mRNA levels dramatically decreased at the 8-cell stage, and they were barely detectable in blastocysts, while mRNA levels of other genes, including NANOG, and CDX2 were not affected. In addition, the combination of two sgRNAs led to large-scale deletion (about 1.8 kb in the same chromosome. Next, we injected an enhanced green fluorescent protein (eGFP vector targeting the OCT4 exon with Cas9 and sgRNA to create a knockin. We confirmed eGFP fluorescence in blastocysts in the inner cell mass, and also checked the mutation status using PCR and DNA sequencing. A significant portion of blastocysts had eGFP sequence insertions near PAM sites. The CRISPR/CAS9 system provides a good tool for gene functional studies by deleting target genes in the pig.

  8. A Novel Prokaryotic Green Fluorescent Protein Expression System for Testing Gene Editing Tools Activity Like Zinc Finger Nuclease.

    Science.gov (United States)

    Sabzehei, Faezeh; Kouhpayeh, Shirin; Dastjerdeh, Mansoureh Shahbazi; Khanahmad, Hossein; Salehi, Rasoul; Naderi, Shamsi; Taghizadeh, Razieh; Rabiei, Parisa; Hejazi, Zahra; Shariati, Laleh

    2017-01-01

    Gene editing technology has created a revolution in the field of genome editing. The three of the most famous tools in gene editing technology are zinc finger nucleases (ZFNs), transcription activator-like effector nucleases, clustered regularly interspaced short palindromic repeats (CRISPR), and CRISPR-associated systems. As their predictable nature, it is necessary to assess their efficiency. There are some methods for this purpose, but most of them are time labor and complicated. Here, we introduce a new prokaryotic reporter system, which makes it possible to evaluate the efficiency of gene editing tools faster, cheaper, and simpler than previous methods. At first, the target sites of a custom ZFN, which is designed against a segment of ampicillin resistance gene, were cloned on both sides of green fluorescent protein (GFP) gene to construct pPRO-GFP. Then pPRO-GFP was transformed into Escherichia coli TOP10F' that contains pZFN (contains expression cassette of a ZFN against ampicillin resistant gene), or p15A-KanaR as a negative control. The transformed bacteria were cultured on three separate media that contained ampicillin, kanamycin, and ampicillin + kanamycin; then the resulted colonies were assessed by flow cytometry. The results of flow cytometry showed a significant difference between the case (bacteria contain pZFN) and control (bacteria contain p15A, KanaR) in MFI (Mean Fluorescence Intensity) ( P < 0.0001). According to ZFN efficiency, it can bind and cut the target sites, the bilateral cutting can affect the intensity of GFP fluorescence. Our flow cytometry results showed that this ZFN could reduce the intensity of GFP color and colony count of bacteria in media containing amp + kana versus control sample.

  9. Application of DETECTER, an evolutionary genomic tool to analyze genetic variation, to the cystic fibrosis gene family

    Directory of Open Access Journals (Sweden)

    De Kee Danny W

    2006-03-01

    Full Text Available Abstract Background The medical community requires computational tools that distinguish missense genetic differences having phenotypic impact within the vast number of sense mutations that do not. Tools that do this will become increasingly important for those seeking to use human genome sequence data to predict disease, make prognoses, and customize therapy to individual patients. Results An approach, termed DETECTER, is proposed to identify sites in a protein sequence where amino acid replacements are likely to have a significant effect on phenotype, including causing genetic disease. This approach uses a model-dependent tool to estimate the normalized replacement rate at individual sites in a protein sequence, based on a history of those sites extracted from an evolutionary analysis of the corresponding protein family. This tool identifies sites that have higher-than-average, average, or lower-than-average rates of change in the lineage leading to the sequence in the population of interest. The rates are then combined with sequence data to determine the likelihoods that particular amino acids were present at individual sites in the evolutionary history of the gene family. These likelihoods are used to predict whether any specific amino acid replacements, if introduced at the site in a modern human population, would have a significant impact on fitness. The DETECTER tool is used to analyze the cystic fibrosis transmembrane conductance regulator (CFTR gene family. Conclusion In this system, DETECTER retrodicts amino acid replacements associated with the cystic fibrosis disease with greater accuracy than alternative approaches. While this result validates this approach for this particular family of proteins only, the approach may be applicable to the analysis of polymorphisms generally, including SNPs in a human population.

  10. Suppression subtractive hybridization as a tool to identify anthocyanin metabolism-related genes in apple skin.

    Science.gov (United States)

    Ban, Yusuke; Moriguchi, Takaya

    2010-01-01

    The pigmentation of anthocyanins is one of the important determinants for consumer preference and marketability in horticultural crops such as fruits and flowers. To elucidate the mechanisms underlying the physiological process leading to the pigmentation of anthocyanins, identification of the genes differentially expressed in response to anthocyanin accumulation is a useful strategy. Currently, microarrays have been widely used to isolate differentially expressed genes. However, the use of microarrays is limited by its high cost of special apparatus and materials. Therefore, availability of microarrays is limited and does not come into common use at present. Suppression subtractive hybridization (SSH) is an alternative tool that has been widely used to identify differentially expressed genes due to its easy handling and relatively low cost. This chapter describes the procedures for SSH, including RNA extraction from polysaccharides and polyphenol-rich samples, poly(A)+ RNA purification, evaluation of subtraction efficiency, and differential screening using reverse northern in apple skin.

  11. MediPlEx - a tool to combine in silico & experimental gene expression profiles of the model legume Medicago truncatula

    Directory of Open Access Journals (Sweden)

    Stutz Leonhard J

    2010-10-01

    Full Text Available Abstract Background Expressed Sequence Tags (ESTs are in general used to gain a first insight into gene activities from a species of interest. Subsequently, and typically based on a combination of EST and genome sequences, microarray-based expression analyses are performed for a variety of conditions. In some cases, a multitude of EST and microarray experiments are conducted for one species, covering different tissues, cell states, and cell types. Under these circumstances, the challenge arises to combine results derived from the different expression profiling strategies, with the goal to uncover novel information on the basis of the integrated datasets. Findings Using our new analysis tool, MediPlEx (MEDIcago truncatula multiPLe EXpression analysis, expression data from EST experiments, oligonucleotide microarrays and Affymetrix GeneChips® can be combined and analyzed, leading to a novel approach to integrated transcriptome analysis. We have validated our tool via the identification of a set of well-characterized AM-specific and AM-induced marker genes, identified by MediPlEx on the basis of in silico and experimental gene expression profiles from roots colonized with AM fungi. Conclusions MediPlEx offers an integrated analysis pipeline for different sets of expression data generated for the model legume Medicago truncatula. As expected, in silico and experimental gene expression data that cover the same biological condition correlate well. The collection of differentially expressed genes identified via MediPlEx provides a starting point for functional studies in plant mutants. MediPlEx can freely be used at http://www.cebitec.uni-bielefeld.de/mediplex.

  12. Paralogous Genes as a Tool to Study the Regulation of Gene Expression

    DEFF Research Database (Denmark)

    Hoffmann, Robert D

    The genomes of plants are marked by reoccurring events of whole-genome duplication. These events are major contributors to speciation and provide the genetic material for organisms to evolve ever greater complexity. Duplicated genes, referred to as paralogs, may be retained because they acquired...... regions. These results suggest that a concurrent purifying selection acts on coding and non-coding sequences of paralogous genes in A. thaliana. Mutational analyses of the promoters from a paralogous gene pair were performed in transgenic A. thaliana plants. The results revealed a 170-bp long DNA sequence...... that forms a bifunctional cis-regulatory module; it represses gene expression in the sporophyte while activating it in pollen. This finding is important for many aspects of gene regulation and the transcriptional changes underlying gametophyte development. In conclusion, the presented thesis suggests that...

  13. Integration of molecular biology tools for identifying promoters and genes abundantly expressed in flowers of Oncidium Gower Ramsey

    Directory of Open Access Journals (Sweden)

    Tung Shu-Yun

    2011-04-01

    Full Text Available Abstract Background Orchids comprise one of the largest families of flowering plants and generate commercially important flowers. However, model plants, such as Arabidopsis thaliana do not contain all plant genes, and agronomic and horticulturally important genera and species must be individually studied. Results Several molecular biology tools were used to isolate flower-specific gene promoters from Oncidium 'Gower Ramsey' (Onc. GR. A cDNA library of reproductive tissues was used to construct a microarray in order to compare gene expression in flowers and leaves. Five genes were highly expressed in flower tissues, and the subcellular locations of the corresponding proteins were identified using lip transient transformation with fluorescent protein-fusion constructs. BAC clones of the 5 genes, together with 7 previously published flower- and reproductive growth-specific genes in Onc. GR, were identified for cloning of their promoter regions. Interestingly, 3 of the 5 novel flower-abundant genes were putative trypsin inhibitor (TI genes (OnTI1, OnTI2 and OnTI3, which were tandemly duplicated in the same BAC clone. Their promoters were identified using transient GUS reporter gene transformation and stable A. thaliana transformation analyses. Conclusions By combining cDNA microarray, BAC library, and bombardment assay techniques, we successfully identified flower-directed orchid genes and promoters.

  14. PathMAPA: a tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for Arabidopsis

    Directory of Open Access Journals (Sweden)

    Ma Ligeng

    2003-11-01

    Full Text Available Abstract Background To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. Results We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i upload and populate microarray data into a database; (ii integrate gene expression with enzymes of the pathways; (iii generate pathway diagrams without building image files manually; (iv visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. Conclusion PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i automatic generation of pathways associated with gene expression and (ii statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s.

  15. CGUG: in silico proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to ca. 1.9 Mb

    Directory of Open Access Journals (Sweden)

    Mahadevan Padmanabhan

    2009-08-01

    Full Text Available Abstract Background Viruses and small-genome bacteria (~2 megabases and smaller comprise a considerable population in the biosphere and are of interest to many researchers. These genomes are now sequenced at an unprecedented rate and require complementary computational tools to analyze. "CoreGenesUniqueGenes" (CGUG is an in silico genome data mining tool that determines a "core" set of genes from two to five organisms with genomes in this size range. Core and unique genes may reflect similar niches and needs, and may be used in classifying organisms. Findings CGUG is available at http://binf.gmu.edu/geneorder.html as a web-based on-the-fly tool that performs iterative BLASTP analyses using a reference genome and up to four query genomes to provide a table of genes common to these genomes. The result is an in silico display of genomes and their proteomes, allowing for further analysis. CGUG can be used for "genome annotation by homology", as demonstrated with Chlamydophila and Francisella genomes. Conclusion CGUG is used to reanalyze the ICTV-based classifications of bacteriophages, to reconfirm long-standing relationships and to explore new classifications. These genomes have been problematic in the past, due largely to horizontal gene transfers. CGUG is validated as a tool for reannotating small genome bacteria using more up-to-date annotations by similarity or homology. These serve as an entry point for wet-bench experiments to confirm the functions of these "hypothetical" and "unknown" proteins.

  16. Dcode.org anthology of comparative genomic tools.

    Science.gov (United States)

    Loots, Gabriela G; Ovcharenko, Ivan

    2005-07-01

    Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the non-coding encryption of gene regulation across genomes. To facilitate the practical application of comparative sequence analysis to genetics and genomics, we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools, zPicture and Mulan; a phylogenetic shadowing tool, eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools, rVista and multiTF; a tool for extracting cis-regulatory modules governing the expression of co-regulated genes, Creme 2.0; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here, we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ website.

  17. Construction of new synthetic biology tools for the control of gene expression in the cyanobacterium Synechococcus sp. strain PCC 7002.

    Science.gov (United States)

    Zess, Erin K; Begemann, Matthew B; Pfleger, Brian F

    2016-02-01

    Predictive control of gene expression is an essential tool for developing synthetic biological systems. The current toolbox for controlling gene expression in cyanobacteria is a barrier to more in-depth genetic analysis and manipulation. Towards relieving this bottleneck, this work describes the use of synthetic biology to construct an anhydrotetracycline-based induction system and adapt a trans-acting small RNA (sRNA) system for use in the cyanobacterium Synechococcus sp. strain PCC 7002. An anhydrotetracycline-inducible promoter was developed to maximize intrinsic strength and dynamic range. The resulting construct, PEZtet , exhibited tight repression and a maximum 32-fold induction upon addition of anhydrotetracycline. Additionally, a sRNA system based on the Escherichia coli IS10 RNA-IN/OUT regulator was adapted for use in Synechococcus sp. strain PCC 7002. This system exhibited 70% attenuation of target gene expression, providing a demonstration of the use of sRNAs for differential gene expression in cyanobacteria. These systems were combined to produce an inducible sRNA system, which demonstrated 59% attenuation of target gene expression. Lastly, the role of Hfq, a critical component of sRNA systems in E. coli, was investigated. Genetic studies showed that the Hfq homolog in Synechococcus sp. strain PCC 7002 did not impact repression by the engineered sRNA system. In summary, this work describes new synthetic biology tools that can be applied to physiological studies, metabolic engineering, or sRNA platforms in Synechococcus sp. strain PCC 7002. © 2015 Wiley Periodicals, Inc.

  18. G-NEST: A gene neighborhood scoring tool to identify co-conserved, co-expressed genes

    Science.gov (United States)

    In previous studies, gene neighborhoods--spatial clusters of co-expressed genes in the genome--have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Sc...

  19. Genes2FANs: connecting genes through functional association networks

    Science.gov (United States)

    2012-01-01

    Background Protein-protein, cell signaling, metabolic, and transcriptional interaction networks are useful for identifying connections between lists of experimentally identified genes/proteins. However, besides physical or co-expression interactions there are many ways in which pairs of genes, or their protein products, can be associated. By systematically incorporating knowledge on shared properties of genes from diverse sources to build functional association networks (FANs), researchers may be able to identify additional functional interactions between groups of genes that are not readily apparent. Results Genes2FANs is a web based tool and a database that utilizes 14 carefully constructed FANs and a large-scale protein-protein interaction (PPI) network to build subnetworks that connect lists of human and mouse genes. The FANs are created from mammalian gene set libraries where mouse genes are converted to their human orthologs. The tool takes as input a list of human or mouse Entrez gene symbols to produce a subnetwork and a ranked list of intermediate genes that are used to connect the query input list. In addition, users can enter any PubMed search term and then the system automatically converts the returned results to gene lists using GeneRIF. This gene list is then used as input to generate a subnetwork from the user’s PubMed query. As a case study, we applied Genes2FANs to connect disease genes from 90 well-studied disorders. We find an inverse correlation between the counts of links connecting disease genes through PPI and links connecting diseases genes through FANs, separating diseases into two categories. Conclusions Genes2FANs is a useful tool for interpreting the relationships between gene/protein lists in the context of their various functions and networks. Combining functional association interactions with physical PPIs can be useful for revealing new biology and help form hypotheses for further experimentation. Our finding that disease genes in

  20. Meta-analysis diagnostic accuracy of SNP-based pathogenicity detection tools: a case of UTG1A1 gene mutations.

    Science.gov (United States)

    Galehdari, Hamid; Saki, Najmaldin; Mohammadi-Asl, Javad; Rahim, Fakher

    2013-01-01

    Crigler-Najjar syndrome (CNS) type I and type II are usually inherited as autosomal recessive conditions that result from mutations in the UGT1A1 gene. The main objective of the present review is to summarize results of all available evidence on the accuracy of SNP-based pathogenicity detection tools compared to published clinical result for the prediction of in nsSNPs that leads to disease using prediction performance method. A comprehensive search was performed to find all mutations related to CNS. Database searches included dbSNP, SNPdbe, HGMD, Swissvar, ensemble, and OMIM. All the mutation related to CNS was extracted. The pathogenicity prediction was done using SNP-based pathogenicity detection tools include SIFT, PHD-SNP, PolyPhen2, fathmm, Provean, and Mutpred. Overall, 59 different SNPs related to missense mutations in the UGT1A1 gene, were reviewed. Comparing the diagnostic OR, PolyPhen2 and Mutpred have the highest detection 4.983 (95% CI: 1.24 - 20.02) in both, following by SIFT (diagnostic OR: 3.25, 95% CI: 1.07 - 9.83). The highest MCC of SNP-based pathogenicity detection tools, was belong to SIFT (34.19%) followed by Provean, PolyPhen2, and Mutpred (29.99%, 29.89%, and 29.89%, respectively). Hence the highest SNP-based pathogenicity detection tools ACC, was fit to SIFT (62.71%) followed by PolyPhen2, and Mutpred (61.02%, in both). Our results suggest that some of the well-established SNP-based pathogenicity detection tools can appropriately reflect the role of a disease-associated SNP in both local and global structures.

  1. PhenoLink - a web-tool for linking phenotype to ~omics data for bacteria: application to gene-trait matching for Lactobacillus plantarum strains

    Directory of Open Access Journals (Sweden)

    Bayjanov Jumamurat R

    2012-05-01

    Full Text Available Abstract Background Linking phenotypes to high-throughput molecular biology information generated by ~omics technologies allows revealing cellular mechanisms underlying an organism's phenotype. ~Omics datasets are often very large and noisy with many features (e.g., genes, metabolite abundances. Thus, associating phenotypes to ~omics data requires an approach that is robust to noise and can handle large and diverse data sets. Results We developed a web-tool PhenoLink (http://bamics2.cmbi.ru.nl/websoftware/phenolink/ that links phenotype to ~omics data sets using well-established as well new techniques. PhenoLink imputes missing values and preprocesses input data (i to decrease inherent noise in the data and (ii to counterbalance pitfalls of the Random Forest algorithm, on which feature (e.g., gene selection is based. Preprocessed data is used in feature (e.g., gene selection to identify relations to phenotypes. We applied PhenoLink to identify gene-phenotype relations based on the presence/absence of 2847 genes in 42 Lactobacillus plantarum strains and phenotypic measurements of these strains in several experimental conditions, including growth on sugars and nitrogen-dioxide production. Genes were ranked based on their importance (predictive value to correctly predict the phenotype of a given strain. In addition to known gene to phenotype relations we also found novel relations. Conclusions PhenoLink is an easily accessible web-tool to facilitate identifying relations from large and often noisy phenotype and ~omics datasets. Visualization of links to phenotypes offered in PhenoLink allows prioritizing links, finding relations between features, finding relations between phenotypes, and identifying outliers in phenotype data. PhenoLink can be used to uncover phenotype links to a multitude of ~omics data, e.g., gene presence/absence (determined by e.g.: CGH or next-generation sequencing, gene expression (determined by e.g.: microarrays or RNA

  2. GeneBins: a database for classifying gene expression data, with application to plant genome arrays

    Directory of Open Access Journals (Sweden)

    Weiller Georg

    2007-03-01

    Full Text Available Abstract Background To interpret microarray experiments, several ontological analysis tools have been developed. However, current tools are limited to specific organisms. Results We developed a bioinformatics system to assign the probe set sequences of any organism to a hierarchical functional classification modelled on KEGG ontology. The GeneBins database currently supports the functional classification of expression data from four Affymetrix arrays; Arabidopsis thaliana, Oryza sativa, Glycine max and Medicago truncatula. An online analysis tool to identify relevant functions is also provided. Conclusion GeneBins provides resources to interpret gene expression results from microarray experiments. It is available at http://bioinfoserver.rsbs.anu.edu.au/utils/GeneBins/

  3. geneCBR: a translational tool for multiple-microarray analysis and integrative information retrieval for aiding diagnosis in cancer research

    Directory of Open Access Journals (Sweden)

    Fdez-Riverola Florentino

    2009-06-01

    Full Text Available Abstract Background Bioinformatics and medical informatics are two research fields that serve the needs of different but related communities. Both domains share the common goal of providing new algorithms, methods and technological solutions to biomedical research, and contributing to the treatment and cure of diseases. Although different microarray techniques have been successfully used to investigate useful information for cancer diagnosis at the gene expression level, the true integration of existing methods into day-to-day clinical practice is still a long way off. Within this context, case-based reasoning emerges as a suitable paradigm specially intended for the development of biomedical informatics applications and decision support systems, given the support and collaboration involved in such a translational development. With the goals of removing barriers against multi-disciplinary collaboration and facilitating the dissemination and transfer of knowledge to real practice, case-based reasoning systems have the potential to be applied to translational research mainly because their computational reasoning paradigm is similar to the way clinicians gather, analyze and process information in their own practice of clinical medicine. Results In addressing the issue of bridging the existing gap between biomedical researchers and clinicians who work in the domain of cancer diagnosis, prognosis and treatment, we have developed and made accessible a common interactive framework. Our geneCBR system implements a freely available software tool that allows the use of combined techniques that can be applied to gene selection, clustering, knowledge extraction and prediction for aiding diagnosis in cancer research. For biomedical researches, geneCBR expert mode offers a core workbench for designing and testing new techniques and experiments. For pathologists or oncologists, geneCBR diagnostic mode implements an effective and reliable system that can

  4. A large-scale benchmark of gene prioritization methods.

    Science.gov (United States)

    Guala, Dimitri; Sonnhammer, Erik L L

    2017-04-21

    In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods.

  5. TranscriptomeBrowser 3.0: introducing a new compendium of molecular interactions and a new visualization tool for the study of gene regulatory networks.

    Science.gov (United States)

    Lepoivre, Cyrille; Bergon, Aurélie; Lopez, Fabrice; Perumal, Narayanan B; Nguyen, Catherine; Imbert, Jean; Puthier, Denis

    2012-01-31

    Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i) predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices), (ii) potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii) regulatory interactions curated from the literature, (iv) predicted post-transcriptional regulation by micro-RNA, (v) protein kinase-substrate interactions and (vi) physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration of heterogeneous biological information

  6. Gene therapy in dentistry: tool of genetic engineering. Revisited.

    Science.gov (United States)

    Gupta, Khushboo; Singh, Saurabh; Garg, Kavita Nitish

    2015-03-01

    Advances in biotechnology have brought gene therapy to the forefront of medical research. The concept of transferring genes to tissues for clinical applications has been discussed nearly half a century, but the ability to manipulate genetic material via recombinant DNA technology has brought this goal to reality. The feasibility of gene transfer was first demonstrated using tumour viruses. This led to development of viral and nonviral methods for the genetic modification of somatic cells. Applications of gene therapy to dental and oral problems illustrate the potential impact of this technology on dentistry. Preclinical trial results regarding the same have been very promising. In this review we will discuss methods, vectors involved, clinical implication in dentistry and scientific issues associated with gene therapy. Copyright © 2014 Elsevier Ltd. All rights reserved.

  7. Not proper ROC curves as new tool for the analysis of differentially expressed genes in microarray experiments

    Directory of Open Access Journals (Sweden)

    Pistoia Vito

    2008-10-01

    Full Text Available Abstract Background Most microarray experiments are carried out with the purpose of identifying genes whose expression varies in relation with specific conditions or in response to environmental stimuli. In such studies, genes showing similar mean expression values between two or more groups are considered as not differentially expressed, even if hidden subclasses with different expression values may exist. In this paper we propose a new method for identifying differentially expressed genes, based on the area between the ROC curve and the rising diagonal (ABCR. ABCR represents a more general approach than the standard area under the ROC curve (AUC, because it can identify both proper (i.e., concave and not proper ROC curves (NPRC. In particular, NPRC may correspond to those genes that tend to escape standard selection methods. Results We assessed the performance of our method using data from a publicly available database of 4026 genes, including 14 normal B cell samples (NBC and 20 heterogeneous lymphomas (namely: 9 follicular lymphomas and 11 chronic lymphocytic leukemias. Moreover, NBC also included two sub-classes, i.e., 6 heavily stimulated and 8 slightly or not stimulated samples. We identified 1607 differentially expressed genes with an estimated False Discovery Rate of 15%. Among them, 16 corresponded to NPRC and all escaped standard selection procedures based on AUC and t statistics. Moreover, a simple inspection to the shape of such plots allowed to identify the two subclasses in either one class in 13 cases (81%. Conclusion NPRC represent a new useful tool for the analysis of microarray data.

  8. GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature

    Directory of Open Access Journals (Sweden)

    Ning Ye

    2015-01-01

    Full Text Available The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.

  9. SNPexp - A web tool for calculating and visualizing correlation between HapMap genotypes and gene expression levels

    Directory of Open Access Journals (Sweden)

    Franke Andre

    2010-12-01

    Full Text Available Abstract Background Expression levels for 47294 transcripts in lymphoblastoid cell lines from all 270 HapMap phase II individuals, and genotypes (both HapMap phase II and III of 3.96 million single nucleotide polymorphisms (SNPs in the same individuals are publicly available. We aimed to generate a user-friendly web based tool for visualization of the correlation between SNP genotypes within a specified genomic region and a gene of interest, which is also well-known as an expression quantitative trait locus (eQTL analysis. Results SNPexp is implemented as a server-side script, and publicly available on this website: http://tinyurl.com/snpexp. Correlation between genotype and transcript expression levels are calculated by performing linear regression and the Wald test as implemented in PLINK and visualized using the UCSC Genome Browser. Validation of SNPexp using previously published eQTLs yielded comparable results. Conclusions SNPexp provides a convenient and platform-independent way to calculate and visualize the correlation between HapMap genotypes within a specified genetic region anywhere in the genome and gene expression levels. This allows for investigation of both cis and trans effects. The web interface and utilization of publicly available and widely used software resources makes it an attractive supplement to more advanced bioinformatic tools. For the advanced user the program can be used on a local computer on custom datasets.

  10. Argot2: a large scale function prediction tool relying on semantic similarity of weighted Gene Ontology terms.

    Science.gov (United States)

    Falda, Marco; Toppo, Stefano; Pescarolo, Alessandro; Lavezzo, Enrico; Di Camillo, Barbara; Facchinetti, Andrea; Cilia, Elisa; Velasco, Riccardo; Fontana, Paolo

    2012-03-28

    Predicting protein function has become increasingly demanding in the era of next generation sequencing technology. The task to assign a curator-reviewed function to every single sequence is impracticable. Bioinformatics tools, easy to use and able to provide automatic and reliable annotations at a genomic scale, are necessary and urgent. In this scenario, the Gene Ontology has provided the means to standardize the annotation classification with a structured vocabulary which can be easily exploited by computational methods. Argot2 is a web-based function prediction tool able to annotate nucleic or protein sequences from small datasets up to entire genomes. It accepts as input a list of sequences in FASTA format, which are processed using BLAST and HMMER searches vs UniProKB and Pfam databases respectively; these sequences are then annotated with GO terms retrieved from the UniProtKB-GOA database and the terms are weighted using the e-values from BLAST and HMMER. The weighted GO terms are processed according to both their semantic similarity relations described by the Gene Ontology and their associated score. The algorithm is based on the original idea developed in a previous tool called Argot. The entire engine has been completely rewritten to improve both accuracy and computational efficiency, thus allowing for the annotation of complete genomes. The revised algorithm has been already employed and successfully tested during in-house genome projects of grape and apple, and has proven to have a high precision and recall in all our benchmark conditions. It has also been successfully compared with Blast2GO, one of the methods most commonly employed for sequence annotation. The server is freely accessible at http://www.medcomp.medicina.unipd.it/Argot2.

  11. TranscriptomeBrowser 3.0: introducing a new compendium of molecular interactions and a new visualization tool for the study of gene regulatory networks

    Directory of Open Access Journals (Sweden)

    Lepoivre Cyrille

    2012-01-01

    Full Text Available Abstract Background Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. Results We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices, (ii potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii regulatory interactions curated from the literature, (iv predicted post-transcriptional regulation by micro-RNA, (v protein kinase-substrate interactions and (vi physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration

  12. pGenN, a Gene Normalization Tool for Plant Genes and Proteins in Scientific Literature

    Science.gov (United States)

    Ding, Ruoyao; Arighi, Cecilia N.; Lee, Jung-Youn; Wu, Cathy H.; Vijay-Shanker, K.

    2015-01-01

    Background Automatically detecting gene/protein names in the literature and connecting them to databases records, also known as gene normalization, provides a means to structure the information buried in free-text literature. Gene normalization is critical for improving the coverage of annotation in the databases, and is an essential component of many text mining systems and database curation pipelines. Methods In this manuscript, we describe a gene normalization system specifically tailored for plant species, called pGenN (pivot-based Gene Normalization). The system consists of three steps: dictionary-based gene mention detection, species assignment, and intra species normalization. We have developed new heuristics to improve each of these phases. Results We evaluated the performance of pGenN on an in-house expertly annotated corpus consisting of 104 plant relevant abstracts. Our system achieved an F-value of 88.9% (Precision 90.9% and Recall 87.2%) on this corpus, outperforming state-of-art systems presented in BioCreative III. We have processed over 440,000 plant-related Medline abstracts using pGenN. The gene normalization results are stored in a local database for direct query from the pGenN web interface (proteininformationresource.org/pgenn/). The annotated literature corpus is also publicly available through the PIR text mining portal (proteininformationresource.org/iprolink/). PMID:26258475

  13. GOssTo: a stand-alone application and a web tool for calculating semantic similarities on the Gene Ontology.

    Science.gov (United States)

    Caniza, Horacio; Romero, Alfonso E; Heron, Samuel; Yang, Haixuan; Devoto, Alessandra; Frasca, Marco; Mesiti, Marco; Valentini, Giorgio; Paccanaro, Alberto

    2014-08-01

    We present GOssTo, the Gene Ontology semantic similarity Tool, a user-friendly software system for calculating semantic similarities between gene products according to the Gene Ontology. GOssTo is bundled with six semantic similarity measures, including both term- and graph-based measures, and has extension capabilities to allow the user to add new similarities. Importantly, for any measure, GOssTo can also calculate the Random Walk Contribution that has been shown to greatly improve the accuracy of similarity measures. GOssTo is very fast, easy to use, and it allows the calculation of similarities on a genomic scale in a few minutes on a regular desktop machine. alberto@cs.rhul.ac.uk GOssTo is available both as a stand-alone application running on GNU/Linux, Windows and MacOS from www.paccanarolab.org/gossto and as a web application from www.paccanarolab.org/gosstoweb. The stand-alone application features a simple and concise command line interface for easy integration into high-throughput data processing pipelines. © The Author 2014. Published by Oxford University Press.

  14. Escape Excel: A tool for preventing gene symbol and accession conversion errors.

    Science.gov (United States)

    Welsh, Eric A; Stewart, Paul A; Kuenzi, Brent M; Eschrich, James A

    2017-01-01

    Microsoft Excel automatically converts certain gene symbols, database accessions, and other alphanumeric text into dates, scientific notation, and other numerical representations. These conversions lead to subsequent, irreversible, corruption of the imported text. A recent survey of popular genomic literature estimates that one-fifth of all papers with supplementary gene lists suffer from this issue. Here, we present an open-source tool, Escape Excel, which prevents these erroneous conversions by generating an escaped text file that can be safely imported into Excel. Escape Excel is implemented in a variety of formats (http://www.github.com/pstew/escape_excel), including a command line based Perl script, a Windows-only Excel Add-In, an OS X drag-and-drop application, a simple web-server, and as a Galaxy web environment interface. Test server implementations are accessible as a Galaxy interface (http://apostl.moffitt.org) and simple non-Galaxy web server (http://apostl.moffitt.org:8000/). Escape Excel detects and escapes a wide variety of problematic text strings so that they are not erroneously converted into other representations upon importation into Excel. Examples of problematic strings include date-like strings, time-like strings, leading zeroes in front of numbers, and long numeric and alphanumeric identifiers that should not be automatically converted into scientific notation. It is hoped that greater awareness of these potential data corruption issues, together with diligent escaping of text files prior to importation into Excel, will help to reduce the amount of Excel-corrupted data in scientific analyses and publications.

  15. Powerful tools for genetic modification: Advances in gene editing.

    Science.gov (United States)

    Roesch, Erica A; Drumm, Mitchell L

    2017-11-01

    Recent discoveries and technical advances in genetic engineering, methods called gene or genome editing, provide hope for repairing genes that cause diseases like cystic fibrosis (CF) or otherwise altering a gene for therapeutic benefit. There are both hopes and hurdles with these technologies, with new ideas emerging almost daily. Initial studies using intestinal organoid cultures carrying the common, F508del mutation have shown that gene editing by CRISPR/Cas9 can convert cells lacking CFTR function to cells with normal channel function, providing a precedent that this technology can be harnessed for CF. While this is an important precedent, the challenges that remain are not trivial. A logistical issue for this and many other genetic diseases is genetic heterogeneity. Approximately, 2000 mutations associated with CF have been found in CFTR, the gene responsible for CF, and thus a feasible strategy that would encompass all individuals affected by the disease is particularly difficult to envision. However, single strategies that would be applicable to all subjects affected by CF have been conceived and are being investigated. With all of these approaches, efficiency (the proportion of cells edited), accuracy (how often other sites in the genome are affected), and delivery of the gene editing components to the desired cells are perhaps the most significant, impending hurdles. Our understanding of each of these areas is increasing rapidly, and while it is impossible to predict when a successful strategy will reach the clinic, there is every reason to believe it is a question of "when" and not "if." © 2017 Wiley Periodicals, Inc.

  16. Disease gene characterization through large-scale co-expression analysis.

    Directory of Open Access Journals (Sweden)

    Allen Day

    2009-12-01

    Full Text Available In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET.Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2 and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders.UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis.

  17. Getting Started with GeneRecon — An Introduction to the Association Mapping Tool GeneRecon

    DEFF Research Database (Denmark)

    Mailund, T; Schauser, Leif

    2006-01-01

    GeneRecon is a software package for linkage disequilibrium mapping using coalescent theory. It is based on Bayesian Markov-chain Monte Carlo (MCMC) method for fine-scale linkage-disequilibrium gene mapping using high-density marker maps. GeneRecon explicitly models the genealogy of a sample...... of the case chromosomes in the vicinity of a disease locus. Given case and control data in the form of genotype or haplotype information, it estimates a number of parameters, most importantly, the disease position....

  18. Antisense gene silencing

    DEFF Research Database (Denmark)

    Nielsen, Troels T; Nielsen, Jørgen E

    2013-01-01

    Since the first reports that double-stranded RNAs can efficiently silence gene expression in C. elegans, the technology of RNA interference (RNAi) has been intensively exploited as an experimental tool to study gene function. With the subsequent discovery that RNAi could also be applied...

  19. GBOOST: a GPU-based tool for detecting gene-gene interactions in genome-wide case control studies.

    Science.gov (United States)

    Yung, Ling Sing; Yang, Can; Wan, Xiang; Yu, Weichuan

    2011-05-01

    Collecting millions of genetic variations is feasible with the advanced genotyping technology. With a huge amount of genetic variations data in hand, developing efficient algorithms to carry out the gene-gene interaction analysis in a timely manner has become one of the key problems in genome-wide association studies (GWAS). Boolean operation-based screening and testing (BOOST), a recent work in GWAS, completes gene-gene interaction analysis in 2.5 days on a desktop computer. Compared with central processing units (CPUs), graphic processing units (GPUs) are highly parallel hardware and provide massive computing resources. We are, therefore, motivated to use GPUs to further speed up the analysis of gene-gene interactions. We implement the BOOST method based on a GPU framework and name it GBOOST. GBOOST achieves a 40-fold speedup compared with BOOST. It completes the analysis of Wellcome Trust Case Control Consortium Type 2 Diabetes (WTCCC T2D) genome data within 1.34 h on a desktop computer equipped with Nvidia GeForce GTX 285 display card. GBOOST code is available at http://bioinformatics.ust.hk/BOOST.html#GBOOST.

  20. GOseek: a gene ontology search engine using enhanced keywords.

    Science.gov (United States)

    Taha, Kamal

    2013-01-01

    We propose in this paper a biological search engine called GOseek, which overcomes the limitation of current gene similarity tools. Given a set of genes, GOseek returns the most significant genes that are semantically related to the given genes. These returned genes are usually annotated to one of the Lowest Common Ancestors (LCA) of the Gene Ontology (GO) terms annotating the given genes. Most genes have several annotation GO terms. Therefore, there may be more than one LCA for the GO terms annotating the given genes. The LCA annotating the genes that are most semantically related to the given gene is the one that receives the most aggregate semantic contribution from the GO terms annotating the given genes. To identify this LCA, GOseek quantifies the contribution of the GO terms annotating the given genes to the semantics of their LCAs. That is, it encodes the semantic contribution into a numeric format. GOseek uses microarray experiment data to rank result genes based on their significance. We evaluated GOseek experimentally and compared it with a comparable gene prediction tool. Results showed marked improvement over the tool.

  1. Novel gene expression tools for rice biotechnology

    Science.gov (United States)

    Biotechnology is an effective and important method of improving both quality and agronomic traits in rice. We are developing novel molecular tools for genetic engineering, with a focus on developing novel transgene expression control elements (i.e. promoters) for rice. A suite of monocot grass promo...

  2. Database for High Throughput Screening Hits (dHITS): a simple tool to retrieve gene specific phenotypes from systematic screens done in yeast.

    Science.gov (United States)

    Chuartzman, Silvia G; Schuldiner, Maya

    2018-03-25

    In the last decade several collections of Saccharomyces cerevisiae yeast strains have been created. In these collections every gene is modified in a similar manner such as by a deletion or the addition of a protein tag. Such libraries have enabled a diversity of systematic screens, giving rise to large amounts of information regarding gene functions. However, often papers describing such screens focus on a single gene or a small set of genes and all other loci affecting the phenotype of choice ('hits') are only mentioned in tables that are provided as supplementary material and are often hard to retrieve or search. To help unify and make such data accessible, we have created a Database of High Throughput Screening Hits (dHITS). The dHITS database enables information to be obtained about screens in which genes of interest were found as well as the other genes that came up in that screen - all in a readily accessible and downloadable format. The ability to query large lists of genes at the same time provides a platform to easily analyse hits obtained from transcriptional analyses or other screens. We hope that this platform will serve as a tool to facilitate investigation of protein functions to the yeast community. © 2018 The Authors Yeast Published by John Wiley & Sons Ltd.

  3. Motif enrichment tool.

    Science.gov (United States)

    Blatti, Charles; Sinha, Saurabh

    2014-07-01

    The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. MAGMA: generalized gene-set analysis of GWAS data.

    Science.gov (United States)

    de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

    2015-04-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.

  5. HTP-OligoDesigner: An Online Primer Design Tool for High-Throughput Gene Cloning and Site-Directed Mutagenesis.

    Science.gov (United States)

    Camilo, Cesar M; Lima, Gustavo M A; Maluf, Fernando V; Guido, Rafael V C; Polikarpov, Igor

    2016-01-01

    Following burgeoning genomic and transcriptomic sequencing data, biochemical and molecular biology groups worldwide are implementing high-throughput cloning and mutagenesis facilities in order to obtain a large number of soluble proteins for structural and functional characterization. Since manual primer design can be a time-consuming and error-generating step, particularly when working with hundreds of targets, the automation of primer design process becomes highly desirable. HTP-OligoDesigner was created to provide the scientific community with a simple and intuitive online primer design tool for both laboratory-scale and high-throughput projects of sequence-independent gene cloning and site-directed mutagenesis and a Tm calculator for quick queries.

  6. Recode-2: new design, new search tools, and many more genes.

    LENUS (Irish Health Repository)

    Bekaert, Michaël

    2010-01-01

    \\'Recoding\\' is a term used to describe non-standard read-out of the genetic code, and encompasses such phenomena as programmed ribosomal frameshifting, stop codon readthrough, selenocysteine insertion and translational bypassing. Although only a small proportion of genes utilize recoding in protein synthesis, accurate annotation of \\'recoded\\' genes lags far behind annotation of \\'standard\\' genes. In order to address this issue, provide a service to researchers in the field, and offer training data for developers of gene-annotation software, we have gathered together known cases of recoding within the Recode database. Recode-2 is an improved and updated version of the database. It provides access to detailed information on genes known to utilize translational recoding and allows complex search queries, browsing of recoding data and enhanced visualization of annotated sequence elements. At present, the Recode-2 database stores information on approximately 1500 genes that are known to utilize recoding in their expression--a factor of approximately three increase over the previous version of the database. Recode-2 is available at http:\\/\\/recode.ucc.ie.

  7. Constructive Technology Assessment (CTA) as a tool in coverage with evidence development: the case of the 70-gene prognosis signature for breast cancer diagnostics

    NARCIS (Netherlands)

    Retel, Valesca; Retèl, Valesca P.; Bueno-de-Mesquita, Jolien M.; Hummel, J. Marjan; van de Vijver, Marc J.; Douma, Kirsten F.L.; Karsenberg, Kim; van Dam, Frits S.A.M.; van Krimpen, Cees; Bellot, Frank E.; Roumen, Rudi M.H.; Linn, Sabine C.; van Harten, Willem H.

    2009-01-01

    Objectives: Constructive Technology Assessment (CTA) is a means to guide early implementation of new developments in society, and can be used as an evaluation tool for Coverage with Evidence Development (CED). We used CTA for the introduction of a new diagnostic test in the Netherlands, the 70-gene

  8. Constructive Technology Assessment (CTA) as a tool in Coverage with Evidence Development: The case of the 70-gene prognosis signature for breast cancer diagnostics

    NARCIS (Netherlands)

    Retèl, Valesca P.; Bueno-de-Mesquita, Jolien M.; Hummel, Marjan J. M.; van de Vijver, Marc J.; Douma, Kirsten F. L.; Karsenberg, Kim; van Dam, Frits S. A. M.; van Krimpen, Cees; Bellot, Frank E.; Roumen, Rudi M. H.; Linn, Sabine C.; van Harten, Wim H.

    2009-01-01

    Objectives: Constructive Technology Assessment (CTA) is a means to guide early implementation of new developments in society, and can be used as an evaluation tool for Coverage with Evidence Development (CED). We used CTA for the introduction of a new diagnostic test in the Netherlands, the 70-gene

  9. Transfer of a repair gene from E. coli as a tool in studies on the action of alkylating mutagens in tobacco

    Energy Technology Data Exchange (ETDEWEB)

    Veleminsky, J; Briza, J; Angelis, K; Satava, J [Institute of Experimental Botany, Czechoslovakian Academy of Sciences, Prague (Czech Republic); Margison, G [Institute of Experimental Botany, Czechoslovakian Academy of Sciences, Prague (Czech Republic); [Paterson Institute for Cancer Research, CRC, Manchester (United Kingdom)

    1990-01-01

    Full text: Alkylating agents (AA) belong to the most potent mutagens. Nevertheless, the role of individual DNA lesions in the toxic and mutagenic effects of AA in plants are poorly understood. A new tool to study this topic is the transfer of a gene with a specified repair function for a specific DNA lesion. Differences in the responses to AA can be assumed to be caused by changes in the amount of DNA lesion(s) repaired by the introduced gene. Methyl-nitroso urea (MNU) produces 06-methylG and other DNA lesions methylated at O-sites. Taurine-chloroethyl-nitrosourea (TCNH) causes DNA-DNA crosslinks, the formation of which starts with the chloroethylation of G at 06. Both 06-methylG, 04-methylT, O-methylphosphotriesters produced by MNH and 06-chloroethylG produced by TCNH are known to be repaired with AT coded by E. coli ada gene. Transfer of this gene and its expression in tobacco appeared to increase the resistance of the transformed cell to both AA tested. It seems, therefore, likely that the DNA lesions mentioned above are at least partly involved in the production of toxic effects of AA in tobacco. (author)

  10. AffyMiner: mining differentially expressed genes and biological knowledge in GeneChip microarray data

    Directory of Open Access Journals (Sweden)

    Xia Yuannan

    2006-12-01

    Full Text Available Abstract Background DNA microarrays are a powerful tool for monitoring the expression of tens of thousands of genes simultaneously. With the advance of microarray technology, the challenge issue becomes how to analyze a large amount of microarray data and make biological sense of them. Affymetrix GeneChips are widely used microarrays, where a variety of statistical algorithms have been explored and used for detecting significant genes in the experiment. These methods rely solely on the quantitative data, i.e., signal intensity; however, qualitative data are also important parameters in detecting differentially expressed genes. Results AffyMiner is a tool developed for detecting differentially expressed genes in Affymetrix GeneChip microarray data and for associating gene annotation and gene ontology information with the genes detected. AffyMiner consists of the functional modules, GeneFinder for detecting significant genes in a treatment versus control experiment and GOTree for mapping genes of interest onto the Gene Ontology (GO space; and interfaces to run Cluster, a program for clustering analysis, and GenMAPP, a program for pathway analysis. AffyMiner has been used for analyzing the GeneChip data and the results were presented in several publications. Conclusion AffyMiner fills an important gap in finding differentially expressed genes in Affymetrix GeneChip microarray data. AffyMiner effectively deals with multiple replicates in the experiment and takes into account both quantitative and qualitative data in identifying significant genes. AffyMiner reduces the time and effort needed to compare data from multiple arrays and to interpret the possible biological implications associated with significant changes in a gene's expression.

  11. Three gene expression vector sets for concurrently expressing multiple genes in Saccharomyces cerevisiae.

    Science.gov (United States)

    Ishii, Jun; Kondo, Takashi; Makino, Harumi; Ogura, Akira; Matsuda, Fumio; Kondo, Akihiko

    2014-05-01

    Yeast has the potential to be used in bulk-scale fermentative production of fuels and chemicals due to its tolerance for low pH and robustness for autolysis. However, expression of multiple external genes in one host yeast strain is considerably labor-intensive due to the lack of polycistronic transcription. To promote the metabolic engineering of yeast, we generated systematic and convenient genetic engineering tools to express multiple genes in Saccharomyces cerevisiae. We constructed a series of multi-copy and integration vector sets for concurrently expressing two or three genes in S. cerevisiae by embedding three classical promoters. The comparative expression capabilities of the constructed vectors were monitored with green fluorescent protein, and the concurrent expression of genes was monitored with three different fluorescent proteins. Our multiple gene expression tool will be helpful to the advanced construction of genetically engineered yeast strains in a variety of research fields other than metabolic engineering. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.

  12. EcoGene 3.0.

    Science.gov (United States)

    Zhou, Jindan; Rudd, Kenneth E

    2013-01-01

    EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection.

  13. EcoGene 3.0

    Science.gov (United States)

    Zhou, Jindan; Rudd, Kenneth E.

    2013-01-01

    EcoGene (http://ecogene.org) is a database and website devoted to continuously improving the structural and functional annotation of Escherichia coli K-12, one of the most well understood model organisms, represented by the MG1655(Seq) genome sequence and annotations. Major improvements to EcoGene in the past decade include (i) graphic presentations of genome map features; (ii) ability to design Boolean queries and Venn diagrams from EcoArray, EcoTopics or user-provided GeneSets; (iii) the genome-wide clone and deletion primer design tool, PrimerPairs; (iv) sequence searches using a customized EcoBLAST; (v) a Cross Reference table of synonymous gene and protein identifiers; (vi) proteome-wide indexing with GO terms; (vii) EcoTools access to >2000 complete bacterial genomes in EcoGene-RefSeq; (viii) establishment of a MySql relational database; and (ix) use of web content management systems. The biomedical literature is surveyed daily to provide citation and gene function updates. As of September 2012, the review of 37 397 abstracts and articles led to creation of 98 425 PubMed-Gene links and 5415 PubMed-Topic links. Annotation updates to Genbank U00096 are transmitted from EcoGene to NCBI. Experimental verifications include confirmation of a CTG start codon, pseudogene restoration and quality assurance of the Keio strain collection. PMID:23197660

  14. RGmatch: matching genomic regions to proximal genes in omics data integration

    Directory of Open Access Journals (Sweden)

    Pedro Furió-Tarí

    2016-11-01

    Full Text Available Abstract Background The integrative analysis of multiple genomics data often requires that genome coordinates-based signals have to be associated with proximal genes. The relative location of a genomic region with respect to the gene (gene area is important for functional data interpretation; hence algorithms that match regions to genes should be able to deliver insight into this information. Results In this work we review the tools that are publicly available for making region-to-gene associations. We also present a novel method, RGmatch, a flexible and easy-to-use Python tool that computes associations either at the gene, transcript, or exon level, applying a set of rules to annotate each region-gene association with the region location within the gene. RGmatch can be applied to any organism as long as genome annotation is available. Furthermore, we qualitatively and quantitatively compare RGmatch to other tools. Conclusions RGmatch simplifies the association of a genomic region with its closest gene. At the same time, it is a powerful tool because the rules used to annotate these associations are very easy to modify according to the researcher’s specific interests. Some important differences between RGmatch and other similar tools already in existence are RGmatch’s flexibility, its wide range of user options, compatibility with any annotatable organism, and its comprehensive and user-friendly output.

  15. LimTox: a web tool for applied text mining of adverse event and toxicity associations of compounds, drugs and genes.

    Science.gov (United States)

    Cañada, Andres; Capella-Gutierrez, Salvador; Rabal, Obdulia; Oyarzabal, Julen; Valencia, Alfonso; Krallinger, Martin

    2017-07-03

    A considerable effort has been devoted to retrieve systematically information for genes and proteins as well as relationships between them. Despite the importance of chemical compounds and drugs as a central bio-entity in pharmacological and biological research, only a limited number of freely available chemical text-mining/search engine technologies are currently accessible. Here we present LimTox (Literature Mining for Toxicology), a web-based online biomedical search tool with special focus on adverse hepatobiliary reactions. It integrates a range of text mining, named entity recognition and information extraction components. LimTox relies on machine-learning, rule-based, pattern-based and term lookup strategies. This system processes scientific abstracts, a set of full text articles and medical agency assessment reports. Although the main focus of LimTox is on adverse liver events, it enables also basic searches for other organ level toxicity associations (nephrotoxicity, cardiotoxicity, thyrotoxicity and phospholipidosis). This tool supports specialized search queries for: chemical compounds/drugs, genes (with additional emphasis on key enzymes in drug metabolism, namely P450 cytochromes-CYPs) and biochemical liver markers. The LimTox website is free and open to all users and there is no login requirement. LimTox can be accessed at: http://limtox.bioinfo.cnio.es. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Using RNA-Seq Data to Evaluate Reference Genes Suitable for Gene Expression Studies in Soybean.

    Directory of Open Access Journals (Sweden)

    Aldrin Kay-Yuen Yim

    Full Text Available Differential gene expression profiles often provide important clues for gene functions. While reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR is an important tool, the validity of the results depends heavily on the choice of proper reference genes. In this study, we employed new and published RNA-sequencing (RNA-Seq datasets (26 sequencing libraries in total to evaluate reference genes reported in previous soybean studies. In silico PCR showed that 13 out of 37 previously reported primer sets have multiple targets, and 4 of them have amplicons with different sizes. Using a probabilistic approach, we identified new and improved candidate reference genes. We further performed 2 validation tests (with 26 RNA samples on 8 commonly used reference genes and 7 newly identified candidates, using RT-qPCR. In general, the new candidate reference genes exhibited more stable expression levels under the tested experimental conditions. The three newly identified candidate reference genes Bic-C2, F-box protein2, and VPS-like gave the best overall performance, together with the commonly used ELF1b. It is expected that the proposed probabilistic model could serve as an important tool to identify stable reference genes when more soybean RNA-Seq data from different growth stages and treatments are used.

  17. Common features of microRNA target prediction tools

    Directory of Open Access Journals (Sweden)

    Sarah M. Peterson

    2014-02-01

    Full Text Available The human genome encodes for over 1800 microRNAs, which are short noncoding RNA molecules that function to regulate gene expression post-transcriptionally. Due to the potential for one microRNA to target multiple gene transcripts, microRNAs are recognized as a major mechanism to regulate gene expression and mRNA translation. Computational prediction of microRNA targets is a critical initial step in identifying microRNA:mRNA target interactions for experimental validation. The available tools for microRNA target prediction encompass a range of different computational approaches, from the modeling of physical interactions to the incorporation of machine learning. This review provides an overview of the major computational approaches to microRNA target prediction. Our discussion highlights three tools for their ease of use, reliance on relatively updated versions of miRBase, and range of capabilities, and these are DIANA-microT-CDS, miRanda-mirSVR, and TargetScan. In comparison across all microRNA target prediction tools, four main aspects of the microRNA:mRNA target interaction emerge as common features on which most target prediction is based: seed match, conservation, free energy, and site accessibility. This review explains these features and identifies how they are incorporated into currently available target prediction tools. MicroRNA target prediction is a dynamic field with increasing attention on development of new analysis tools. This review attempts to provide a comprehensive assessment of these tools in a manner that is accessible across disciplines. Understanding the basis of these prediction methodologies will aid in user selection of the appropriate tools and interpretation of the tool output.

  18. Genes on B chromosomes: old questions revisited with new tools.

    Science.gov (United States)

    Banaei-Moghaddam, Ali M; Martis, Mihaela M; Macas, Jiří; Gundlach, Heidrun; Himmelbach, Axel; Altschmied, Lothar; Mayer, Klaus F X; Houben, Andreas

    2015-01-01

    B chromosomes are supernumerary dispensable parts of the karyotype which appear in some individuals of some populations in some species. Often, they have been considered as 'junk DNA' or genomic parasites without functional genes. Due to recent advances in sequencing technologies, it became possible to investigate their DNA composition, transcriptional activity and effects on the host transcriptome profile in detail. Here, we review the most recent findings regarding the gene content of B chromosomes and their transcriptional activities and discuss these findings in the context of comparable biological phenomena, like sex chromosomes, aneuploidy and pseudogenes. Recent data suggest that B chromosomes carry transcriptionally active genic sequences which could affect the transcriptome profile of their host genome. These findings are gradually changing our view that B chromosomes are solely genetically inert selfish elements without any functional genes. This at one side could partly explain the deleterious effects which are associated with their presence. On the other hand it makes B chromosome a nice model for studying regulatory mechanisms of duplicated genes and their evolutionary consequences. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. Development of functional genomic tools in trematodes: RNA interference and luciferase reporter gene activity in Fasciola hepatica.

    Directory of Open Access Journals (Sweden)

    Gabriel Rinaldi

    2008-07-01

    Full Text Available The growing availability of sequence information from diverse parasites through genomic and transcriptomic projects offer new opportunities for the identification of key mediators in the parasite-host interaction. Functional genomics approaches and methods for the manipulation of genes are essential tools for deciphering the roles of genes and to identify new intervention targets in parasites. Exciting advances in functional genomics for parasitic helminths are starting to occur, with transgene expression and RNA interference (RNAi reported in several species of nematodes, but the area is still in its infancy in flatworms, with reports in just three species. While advancing in model organisms, there is a need to rapidly extend these technologies to other parasites responsible for several chronic diseases of humans and cattle. In order to extend these approaches to less well studied parasitic worms, we developed a test method for the presence of a viable RNAi pathway by silencing the exogenous reporter gene, firefly luciferase (fLUC. We established the method in the human blood fluke Schistosoma mansoni and then confirmed its utility in the liver fluke Fasciola hepatica. We transformed newly excysted juveniles of F. hepatica by electroporation with mRNA of fLUC and three hours later were able to detect luciferase enzyme activity, concentrated mainly in the digestive ceca. Subsequently, we tested the presence of an active RNAi pathway in F. hepatica by knocking down the exogenous luciferase activity by introduction into the transformed parasites of double-stranded RNA (dsRNA specific for fLUC. In addition, we tested the RNAi pathway targeting an endogenous F. hepatica gene encoding leucine aminopeptidase (FhLAP, and observed a significant reduction in specific mRNA levels. In summary, these studies demonstrated the utility of RNAi targeting reporter fLUC as a reporter gene assay to establish the presence of an intact RNAi pathway in helminth

  20. In-silico human genomics with GeneCards

    Directory of Open Access Journals (Sweden)

    Stelzer Gil

    2011-10-01

    Full Text Available Abstract Since 1998, the bioinformatics, systems biology, genomics and medical communities have enjoyed a synergistic relationship with the GeneCards database of human genes (http://www.genecards.org. This human gene compendium was created to help to introduce order into the increasing chaos of information flow. As a consequence of viewing details and deep links related to specific genes, users have often requested enhanced capabilities, such that, over time, GeneCards has blossomed into a suite of tools (including GeneDecks, GeneALaCart, GeneLoc, GeneNote and GeneAnnot for a variety of analyses of both single human genes and sets thereof. In this paper, we focus on inhouse and external research activities which have been enabled, enhanced, complemented and, in some cases, motivated by GeneCards. In turn, such interactions have often inspired and propelled improvements in GeneCards. We describe here the evolution and architecture of this project, including examples of synergistic applications in diverse areas such as synthetic lethality in cancer, the annotation of genetic variations in disease, omics integration in a systems biology approach to kidney disease, and bioinformatics tools.

  1. Inferring gene expression dynamics via functional regression analysis

    Directory of Open Access Journals (Sweden)

    Leng Xiaoyan

    2008-01-01

    Full Text Available Abstract Background Temporal gene expression profiles characterize the time-dynamics of expression of specific genes and are increasingly collected in current gene expression experiments. In the analysis of experiments where gene expression is obtained over the life cycle, it is of interest to relate temporal patterns of gene expression associated with different developmental stages to each other to study patterns of long-term developmental gene regulation. We use tools from functional data analysis to study dynamic changes by relating temporal gene expression profiles of different developmental stages to each other. Results We demonstrate that functional regression methodology can pinpoint relationships that exist between temporary gene expression profiles for different life cycle phases and incorporates dimension reduction as needed for these high-dimensional data. By applying these tools, gene expression profiles for pupa and adult phases are found to be strongly related to the profiles of the same genes obtained during the embryo phase. Moreover, one can distinguish between gene groups that exhibit relationships with positive and others with negative associations between later life and embryonal expression profiles. Specifically, we find a positive relationship in expression for muscle development related genes, and a negative relationship for strictly maternal genes for Drosophila, using temporal gene expression profiles. Conclusion Our findings point to specific reactivation patterns of gene expression during the Drosophila life cycle which differ in characteristic ways between various gene groups. Functional regression emerges as a useful tool for relating gene expression patterns from different developmental stages, and avoids the problems with large numbers of parameters and multiple testing that affect alternative approaches.

  2. Text mining in cancer gene and pathway prioritization.

    Science.gov (United States)

    Luo, Yuan; Riedlinger, Gregory; Szolovits, Peter

    2014-01-01

    Prioritization of cancer implicated genes has received growing attention as an effective way to reduce wet lab cost by computational analysis that ranks candidate genes according to the likelihood that experimental verifications will succeed. A multitude of gene prioritization tools have been developed, each integrating different data sources covering gene sequences, differential expressions, function annotations, gene regulations, protein domains, protein interactions, and pathways. This review places existing gene prioritization tools against the backdrop of an integrative Omic hierarchy view toward cancer and focuses on the analysis of their text mining components. We explain the relatively slow progress of text mining in gene prioritization, identify several challenges to current text mining methods, and highlight a few directions where more effective text mining algorithms may improve the overall prioritization task and where prioritizing the pathways may be more desirable than prioritizing only genes.

  3. rAAV Vectors as Safe and Efficient Tools for the Stable Delivery of Genes to Primary Human Chondrosarcoma Cells In Vitro and In Situ

    Directory of Open Access Journals (Sweden)

    Henning Madry

    2012-01-01

    Full Text Available Treatment of chondrosarcoma remains a major challenge in orthopaedic oncology. Gene transfer strategies based on recombinant adenoassociated viral (rAAV vectors may provide powerful tools to develop new, efficient therapeutic options against these tumors. In the present study, we tested the hypothesis that rAAV is adapted for a stable and safe delivery of foreign sequences in human chondrosarcoma tissue by transducing primary human chondrosarcoma cells in vitro and in situ with different reporter genes (E. coli lacZ, firefly luc, Discosoma sp. RFP. The effects of rAAV administration upon cell survival and metabolic activities were also evaluated to monitor possibly detrimental effects of the gene transfer method. Remarkably, we provide evidence that efficient and prolonged expression of transgene sequences via rAAV can be safely achieved in all the systems investigated, demonstrating the potential of the approach of direct application of therapeutic gene vectors as a means to treat chondrosarcoma.

  4. G2D: a tool for mining genes associated with disease

    OpenAIRE

    Perez-Iratxeta, Carolina; Wjst, Matthias; Bork, Peer; Andrade, Miguel A

    2005-01-01

    Abstract Background Human inherited diseases can be associated by genetic linkage with one or more genomic regions. The availability of the complete sequence of the human genome allows examining those locations for an associated gene. We previously developed an algorithm to prioritize genes on a chromosomal region according to their possible relation to an inherited disease using a combination of data mining on biomedical databases and gene sequence analysis. Results We have implemented this ...

  5. A segment of rbcL gene as a potential tool for forensic discrimination of Cannabis sativa seized at Rio de Janeiro, Brazil.

    Science.gov (United States)

    Mello, I C T; Ribeiro, A S D; Dias, V H G; Silva, R; Sabino, B D; Garrido, R G; Seldin, L; de Moura Neto, Rodrigo Soares

    2016-03-01

    Cannabis sativa, known by the common name marijuana, is the psychoactive drug most widely distributed in the world. Identification of Cannabis cultivars may be useful for association to illegal crops, which may reveal trafficking routes and related criminal groups. This study provides evidence for the performance of a segment of the rbcL gene, through genetic signature, as a tool for identification for C. sativa samples apprehended by the Rio de Janeiro Police, Brazil. The PCR amplified and further sequenced the fragment of approximately 561 bp of 24 samples of C. sativa rbcL gene and showed the same nucleotide sequences, suggesting a possible genetic similarity or identical varieties. Comparing with other Cannabaceae family sequences, we have found 99% of similarity between the Rio de Janeiro sequence and three other C. sativa rbcL genes. These findings suggest that the fragment utilized at this study is efficient in identifying C. sativa samples, therefore, useful in genetic discrimination of samples seized in forensic cases.

  6. Affinity-based biosensors as promising tools for gene doping detection.

    Science.gov (United States)

    Minunni, Maria; Scarano, Simona; Mascini, Marco

    2008-05-01

    Innovative bioanalytical approaches can be foreseen as interesting means for solving relevant emerging problems in anti-doping control. Sport authorities fear that the newer form of doping, so-called gene doping, based on a misuse of gene therapy, will be undetectable and thus much less preventable. The World Anti-Doping Agency has already asked scientists to assist in finding ways to prevent and detect this newest kind of doping. In this Opinion article we discuss the main aspects of gene doping, from the putative target analytes to suitable sampling strategies. Moreover, we discuss the potential application of affinity sensing in this field, which so far has been successfully applied to a variety of analytical problems, from clinical diagnostics to food and environmental analysis.

  7. SWPhylo - A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees.

    Science.gov (United States)

    Yu, Xiaoyu; Reva, Oleg N

    2018-01-01

    Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA.

  8. GREAT: a web portal for Genome Regulatory Architecture Tools.

    Science.gov (United States)

    Bouyioukos, Costas; Bucchini, François; Elati, Mohamed; Képès, François

    2016-07-08

    GREAT (Genome REgulatory Architecture Tools) is a novel web portal for tools designed to generate user-friendly and biologically useful analysis of genome architecture and regulation. The online tools of GREAT are freely accessible and compatible with essentially any operating system which runs a modern browser. GREAT is based on the analysis of genome layout -defined as the respective positioning of co-functional genes- and its relation with chromosome architecture and gene expression. GREAT tools allow users to systematically detect regular patterns along co-functional genomic features in an automatic way consisting of three individual steps and respective interactive visualizations. In addition to the complete analysis of regularities, GREAT tools enable the use of periodicity and position information for improving the prediction of transcription factor binding sites using a multi-view machine learning approach. The outcome of this integrative approach features a multivariate analysis of the interplay between the location of a gene and its regulatory sequence. GREAT results are plotted in web interactive graphs and are available for download either as individual plots, self-contained interactive pages or as machine readable tables for downstream analysis. The GREAT portal can be reached at the following URL https://absynth.issb.genopole.fr/GREAT and each individual GREAT tool is available for downloading. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Down-Regulation of Gene Expression by RNA-Induced Gene Silencing

    Science.gov (United States)

    Travella, Silvia; Keller, Beat

    Down-regulation of endogenous genes via post-transcriptional gene silencing (PTGS) is a key to the characterization of gene function in plants. Many RNA-based silencing mechanisms such as post-transcriptional gene silencing, co-suppression, quelling, and RNA interference (RNAi) have been discovered among species of different kingdoms (plants, fungi, and animals). One of the most interesting discoveries was RNAi, a sequence-specific gene-silencing mechanism initiated by the introduction of double-stranded RNA (dsRNA), homologous in sequence to the silenced gene, which triggers degradation of mRNA. Infection of plants with modified viruses can also induce RNA silencing and is referred to as virus-induced gene silencing (VIGS). In contrast to insertional mutagenesis, these emerging new reverse genetic approaches represent a powerful tool for exploring gene function and for manipulating gene expression experimentally in cereal species such as barley and wheat. We examined how RNAi and VIGS have been used to assess gene function in barley and wheat, including molecular mechanisms involved in the process and available methodological elements, such as vectors, inoculation procedures, and analysis of silenced phenotypes.

  10. ncRNA-class Web Tool: Non-coding RNA feature extraction and pre-miRNA classification web tool

    KAUST Repository

    Kleftogiannis, Dimitrios A.; Theofilatos, Konstantinos A.; Papadimitriou, Stergios; Tsakalidis, Athanasios K.; Likothanassis, Spiridon D.; Mavroudi, Seferina P.

    2012-01-01

    Until recently, it was commonly accepted that most genetic information is transacted by proteins. Recent evidence suggests that the majority of the genomes of mammals and other complex organisms are in fact transcribed into non-coding RNAs (ncRNAs), many of which are alternatively spliced and/or processed into smaller products. Non coding RNA genes analysis requires the calculation of several sequential, thermodynamical and structural features. Many independent tools have already been developed for the efficient calculation of such features but to the best of our knowledge there does not exist any integrative approach for this task. The most significant amount of existing work is related to the miRNA class of non-coding RNAs. MicroRNAs (miRNAs) are small non-coding RNAs that play a significant role in gene regulation and their prediction is a challenging bioinformatics problem. Non-coding RNA feature extraction and pre-miRNA classification Web Tool (ncRNA-class Web Tool) is a publicly available web tool ( http://150.140.142.24:82/Default.aspx ) which provides a user friendly and efficient environment for the effective calculation of a set of 58 sequential, thermodynamical and structural features of non-coding RNAs, plus a tool for the accurate prediction of miRNAs. © 2012 IFIP International Federation for Information Processing.

  11. Pgas, a Low-pH-Induced Promoter, as a Tool for Dynamic Control of Gene Expression for Metabolic Engineering of Aspergillus niger.

    Science.gov (United States)

    Yin, Xian; Shin, Hyun-Dong; Li, Jianghua; Du, Guocheng; Liu, Long; Chen, Jian

    2017-03-15

    The dynamic control of gene expression is important for adjusting fluxes in order to obtain desired products and achieve appropriate cell growth, particularly when the synthesis of a desired product drains metabolites required for cell growth. For dynamic gene expression, a promoter responsive to a particular environmental stressor is vital. Here, we report a low-pH-inducible promoter, P gas , which promotes minimal gene expression at pH values above 5.0 but functions efficiently at low pHs, such as pH 2.0. First, we performed a transcriptional analysis of Aspergillus niger , an excellent platform for the production of organic acids, and we found that the promoter P gas may act efficiently at low pH. Then, a gene for synthetic green fluorescent protein ( sGFP ) was successfully expressed by P gas at pH 2.0, verifying the results of the transcriptional analysis. Next, P gas was used to express the cis -aconitate decarboxylase ( cad ) gene of Aspergillus terreus in A. niger , allowing the production of itaconic acid at a titer of 4.92 g/liter. Finally, we found that P gas strength was independent of acid type and acid ion concentration, showing dependence on pH only. IMPORTANCE The promoter P gas can be used for the dynamic control of gene expression in A. niger for metabolic engineering to produce organic acids. This promoter may also be a candidate tool for genetic engineering. Copyright © 2017 American Society for Microbiology.

  12. Integrative Functional Genomics for Systems Genetics in GeneWeaver.org.

    Science.gov (United States)

    Bubier, Jason A; Langston, Michael A; Baker, Erich J; Chesler, Elissa J

    2017-01-01

    The abundance of existing functional genomics studies permits an integrative approach to interpreting and resolving the results of diverse systems genetics studies. However, a major challenge lies in assembling and harmonizing heterogeneous data sets across species for facile comparison to the positional candidate genes and coexpression networks that come from systems genetic studies. GeneWeaver is an online database and suite of tools at www.geneweaver.org that allows for fast aggregation and analysis of gene set-centric data. GeneWeaver contains curated experimental data together with resource-level data such as GO annotations, MP annotations, and KEGG pathways, along with persistent stores of user entered data sets. These can be entered directly into GeneWeaver or transferred from widely used resources such as GeneNetwork.org. Data are analyzed using statistical tools and advanced graph algorithms to discover new relations, prioritize candidate genes, and generate function hypotheses. Here we use GeneWeaver to find genes common to multiple gene sets, prioritize candidate genes from a quantitative trait locus, and characterize a set of differentially expressed genes. Coupling a large multispecies repository curated and empirical functional genomics data to fast computational tools allows for the rapid integrative analysis of heterogeneous data for interpreting and extrapolating systems genetics results.

  13. GeneCAT--novel webtools that combine BLAST and co-expression analyses

    DEFF Research Database (Denmark)

    Mutwil, Marek; Obro, Jens; Willats, William G T

    2008-01-01

    The gene co-expression analysis toolbox (GeneCAT) introduces several novel microarray data analyzing tools. First, the multigene co-expression analysis, combined with co-expressed gene networks, provides a more powerful data mining technique than standard, single-gene co-expression analysis. Second...... orthologs in the plant model organisms Arabidopsis thaliana and Hordeum vulgare (Barley). GeneCAT is equipped with expression data for the model plant A. thaliana, and first to introduce co-expression mining tools for the monocot Barley. GeneCAT is available at http://genecat.mpg.de....

  14. a positive control plasmid for reporter gene assay

    African Journals Online (AJOL)

    STORAGESEVER

    2008-07-04

    Jul 4, 2008 ... qualification as a positive control for luciferase reporter gene assays. Key words: Reporter gene plasmid, luciferase assay, cytomegalovirus promoter/enhancer, human melanoma cell line. INTRODUCTION. Reporter genes, often called reporters, have become a precious tool in studies of gene expression ...

  15. A database and tool, IM Browser, for exploring and integrating emerging gene and protein interaction data for Drosophila

    Directory of Open Access Journals (Sweden)

    Parrish Jodi R

    2006-04-01

    Full Text Available Abstract Background Biological processes are mediated by networks of interacting genes and proteins. Efforts to map and understand these networks are resulting in the proliferation of interaction data derived from both experimental and computational techniques for a number of organisms. The volume of this data combined with the variety of specific forms it can take has created a need for comprehensive databases that include all of the available data sets, and for exploration tools to facilitate data integration and analysis. One powerful paradigm for the navigation and analysis of interaction data is an interaction graph or map that represents proteins or genes as nodes linked by interactions. Several programs have been developed for graphical representation and analysis of interaction data, yet there remains a need for alternative programs that can provide casual users with rapid easy access to many existing and emerging data sets. Description Here we describe a comprehensive database of Drosophila gene and protein interactions collected from a variety of sources, including low and high throughput screens, genetic interactions, and computational predictions. We also present a program for exploring multiple interaction data sets and for combining data from different sources. The program, referred to as the Interaction Map (IM Browser, is a web-based application for searching and visualizing interaction data stored in a relational database system. Use of the application requires no downloads and minimal user configuration or training, thereby enabling rapid initial access to interaction data. IM Browser was designed to readily accommodate and integrate new types of interaction data as it becomes available. Moreover, all information associated with interaction measurements or predictions and the genes or proteins involved are accessible to the user. This allows combined searches and analyses based on either common or technique-specific attributes

  16. FARO server: Meta-analysis of gene expression by matching gene expression signatures to a compendium of public gene expression data

    DEFF Research Database (Denmark)

    Manijak, Mieszko P.; Nielsen, Henrik Bjørn

    2011-01-01

    circumvented by instead matching gene expression signatures to signatures of other experiments. FINDINGS: To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700...... Arabidopsis microarray experiments. CONCLUSIONS: Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/....

  17. GO Explorer: A gene-ontology tool to aid in the interpretation of shotgun proteomics data.

    Science.gov (United States)

    Carvalho, Paulo C; Fischer, Juliana Sg; Chen, Emily I; Domont, Gilberto B; Carvalho, Maria Gc; Degrave, Wim M; Yates, John R; Barbosa, Valmir C

    2009-02-24

    Spectral counting is a shotgun proteomics approach comprising the identification and relative quantitation of thousands of proteins in complex mixtures. However, this strategy generates bewildering amounts of data whose biological interpretation is a challenge. Here we present a new algorithm, termed GO Explorer (GOEx), that leverages the gene ontology (GO) to aid in the interpretation of proteomic data. GOEx stands out because it combines data from protein fold changes with GO over-representation statistics to help draw conclusions. Moreover, it is tightly integrated within the PatternLab for Proteomics project and, thus, lies within a complete computational environment that provides parsers and pattern recognition tools designed for spectral counting. GOEx offers three independent methods to query data: an interactive directed acyclic graph, a specialist mode where key words can be searched, and an automatic search. Its usefulness is demonstrated by applying it to help interpret the effects of perillyl alcohol, a natural chemotherapeutic agent, on glioblastoma multiform cell lines (A172). We used a new multi-surfactant shotgun proteomic strategy and identified more than 2600 proteins; GOEx pinpointed key sets of differentially expressed proteins related to cell cycle, alcohol catabolism, the Ras pathway, apoptosis, and stress response, to name a few. GOEx facilitates organism-specific studies by leveraging GO and providing a rich graphical user interface. It is a simple to use tool, specialized for biologists who wish to analyze spectral counting data from shotgun proteomics. GOEx is available at http://pcarvalho.com/patternlab.

  18. Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

    Directory of Open Access Journals (Sweden)

    Walchli John

    2009-04-01

    Full Text Available Abstract Background With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. Results In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38α, viral polymerase (HCV NS5B, and bacterial structural protein (FtsZ were expressed in both E. coli and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. Conclusion The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.

  19. A Review of Pathway-Based Analysis Tools That Visualize Genetic Variants

    Directory of Open Access Journals (Sweden)

    Elisa Cirillo

    2017-11-01

    Full Text Available Pathway analysis is a powerful method for data analysis in genomics, most often applied to gene expression analysis. It is also promising for single-nucleotide polymorphism (SNP data analysis, such as genome-wide association study data, because it allows the interpretation of variants with respect to the biological processes in which the affected genes and proteins are involved. Such analyses support an interactive evaluation of the possible effects of variations on function, regulation or interaction of gene products. Current pathway analysis software often does not support data visualization of variants in pathways as an alternate method to interpret genetic association results, and specific statistical methods for pathway analysis of SNP data are not combined with these visualization features. In this review, we first describe the visualization options of the tools that were identified by a literature review, in order to provide insight for improvements in this developing field. Tool evaluation was performed using a computational epistatic dataset of gene–gene interactions for obesity risk. Next, we report the necessity to include in these tools statistical methods designed for the pathway-based analysis with SNP data, expressly aiming to define features for more comprehensive pathway-based analysis tools. We conclude by recognizing that pathway analysis of genetic variations data requires a sophisticated combination of the most useful and informative visual aspects of the various tools evaluated.

  20. SWPhylo – A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees

    Science.gov (United States)

    Yu, Xiaoyu; Reva, Oleg N

    2018-01-01

    Modern phylogenetic studies may benefit from the analysis of complete genome sequences of various microorganisms. Evolutionary inferences based on genome-scale analysis are believed to be more accurate than the gene-based alternative. However, the computational complexity of current phylogenomic procedures, inappropriateness of standard phylogenetic tools to process genome-wide data, and lack of reliable substitution models which correlates with alignment-free phylogenomic approaches deter microbiologists from using these opportunities. For example, the super-matrix and super-tree approaches of phylogenomics use multiple integrated genomic loci or individual gene-based trees to infer an overall consensus tree. However, these approaches potentially multiply errors of gene annotation and sequence alignment not mentioning the computational complexity and laboriousness of the methods. In this article, we demonstrate that the annotation- and alignment-free comparison of genome-wide tetranucleotide frequencies, termed oligonucleotide usage patterns (OUPs), allowed a fast and reliable inference of phylogenetic trees. These were congruent to the corresponding whole genome super-matrix trees in terms of tree topology when compared with other known approaches including 16S ribosomal RNA and GyrA protein sequence comparison, complete genome-based MAUVE, and CVTree methods. A Web-based program to perform the alignment-free OUP-based phylogenomic inferences was implemented at http://swphylo.bi.up.ac.za/. Applicability of the tool was tested on different taxa from subspecies to intergeneric levels. Distinguishing between closely related taxonomic units may be enforced by providing the program with alignments of marker protein sequences, eg, GyrA. PMID:29511354

  1. Transgenesis: An efficient tool in mulberry breeding | Wani | African ...

    African Journals Online (AJOL)

    Genetic engineering is the most potent biotechnological approach dealing with transfer of specially constructed gene assemblies through various transformation techniques. Tools of recombinant DNA technology facilitated development of transgenic plants. The plants obtained through genetic engineering contain a gene or ...

  2. HMM-Based Gene Annotation Methods

    Energy Technology Data Exchange (ETDEWEB)

    Haussler, David; Hughey, Richard; Karplus, Keven

    1999-09-20

    Development of new statistical methods and computational tools to identify genes in human genomic DNA, and to provide clues to their functions by identifying features such as transcription factor binding sites, tissue, specific expression and splicing patterns, and remove homologies at the protein level with genes of known function.

  3. Constructive Technology Assessment (CTA) as a tool in coverage with evidence development: the case of the 70-gene prognosis signature for breast cancer diagnostics.

    Science.gov (United States)

    Retèl, Valesca P; Bueno-de-Mesquita, Jolien M; Hummel, Marjan J M; van de Vijver, Marc J; Douma, Kirsten F L; Karsenberg, Kim; van Dam, Frits S A M; van Krimpen, Cees; Bellot, Frank E; Roumen, Rudi M H; Linn, Sabine C; van Harten, Wim H

    2009-01-01

    Constructive Technology Assessment (CTA) is a means to guide early implementation of new developments in society, and can be used as an evaluation tool for Coverage with Evidence Development (CED). We used CTA for the introduction of a new diagnostic test in the Netherlands, the 70-gene prognosis signature (MammaPrint) for node-negative breast cancer patients. Studied aspects were (organizational) efficiency, patient-centeredness and diffusion scenarios. Pre-post structured surveys were conducted in fifteen community hospitals concerning changes in logistics and teamwork as a consequence of the introduction of the 70-gene signature. Patient-centeredness was measured by questionnaires and interviews regarding knowledge and psychological impact of the test. Diffusion scenarios, which are commonly applied in industry to anticipate on future development and diffusion of their products, have been applied in this study. Median implementation-time of the 70-gene signature was 1.2 months. Most changes were seen in pathology processes and adjuvant treatment decisions. Physicians valued the addition of the 70-gene signature information as beneficial for patient management. Patient-centeredness (n = 77, response 78 percent): patients receiving a concordant high-risk and discordant clinical low/high risk-signature showed significantly more negative emotions with respect to receiving both test-results compared with concordant low-risk and discordant clinical high/low risk-signature patients. The first scenario was written in 2004 before the introduction of the 70-gene signature and identified hypothetical developments that could influence diffusion; especially the "what-if" deviation describing a discussion on validity among physicians proved to be realistic. Differences in speed of implementation and influenced treatment decisions were seen. Impact on patients seems especially related to discordance and its successive communication. In the future, scenario drafting will lead

  4. Human gene therapy: novel approaches to improve the current gene delivery systems.

    Science.gov (United States)

    Cucchiarini, Magali

    2016-06-01

    Even though gene therapy made its way through the clinics to treat a number of human pathologies since the early years of experimental research and despite the recent approval of the first gene-based product (Glybera) in Europe, the safe and effective use of gene transfer vectors remains a challenge in human gene therapy due to the existence of barriers in the host organism. While work is under active investigation to improve the gene transfer systems themselves, the use of controlled release approaches may offer alternative, convenient tools of vector delivery to achieve a performant gene transfer in vivo while overcoming the various physiological barriers that preclude its wide use in patients. This article provides an overview of the most significant contributions showing how the principles of controlled release strategies may be adapted for human gene therapy.

  5. Radionuclide reporter gene imaging

    Energy Technology Data Exchange (ETDEWEB)

    Min, Jung Joon [School of Medicine, Chonnam National Univ., Gwangju (Korea, Republic of)

    2004-04-01

    Recent progress in the development of non-invasive imaging technologies continues to strengthen the role of molecular imaging biological research. These tools have been validated recently in variety of research models, and have been shown to provide continuous quantitative monitoring of the location(s), magnitude, and time-variation of gene expression. This article reviews the principles, characteristics, categories and the use of radionuclide reporter gene imaging technologies as they have been used in imaging cell trafficking, imaging gene therapy, imaging endogenous gene expression and imaging molecular interactions. The studies published to date demonstrate that reporter gene imaging technologies will help to accelerate model validation as well as allow for clinical monitoring of human diseases.

  6. Radionuclide reporter gene imaging

    International Nuclear Information System (INIS)

    Min, Jung Joon

    2004-01-01

    Recent progress in the development of non-invasive imaging technologies continues to strengthen the role of molecular imaging biological research. These tools have been validated recently in variety of research models, and have been shown to provide continuous quantitative monitoring of the location(s), magnitude, and time-variation of gene expression. This article reviews the principles, characteristics, categories and the use of radionuclide reporter gene imaging technologies as they have been used in imaging cell trafficking, imaging gene therapy, imaging endogenous gene expression and imaging molecular interactions. The studies published to date demonstrate that reporter gene imaging technologies will help to accelerate model validation as well as allow for clinical monitoring of human diseases

  7. PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

    Directory of Open Access Journals (Sweden)

    Wasnick Michael

    2008-03-01

    Full Text Available Abstract Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any

  8. Actin gene identification from selected medicinal plants for their use as internal controls for gene expression studies

    International Nuclear Information System (INIS)

    Mufti, F.U.D.; Banaras, S.

    2015-01-01

    Internal control genes are the constitutive genes which maintain the basic cellular functions and regularly express in both normal and stressed conditions in living organisms. They are used in normalization of gene expression studies in comparative analysis of target genes, as their expression remains comparatively unchanged in all varied conditions. Among internal control genes, actin is considered as a candidate gene for expression studies due to its vital role in shaping cytoskeleton and plant physiology. Unfortunately most of such knowledge is limited to only model plants or crops, not much is known about important medicinal plants. Therefore, we selected seven important medicinal wild plants for molecular identification of actin gene. We used gene specific primers designed from the conserved regions of several known orthologues or homologues of actin genes from other plants. The amplified products of 370-380 bp were sequenced and submitted to GeneBank after their confirmation using different bioinformatics tools. All the novel partial sequences of putative actin genes were submitted to GeneBank (Parthenium hysterophorus (KJ774023), Fagonia indica (KJ774024), Rhazya stricta (KJ774025), Whithania coagulans (KJ774026), Capparis decidua (KJ774027), Verbena officinalis (KJ774028) and Aerva javanica (KJ774029)). The comparisons of these partial sequences by Basic Local Alignment Search Tool (BLAST) and phylogenetic trees demonstrated high similarity with known actin genes of other plants. Our findings illustrated highly conserved nature of actin gene among these selected plants. These novel partial fragments of actin genes from these wild medicinal plants can be used as internal controls for future gene expression studies of these important plants after precise validations of their stable expression in such plants. (author)

  9. The Human Tyrosyl-DNA Phosphodiesterase 1 (hTdp1) Inhibitor NSC120686 as an Exploratory Tool to Investigate Plant Tdp1 Genes.

    Science.gov (United States)

    Macovei, Anca; Pagano, Andrea; Sabatini, Maria Elisa; Grandi, Sofia; Balestrazzi, Alma

    2018-03-28

    The hTdp1 (human tyrosyl-DNA phosphodiesterase 1) inhibitor NSC120686 has been used, along with topoisomerase inhibitors, as a pharmacophoric model to restrain the Tdp1 activity as part of a synergistic treatment for cancer. While this compound has an end-point application in medical research, in plants, its application has not been considered so far. The originality of our study consists in the use of hTdp1 inhibitor in Medicago truncatula cells, which, unlike human cells, contain two Tdp1 genes. Hence, the purpose of this study was to test the hTdp1 inhibitor NSC120686 as an exploratory tool to investigate the plant Tdp1 genes, since their characterization is still in incipient phases. To do so, M. truncatula calli were exposed to increasing (75, 150, 300 μM) concentrations of NSC120686. The levels of cell mortality and DNA damage, measured via diffusion assay and comet assay, respectively, were significantly increased when the highest doses were used, indicative of a cytotoxic and genotoxic threshold. In addition, the NSC120686-treated calli and untreated MtTdp1α -depleted calli shared a similar response in terms of programmed cell death (PCD)/necrosis and DNA damage. Interestingly, the expression profiles of MtTdp1α and MtTdp1β genes were differently affected by the NSC120686 treatment, as MtTdp1α was upregulated while MtTdp1β was downregulated. The NSC120686 treatment affected not only the MtTdp1 genes but also other genes with roles in alternative DNA repair pathways. Since the expression patterns of these genes were different than what was observed in the MtTdp1α -depleted plants, it could be hypothesized that the NSC120686 treatment exerts a different influence compared to that resulting from the lack of the MtTdp1α gene function.

  10. GO Explorer: A gene-ontology tool to aid in the interpretation of shotgun proteomics data

    Directory of Open Access Journals (Sweden)

    Domont Gilberto B

    2009-02-01

    Full Text Available Abstract Background Spectral counting is a shotgun proteomics approach comprising the identification and relative quantitation of thousands of proteins in complex mixtures. However, this strategy generates bewildering amounts of data whose biological interpretation is a challenge. Results Here we present a new algorithm, termed GO Explorer (GOEx, that leverages the gene ontology (GO to aid in the interpretation of proteomic data. GOEx stands out because it combines data from protein fold changes with GO over-representation statistics to help draw conclusions. Moreover, it is tightly integrated within the PatternLab for Proteomics project and, thus, lies within a complete computational environment that provides parsers and pattern recognition tools designed for spectral counting. GOEx offers three independent methods to query data: an interactive directed acyclic graph, a specialist mode where key words can be searched, and an automatic search. Its usefulness is demonstrated by applying it to help interpret the effects of perillyl alcohol, a natural chemotherapeutic agent, on glioblastoma multiform cell lines (A172. We used a new multi-surfactant shotgun proteomic strategy and identified more than 2600 proteins; GOEx pinpointed key sets of differentially expressed proteins related to cell cycle, alcohol catabolism, the Ras pathway, apoptosis, and stress response, to name a few. Conclusion GOEx facilitates organism-specific studies by leveraging GO and providing a rich graphical user interface. It is a simple to use tool, specialized for biologists who wish to analyze spectral counting data from shotgun proteomics. GOEx is available at http://pcarvalho.com/patternlab.

  11. Web tools for the prioritization of candidate disease genes.

    NARCIS (Netherlands)

    Oti, M.O.; Ballouz, S.; Wouters, M.A.

    2011-01-01

    Despite increasing sequencing capacity, genetic disease investigation still frequently results in the identification of loci containing multiple candidate disease genes that need to be tested for involvement in the disease. This process can be expedited by prioritizing the candidates prior to

  12. Uses of antimicrobial genes from microbial genome

    Science.gov (United States)

    Sorek, Rotem; Rubin, Edward M.

    2013-08-20

    We describe a method for mining microbial genomes to discover antimicrobial genes and proteins having broad spectrum of activity. Also described are antimicrobial genes and their expression products from various microbial genomes that were found using this method. The products of such genes can be used as antimicrobial agents or as tools for molecular biology.

  13. FGF: A web tool for Fishing Gene Family in a whole genome database

    DEFF Research Database (Denmark)

    Zheng, Hongkun; Shi, Junjie; Fang, Xiaodong

    2007-01-01

    to efficiently search for and identify gene families. The FGF output displays the results as visual phylogenetic trees including information on gene structure, chromosome position, duplication fate and selective pressure. It is particularly useful to identify pseudogenes and detect changes in gene structure. FGF...

  14. Electrotransfer parameters as a tool for controlled and targeted gene expression in skin

    Directory of Open Access Journals (Sweden)

    Spela Kos

    2016-01-01

    Full Text Available Skin is an attractive target for gene electrotransfer. It consists of different cell types that can be transfected, leading to various responses to gene electrotransfer. We demonstrate that these responses could be controlled by selecting the appropriate electrotransfer parameters. Specifically, the application of low or high electric pulses, applied by multi-electrode array, provided the possibility to control the depth of the transfection in the skin, the duration and the level of gene expression, as well as the local or systemic distribution of the transgene. The influence of electric pulse type was first studied using a plasmid encoding a reporter gene (DsRed. Then, plasmids encoding therapeutic genes (IL-12, shRNA against endoglin, shRNA against melanoma cell adhesion molecule were used, and their effects on wound healing and cutaneous B16F10 melanoma tumors were investigated. The high-voltage pulses resulted in gene expression that was restricted to superficial skin layers and induced a local response. In contrast, the low-voltage electric pulses promoted transfection into the deeper skin layers, resulting in prolonged gene expression and higher transgene production, possibly with systemic distribution. Therefore, in the translation into the clinics, it will be of the utmost importance to adjust the electrotransfer parameters for different therapeutic approaches and specific mode of action of the therapeutic gene.

  15. Gene Expression Omnibus (GEO)

    Data.gov (United States)

    U.S. Department of Health & Human Services — Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided...

  16. Endocrine aspects of cancer gene therapy.

    Science.gov (United States)

    Barzon, Luisa; Boscaro, Marco; Palù, Giorgio

    2004-02-01

    The field of cancer gene therapy is in continuous expansion, and technology is quickly moving ahead as far as gene targeting and regulation of gene expression are concerned. This review focuses on the endocrine aspects of gene therapy, including the possibility to exploit hormone and hormone receptor functions for regulating therapeutic gene expression, the use of endocrine-specific genes as new therapeutic tools, the effects of viral vector delivery and transgene expression on the endocrine system, and the endocrine response to viral vector delivery. Present ethical concerns of gene therapy and the risk of germ cell transduction are also discussed, along with potential lines of innovation to improve cell and gene targeting.

  17. MaGnET: Malaria Genome Exploration Tool.

    Science.gov (United States)

    Sharman, Joanna L; Gerloff, Dietlind L

    2013-09-15

    The Malaria Genome Exploration Tool (MaGnET) is a software tool enabling intuitive 'exploration-style' visualization of functional genomics data relating to the malaria parasite, Plasmodium falciparum. MaGnET provides innovative integrated graphic displays for different datasets, including genomic location of genes, mRNA expression data, protein-protein interactions and more. Any selection of genes to explore made by the user is easily carried over between the different viewers for different datasets, and can be changed interactively at any point (without returning to a search). Free online use (Java Web Start) or download (Java application archive and MySQL database; requires local MySQL installation) at http://malariagenomeexplorer.org joanna.sharman@ed.ac.uk or dgerloff@ffame.org Supplementary data are available at Bioinformatics online.

  18. The development and application of a multiple gene co-silencing system using endogenous URA3 as a reporter gene in Ganoderma lucidum.

    Directory of Open Access Journals (Sweden)

    Dashuai Mu

    Full Text Available Ganoderma lucidum is one of the most important medicinal mushrooms; however, molecular genetics research on this species has been limited due to a lack of reliable reverse genetic tools. In this study, the endogenous orotidine 5'-monophosphate decarboxylase gene (URA3 was cloned as a silencing reporter, and four gene-silencing methods using hairpin, sense, antisense, and dual promoter constructs, were introduced into G. lucidum through a simple electroporation procedure. A comparison and evaluation of silencing efficiency demonstrated that all of the four methods differentially suppressed the expression of URA3. Our data unequivocally indicate that the dual promoter silencing vector yields the highest rate of URA3 silencing compared with other vectors (up to 81.9%. To highlight the advantages of the dual promoter system, we constructed a co-silencing system based on the dual promoter method and succeeded in co-silencing URA3 and laccase in G. lucidum. The reduction of the mRNA levels of the two genes were correlated. Thus, the screening efficiency for RNAi knockdown of multiple genes may be improved by the co-silencing of an endogenous reporter gene. The molecular tools developed in this study should facilitate the isolation of genes and the characterization of the functions of multiple genes in this pharmaceutically important species, and these tools should be highly useful for the study of other basidiomycetes.

  19. Tools to Minimize Inter-Laboratory Variability in Vitellogenin Gene Expression Monitoring Programs

    Data.gov (United States)

    U.S. Environmental Protection Agency — All data files are in excel format. Files with names CSU are different mesocosms qPCR data results for vitellogen gene and 18s a house keeping gene. Data files...

  20. GeneYenta: a phenotype-based rare disease case matching tool based on online dating algorithms for the acceleration of exome interpretation.

    Science.gov (United States)

    Gottlieb, Michael M; Arenillas, David J; Maithripala, Savanie; Maurer, Zachary D; Tarailo Graovac, Maja; Armstrong, Linlea; Patel, Millan; van Karnebeek, Clara; Wasserman, Wyeth W

    2015-04-01

    Advances in next-generation sequencing (NGS) technologies have helped reveal causal variants for genetic diseases. In order to establish causality, it is often necessary to compare genomes of unrelated individuals with similar disease phenotypes to identify common disrupted genes. When working with cases of rare genetic disorders, finding similar individuals can be extremely difficult. We introduce a web tool, GeneYenta, which facilitates the matchmaking process, allowing clinicians to coordinate detailed comparisons for phenotypically similar cases. Importantly, the system is focused on phenotype annotation, with explicit limitations on highly confidential data that create barriers to participation. The procedure for matching of patient phenotypes, inspired by online dating services, uses an ontology-based semantic case matching algorithm with attribute weighting. We evaluate the capacity of the system using a curated reference data set and 19 clinician entered cases comparing four matching algorithms. We find that the inclusion of clinician weights can augment phenotype matching. © 2015 WILEY PERIODICALS, INC.

  1. Cry-Bt identifier: a biological database for PCR detection of Cry genes present in transgenic plants.

    Science.gov (United States)

    Singh, Vinay Kumar; Ambwani, Sonu; Marla, Soma; Kumar, Anil

    2009-10-23

    We describe the development of a user friendly tool that would assist in the retrieval of information relating to Cry genes in transgenic crops. The tool also helps in detection of transformed Cry genes from Bacillus thuringiensis present in transgenic plants by providing suitable designed primers for PCR identification of these genes. The tool designed based on relational database model enables easy retrieval of information from the database with simple user queries. The tool also enables users to access related information about Cry genes present in various databases by interacting with different sources (nucleotide sequences, protein sequence, sequence comparison tools, published literature, conserved domains, evolutionary and structural data). http://insilicogenomics.in/Cry-btIdentifier/welcome.html.

  2. HRGFish: A database of hypoxia responsive genes in fishes

    Science.gov (United States)

    Rashid, Iliyas; Nagpure, Naresh Sahebrao; Srivastava, Prachi; Kumar, Ravindra; Pathak, Ajey Kumar; Singh, Mahender; Kushwaha, Basdeo

    2017-02-01

    Several studies have highlighted the changes in the gene expression due to the hypoxia response in fishes, but the systematic organization of the information and the analytical platform for such genes are lacking. In the present study, an attempt was made to develop a database of hypoxia responsive genes in fishes (HRGFish), integrated with analytical tools, using LAMPP technology. Genes reported in hypoxia response for fishes were compiled through literature survey and the database presently covers 818 gene sequences and 35 gene types from 38 fishes. The upstream fragments (3,000 bp), covered in this database, enables to compute CG dinucleotides frequencies, motif finding of the hypoxia response element, identification of CpG island and mapping with the reference promoter of zebrafish. The database also includes functional annotation of genes and provides tools for analyzing sequences and designing primers for selected gene fragments. This may be the first database on the hypoxia response genes in fishes that provides a workbench to the scientific community involved in studying the evolution and ecological adaptation of the fish species in relation to hypoxia.

  3. Bayesian nonparametric variable selection as an exploratory tool for discovering differentially expressed genes.

    Science.gov (United States)

    Shahbaba, Babak; Johnson, Wesley O

    2013-05-30

    High-throughput scientific studies involving no clear a priori hypothesis are common. For example, a large-scale genomic study of a disease may examine thousands of genes without hypothesizing that any specific gene is responsible for the disease. In these studies, the objective is to explore a large number of possible factors (e.g., genes) in order to identify a small number that will be considered in follow-up studies that tend to be more thorough and on smaller scales. A simple, hierarchical, linear regression model with random coefficients is assumed for case-control data that correspond to each gene. The specific model used will be seen to be related to a standard Bayesian variable selection model. Relatively large regression coefficients correspond to potential differences in responses for cases versus controls and thus to genes that might 'matter'. For large-scale studies, and using a Dirichlet process mixture model for the regression coefficients, we are able to find clusters of regression effects of genes with increasing potential effect or 'relevance', in relation to the outcome of interest. One cluster will always correspond to genes whose coefficients are in a neighborhood that is relatively close to zero and will be deemed least relevant. Other clusters will correspond to increasing magnitudes of the random/latent regression coefficients. Using simulated data, we demonstrate that our approach could be quite effective in finding relevant genes compared with several alternative methods. We apply our model to two large-scale studies. The first study involves transcriptome analysis of infection by human cytomegalovirus. The second study's objective is to identify differentially expressed genes between two types of leukemia. Copyright © 2012 John Wiley & Sons, Ltd.

  4. Monitoring bioremediation of atrazine in soil microcosms using molecular tools

    International Nuclear Information System (INIS)

    Sagarkar, Sneha; Mukherjee, Shinjini; Nousiainen, Aura; Björklöf, Katarina; Purohit, Hemant J.; Jørgensen, Kirsten S.; Kapley, Atya

    2013-01-01

    Molecular tools in microbial community analysis give access to information on catabolic potential and diversity of microbes. Applied in bioremediation, they could provide a new dimension to improve pollution control. This concept has been demonstrated in the study using atrazine as model pollutant. Bioremediation of the herbicide, atrazine, was analyzed in microcosm studies by bioaugmentation, biostimulation and natural attenuation. Genes from the atrazine degrading pathway atzA/B/C/D/E/F, trzN, and trzD were monitored during the course of treatment and results demonstrated variation in atzC, trzD and trzN genes with time. Change in copy number of trzN gene under different treatment processes was demonstrated by real-time PCR. The amplified trzN gene was cloned and sequence data showed homology to genes reported in Arthrobacter and Nocardioides. Results demonstrate that specific target genes can be monitored, quantified and correlated to degradation analysis which would help in predicting the outcome of any bioremediation strategy. - Highlights: ► Degradation of herbicide, atrazine. ► Comparison of bioremediation via bioaugmentation, biostimulation and natural attenuation. ► Gene profile analysis in all treatments. ► Variation in trzN gene numbers correlated to degradation efficiency. ► Cloning and sequence analysis of trzN gene demonstrates very high homology to reported gene. - This study demonstrates the use of molecular tools in bioremediation to monitor and track target genes; correlates the results with degradation and thereby predicts the efficiency of treatment.

  5. Serial analysis of gene expression (SAGE)

    NARCIS (Netherlands)

    van Ruissen, Fred; Baas, Frank

    2007-01-01

    In 1995, serial analysis of gene expression (SAGE) was developed as a versatile tool for gene expression studies. SAGE technology does not require pre-existing knowledge of the genome that is being examined and therefore SAGE can be applied to many different model systems. In this chapter, the SAGE

  6. Analysis tools for the interplay between genome layout and regulation.

    Science.gov (United States)

    Bouyioukos, Costas; Elati, Mohamed; Képès, François

    2016-06-06

    Genome layout and gene regulation appear to be interdependent. Understanding this interdependence is key to exploring the dynamic nature of chromosome conformation and to engineering functional genomes. Evidence for non-random genome layout, defined as the relative positioning of either co-functional or co-regulated genes, stems from two main approaches. Firstly, the analysis of contiguous genome segments across species, has highlighted the conservation of gene arrangement (synteny) along chromosomal regions. Secondly, the study of long-range interactions along a chromosome has emphasised regularities in the positioning of microbial genes that are co-regulated, co-expressed or evolutionarily correlated. While one-dimensional pattern analysis is a mature field, it is often powerless on biological datasets which tend to be incomplete, and partly incorrect. Moreover, there is a lack of comprehensive, user-friendly tools to systematically analyse, visualise, integrate and exploit regularities along genomes. Here we present the Genome REgulatory and Architecture Tools SCAN (GREAT:SCAN) software for the systematic study of the interplay between genome layout and gene expression regulation. SCAN is a collection of related and interconnected applications currently able to perform systematic analyses of genome regularities as well as to improve transcription factor binding sites (TFBS) and gene regulatory network predictions based on gene positional information. We demonstrate the capabilities of these tools by studying on one hand the regular patterns of genome layout in the major regulons of the bacterium Escherichia coli. On the other hand, we demonstrate the capabilities to improve TFBS prediction in microbes. Finally, we highlight, by visualisation of multivariate techniques, the interplay between position and sequence information for effective transcription regulation.

  7. Gene expression profiles reveal key genes for early diagnosis and treatment of adamantinomatous craniopharyngioma.

    Science.gov (United States)

    Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing

    2018-04-23

    Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis

  8. FGF: A web tool for Fishing Gene Family in a whole genome database

    DEFF Research Database (Denmark)

    Zheng, Hongkun; Shi, Junjie; Fang, Xiaodong

    2007-01-01

    Gene duplication is an important process in evolution. The availability of genome sequences of a number of organisms has made it possible to conduct comprehensive searches for duplicated genes enabling informative studies of their evolution. We have established the FGF (Fishing Gene Family) progr...... is freely available on a web server at http://fgf.genomics.org.cn/...

  9. Imaging after vascular gene therapy

    International Nuclear Information System (INIS)

    Manninen, Hannu I.; Yang, Xiaoming

    2005-01-01

    Targets for cardiovascular gene therapy currently include limiting restenosis after balloon angioplasty and stent placement, inhibiting vein bypass graft intimal hyperplasia/stenosis, therapeutic angiogenesis for cardiac and lower-limb ischemia, and prevention of thrombus formation. While catheter angiography is still standard method to follow-up vascular gene transfer, other modern imaging techniques, especially intravascular ultrasound (IVUS), magnetic resonance (MR), and positron emission tomography (PET) imaging provide complementary information about the therapeutic effect of vascular gene transfer in humans. Although molecular imaging of therapeutic gene expression in the vasculatures is still in its technical development phase, it has already offered basic medical science an extremely useful in vivo evaluation tool for non- or minimally invasive imaging of vascular gene therapy

  10. Dereplication, Aggregation and Scoring Tool (DAS Tool) v1.0

    Energy Technology Data Exchange (ETDEWEB)

    2017-03-01

    Communities of uncultivated microbes are critical to ecosystem function and microorganism health, and a key objective of metagenomic studies is to analyze organism-specific metabolic pathways and reconstruct community interaction networks. This requires accurate assignment of genes to genomes, yet existing binning methods often fail to predict a reasonable number of genomes and report many bins of low quality and completeness. Furthermore, the performance of existing algorithms varies between samples and biotypes. Here, we present a dereplication, aggregation and scoring strategy, DAS Tool, that combines the strengths of a flexible set of established binning algorithms. DAS Tools applied to a constructed community generated more accurate bins than any automated method. Further, when applied to samples of different complexity, including soil, natural oil seeps, and the human gut, DAS Tool recovered substantially more near-complete genomes than any single binning method alone. Included were three genomes from a novel lineage . The ability to reconstruct many near-complete genomes from metagenomics data will greatly advance genome-centric analyses of ecosystems.

  11. Application of genomic tools in plant breeding.

    Science.gov (United States)

    Pérez-de-Castro, A M; Vilanova, S; Cañizares, J; Pascual, L; Blanca, J M; Díez, M J; Prohens, J; Picó, B

    2012-05-01

    Plant breeding has been very successful in developing improved varieties using conventional tools and methodologies. Nowadays, the availability of genomic tools and resources is leading to a new revolution of plant breeding, as they facilitate the study of the genotype and its relationship with the phenotype, in particular for complex traits. Next Generation Sequencing (NGS) technologies are allowing the mass sequencing of genomes and transcriptomes, which is producing a vast array of genomic information. The analysis of NGS data by means of bioinformatics developments allows discovering new genes and regulatory sequences and their positions, and makes available large collections of molecular markers. Genome-wide expression studies provide breeders with an understanding of the molecular basis of complex traits. Genomic approaches include TILLING and EcoTILLING, which make possible to screen mutant and germplasm collections for allelic variants in target genes. Re-sequencing of genomes is very useful for the genome-wide discovery of markers amenable for high-throughput genotyping platforms, like SSRs and SNPs, or the construction of high density genetic maps. All these tools and resources facilitate studying the genetic diversity, which is important for germplasm management, enhancement and use. Also, they allow the identification of markers linked to genes and QTLs, using a diversity of techniques like bulked segregant analysis (BSA), fine genetic mapping, or association mapping. These new markers are used for marker assisted selection, including marker assisted backcross selection, 'breeding by design', or new strategies, like genomic selection. In conclusion, advances in genomics are providing breeders with new tools and methodologies that allow a great leap forward in plant breeding, including the 'superdomestication' of crops and the genetic dissection and breeding for complex traits.

  12. Genetic architecture of gene expression in the chicken

    Directory of Open Access Journals (Sweden)

    Stanley Dragana

    2013-01-01

    Full Text Available Abstract Background The annotation of many genomes is limited, with a large proportion of identified genes lacking functional assignments. The construction of gene co-expression networks is a powerful approach that presents a way of integrating information from diverse gene expression datasets into a unified analysis which allows inferences to be drawn about the role of previously uncharacterised genes. Using this approach, we generated a condition-free gene co-expression network for the chicken using data from 1,043 publically available Affymetrix GeneChip Chicken Genome Arrays. This data was generated from a diverse range of experiments, including different tissues and experimental conditions. Our aim was to identify gene co-expression modules and generate a tool to facilitate exploration of the functional chicken genome. Results Fifteen modules, containing between 24 and 473 genes, were identified in the condition-free network. Most of the modules showed strong functional enrichment for particular Gene Ontology categories. However, a few showed no enrichment. Transcription factor binding site enrichment was also noted. Conclusions We have demonstrated that this chicken gene co-expression network is a useful tool in gene function prediction and the identification of putative novel transcription factors and binding sites. This work highlights the relevance of this methodology for functional prediction in poorly annotated genomes such as the chicken.

  13. Reconstruction of Ancestral Genomes in Presence of Gene Gain and Loss.

    Science.gov (United States)

    Avdeyev, Pavel; Jiang, Shuai; Aganezov, Sergey; Hu, Fei; Alekseyev, Max A

    2016-03-01

    Since most dramatic genomic changes are caused by genome rearrangements as well as gene duplications and gain/loss events, it becomes crucial to understand their mechanisms and reconstruct ancestral genomes of the given genomes. This problem was shown to be NP-complete even in the "simplest" case of three genomes, thus calling for heuristic rather than exact algorithmic solutions. At the same time, a larger number of input genomes may actually simplify the problem in practice as it was earlier illustrated with MGRA, a state-of-the-art software tool for reconstruction of ancestral genomes of multiple genomes. One of the key obstacles for MGRA and other similar tools is presence of breakpoint reuses when the same breakpoint region is broken by several different genome rearrangements in the course of evolution. Furthermore, such tools are often limited to genomes composed of the same genes with each gene present in a single copy in every genome. This limitation makes these tools inapplicable for many biological datasets and degrades the resolution of ancestral reconstructions in diverse datasets. We address these deficiencies by extending the MGRA algorithm to genomes with unequal gene contents. The developed next-generation tool MGRA2 can handle gene gain/loss events and shares the ability of MGRA to reconstruct ancestral genomes uniquely in the case of limited breakpoint reuse. Furthermore, MGRA2 employs a number of novel heuristics to cope with higher breakpoint reuse and process datasets inaccessible for MGRA. In practical experiments, MGRA2 shows superior performance for simulated and real genomes as compared to other ancestral genome reconstruction tools.

  14. Patenting human genes: Chinese academic articles' portrayal of gene patents.

    Science.gov (United States)

    Du, Li

    2018-04-24

    The patenting of human genes has been the subject of debate for decades. While China has gradually come to play an important role in the global genomics-based testing and treatment market, little is known about Chinese scholars' perspectives on patent protection for human genes. A content analysis of academic literature was conducted to identify Chinese scholars' concerns regarding gene patents, including benefits and risks of patenting human genes, attitudes that researchers hold towards gene patenting, and any legal and policy recommendations offered for the gene patent regime in China. 57.2% of articles were written by law professors, but scholars from health sciences, liberal arts, and ethics also participated in discussions on gene patent issues. While discussions of benefits and risks were relatively balanced in the articles, 63.5% of the articles favored gene patenting in general and, of the articles (n = 41) that explored gene patents in the Chinese context, 90.2% supported patent protections for human genes in China. The patentability of human genes was discussed in 33 articles, and 75.8% of these articles reached the conclusion that human genes are patentable. Chinese scholars view the patent regime as an important legal tool to protect the interests of inventors and inventions as well as the genetic resources of China. As such, many scholars support a gene patent system in China. These attitudes towards gene patents remain unchanged following the court ruling in the Myriad case in 2013, but arguments have been raised about the scope of gene patents, in particular that the increasing numbers of gene patents may negatively impact public health in China.

  15. COGNATE: comparative gene annotation characterizer.

    Science.gov (United States)

    Wilbrandt, Jeanne; Misof, Bernhard; Niehuis, Oliver

    2017-07-17

    The comparison of gene and genome structures across species has the potential to reveal major trends of genome evolution. However, such a comparative approach is currently hampered by a lack of standardization (e.g., Elliott TA, Gregory TR, Philos Trans Royal Soc B: Biol Sci 370:20140331, 2015). For example, testing the hypothesis that the total amount of coding sequences is a reliable measure of potential proteome diversity (Wang M, Kurland CG, Caetano-Anollés G, PNAS 108:11954, 2011) requires the application of standardized definitions of coding sequence and genes to create both comparable and comprehensive data sets and corresponding summary statistics. However, such standard definitions either do not exist or are not consistently applied. These circumstances call for a standard at the descriptive level using a minimum of parameters as well as an undeviating use of standardized terms, and for software that infers the required data under these strict definitions. The acquisition of a comprehensive, descriptive, and standardized set of parameters and summary statistics for genome publications and further analyses can thus greatly benefit from the availability of an easy to use standard tool. We developed a new open-source command-line tool, COGNATE (Comparative Gene Annotation Characterizer), which uses a given genome assembly and its annotation of protein-coding genes for a detailed description of the respective gene and genome structure parameters. Additionally, we revised the standard definitions of gene and genome structures and provide the definitions used by COGNATE as a working draft suggestion for further reference. Complete parameter lists and summary statistics are inferred using this set of definitions to allow down-stream analyses and to provide an overview of the genome and gene repertoire characteristics. COGNATE is written in Perl and freely available at the ZFMK homepage ( https://www.zfmk.de/en/COGNATE ) and on github ( https

  16. Visually Relating Gene Expression and in vivo DNA Binding Data

    Energy Technology Data Exchange (ETDEWEB)

    Huang, Min-Yu; Mackey, Lester; Ker?,; nen, Soile V. E.; Weber, Gunther H.; Jordan, Michael I.; Knowles, David W.; Biggin, Mark D.; Hamann, Bernd

    2011-09-20

    Gene expression and in vivo DNA binding data provide important information for understanding gene regulatory networks: in vivo DNA binding data indicate genomic regions where transcription factors are bound, and expression data show the output resulting from this binding. Thus, there must be functional relationships between these two types of data. While visualization and data analysis tools exist for each data type alone, there is a lack of tools that can easily explore the relationship between them. We propose an approach that uses the average expression driven by multiple of ciscontrol regions to visually relate gene expression and in vivo DNA binding data. We demonstrate the utility of this tool with examples from the network controlling early Drosophila development. The results obtained support the idea that the level of occupancy of a transcription factor on DNA strongly determines the degree to which the factor regulates a target gene, and in some cases also controls whether the regulation is positive or negative.

  17. SIGNATURE: A workbench for gene expression signature analysis

    Directory of Open Access Journals (Sweden)

    Chang Jeffrey T

    2011-11-01

    Full Text Available Abstract Background The biological phenotype of a cell, such as a characteristic visual image or behavior, reflects activities derived from the expression of collections of genes. As such, an ability to measure the expression of these genes provides an opportunity to develop more precise and varied sets of phenotypes. However, to use this approach requires computational methods that are difficult to implement and apply, and thus there is a critical need for intelligent software tools that can reduce the technical burden of the analysis. Tools for gene expression analyses are unusually difficult to implement in a user-friendly way because their application requires a combination of biological data curation, statistical computational methods, and database expertise. Results We have developed SIGNATURE, a web-based resource that simplifies gene expression signature analysis by providing software, data, and protocols to perform the analysis successfully. This resource uses Bayesian methods for processing gene expression data coupled with a curated database of gene expression signatures, all carried out within a GenePattern web interface for easy use and access. Conclusions SIGNATURE is available for public use at http://genepattern.genome.duke.edu/signature/.

  18. Mutation based treatment recommendations from next generation sequencing data: a comparison of web tools.

    Science.gov (United States)

    Patel, Jaymin M; Knopf, Joshua; Reiner, Eric; Bossuyt, Veerle; Epstein, Lianne; DiGiovanna, Michael; Chung, Gina; Silber, Andrea; Sanft, Tara; Hofstatter, Erin; Mougalian, Sarah; Abu-Khalaf, Maysa; Platt, James; Shi, Weiwei; Gershkovich, Peter; Hatzis, Christos; Pusztai, Lajos

    2016-04-19

    Interpretation of complex cancer genome data, generated by tumor target profiling platforms, is key for the success of personalized cancer therapy. How to draw therapeutic conclusions from tumor profiling results is not standardized and may vary among commercial and academically-affiliated recommendation tools. We performed targeted sequencing of 315 genes from 75 metastatic breast cancer biopsies using the FoundationOne assay. Results were run through 4 different web tools including the Drug-Gene Interaction Database (DGidb), My Cancer Genome (MCG), Personalized Cancer Therapy (PCT), and cBioPortal, for drug and clinical trial recommendations. These recommendations were compared amongst each other and to those provided by FoundationOne. The identification of a gene as targetable varied across the different recommendation sources. Only 33% of cases had 4 or more sources recommend the same drug for at least one of the usually several altered genes found in tumor biopsies. These results indicate further development and standardization of broadly applicable software tools that assist in our therapeutic interpretation of genomic data is needed. Existing algorithms for data acquisition, integration and interpretation will likely need to incorporate artificial intelligence tools to improve both content and real-time status.

  19. Transgene Expression in Microalgae-From Tools to Applications.

    Science.gov (United States)

    Doron, Lior; Segal, Na'ama; Shapira, Michal

    2016-01-01

    Microalgae comprise a biodiverse group of photosynthetic organisms that reside in water sources and sediments. The green microalgae Chlamydomonas reinhardtii was adopted as a useful model organism for studying various physiological systems. Its ability to grow under both photosynthetic and heterotrophic conditions allows efficient growth of non-photosynthetic mutants, making Chlamydomonas a useful genetic tool to study photosynthesis. In addition, this green alga can grow as haploid or diploid cells, similar to yeast, providing a powerful genetic system. As a result, easy and efficient transformation systems have been developed for Chlamydomonas, targeting both the chloroplast and nuclear genomes. Since microalgae comprise a rich repertoire of species that offer variable advantages for biotech and biomed industries, gene transfer technologies were further developed for many microalgae to allow for the expression of foreign proteins of interest. Expressing foreign genes in the chloroplast enables the targeting of foreign DNA to specific sites by homologous recombination. Chloroplast transformation also allows for the introduction of genes encoding several enzymes from a complex pathway, possibly as an operon. Expressing foreign proteins in the chloroplast can also be achieved by introducing the target gene into the nuclear genome, with the protein product bearing a targeting signal that directs import of the transgene-product into the chloroplast, like other endogenous chloroplast proteins. Integration of foreign genes into the nuclear genome is mostly random, resulting in large variability between different clones, such that extensive screening is required. The use of different selection modalities is also described, with special emphasis on the use of herbicides and metabolic markers which are considered to be friendly to the environment, as compared to drug-resistance genes that are commonly used. Finally, despite the development of a wide range of transformation

  20. Gene Circuit Analysis of the Terminal Gap Gene huckebein

    Science.gov (United States)

    Ashyraliyev, Maksat; Siggens, Ken; Janssens, Hilde; Blom, Joke; Akam, Michael; Jaeger, Johannes

    2009-01-01

    The early embryo of Drosophila melanogaster provides a powerful model system to study the role of genes in pattern formation. The gap gene network constitutes the first zygotic regulatory tier in the hierarchy of the segmentation genes involved in specifying the position of body segments. Here, we use an integrative, systems-level approach to investigate the regulatory effect of the terminal gap gene huckebein (hkb) on gap gene expression. We present quantitative expression data for the Hkb protein, which enable us to include hkb in gap gene circuit models. Gap gene circuits are mathematical models of gene networks used as computational tools to extract regulatory information from spatial expression data. This is achieved by fitting the model to gap gene expression patterns, in order to obtain estimates for regulatory parameters which predict a specific network topology. We show how considering variability in the data combined with analysis of parameter determinability significantly improves the biological relevance and consistency of the approach. Our models are in agreement with earlier results, which they extend in two important respects: First, we show that Hkb is involved in the regulation of the posterior hunchback (hb) domain, but does not have any other essential function. Specifically, Hkb is required for the anterior shift in the posterior border of this domain, which is now reproduced correctly in our models. Second, gap gene circuits presented here are able to reproduce mutants of terminal gap genes, while previously published models were unable to reproduce any null mutants correctly. As a consequence, our models now capture the expression dynamics of all posterior gap genes and some variational properties of the system correctly. This is an important step towards a better, quantitative understanding of the developmental and evolutionary dynamics of the gap gene network. PMID:19876378

  1. Identifying Cancer Driver Genes Using Replication-Incompetent Retroviral Vectors

    Directory of Open Access Journals (Sweden)

    Victor M. Bii

    2016-10-01

    Full Text Available Identifying novel genes that drive tumor metastasis and drug resistance has significant potential to improve patient outcomes. High-throughput sequencing approaches have identified cancer genes, but distinguishing driver genes from passengers remains challenging. Insertional mutagenesis screens using replication-incompetent retroviral vectors have emerged as a powerful tool to identify cancer genes. Unlike replicating retroviruses and transposons, replication-incompetent retroviral vectors lack additional mutagenesis events that can complicate the identification of driver mutations from passenger mutations. They can also be used for almost any human cancer due to the broad tropism of the vectors. Replication-incompetent retroviral vectors have the ability to dysregulate nearby cancer genes via several mechanisms including enhancer-mediated activation of gene promoters. The integrated provirus acts as a unique molecular tag for nearby candidate driver genes which can be rapidly identified using well established methods that utilize next generation sequencing and bioinformatics programs. Recently, retroviral vector screens have been used to efficiently identify candidate driver genes in prostate, breast, liver and pancreatic cancers. Validated driver genes can be potential therapeutic targets and biomarkers. In this review, we describe the emergence of retroviral insertional mutagenesis screens using replication-incompetent retroviral vectors as a novel tool to identify cancer driver genes in different cancer types.

  2. CRDB: database of chemosensory receptor gene families in vertebrate.

    Directory of Open Access Journals (Sweden)

    Dong Dong

    Full Text Available Chemosensory receptors (CR are crucial for animals to sense the environmental changes and survive on earth. The emergence of whole-genome sequences provides us an opportunity to identify the entire CR gene repertoires. To completely gain more insight into the evolution of CR genes in vertebrates, we identified the nearly all CR genes in 25 vertebrates using homology-based approaches. Among these CR gene repertoires, nearly half of them were identified for the first time in those previously uncharacterized species, such as the guinea pig, giant panda and elephant, etc. Consistent with previous findings, we found that the numbers of CR genes vary extensively among different species, suggesting an extreme form of 'birth-and-death' evolution. For the purpose of facilitating CR gene analysis, we constructed a database with the goals to provide a resource for CR genes annotation and a web tool for exploring their evolutionary patterns. Besides a search engine for the gene extraction from a specific chromosome region, an easy-to-use phylogenetic analysis tool was also provided to facilitate online phylogeny study of CR genes. Our work can provide a rigorous platform for further study on the evolution of CR genes in vertebrates.

  3. Studying the Complex Expression Dependences between Sets of Coexpressed Genes

    Directory of Open Access Journals (Sweden)

    Mario Huerta

    2014-01-01

    Full Text Available Organisms simplify the orchestration of gene expression by coregulating genes whose products function together in the cell. The use of clustering methods to obtain sets of coexpressed genes from expression arrays is very common; nevertheless there are no appropriate tools to study the expression networks among these sets of coexpressed genes. The aim of the developed tools is to allow studying the complex expression dependences that exist between sets of coexpressed genes. For this purpose, we start detecting the nonlinear expression relationships between pairs of genes, plus the coexpressed genes. Next, we form networks among sets of coexpressed genes that maintain nonlinear expression dependences between all of them. The expression relationship between the sets of coexpressed genes is defined by the expression relationship between the skeletons of these sets, where this skeleton represents the coexpressed genes with a well-defined nonlinear expression relationship with the skeleton of the other sets. As a result, we can study the nonlinear expression relationships between a target gene and other sets of coexpressed genes, or start the study from the skeleton of the sets, to study the complex relationships of activation and deactivation between the sets of coexpressed genes that carry out the different cellular processes present in the expression experiments.

  4. Gastric Cancer Associated Genes Identified by an Integrative Analysis of Gene Expression Data

    Directory of Open Access Journals (Sweden)

    Bing Jiang

    2017-01-01

    Full Text Available Gastric cancer is one of the most severe complex diseases with high morbidity and mortality in the world. The molecular mechanisms and risk factors for this disease are still not clear since the cancer heterogeneity caused by different genetic and environmental factors. With more and more expression data accumulated nowadays, we can perform integrative analysis for these data to understand the complexity of gastric cancer and to identify consensus players for the heterogeneous cancer. In the present work, we screened the published gene expression data and analyzed them with integrative tool, combined with pathway and gene ontology enrichment investigation. We identified several consensus differentially expressed genes and these genes were further confirmed with literature mining; at last, two genes, that is, immunoglobulin J chain and C-X-C motif chemokine ligand 17, were screened as novel gastric cancer associated genes. Experimental validation is proposed to further confirm this finding.

  5. The 'PUCE CAFE' Project: the first 15K coffee microarray, a new tool for discovering candidate genes correlated to agronomic and quality traits.

    Science.gov (United States)

    Privat, Isabelle; Bardil, Amélie; Gomez, Aureliano Bombarely; Severac, Dany; Dantec, Christelle; Fuentes, Ivanna; Mueller, Lukas; Joët, Thierry; Pot, David; Foucrier, Séverine; Dussert, Stéphane; Leroy, Thierry; Journot, Laurent; de Kochko, Alexandre; Campa, Claudine; Combes, Marie-Christine; Lashermes, Philippe; Bertrand, Benoit

    2011-01-05

    Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research.

  6. The 'PUCE CAFE' Project: the First 15K Coffee Microarray, a New Tool for Discovering Candidate Genes correlated to Agronomic and Quality Traits

    Directory of Open Access Journals (Sweden)

    Leroy Thierry

    2011-01-01

    Full Text Available Abstract Background Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. Results The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta. Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica. Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. Conclusion We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics. This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid, drastically enlarging its impact for high-throughput gene expression in the community of coffee research.

  7. Homology-dependent Gene Silencing in Paramecium

    Science.gov (United States)

    Ruiz, Françoise; Vayssié, Laurence; Klotz, Catherine; Sperling, Linda; Madeddu, Luisa

    1998-01-01

    Microinjection at high copy number of plasmids containing only the coding region of a gene into the Paramecium somatic macronucleus led to a marked reduction in the expression of the corresponding endogenous gene(s). The silencing effect, which is stably maintained throughout vegetative growth, has been observed for all Paramecium genes examined so far: a single-copy gene (ND7), as well as members of multigene families (centrin genes and trichocyst matrix protein genes) in which all closely related paralogous genes appeared to be affected. This phenomenon may be related to posttranscriptional gene silencing in transgenic plants and quelling in Neurospora and allows the efficient creation of specific mutant phenotypes thus providing a potentially powerful tool to study gene function in Paramecium. For the two multigene families that encode proteins that coassemble to build up complex subcellular structures the analysis presented herein provides the first experimental evidence that the members of these gene families are not functionally redundant. PMID:9529389

  8. Markerless gene knockout and integration to express heterologous biosynthetic gene clusters in Pseudomonas putida

    DEFF Research Database (Denmark)

    Choi, Kyeong Rok; Cho, Jae Sung; Cho, In Jin

    2018-01-01

    Pseudomonas putida has gained much interest among metabolic engineers as a workhorse for producing valuable natural products. While a few gene knockout tools for P. putida have been reported, integration of heterologous genes into the chromosome of P. putida, an essential strategy to develop stable...... plasmid curing systems, generating final strains free of antibiotic markers and plasmids. This markerless recombineering system for efficient gene knockout and integration will expedite metabolic engineering of P. putida, a bacterial host strain of increasing academic and industrial interest....

  9. Gene Drive for Mosquito Control: Where Did It Come from and Where Are We Headed?

    Science.gov (United States)

    Macias, Vanessa M; Ohm, Johanna R; Rasgon, Jason L

    2017-09-02

    Mosquito-borne pathogens place an enormous burden on human health. The existing toolkit is insufficient to support ongoing vector-control efforts towards meeting disease elimination and eradication goals. The perspective that genetic approaches can potentially add a significant set of tools toward mosquito control is not new, but the recent improvements in site-specific gene editing with CRISPR/Cas9 systems have enhanced our ability to both study mosquito biology using reverse genetics and produce genetics-based tools. Cas9-mediated gene-editing is an efficient and adaptable platform for gene drive strategies, which have advantages over innundative release strategies for introgressing desirable suppression and pathogen-blocking genotypes into wild mosquito populations; until recently, an effective gene drive has been largely out of reach. Many considerations will inform the effective use of new genetic tools, including gene drives. Here we review the lengthy history of genetic advances in mosquito biology and discuss both the impact of efficient site-specific gene editing on vector biology and the resulting potential to deploy new genetic tools for the abatement of mosquito-borne disease.

  10. Improved molecular tools for sugar cane biotechnology.

    Science.gov (United States)

    Kinkema, Mark; Geijskes, Jason; Delucca, Paulo; Palupe, Anthony; Shand, Kylie; Coleman, Heather D; Brinin, Anthony; Williams, Brett; Sainz, Manuel; Dale, James L

    2014-03-01

    Sugar cane is a major source of food and fuel worldwide. Biotechnology has the potential to improve economically-important traits in sugar cane as well as diversify sugar cane beyond traditional applications such as sucrose production. High levels of transgene expression are key to the success of improving crops through biotechnology. Here we describe new molecular tools that both expand and improve gene expression capabilities in sugar cane. We have identified promoters that can be used to drive high levels of gene expression in the leaf and stem of transgenic sugar cane. One of these promoters, derived from the Cestrum yellow leaf curling virus, drives levels of constitutive transgene expression that are significantly higher than those achieved by the historical benchmark maize polyubiquitin-1 (Zm-Ubi1) promoter. A second promoter, the maize phosphonenolpyruvate carboxylate promoter, was found to be a strong, leaf-preferred promoter that enables levels of expression comparable to Zm-Ubi1 in this organ. Transgene expression was increased approximately 50-fold by gene modification, which included optimising the codon usage of the coding sequence to better suit sugar cane. We also describe a novel dual transcriptional enhancer that increased gene expression from different promoters, boosting expression from Zm-Ubi1 over eightfold. These molecular tools will be extremely valuable for the improvement of sugar cane through biotechnology.

  11. GeneRecon Users' Manual — A coalescent based tool for fine-scale association mapping

    DEFF Research Database (Denmark)

    Mailund, T

    2006-01-01

    GeneRecon is a software package for linkage disequilibrium mapping using coalescent theory. It is based on Bayesian Markov-chain Monte Carlo (MCMC) method for fine-scale linkage-disequilibrium gene mapping using high-density marker maps. GeneRecon explicitly models the genealogy of a sample of th...

  12. The genomic view of genes responsive to the antagonistic phytohormones, abscisic acid, and gibberellin.

    Science.gov (United States)

    Yazaki, Junshi; Kikuchi, Shoshi

    2005-01-01

    We now have the various genomics tools for monocot (Oryza sativa) and a dicot (Arabidopsis thaliana) plant. Plant is not only a very important agricultural resource but also a model organism for biological research. It is important that the interaction between ABA and GA is investigated for controlling the transition from embryogenesis to germination in seeds using genomics tools. These studies have investigated the relationship between dormancy and germination using genomics tools. Genomics tools identified genes that had never before been annotated as ABA- or GA-responsive genes in plant, detected new interactions between genes responsive to the two hormones, comprehensively characterized cis-elements of hormone-responsive genes, and characterized cis-elements of rice and Arabidopsis. In these research, ABA- and GA-regulated genes have been classified as functional proteins (proteins that probably function in stress or PR tolerance) and regulatory proteins (protein factors involved in further regulation of signal transduction). Comparison between ABA and/or GA-responsive genes in rice and those in Arabidopsis has shown that the cis-element has specificity in each species. cis-Elements for the dehydration-stress response have been specified in Arabidopsis but not in rice. cis-Elements for protein storage are remarkably richer in the upstream regions of the rice gene than in those of Arabidopsis.

  13. Using riboswitches to regulate gene expression and define gene function in mycobacteria.

    Science.gov (United States)

    Van Vlack, Erik R; Seeliger, Jessica C

    2015-01-01

    Mycobacteria include both environmental species and many pathogenic species such as Mycobacterium tuberculosis, an intracellular pathogen that is the causative agent of tuberculosis in humans. Inducible gene expression is a powerful tool for examining gene function and essentiality, both in in vitro culture and in host cell infections. The theophylline-inducible artificial riboswitch has recently emerged as an alternative to protein repressor-based systems. The riboswitch is translationally regulated and is combined with a mycobacterial promoter that provides transcriptional control. We here provide methods used by our laboratory to characterize the riboswitch response to theophylline in reporter strains, recombinant organisms containing riboswitch-regulated endogenous genes, and in host cell infections. These protocols should facilitate the application of both existing and novel artificial riboswitches to the exploration of gene function in mycobacteria. © 2015 Elsevier Inc. All rights reserved.

  14. Protoplast isolation, transient transformation of leaf mesophyll protoplasts and improved Agrobacterium-mediated leaf disc infiltration of Phaseolus vulgaris: tools for rapid gene expression analysis.

    Science.gov (United States)

    Nanjareddy, Kalpana; Arthikala, Manoj-Kumar; Blanco, Lourdes; Arellano, Elizabeth S; Lara, Miguel

    2016-06-24

    . vulgaris tissues. We also provide a high-efficiency and amenable method for leaf mesophyll transformation for rapid gene functional characterization studies. Furthermore, a modified SAAT leaf disc infiltration approach aids in validating genes and their functions. Together, these methods help to rapidly unravel novel gene functions and are promising tools for P. vulgaris research.

  15. Brains, Genes and Primates

    Science.gov (United States)

    Belmonte, Juan Carlos Izpisua; Callaway, Edward M.; Churchland, Patricia; Caddick, Sarah J.; Feng, Guoping; Homanics, Gregg E.; Lee, Kuo-Fen; Leopold, David A.; Miller, Cory T.; Mitchell, Jude F.; Mitalipov, Shoukhrat; Moutri, Alysson R.; Movshon, J. Anthony; Okano, Hideyuki; Reynolds, John H.; Ringach, Dario; Sejnowski, Terrence J.; Silva, Afonso C.; Strick, Peter L.; Wu, Jun; Zhang, Feng

    2015-01-01

    One of the great strengths of the mouse model is the wide array of genetic tools that have been developed. Striking examples include methods for directed modification of the genome, and for regulated expression or inactivation of genes. Within neuroscience, it is now routine to express reporter genes, neuronal activity indicators and opsins in specific neuronal types in the mouse. However, there are considerable anatomical, physiological, cognitive and behavioral differences between the mouse and the human that, in some areas of inquiry, limit the degree to which insights derived from the mouse can be applied to understanding human neurobiology. Several recent advances have now brought into reach the goal of applying these tools to understanding the primate brain. Here we describe these advances, consider their potential to advance our understanding of the human brain and brain disorders, discuss bioethical considerations, and describe what will be needed to move forward. PMID:25950631

  16. The gene trap resource: a treasure trove for hemopoiesis research.

    Science.gov (United States)

    Forrai, Ariel; Robb, Lorraine

    2005-08-01

    The laboratory mouse is an invaluable tool for functional gene discovery because of its genetic malleability and a biological similarity to human systems that facilitates identification of human models of disease. A number of mutagenic technologies are being used to elucidate gene function in the mouse. Gene trapping is an insertional mutagenesis strategy that is being undertaken by multiple research groups, both academic and private, in an effort to introduce mutations across the mouse genome. Large-scale, publicly funded gene trap programs have been initiated in several countries with the International Gene Trap Consortium coordinating certain efforts and resources. We outline the methodology of mammalian gene trapping and how it can be used to identify genes expressed in both primitive and definitive blood cells and to discover hemopoietic regulator genes. Mouse mutants with hematopoietic phenotypes derived using gene trapping are described. The efforts of the large-scale gene trapping consortia have now led to the availability of libraries of mutagenized ES cell clones. The identity of the trapped locus in each of these clones can be identified by sequence-based searching via the world wide web. This resource provides an extraordinary tool for all researchers wishing to use mouse genetics to understand gene function.

  17. Site-specific selfish genes as tools for the control and genetic engineering of natural populations.

    Science.gov (United States)

    Burt, Austin

    2003-05-07

    Site-specific selfish genes exploit host functions to copy themselves into a defined target DNA sequence, and include homing endonuclease genes, group II introns and some LINE-like transposable elements. If such genes can be engineered to target new host sequences, then they can be used to manipulate natural populations, even if the number of individuals released is a small fraction of the entire population. For example, a genetic load sufficient to eradicate a population can be imposed in fewer than 20 generations, if the target is an essential host gene, the knockout is recessive and the selfish gene has an appropriate promoter. There will be selection for resistance, but several strategies are available for reducing the likelihood of it evolving. These genes may also be used to genetically engineer natural populations, by means of population-wide gene knockouts, gene replacements and genetic transformations. By targeting sex-linked loci just prior to meiosis one may skew the population sex ratio, and by changing the promoter one may limit the spread of the gene to neighbouring populations. The proposed constructs are evolutionarily stable in the face of the mutations most likely to arise during their spread, and strategies are also available for reversing the manipulations.

  18. Integration of multiple networks and pathways identifies cancer driver genes in pan-cancer analysis.

    Science.gov (United States)

    Cava, Claudia; Bertoli, Gloria; Colaprico, Antonio; Olsen, Catharina; Bontempi, Gianluca; Castiglioni, Isabella

    2018-01-06

    Modern high-throughput genomic technologies represent a comprehensive hallmark of molecular changes in pan-cancer studies. Although different cancer gene signatures have been revealed, the mechanism of tumourigenesis has yet to be completely understood. Pathways and networks are important tools to explain the role of genes in functional genomic studies. However, few methods consider the functional non-equal roles of genes in pathways and the complex gene-gene interactions in a network. We present a novel method in pan-cancer analysis that identifies de-regulated genes with a functional role by integrating pathway and network data. A pan-cancer analysis of 7158 tumour/normal samples from 16 cancer types identified 895 genes with a central role in pathways and de-regulated in cancer. Comparing our approach with 15 current tools that identify cancer driver genes, we found that 35.6% of the 895 genes identified by our method have been found as cancer driver genes with at least 2/15 tools. Finally, we applied a machine learning algorithm on 16 independent GEO cancer datasets to validate the diagnostic role of cancer driver genes for each cancer. We obtained a list of the top-ten cancer driver genes for each cancer considered in this study. Our analysis 1) confirmed that there are several known cancer driver genes in common among different types of cancer, 2) highlighted that cancer driver genes are able to regulate crucial pathways.

  19. Evolutionary signatures amongst disease genes permit novel methods for gene prioritization and construction of informative gene-based networks.

    Directory of Open Access Journals (Sweden)

    Nolan Priedigkeit

    2015-02-01

    Full Text Available Genes involved in the same function tend to have similar evolutionary histories, in that their rates of evolution covary over time. This coevolutionary signature, termed Evolutionary Rate Covariation (ERC, is calculated using only gene sequences from a set of closely related species and has demonstrated potential as a computational tool for inferring functional relationships between genes. To further define applications of ERC, we first established that roughly 55% of genetic diseases posses an ERC signature between their contributing genes. At a false discovery rate of 5% we report 40 such diseases including cancers, developmental disorders and mitochondrial diseases. Given these coevolutionary signatures between disease genes, we then assessed ERC's ability to prioritize known disease genes out of a list of unrelated candidates. We found that in the presence of an ERC signature, the true disease gene is effectively prioritized to the top 6% of candidates on average. We then apply this strategy to a melanoma-associated region on chromosome 1 and identify MCL1 as a potential causative gene. Furthermore, to gain global insight into disease mechanisms, we used ERC to predict molecular connections between 310 nominally distinct diseases. The resulting "disease map" network associates several diseases with related pathogenic mechanisms and unveils many novel relationships between clinically distinct diseases, such as between Hirschsprung's disease and melanoma. Taken together, these results demonstrate the utility of molecular evolution as a gene discovery platform and show that evolutionary signatures can be used to build informative gene-based networks.

  20. Investigating Gene Function in Cereal Rust Fungi by Plant-Mediated Virus-Induced Gene Silencing.

    Science.gov (United States)

    Panwar, Vinay; Bakkeren, Guus

    2017-01-01

    Cereal rust fungi are destructive pathogens, threatening grain production worldwide. Targeted breeding for resistance utilizing host resistance genes has been effective. However, breakdown of resistance occurs frequently and continued efforts are needed to understand how these fungi overcome resistance and to expand the range of available resistance genes. Whole genome sequencing, transcriptomic and proteomic studies followed by genome-wide computational and comparative analyses have identified large repertoire of genes in rust fungi among which are candidates predicted to code for pathogenicity and virulence factors. Some of these genes represent defence triggering avirulence effectors. However, functions of most genes still needs to be assessed to understand the biology of these obligate biotrophic pathogens. Since genetic manipulations such as gene deletion and genetic transformation are not yet feasible in rust fungi, performing functional gene studies is challenging. Recently, Host-induced gene silencing (HIGS) has emerged as a useful tool to characterize gene function in rust fungi while infecting and growing in host plants. We utilized Barley stripe mosaic virus-mediated virus induced gene silencing (BSMV-VIGS) to induce HIGS of candidate rust fungal genes in the wheat host to determine their role in plant-fungal interactions. Here, we describe the methods for using BSMV-VIGS in wheat for functional genomics study in cereal rust fungi.

  1. Genes on B chromosomes: Old questions revisited with new tools

    Czech Academy of Sciences Publication Activity Database

    Banaei-Moghaddam, A.M.; Martis, M.M.; Macas, Jiří; Gundlach, H.; Himmelbach, A.; Altschmied, L.; Mayer, K. F. X.; Houben, A.

    2015-01-01

    Roč. 1849, č. 1 (2015), s. 64-70 ISSN 1874-9399 R&D Projects: GA ČR GBP501/12/G090 Institutional support: RVO:60077344 Keywords : Gene regulation * genome evolution * junk DNA * pseudogene * transcription Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 5.373, year: 2015

  2. Gene delivery to the lungs: pulmonary gene therapy for cystic fibrosis.

    Science.gov (United States)

    Villate-Beitia, Ilia; Zarate, Jon; Puras, Gustavo; Pedraz, José Luis

    2017-07-01

    Cystic fibrosis (CF) is a monogenic autosomal recessive disorder where the defective gene, the cystic fibrosis transmembrane conductance regulator (CFTR), is well identified. Moreover, the respiratory tract can be targeted through noninvasive aerosolized formulations for inhalation. Therefore, gene therapy is considered a plausible strategy to address this disease. Conventional gene therapy strategies rely on the addition of a correct copy of the CFTR gene into affected cells in order to restore the channel activity. In recent years, genome correction strategies have emerged, such as zinc-finger nucleases, transcription activator-like effector nucleases and clustered regularly interspaced short palindromic repeats associated to Cas9 nucleases. These gene editing tools aim to repair the mutated gene at its original genomic locus with high specificity. Besides, the success of gene therapy critically depends on the nucleic acids carriers. To date, several clinical studies have been carried out to add corrected copies of the CFTR gene into target cells using viral and non-viral vectors, some of them with encouraging results. Regarding genome editing systems, preliminary in vitro studies have been performed in order to repair the CFTR gene. In this review, after briefly introducing the basis of CF, we discuss the up-to-date gene therapy strategies to address the disease. The review focuses on the main factors to take into consideration when developing gene delivery strategies, such as the design of vectors and plasmid DNA, in vitro/in vivo tests, translation to human use, administration methods, manufacturing conditions and regulatory issues.

  3. RatMap--rat genome tools and data.

    Science.gov (United States)

    Petersen, Greta; Johnson, Per; Andersson, Lars; Klinga-Levan, Karin; Gómez-Fabre, Pedro M; Ståhl, Fredrik

    2005-01-01

    The rat genome database RatMap (http://ratmap.org or http://ratmap.gen.gu.se) has been one of the main resources for rat genome information since 1994. The database is maintained by CMB-Genetics at Goteborg University in Sweden and provides information on rat genes, polymorphic rat DNA-markers and rat quantitative trait loci (QTLs), all curated at RatMap. The database is under the supervision of the Rat Gene and Nomenclature Committee (RGNC); thus much attention is paid to rat gene nomenclature. RatMap presents information on rat idiograms, karyotypes and provides a unified presentation of the rat genome sequence and integrated rat linkage maps. A set of tools is also available to facilitate the identification and characterization of rat QTLs, as well as the estimation of exon/intron number and sizes in individual rat genes. Furthermore, comparative gene maps of rat in regard to mouse and human are provided.

  4. RatMap—rat genome tools and data

    Science.gov (United States)

    Petersen, Greta; Johnson, Per; Andersson, Lars; Klinga-Levan, Karin; Gómez-Fabre, Pedro M.; Ståhl, Fredrik

    2005-01-01

    The rat genome database RatMap (http://ratmap.org or http://ratmap.gen.gu.se) has been one of the main resources for rat genome information since 1994. The database is maintained by CMB–Genetics at Göteborg University in Sweden and provides information on rat genes, polymorphic rat DNA-markers and rat quantitative trait loci (QTLs), all curated at RatMap. The database is under the supervision of the Rat Gene and Nomenclature Committee (RGNC); thus much attention is paid to rat gene nomenclature. RatMap presents information on rat idiograms, karyotypes and provides a unified presentation of the rat genome sequence and integrated rat linkage maps. A set of tools is also available to facilitate the identification and characterization of rat QTLs, as well as the estimation of exon/intron number and sizes in individual rat genes. Furthermore, comparative gene maps of rat in regard to mouse and human are provided. PMID:15608244

  5. Restricted gene flow and fine-scale population structuring in tool using New Caledonian crows

    Science.gov (United States)

    Rutz, C.; Ryder, T. B.; Fleischer, R. C.

    2012-04-01

    New Caledonian crows Corvus moneduloides are the most prolific avian tool users. It has been suggested that some aspects of their complex tool use behaviour are under the influence of cultural processes, involving the social transmission—and perhaps even progressive refinement—of tool designs. Using microsatellite and mt-haplotype profiling of crows from three distinct habitats (dry forest, farmland and beachside habitat), we show that New Caledonian crow populations can exhibit significant fine-scale genetic structuring. Our finding that some sites of cultural isolation of crow groups. Restricted movement of birds between local populations at such small spatial scales, especially across habitat boundaries, illustrates how specific tool designs could be preserved over time, and how tool technologies of different crow groups could diverge due to drift and local selection pressures. Young New Caledonian crows have an unusually long juvenile dependency period, during which they acquire complex tool-related foraging skills. We suggest that the resulting delayed natal dispersal drives population-divergence patterns in this species. Our work provides essential context for future studies that examine the genetic makeup of crow populations across larger geographic areas, including localities with suspected cultural differences in crow tool technologies.

  6. Transgene expression in microalgae – from tools to applications

    Directory of Open Access Journals (Sweden)

    Lior eDoron

    2016-04-01

    Full Text Available Microalgae comprise a biodiverse group of photosynthetic organisms that reside in water sources and sediments. The green microalgae Chlamydomonas reinhardtii was adopted as a useful model organism for studying various physiological systems. Its ability to grow under both photosynthetic and heterotrophic conditions allows efficient growth of non-photosynthetic mutants, making Chlamydomonas a useful genetic tool to study photosynthesis. In addition, this green alga can grow as haploid or diploid cells, similar to yeast, providing a powerful genetic system. As a result, easy and efficient transformation systems have been developed for Chlamydomonas, targeting both the chloroplast and nuclear genomes. Since microalgae comprise a rich repertoire of species that offer variable advantages for biotech and biomed industries, gene transfer technologies were further developed for many microalgae to allow for the expression of foreign proteins of interest. Expressing foreign genes in the chloroplast enables the targeting of foreign DNA to specific sites by homologous recombination. Chloroplast transformation also allows for the introduction of genes encoding several enzymes from a complex pathway, possibly as an operon. Expressing foreign proteins in the chloroplast can also be achieved by introducing the target gene into the nuclear genome, with the protein product bearing a targeting signal that directs import of the transgene-product into the chloroplast, like other endogenous chloroplast proteins. Integration of foreign genes into the nuclear genome is mostly random, resulting in large variability between different clones, such that extensive screening is required. The use of different selection modalities is also described, with special emphasis on the use of herbicides and metabolic markers which are considered to be friendly to the environment, as compared to drug-resistance genes that are commonly used. Finally, despite the development of a wide

  7. Molecular tools for carotenogenesis analysis in the zygomycete Mucor circinelloides.

    Science.gov (United States)

    Torres-Martínez, Santiago; Ruiz-Vázquez, Rosa M; Garre, Victoriano; López-García, Sergio; Navarro, Eusebio; Vila, Ana

    2012-01-01

    The carotene producer fungus Mucor circinelloides is the zygomycete more amenable to genetic manipulations by using molecular tools. Since the initial development of an effective procedure of genetic transformation, more than two decades ago, the availability of new molecular approaches such as gene replacement techniques and gene expression inactivation by RNA silencing, in addition to the sequencing of its genome, has made Mucor a valuable organism for the study of a number of processes. Here we describe in detail the main techniques and methods currently used to manipulate M. circinelloides, including transformation, gene replacement, gene silencing, RNAi, and immunoprecipitation.

  8. Systematic enrichment analysis of gene expression profiling studies identifies consensus pathways implicated in colorectal cancer development

    Directory of Open Access Journals (Sweden)

    Jesús Lascorz

    2011-01-01

    Full Text Available Background: A large number of gene expression profiling (GEP studies on colorectal carcinogenesis have been performed but no reliable gene signature has been identified so far due to the lack of reproducibility in the reported genes. There is growing evidence that functionally related genes, rather than individual genes, contribute to the etiology of complex traits. We used, as a novel approach, pathway enrichment tools to define functionally related genes that are consistently up- or down-regulated in colorectal carcinogenesis. Materials and Methods: We started the analysis with 242 unique annotated genes that had been reported by any of three recent meta-analyses covering GEP studies on genes differentially expressed in carcinoma vs normal mucosa. Most of these genes (218, 91.9% had been reported in at least three GEP studies. These 242 genes were submitted to bioinformatic analysis using a total of nine tools to detect enrichment of Gene Ontology (GO categories or Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. As a final consistency criterion the pathway categories had to be enriched by several tools to be taken into consideration. Results: Our pathway-based enrichment analysis identified the categories of ribosomal protein constituents, extracellular matrix receptor interaction, carbonic anhydrase isozymes, and a general category related to inflammation and cellular response as significantly and consistently overrepresented entities. Conclusions: We triaged the genes covered by the published GEP literature on colorectal carcinogenesis and subjected them to multiple enrichment tools in order to identify the consistently enriched gene categories. These turned out to have known functional relationships to cancer development and thus deserve further investigation.

  9. Vector Production in an Academic Environment: A Tool to Assess Production Costs

    Science.gov (United States)

    Boeke, Aaron; Doumas, Patrick; Reeves, Lilith; McClurg, Kyle; Bischof, Daniela; Sego, Lina; Auberry, Alisha; Tatikonda, Mohan

    2013-01-01

    Abstract Generating gene and cell therapy products under good manufacturing practices is a complex process. When determining the cost of these products, researchers must consider the large number of supplies used for manufacturing and the personnel and facility costs to generate vector and maintain a cleanroom facility. To facilitate cost estimates, the Indiana University Vector Production Facility teamed with the Indiana University Kelley School of Business to develop a costing tool that, in turn, provides pricing. The tool is designed in Microsoft Excel and is customizable to meet the needs of other core facilities. It is available from the National Gene Vector Biorepository. The tool allows cost determinations using three different costing methods and was developed in an effort to meet the A21 circular requirements for U.S. core facilities performing work for federally funded projects. The costing tool analysis reveals that the cost of vector production does not have a linear relationship with batch size. For example, increasing the production from 9 to18 liters of a retroviral vector product increases total costs a modest 1.2-fold rather than doubling in total cost. The analysis discussed in this article will help core facilities and investigators plan a cost-effective strategy for gene and cell therapy production. PMID:23360377

  10. Angiosperm phylogeny inferred from multiple genes as a tool for comparative biology.

    Science.gov (United States)

    Soltis, P S; Soltis, D E; Chase, M W

    1999-11-25

    Comparative biology requires a firm phylogenetic foundation to uncover and understand patterns of diversification and evaluate hypotheses of the processes responsible for these patterns. In the angiosperms, studies of diversification in floral form, stamen organization, reproductive biology, photosynthetic pathway, nitrogen-fixing symbioses and life histories have relied on either explicit or implied phylogenetic trees. Furthermore, to understand the evolution of specific genes and gene families, evaluate the extent of conservation of plant genomes and make proper sense of the huge volume of molecular genetic data available for model organisms such as Arabidopsis, Antirrhinum, maize, rice and wheat, a phylogenetic perspective is necessary. Here we report the results of parsimony analyses of DNA sequences of the plastid genes rbcL and atpB and the nuclear 18S rDNA for 560 species of angiosperms and seven non-flowering seed plants and show a well-resolved and well-supported phylogenetic tree for the angiosperms for use in comparative biology.

  11. Discerning molecular interactions: A comprehensive review on biomolecular interaction databases and network analysis tools.

    Science.gov (United States)

    Miryala, Sravan Kumar; Anbarasu, Anand; Ramaiah, Sudha

    2018-02-05

    Computational analysis of biomolecular interaction networks is now gaining a lot of importance to understand the functions of novel genes/proteins. Gene interaction (GI) network analysis and protein-protein interaction (PPI) network analysis play a major role in predicting the functionality of interacting genes or proteins and gives an insight into the functional relationships and evolutionary conservation of interactions among the genes. An interaction network is a graphical representation of gene/protein interactome, where each gene/protein is a node, and interaction between gene/protein is an edge. In this review, we discuss the popular open source databases that serve as data repositories to search and collect protein/gene interaction data, and also tools available for the generation of interaction network, visualization and network analysis. Also, various network analysis approaches like topological approach and clustering approach to study the network properties and functional enrichment server which illustrates the functions and pathway of the genes and proteins has been discussed. Hence the distinctive attribute mentioned in this review is not only to provide an overview of tools and web servers for gene and protein-protein interaction (PPI) network analysis but also to extract useful and meaningful information from the interaction networks. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Genetic manipulation of longevity-related genes as a tool to regulate yeast life span and metabolite production during winemaking.

    Science.gov (United States)

    Orozco, Helena; Matallana, Emilia; Aranda, Agustín

    2013-01-02

    Yeast viability and vitality are essential for different industrial processes where the yeast Saccharomyces cerevisiae is used as a biotechnological tool. Therefore, the decline of yeast biological functions during aging may compromise their successful biotechnological use. Life span is controlled by a variety of molecular mechanisms, many of which are connected to stress tolerance and genomic stability, although the metabolic status of a cell has proven a main factor affecting its longevity. Acetic acid and ethanol accumulation shorten chronological life span (CLS), while glycerol extends it. Different age-related gene classes have been modified by deletion or overexpression to test their role in longevity and metabolism. Overexpression of histone deacetylase SIR2 extends CLS and reduces acetate production, while overexpression of SIR2 homolog HST3 shortens CLS, increases the ethanol level, and reduces acetic acid production. HST3 overexpression also enhances ethanol tolerance. Increasing tolerance to oxidative stress by superoxide dismutase SOD2 overexpression has only a moderate positive effect on CLS. CLS during grape juice fermentation has also been studied for mutants on several mRNA binding proteins that are regulators of gene expression at the posttranscriptional level; we found that NGR1 and UTH4 deletions decrease CLS, while PUF3 and PUB1 deletions increase it. Besides, the pub1Δ mutation increases glycerol production and blocks stress granule formation during grape juice fermentation. Surprisingly, factors relating to apoptosis, such as caspase Yca1 or apoptosis-inducing factor Aif1, play a positive role in yeast longevity during winemaking as their deletions shorten CLS. Manipulation of regulators of gene expression at both transcriptional (i.e., sirtuins) and posttranscriptional (i.e., mRNA binding protein Pub1) levels allows to modulate yeast life span during its biotechnological use. Due to links between aging and metabolism, it also influences the

  13. Induced mutations of rust resistance genes in wheat

    International Nuclear Information System (INIS)

    McIntosh, R.A.

    1983-01-01

    Induced mutations are being used as a tool to study genes for resistance in wheat. It was found that Pm1 can be separated from Lr20 and Sr15, but these two react like a single pleiotropic gene. Mutants were further examined in crosses and backmutations have been attempted. (author)

  14. Construction of coffee transcriptome networks based on gene annotation semantics

    Directory of Open Access Journals (Sweden)

    Castillo Luis F.

    2012-12-01

    Full Text Available Gene annotation is a process that encompasses multiple approaches on the analysis of nucleic acids or protein sequences in order to assign structural and functional characteristics to gene models. When thousands of gene models are being described in an organism genome, construction and visualization of gene networks impose novel challenges in the understanding of complex expression patterns and the generation of new knowledge in genomics research. In order to take advantage of accumulated text data after conventional gene sequence analysis, this work applied semantics in combination with visualization tools to build transcriptome networks from a set of coffee gene annotations. A set of selected coffee transcriptome sequences, chosen by the quality of the sequence comparison reported by Basic Local Alignment Search Tool (BLAST and Interproscan, were filtered out by coverage, identity, length of the query, and e-values. Meanwhile, term descriptors for molecular biology and biochemistry were obtained along the Wordnet dictionary in order to construct a Resource Description Framework (RDF using Ruby scripts and Methontology to find associations between concepts. Relationships between sequence annotations and semantic concepts were graphically represented through a total of 6845 oriented vectors, which were reduced to 745 non-redundant associations. A large gene network connecting transcripts by way of relational concepts was created where detailed connections remain to be validated for biological significance based on current biochemical and genetics frameworks. Besides reusing text information in the generation of gene connections and for data mining purposes, this tool development opens the possibility to visualize complex and abundant transcriptome data, and triggers the formulation of new hypotheses in metabolic pathways analysis.

  15. Identification of Candidate B-Lymphoma Genes by Cross-Species Gene Expression Profiling

    Science.gov (United States)

    Tompkins, Van S.; Han, Seong-Su; Olivier, Alicia; Syrbu, Sergei; Bair, Thomas; Button, Anna; Jacobus, Laura; Wang, Zebin; Lifton, Samuel; Raychaudhuri, Pradip; Morse, Herbert C.; Weiner, George; Link, Brian; Smith, Brian J.; Janz, Siegfried

    2013-01-01

    Comparative genome-wide expression profiling of malignant tumor counterparts across the human-mouse species barrier has a successful track record as a gene discovery tool in liver, breast, lung, prostate and other cancers, but has been largely neglected in studies on neoplasms of mature B-lymphocytes such as diffuse large B cell lymphoma (DLBCL) and Burkitt lymphoma (BL). We used global gene expression profiles of DLBCL-like tumors that arose spontaneously in Myc-transgenic C57BL/6 mice as a phylogenetically conserved filter for analyzing the human DLBCL transcriptome. The human and mouse lymphomas were found to have 60 concordantly deregulated genes in common, including 8 genes that Cox hazard regression analysis associated with overall survival in a published landmark dataset of DLBCL. Genetic network analysis of the 60 genes followed by biological validation studies indicate FOXM1 as a candidate DLBCL and BL gene, supporting a number of studies contending that FOXM1 is a therapeutic target in mature B cell tumors. Our findings demonstrate the value of the “mouse filter” for genomic studies of human B-lineage neoplasms for which a vast knowledge base already exists. PMID:24130802

  16. Investigating the effect of paralogs on microarray gene-set analysis

    LENUS (Irish Health Repository)

    Faure, Andre J

    2011-01-24

    Abstract Background In order to interpret the results obtained from a microarray experiment, researchers often shift focus from analysis of individual differentially expressed genes to analyses of sets of genes. These gene-set analysis (GSA) methods use previously accumulated biological knowledge to group genes into sets and then aim to rank these gene sets in a way that reflects their relative importance in the experimental situation in question. We suspect that the presence of paralogs affects the ability of GSA methods to accurately identify the most important sets of genes for subsequent research. Results We show that paralogs, which typically have high sequence identity and similar molecular functions, also exhibit high correlation in their expression patterns. We investigate this correlation as a potential confounding factor common to current GSA methods using Indygene http:\\/\\/www.cbio.uct.ac.za\\/indygene, a web tool that reduces a supplied list of genes so that it includes no pairwise paralogy relationships above a specified sequence similarity threshold. We use the tool to reanalyse previously published microarray datasets and determine the potential utility of accounting for the presence of paralogs. Conclusions The Indygene tool efficiently removes paralogy relationships from a given dataset and we found that such a reduction, performed prior to GSA, has the ability to generate significantly different results that often represent novel and plausible biological hypotheses. This was demonstrated for three different GSA approaches when applied to the reanalysis of previously published microarray datasets and suggests that the redundancy and non-independence of paralogs is an important consideration when dealing with GSA methodologies.

  17. Suppression of Arabidopsis genes by terminator-less transgene constructs

    Science.gov (United States)

    Transgene-mediated gene silencing is an important biotechnological and research tool. There are several RNAi-mediated techniques available for silencing genes in plants. The basis of all these techniques is to generate double stranded RNA precursors in the cell, which are recognized by the cellula...

  18. A systemic gene silencing method suitable for high throughput, reverse genetic analyses of gene function in fern gametophytes

    Directory of Open Access Journals (Sweden)

    Tanurdzic Milos

    2004-04-01

    Full Text Available Abstract Background Ceratopteris richardii is a useful experimental system for studying gametophyte development and sexual reproduction in plants. However, few tools for cloning mutant genes or disrupting gene function exist for this species. The feasibility of systemic gene silencing as a reverse genetics tool was examined in this study. Results Several DNA constructs targeting a Ceratopteris protoporphyrin IX magnesium chelatase (CrChlI gene that is required for chlorophyll biosynthesis were each introduced into young gametophytes by biolistic delivery. Their transient expression in individual cells resulted in a colorless cell phenotype that affected most cells of the mature gametophyte, including the meristem and gametangia. The colorless phenotype was associated with a 7-fold decrease in the abundance of the endogenous transcript. While a construct designed to promote the transient expression of a CrChlI double stranded, potentially hairpin-forming RNA was found to be the most efficient in systemically silencing the endogenous gene, a plasmid containing the CrChlI cDNA insert alone was sufficient to induce silencing. Bombarded, colorless hermaphroditic gametophytes produced colorless embryos following self-fertilization, demonstrating that the silencing signal could be transmitted through gametogenesis and fertilization. Bombardment of young gametophytes with constructs targeting the Ceratopteris filamentous temperature sensitive (CrFtsZ and uroporphyrin dehydrogenase (CrUrod genes also produced the expected mutant phenotypes. Conclusion A method that induces the systemic silencing of target genes in the Ceratopteris gametophyte is described. It provides a simple, inexpensive and rapid means to test the functions of genes involved in gametophyte development, especially those involved in cellular processes common to all plants.

  19. Cotton Leaf Curl Multan Betasatellite DNA as a Tool to Deliver and Express the Human B-Cell Lymphoma 2 (Bcl-2) Gene in Plants.

    Science.gov (United States)

    Kharazmi, Sara; Ataie Kachoie, Elham; Behjatnia, Seyed Ali Akbar

    2016-05-01

    The betasatellite DNA associated with Cotton leaf curl Multan virus (CLCuMB) contains a single complementary-sense ORF, βC1, which is a pathogenicity determinant. CLCuMB was able to replicate in plants in the presence of diverse helper geminiviruses, including Tomato leaf curl virus-Australia (TLCV-Au), Iranian isolate of Tomato yellow leaf curl virus (TYLCV-[Ab]), and Beet curly top virus (BCTV-Svr), and can be used as a plant gene delivery vector. To test the hypothesis that CLCuMB has the potential to act as an animal gene delivery vector, a specific insertion construct was produced by the introduction of a human B-cell lymphoma 2 (Bcl-2) cDNA into a mutant DNA of CLCuMB in which the βC1 was deleted (β∆C1). The recombinant βΔC1-Bcl-2 construct was successfully replicated in tomato and tobacco plants in the presence of TLCV-Au, BCTV-Svr and TYLCV-[Ab]. Real-time PCR and Western blot analyses of plants containing the replicative forms of recombinant βΔC1-Bcl-2 DNA showed that Bcl-2 gene was expressed in an acceptable level in these plants, indicating that β∆C1 can be used as a tool to deliver and express animal genes in plants. This CLCuMB-based system, having its own promoter activity, offers the possibility of production of animal recombinant proteins in plants.

  20. GENES IN SPORT AND DOPING

    Directory of Open Access Journals (Sweden)

    Andrzej Pokrywka

    2013-06-01

    Full Text Available Genes control biological processes such as muscle production of energy, mitochondria biogenesis, bone formation erythropoiesis, angiogenesis, vasodilation, neurogenesis, etc. DNA profiling for athletes reveals genetic variations that may be associated with endurance ability, muscle performance and power exercise, tendon susceptibility to injuries and psychological aptitude. Already, over 200 genes relating to physical performance have been identified by several research groups. Athletes’ genotyping is developing as a tool for the formulation of personalized training and nutritional programmes to optimize sport training as well as for the prediction of exercise-related injuries. On the other hand, development of molecular technology and gene therapy creates a risk of non-therapeutic use of cells, genes and genetic elements to improve athletic performance. Therefore, the World Anti-Doping Agency decided to include prohibition of gene doping within their World Anti-Doping Code in 2003. In this review article, we will provide a current overview of genes for use in athletes’ genotyping and gene doping possibilities, including their development and detection techniques.

  1. Characterization of the MLO gene family in Rosaceae and gene expression analysis in Malus domestica.

    Science.gov (United States)

    Pessina, Stefano; Pavan, Stefano; Catalano, Domenico; Gallotta, Alessandra; Visser, Richard G F; Bai, Yuling; Malnoy, Mickael; Schouten, Henk J

    2014-07-22

    Powdery mildew (PM) is a major fungal disease of thousands of plant species, including many cultivated Rosaceae. PM pathogenesis is associated with up-regulation of MLO genes during early stages of infection, causing down-regulation of plant defense pathways. Specific members of the MLO gene family act as PM-susceptibility genes, as their loss-of-function mutations grant durable and broad-spectrum resistance. We carried out a genome-wide characterization of the MLO gene family in apple, peach and strawberry, and we isolated apricot MLO homologs through a PCR-approach. Evolutionary relationships between MLO homologs were studied and syntenic blocks constructed. Homologs that are candidates for being PM susceptibility genes were inferred by phylogenetic relationships with functionally characterized MLO genes and, in apple, by monitoring their expression following inoculation with the PM causal pathogen Podosphaera leucotricha. Genomic tools available for Rosaceae were exploited in order to characterize the MLO gene family. Candidate MLO susceptibility genes were identified. In follow-up studies it can be investigated whether silencing or a loss-of-function mutations in one or more of these candidate genes leads to PM resistance.

  2. Transposons As Tools for Functional Genomics in Vertebrate Models.

    Science.gov (United States)

    Kawakami, Koichi; Largaespada, David A; Ivics, Zoltán

    2017-11-01

    Genetic tools and mutagenesis strategies based on transposable elements are currently under development with a vision to link primary DNA sequence information to gene functions in vertebrate models. By virtue of their inherent capacity to insert into DNA, transposons can be developed into powerful tools for chromosomal manipulations. Transposon-based forward mutagenesis screens have numerous advantages including high throughput, easy identification of mutated alleles, and providing insight into genetic networks and pathways based on phenotypes. For example, the Sleeping Beauty transposon has become highly instrumental to induce tumors in experimental animals in a tissue-specific manner with the aim of uncovering the genetic basis of diverse cancers. Here, we describe a battery of mutagenic cassettes that can be applied in conjunction with transposon vectors to mutagenize genes, and highlight versatile experimental strategies for the generation of engineered chromosomes for loss-of-function as well as gain-of-function mutagenesis for functional gene annotation in vertebrate models, including zebrafish, mice, and rats. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. The use of molecular imaging of gene expression by radiotracers in gene therapy

    International Nuclear Information System (INIS)

    Richard-Fiardo, P.; Franken, P.R.; Harrington, K.J.; Vassaux, G.; Cambien, B.

    2011-01-01

    Introduction: Progress with gene-based therapies has been hampered by difficulties in monitoring the biodistribution and kinetics of vector-mediated gene expression. Recent developments in non-invasive imaging have allowed researchers and clinicians to assess the location, magnitude and persistence of gene expression in animals and humans. Such advances should eventually lead to improvement in the efficacy and safety of current clinical protocols for future treatments. Areas Covered: The molecular imaging techniques for monitoring gene therapy in the living subject, with a specific highlight on the key reporter gene approaches that have been developed and validated in preclinical models using the latest imaging modalities. The applications of molecular imaging to biotherapy, with a particular emphasis on monitoring of gene and vector biodistribution and on image-guided radiotherapy. Expert Opinion: Among the reporter gene/probe combinations that have been described so far, one stands out, in our view, as the most versatile and easy to implement: the Na/I symporter. This strategy, exploiting more than 50 years of experience in the treatment of differentiated thyroid carcinomas, has been validated in different types of experimental cancers and with different types of oncolytic viruses and is likely to become a key tool in the implementation of human gene therapy. (authors)

  4. Retroviruses as tools to study the immune system.

    Science.gov (United States)

    Lois, C; Refaeli, Y; Qin, X F; Van Parijs, L

    2001-08-01

    Retrovirus-based vectors provide an efficient means to introduce and express genes in cells of the immune system and have become a popular tool to study immune function. They are easy to manipulate and provide stable, long-term gene expression because they integrate into the genome. Current retroviral vectors do have limitations that affect their usefulness in certain applications. However, recent advances suggest a number of ways in which these vectors might be improved to extend their utility in immunological research.

  5. Genetic manipulation in Sulfolobus islandicus and functional analysis of DNA repair genes

    DEFF Research Database (Denmark)

    Zhang, Changyi; Tian, Bin; Li, Suming

    2013-01-01

    Recently, a novel gene-deletion method was developed for the crenarchaeal model Sulfolobus islandicus, which is a suitable tool for addressing gene essentiality in depth. Using this technique, we have investigated functions of putative DNA repair genes by constructing deletion mutants and studying...

  6. Gene discovery in Triatoma infestans

    Directory of Open Access Journals (Sweden)

    de Burgos Nelia

    2011-03-01

    Full Text Available Abstract Background Triatoma infestans is the most relevant vector of Chagas disease in the southern cone of South America. Since its genome has not yet been studied, sequencing of Expressed Sequence Tags (ESTs is one of the most powerful tools for efficiently identifying large numbers of expressed genes in this insect vector. Results In this work, we generated 826 ESTs, resulting in an increase of 47% in the number of ESTs available for T. infestans. These ESTs were assembled in 471 unique sequences, 151 of which represent 136 new genes for the Reduviidae family. Conclusions Among the putative new genes for the Reduviidae family, we identified and described an interesting subset of genes involved in development and reproduction, which constitute potential targets for insecticide development.

  7. LEGO: a novel method for gene set over-representation analysis by incorporating network-based gene weights.

    Science.gov (United States)

    Dong, Xinran; Hao, Yun; Wang, Xiao; Tian, Weidong

    2016-01-11

    Pathway or gene set over-representation analysis (ORA) has become a routine task in functional genomics studies. However, currently widely used ORA tools employ statistical methods such as Fisher's exact test that reduce a pathway into a list of genes, ignoring the constitutive functional non-equivalent roles of genes and the complex gene-gene interactions. Here, we develop a novel method named LEGO (functional Link Enrichment of Gene Ontology or gene sets) that takes into consideration these two types of information by incorporating network-based gene weights in ORA analysis. In three benchmarks, LEGO achieves better performance than Fisher and three other network-based methods. To further evaluate LEGO's usefulness, we compare LEGO with five gene expression-based and three pathway topology-based methods using a benchmark of 34 disease gene expression datasets compiled by a recent publication, and show that LEGO is among the top-ranked methods in terms of both sensitivity and prioritization for detecting target KEGG pathways. In addition, we develop a cluster-and-filter approach to reduce the redundancy among the enriched gene sets, making the results more interpretable to biologists. Finally, we apply LEGO to two lists of autism genes, and identify relevant gene sets to autism that could not be found by Fisher.

  8. The Songbird Neurogenomics (SoNG Initiative: Community-based tools and strategies for study of brain gene function and evolution

    Directory of Open Access Journals (Sweden)

    Lewin Harris A

    2008-03-01

    Full Text Available Abstract Background Songbirds hold great promise for biomedical, environmental and evolutionary research. A complete draft sequence of the zebra finch genome is imminent, yet a need remains for application of genomic resources within a research community traditionally focused on ethology and neurobiological methods. In response, we developed a core set of genomic tools and a novel collaborative strategy to probe gene expression in diverse songbird species and natural contexts. Results We end-sequenced cDNAs from zebra finch brain and incorporated additional sequences from community sources into a database of 86,784 high quality reads. These assembled into 31,658 non-redundant contigs and singletons, which we annotated via BLAST search of chicken and human databases. The results are publicly available in the ESTIMA:Songbird database. We produced a spotted cDNA microarray with 20,160 addresses representing 17,214 non-redundant products of an estimated 11,500–15,000 genes, validating it by analysis of immediate-early gene (zenk gene activation following song exposure and by demonstrating effective cross hybridization to genomic DNAs of other songbird species in the Passerida Parvorder. Our assembly was also used in the design of the "Lund-zfa" Affymetrix array representing ~22,000 non-redundant sequences. When the two arrays were hybridized to cDNAs from the same set of male and female zebra finch brain samples, both arrays detected a common set of regulated transcripts with a Pearson correlation coefficient of 0.895. To stimulate use of these resources by the songbird research community and to maintain consistent technical standards, we devised a "Community Collaboration" mechanism whereby individual birdsong researchers develop experiments and provide tissues, but a single individual in the community is responsible for all RNA extractions, labelling and microarray hybridizations. Conclusion Immediately, these results set the foundation for a

  9. Digital Signal Processing and Control for the Study of Gene Networks

    Science.gov (United States)

    Shin, Yong-Jun

    2016-04-01

    Thanks to the digital revolution, digital signal processing and control has been widely used in many areas of science and engineering today. It provides practical and powerful tools to model, simulate, analyze, design, measure, and control complex and dynamic systems such as robots and aircrafts. Gene networks are also complex dynamic systems which can be studied via digital signal processing and control. Unlike conventional computational methods, this approach is capable of not only modeling but also controlling gene networks since the experimental environment is mostly digital today. The overall aim of this article is to introduce digital signal processing and control as a useful tool for the study of gene networks.

  10. Galaxy tools and workflows for sequence analysis with applications in molecular plant pathology.

    Science.gov (United States)

    Cock, Peter J A; Grüning, Björn A; Paszkiewicz, Konrad; Pritchard, Leighton

    2013-01-01

    The Galaxy Project offers the popular web browser-based platform Galaxy for running bioinformatics tools and constructing simple workflows. Here, we present a broad collection of additional Galaxy tools for large scale analysis of gene and protein sequences. The motivating research theme is the identification of specific genes of interest in a range of non-model organisms, and our central example is the identification and prediction of "effector" proteins produced by plant pathogens in order to manipulate their host plant. This functional annotation of a pathogen's predicted capacity for virulence is a key step in translating sequence data into potential applications in plant pathology. This collection includes novel tools, and widely-used third-party tools such as NCBI BLAST+ wrapped for use within Galaxy. Individual bioinformatics software tools are typically available separately as standalone packages, or in online browser-based form. The Galaxy framework enables the user to combine these and other tools to automate organism scale analyses as workflows, without demanding familiarity with command line tools and scripting. Workflows created using Galaxy can be saved and are reusable, so may be distributed within and between research groups, facilitating the construction of a set of standardised, reusable bioinformatic protocols. The Galaxy tools and workflows described in this manuscript are open source and freely available from the Galaxy Tool Shed (http://usegalaxy.org/toolshed or http://toolshed.g2.bx.psu.edu).

  11. Analysis of gene expression profile microarray data in complex regional pain syndrome.

    Science.gov (United States)

    Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing

    2017-09-01

    The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.

  12. xSyn: A Software Tool for Identifying Sophisticated 3-Way Interactions From Cancer Expression Data

    Directory of Open Access Journals (Sweden)

    Baishali Bandyopadhyay

    2017-08-01

    Full Text Available Background: Constructing gene co-expression networks from cancer expression data is important for investigating the genetic mechanisms underlying cancer. However, correlation coefficients or linear regression models are not able to model sophisticated relationships among gene expression profiles. Here, we address the 3-way interaction that 2 genes’ expression levels are clustered in different space locations under the control of a third gene’s expression levels. Results: We present xSyn, a software tool for identifying such 3-way interactions from cancer gene expression data based on an optimization procedure involving the usage of UPGMA (Unweighted Pair Group Method with Arithmetic Mean and synergy. The effectiveness is demonstrated by application to 2 real gene expression data sets. Conclusions: xSyn is a useful tool for decoding the complex relationships among gene expression profiles. xSyn is available at http://www.bdxconsult.com/xSyn.html .

  13. The BDGP gene disruption project: Single transposon insertions associated with 40 percent of Drosophila genes

    Energy Technology Data Exchange (ETDEWEB)

    Bellen, Hugo J.; Levis, Robert W.; Liao, Guochun; He, Yuchun; Carlson, Joseph W.; Tsang, Garson; Evans-Holm, Martha; Hiesinger, P. Robin; Schulze, Karen L.; Rubin, Gerald M.; Hoskins, Roger A.; Spradling, Allan C.

    2004-01-13

    The Berkeley Drosophila Genome Project (BDGP) strives to disrupt each Drosophila gene by the insertion of a single transposable element. As part of this effort, transposons in more than 30,000 fly strains were localized and analyzed relative to predicted Drosophila gene structures. Approximately 6,300 lines that maximize genomic coverage were selected to be sent to the Bloomington Stock Center for public distribution, bringing the size of the BDGP gene disruption collection to 7,140 lines. It now includes individual lines predicted to disrupt 5,362 of the 13,666 currently annotated Drosophila genes (39 percent). Other lines contain an insertion at least 2 kb from others in the collection and likely mutate additional incompletely annotated or uncharacterized genes and chromosomal regulatory elements. The remaining strains contain insertions likely to disrupt alternative gene promoters or to allow gene mis-expression. The expanded BDGP gene disruption collection provides a public resource that will facilitate the application of Drosophila genetics to diverse biological problems. Finally, the project reveals new insight into how transposons interact with a eukaryotic genome and helps define optimal strategies for using insertional mutagenesis as a genomic tool.

  14. Engaging Students in a Bioinformatics Activity to Introduce Gene Structure and Function

    Directory of Open Access Journals (Sweden)

    Barbara J. May

    2013-02-01

    Full Text Available Bioinformatics spans many fields of biological research and plays a vital role in mining and analyzing data. Therefore, there is an ever-increasing need for students to understand not only what can be learned from this data, but also how to use basic bioinformatics tools.  This activity is designed to provide secondary and undergraduate biology students to a hands-on activity meant to explore and understand gene structure with the use of basic bioinformatic tools.  Students are provided an “unknown” sequence from which they are asked to use a free online gene finder program to identify the gene. Students then predict the putative function of this gene with the use of additional online databases.

  15. PET/CT imaging of human somatostatin receptor 2 (hsstr2) as reporter gene for gene therapy

    International Nuclear Information System (INIS)

    Hofmann, M.; Gazdhar, A.; Weitzel, T.; Schmid, R.; Krause, T.

    2006-01-01

    Localized information on region-selective gene expression in small animals is widely obtained by use of reporter genes inducing light emission. Using these reporter genes for imaging deep inside the human body fluorescent probes are hindered by attenuation, scattering and possible fluorescence quenching. This can be overcome by use of radio-peptide receptors as reporter genes. Therefore, the feasibility of the somatostatin receptor 2 expression vector system for expression imaging was checked against a control vector containing luciferase gene. For in vivo transduction of vector DNA into the rat forelimb muscles the in vivo electroporation technique was chosen because of its high regio-selectivity. The gene expression was imaged by high-sensitive CCD camera (luciferase activity) and by PET/CT using a Ga-68-DOTATOC as radio peptide probe. The relative sstr2 expression was enhanced by gene transduction at maximum to a factor of 15. The PET/CT images could be fully quantified. The above demonstrated feasibility of radio-peptide PET/CT reporter gene imaging may serve in the future as a tool for full quantitative understanding of regional gene expression, especially in large animals and humans

  16. PET/CT imaging of human somatostatin receptor 2 (hsstr2) as reporter gene for gene therapy

    Energy Technology Data Exchange (ETDEWEB)

    Hofmann, M. [Molecular Imaging and Therapy Group (MIT-Bern), Clinic of Nuclear Medicine, Inselspital, Medical School Bern (Switzerland)]. E-mail: Michael.Hofmann@insel.ch; Gazdhar, A. [Division of Pulmonary Medicine, University Hospital Bern (Switzerland); Weitzel, T. [Molecular Imaging and Therapy Group (MIT-Bern), Clinic of Nuclear Medicine, Inselspital, Medical School Bern (Switzerland); Schmid, R. [Division of Thoracic Surgery, University Hospital Bern (Switzerland); Krause, T. [Molecular Imaging and Therapy Group (MIT-Bern), Clinic of Nuclear Medicine, Inselspital, Medical School Bern (Switzerland)

    2006-12-20

    Localized information on region-selective gene expression in small animals is widely obtained by use of reporter genes inducing light emission. Using these reporter genes for imaging deep inside the human body fluorescent probes are hindered by attenuation, scattering and possible fluorescence quenching. This can be overcome by use of radio-peptide receptors as reporter genes. Therefore, the feasibility of the somatostatin receptor 2 expression vector system for expression imaging was checked against a control vector containing luciferase gene. For in vivo transduction of vector DNA into the rat forelimb muscles the in vivo electroporation technique was chosen because of its high regio-selectivity. The gene expression was imaged by high-sensitive CCD camera (luciferase activity) and by PET/CT using a Ga-68-DOTATOC as radio peptide probe. The relative sstr2 expression was enhanced by gene transduction at maximum to a factor of 15. The PET/CT images could be fully quantified. The above demonstrated feasibility of radio-peptide PET/CT reporter gene imaging may serve in the future as a tool for full quantitative understanding of regional gene expression, especially in large animals and human000.

  17. BGDB: a database of bivalent genes.

    Science.gov (United States)

    Li, Qingyan; Lian, Shuabin; Dai, Zhiming; Xiang, Qian; Dai, Xianhua

    2013-01-01

    Bivalent gene is a gene marked with both H3K4me3 and H3K27me3 epigenetic modification in the same area, and is proposed to play a pivotal role related to pluripotency in embryonic stem (ES) cells. Identification of these bivalent genes and understanding their functions are important for further research of lineage specification and embryo development. So far, lots of genome-wide histone modification data were generated in mouse and human ES cells. These valuable data make it possible to identify bivalent genes, but no comprehensive data repositories or analysis tools are available for bivalent genes currently. In this work, we develop BGDB, the database of bivalent genes. The database contains 6897 bivalent genes in human and mouse ES cells, which are manually collected from scientific literature. Each entry contains curated information, including genomic context, sequences, gene ontology and other relevant information. The web services of BGDB database were implemented with PHP + MySQL + JavaScript, and provide diverse query functions. Database URL: http://dailab.sysu.edu.cn/bgdb/

  18. Quantum selfish gene (biological evolution in terms of quantum mechanics)

    OpenAIRE

    Ozhigov, Yuri I.

    2013-01-01

    I propose to treat the biological evolution of genoms by means of quantum mechanical tools. We start with the concept of meta- gene, which specifies the "selfish gene" of R.Dawkins. Meta- gene encodes the abstract living unity, which can live relatively independently of the others, and can contain a few real creatures. Each population of living creatures we treat as the wave function on meta- genes, which module squared is the total number of creatures with the given meta-gene, and the phase ...

  19. Human transporter database: comprehensive knowledge and discovery tools in the human transporter genes.

    Directory of Open Access Journals (Sweden)

    Adam Y Ye

    Full Text Available Transporters are essential in homeostatic exchange of endogenous and exogenous substances at the systematic, organic, cellular, and subcellular levels. Gene mutations of transporters are often related to pharmacogenetics traits. Recent developments in high throughput technologies on genomics, transcriptomics and proteomics allow in depth studies of transporter genes in normal cellular processes and diverse disease conditions. The flood of high throughput data have resulted in urgent need for an updated knowledgebase with curated, organized, and annotated human transporters in an easily accessible way. Using a pipeline with the combination of automated keywords query, sequence similarity search and manual curation on transporters, we collected 1,555 human non-redundant transporter genes to develop the Human Transporter Database (HTD (http://htd.cbi.pku.edu.cn. Based on the extensive annotations, global properties of the transporter genes were illustrated, such as expression patterns and polymorphisms in relationships with their ligands. We noted that the human transporters were enriched in many fundamental biological processes such as oxidative phosphorylation and cardiac muscle contraction, and significantly associated with Mendelian and complex diseases such as epilepsy and sudden infant death syndrome. Overall, HTD provides a well-organized interface to facilitate research communities to search detailed molecular and genetic information of transporters for development of personalized medicine.

  20. Contemporary molecular tools in microbial ecology and their application to advancing biotechnology

    KAUST Repository

    Rashid, Mamoon; Stingl, Ulrich

    2015-01-01

    Novel methods in microbial ecology are revolutionizing our understanding of the structure and function of microbes in the environment, but concomitant advances in applications of these tools to biotechnology are mostly lagging behind. After more than a century of efforts to improve microbial culturing techniques, about 70–80% of microbial diversity – recently called the “microbial dark matter” – remains uncultured. In early attempts to identify and sample these so far uncultured taxonomic lineages, methods that amplify and sequence ribosomal RNA genes were extensively used. Recent developments in cell separation techniques, DNA amplification, and high-throughput DNA sequencing platforms have now made the discovery of genes/genomes of uncultured microorganisms from different environments possible through the use of metagenomic techniques and single-cell genomics. When used synergistically, these metagenomic and single-cell techniques create a powerful tool to study microbial diversity. These genomics techniques have already been successfully exploited to identify sources for i) novel enzymes or natural products for biotechnology applications, ii) novel genes from extremophiles, and iii) whole genomes or operons from uncultured microbes. More can be done to utilize these tools more efficiently in biotechnology.

  1. Contemporary molecular tools in microbial ecology and their application to advancing biotechnology.

    Science.gov (United States)

    Rashid, Mamoon; Stingl, Ulrich

    2015-12-01

    Novel methods in microbial ecology are revolutionizing our understanding of the structure and function of microbes in the environment, but concomitant advances in applications of these tools to biotechnology are mostly lagging behind. After more than a century of efforts to improve microbial culturing techniques, about 70-80% of microbial diversity - recently called the "microbial dark matter" - remains uncultured. In early attempts to identify and sample these so far uncultured taxonomic lineages, methods that amplify and sequence ribosomal RNA genes were extensively used. Recent developments in cell separation techniques, DNA amplification, and high-throughput DNA sequencing platforms have now made the discovery of genes/genomes of uncultured microorganisms from different environments possible through the use of metagenomic techniques and single-cell genomics. When used synergistically, these metagenomic and single-cell techniques create a powerful tool to study microbial diversity. These genomics techniques have already been successfully exploited to identify sources for i) novel enzymes or natural products for biotechnology applications, ii) novel genes from extremophiles, and iii) whole genomes or operons from uncultured microbes. More can be done to utilize these tools more efficiently in biotechnology. Copyright © 2015 Elsevier Inc. All rights reserved.

  2. Contemporary molecular tools in microbial ecology and their application to advancing biotechnology

    KAUST Repository

    Rashid, Mamoon

    2015-09-25

    Novel methods in microbial ecology are revolutionizing our understanding of the structure and function of microbes in the environment, but concomitant advances in applications of these tools to biotechnology are mostly lagging behind. After more than a century of efforts to improve microbial culturing techniques, about 70–80% of microbial diversity – recently called the “microbial dark matter” – remains uncultured. In early attempts to identify and sample these so far uncultured taxonomic lineages, methods that amplify and sequence ribosomal RNA genes were extensively used. Recent developments in cell separation techniques, DNA amplification, and high-throughput DNA sequencing platforms have now made the discovery of genes/genomes of uncultured microorganisms from different environments possible through the use of metagenomic techniques and single-cell genomics. When used synergistically, these metagenomic and single-cell techniques create a powerful tool to study microbial diversity. These genomics techniques have already been successfully exploited to identify sources for i) novel enzymes or natural products for biotechnology applications, ii) novel genes from extremophiles, and iii) whole genomes or operons from uncultured microbes. More can be done to utilize these tools more efficiently in biotechnology.

  3. In Vivo Imaging of mdrla Gene Expression

    National Research Council Canada - National Science Library

    Synold, Timothy W

    2005-01-01

    .... With the advent of new bioimaging technology and the advancement of efficient gene targeting strategies, they found an opportunity to apply these state-of-the-art molecular tools to their problem...

  4. Selection and validation of reference genes for quantitative gene expression analyses in various tissues and seeds at different developmental stages in Bixa orellana L.

    Science.gov (United States)

    Moreira, Viviane S; Soares, Virgínia L F; Silva, Raner J S; Sousa, Aurizangela O; Otoni, Wagner C; Costa, Marcio G C

    2018-05-01

    Bixa orellana L., popularly known as annatto, produces several secondary metabolites of pharmaceutical and industrial interest, including bixin, whose molecular basis of biosynthesis remain to be determined. Gene expression analysis by quantitative real-time PCR (qPCR) is an important tool to advance such knowledge. However, correct interpretation of qPCR data requires the use of suitable reference genes in order to reduce experimental variations. In the present study, we have selected four different candidates for reference genes in B. orellana , coding for 40S ribosomal protein S9 (RPS9), histone H4 (H4), 60S ribosomal protein L38 (RPL38) and 18S ribosomal RNA (18SrRNA). Their expression stabilities in different tissues (e.g. flower buds, flowers, leaves and seeds at different developmental stages) were analyzed using five statistical tools (NormFinder, geNorm, BestKeeper, ΔCt method and RefFinder). The results indicated that RPL38 is the most stable gene in different tissues and stages of seed development and 18SrRNA is the most unstable among the analyzed genes. In order to validate the candidate reference genes, we have analyzed the relative expression of a target gene coding for carotenoid cleavage dioxygenase 1 (CCD1) using the stable RPL38 and the least stable gene, 18SrRNA , for normalization of the qPCR data. The results demonstrated significant differences in the interpretation of the CCD1 gene expression data, depending on the reference gene used, reinforcing the importance of the correct selection of reference genes for normalization.

  5. Advances in detection systems of gene and chromosome abnormalities

    International Nuclear Information System (INIS)

    Yatagai, Takeo

    2002-01-01

    This review is described from the aspect of radiation biology. For analysis at gene level, oxidative lesion of DNA like 7,8-dihydro-8-oxoguanine formation and its repair by DNA polymerase η etc in bacteria, yeast and mammalian cells are suggested to be a useful index of radiation mutation. Transgenic mice with E. coli and/or phage gene as a reporter can be a tool for gene analysis for specific organ mutation: data obtained by irradiation of X-ray, γ-ray and accelerated carbon beam to the mouse gpt delta are presented. For analysis from gene to chromosome levels, loss of heterozygosity of a specific gene is a key for analysis of chromosome aberration at the molecular level. Studies in yeast and mammalian cells are presented. The author also described data of gene mutation in TK6 cells irradiated by 2 Gy of X-ray and 10 cGy of carbon beam (135 MeV/u) generated by ring-cycrotron. Human-hamster hybrid cell is an alternative tool. Concerning significance at the individual level, the author quoted studies of irradiation of parent mice resulting in increased incidence of somatic cell mutation and of cancer in offspring. Future systems for gene mutation will be a use of transgenic mice or of markers like a specific cancer. (K.H.)

  6. Identification of key pathways and genes influencing prognosis in bladder urothelial carcinoma

    Directory of Open Access Journals (Sweden)

    Ning X

    2017-03-01

    Full Text Available Xin Ning, Yaoliang Deng Department of Urology, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Province, People’s Republic of China Background: Genomic profiling can be used to identify the predictive effect of genomic subsets for determining prognosis in bladder urothelial carcinoma (BUC after radical cystectomy. This study aimed to investigate potential gene and pathway markers associated with prognosis in BUC.Methods: A microarray dataset of BUC was obtained from The Cancer Genome Atlas database. Differentially expressed genes (DEGs were identified by DESeq of the R platform. Kaplan–Meier analysis was applied for prognostic markers. Key pathways and genes were identified using bioinformatics tools, such as gene set enrichment analysis, gene ontology, the Kyoto Encyclopedia of Genes and Genomes, gene multiple association network integration algorithm (GeneMANIA, Search Tool for the Retrieval of Interacting Genes/Proteins, and Molecular Complex Detection.Results: A comparative gene set enrichment analysis of tumor and adjacent normal tissues suggested BUC tumorigenesis resulted mainly from enrichment of cell cycle and DNA damage and repair-related biological processes and pathways, including TP53 and mitotic recombination. Two hundred and fifty-six genes were identified as potential prognosis-related DEGs. Gene ontology and Kyoto Encyclopedia of Genes and Genomes analyses showed that the potential prognosis-related DEGs were enriched in angiogenesis, including the cyclic adenosine monophosphate biosynthetic process, cyclic guanosine monophosphate-protein kinase G, mitogen-activated protein kinase, Rap1, and phosphoinositide-3-kinase-AKT signaling pathway. Nine hub genes, TAGLN, ACTA2, MYH11, CALD1, MYLK, GEM, PRELP, TPM2, and OGN, were identified from the intersection of protein–protein interaction and GeneMANIA networks. Module analysis of protein–protein interaction and GeneMANIA networks mainly showed

  7. Gene ARMADA: an integrated multi-analysis platform for microarray data implemented in MATLAB.

    Science.gov (United States)

    Chatziioannou, Aristotelis; Moulos, Panagiotis; Kolisis, Fragiskos N

    2009-10-27

    The microarray data analysis realm is ever growing through the development of various tools, open source and commercial. However there is absence of predefined rational algorithmic analysis workflows or batch standardized processing to incorporate all steps, from raw data import up to the derivation of significantly differentially expressed gene lists. This absence obfuscates the analytical procedure and obstructs the massive comparative processing of genomic microarray datasets. Moreover, the solutions provided, heavily depend on the programming skills of the user, whereas in the case of GUI embedded solutions, they do not provide direct support of various raw image analysis formats or a versatile and simultaneously flexible combination of signal processing methods. We describe here Gene ARMADA (Automated Robust MicroArray Data Analysis), a MATLAB implemented platform with a Graphical User Interface. This suite integrates all steps of microarray data analysis including automated data import, noise correction and filtering, normalization, statistical selection of differentially expressed genes, clustering, classification and annotation. In its current version, Gene ARMADA fully supports 2 coloured cDNA and Affymetrix oligonucleotide arrays, plus custom arrays for which experimental details are given in tabular form (Excel spreadsheet, comma separated values, tab-delimited text formats). It also supports the analysis of already processed results through its versatile import editor. Besides being fully automated, Gene ARMADA incorporates numerous functionalities of the Statistics and Bioinformatics Toolboxes of MATLAB. In addition, it provides numerous visualization and exploration tools plus customizable export data formats for seamless integration by other analysis tools or MATLAB, for further processing. Gene ARMADA requires MATLAB 7.4 (R2007a) or higher and is also distributed as a stand-alone application with MATLAB Component Runtime. Gene ARMADA provides a

  8. PGASO: A synthetic biology tool for engineering a cellulolytic yeast

    Directory of Open Access Journals (Sweden)

    Chang Jui-Jen

    2012-07-01

    Full Text Available Abstract Background To achieve an economical cellulosic ethanol production, a host that can do both cellulosic saccharification and ethanol fermentation is desirable. However, to engineer a non-cellulolytic yeast to be such a host requires synthetic biology techniques to transform multiple enzyme genes into its genome. Results A technique, named Promoter-based Gene Assembly and Simultaneous Overexpression (PGASO, that employs overlapping oligonucleotides for recombinatorial assembly of gene cassettes with individual promoters, was developed. PGASO was applied to engineer Kluyveromycesmarxianus KY3, which is a thermo- and toxin-tolerant yeast. We obtained a recombinant strain, called KR5, that is capable of simultaneously expressing exoglucanase and endoglucanase (both of Trichodermareesei, a beta-glucosidase (from a cow rumen fungus, a neomycin phosphotransferase, and a green fluorescent protein. High transformation efficiency and accuracy were achieved as ~63% of the transformants was confirmed to be correct. KR5 can utilize beta-glycan, cellobiose or CMC as the sole carbon source for growth and can directly convert cellobiose and beta-glycan to ethanol. Conclusions This study provides the first example of multi-gene assembly in a single step in a yeast species other than Saccharomyces cerevisiae. We successfully engineered a yeast host with a five-gene cassette assembly and the new host is capable of co-expressing three types of cellulase genes. Our study shows that PGASO is an efficient tool for simultaneous expression of multiple enzymes in the kefir yeast KY3 and that KY3 can serve as a host for developing synthetic biology tools.

  9. Genomic Islands: an overview of current software tools and future improvements

    Directory of Open Access Journals (Sweden)

    Soares Siomar de Castro

    2016-03-01

    Full Text Available Microbes are highly diverse and widely distributed organisms. They account for ~60% of Earth’s biomass and new predictions point for the existence of 1011 to 1012 species, which are constantly sharing genes through several different mechanisms. Genomic Islands (GI are critical in this context, as they are large regions acquired through horizontal gene transfer. Also, they present common features like genomic signature deviation, transposase genes, flanking tRNAs and insertion sequences. GIs carry large numbers of genes related to specific lifestyle and are commonly classified in Pathogenicity, Resistance, Metabolic or Symbiotic Islands. With the advent of the next-generation sequencing technologies and the deluge of genomic data, many software tools have been developed that aim to tackle the problem of GI prediction and they are all based on the prediction of GI common features. However, there is still room for the development of new software tools that implements new approaches, such as, machine learning and pangenomics based analyses. Finally, GIs will always hold a potential application in every newly invented genomic approach as they are directly responsible for much of the genomic plasticity of bacteria.

  10. Genomic Islands: an overview of current software tools and future improvements.

    Science.gov (United States)

    Soares, Siomar de Castro; Oliveira, Letícia de Castro; Jaiswal, Arun Kumar; Azevedo, Vasco

    2016-03-01

    Microbes are highly diverse and widely distributed organisms. They account for ~60% of Earth's biomass and new predictions point for the existence of 1011 to 1012 species, which are constantly sharing genes through several different mechanisms. Genomic Islands (GI) are critical in this context, as they are large regions acquired through horizontal gene transfer. Also, they present common features like genomic signature deviation, transposase genes, flanking tRNAs and insertion sequences. GIs carry large numbers of genes related to specific lifestyle and are commonly classified in Pathogenicity, Resistance, Metabolic or Symbiotic Islands. With the advent of the next-generation sequencing technologies and the deluge of genomic data, many software tools have been developed that aim to tackle the problem of GI prediction and they are all based on the prediction of GI common features. However, there is still room for the development of new software tools that implements new approaches, such as, machine learning and pangenomics based analyses. Finally, GIs will always hold a potential application in every newly invented genomic approach as they are directly responsible for much of the genomic plasticity of bacteria.

  11. Drug-loaded nanoparticles induce gene expression in human pluripotent stem cell derivatives

    Science.gov (United States)

    Gajbhiye, Virendra; Escalante, Leah; Chen, Guojun; Laperle, Alex; Zheng, Qifeng; Steyer, Benjamin; Gong, Shaoqin; Saha, Krishanu

    2013-12-01

    Tissue engineering and advanced manufacturing of human stem cells requires a suite of tools to control gene expression spatiotemporally in culture. Inducible gene expression systems offer cell-extrinsic control, typically through addition of small molecules, but small molecule inducers typically contain few functional groups for further chemical modification. Doxycycline (DXC), a potent small molecule inducer of tetracycline (Tet) transgene systems, was conjugated to a hyperbranched dendritic polymer (Boltorn H40) and subsequently reacted with polyethylene glycol (PEG). The resulting PEG-H40-DXC nanoparticle exhibited pH-sensitive drug release behavior and successfully controlled gene expression in stem-cell-derived fibroblasts with a Tet-On system. While free DXC inhibited fibroblast proliferation and matrix metalloproteinase (MMP) activity, PEG-H40-DXC nanoparticles maintained higher fibroblast proliferation levels and MMP activity. The results demonstrate that the PEG-H40-DXC nanoparticle system provides an effective tool to controlling gene expression in human stem cell derivatives.Tissue engineering and advanced manufacturing of human stem cells requires a suite of tools to control gene expression spatiotemporally in culture. Inducible gene expression systems offer cell-extrinsic control, typically through addition of small molecules, but small molecule inducers typically contain few functional groups for further chemical modification. Doxycycline (DXC), a potent small molecule inducer of tetracycline (Tet) transgene systems, was conjugated to a hyperbranched dendritic polymer (Boltorn H40) and subsequently reacted with polyethylene glycol (PEG). The resulting PEG-H40-DXC nanoparticle exhibited pH-sensitive drug release behavior and successfully controlled gene expression in stem-cell-derived fibroblasts with a Tet-On system. While free DXC inhibited fibroblast proliferation and matrix metalloproteinase (MMP) activity, PEG-H40-DXC nanoparticles maintained

  12. Array2BIO: from microarray expression data to functional annotation of co-regulated genes

    Directory of Open Access Journals (Sweden)

    Rasley Amy

    2006-06-01

    Full Text Available Abstract Background There are several isolated tools for partial analysis of microarray expression data. To provide an integrative, easy-to-use and automated toolkit for the analysis of Affymetrix microarray expression data we have developed Array2BIO, an application that couples several analytical methods into a single web based utility. Results Array2BIO converts raw intensities into probe expression values, automatically maps those to genes, and subsequently identifies groups of co-expressed genes using two complementary approaches: (1 comparative analysis of signal versus control and (2 clustering analysis of gene expression across different conditions. The identified genes are assigned to functional categories based on Gene Ontology classification and KEGG protein interaction pathways. Array2BIO reliably handles low-expressor genes and provides a set of statistical methods for quantifying expression levels, including Benjamini-Hochberg and Bonferroni multiple testing corrections. An automated interface with the ECR Browser provides evolutionary conservation analysis for the identified gene loci while the interconnection with Crème allows prediction of gene regulatory elements that underlie observed expression patterns. Conclusion We have developed Array2BIO – a web based tool for rapid comprehensive analysis of Affymetrix microarray expression data, which also allows users to link expression data to Dcode.org comparative genomics tools and integrates a system for translating co-expression data into mechanisms of gene co-regulation. Array2BIO is publicly available at http://array2bio.dcode.org.

  13. A Partial Least Square Approach for Modeling Gene-gene and Gene-environment Interactions When Multiple Markers Are Genotyped

    Science.gov (United States)

    Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C.

    2008-01-01

    Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense SNPs in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches: the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey’s 1-df model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women’s Health Initiative (WHI), this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with BMI. PMID:18615621

  14. A partial least-square approach for modeling gene-gene and gene-environment interactions when multiple markers are genotyped.

    Science.gov (United States)

    Wang, Tao; Ho, Gloria; Ye, Kenny; Strickler, Howard; Elston, Robert C

    2009-01-01

    Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense single nucleotype polymorphisms (SNPs) in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross-locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene-gene interactions. This approach can be similarly applied for modeling gene-environment interactions. We compare this method with other approaches, the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey's one-degree-of-freedom model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case-control data from the Women's Health Initiative, this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with body mass index.

  15. Genetic Tool Development for a New Host for Biotechnology, the Thermotolerant Bacterium Bacillus coagulans

    NARCIS (Netherlands)

    Kovacs, Akos T.; van Hartskamp, Mariska; Kuipers, Oscar P.; van Kranenburg, Richard

    Bacillus coagulans has good potential as an industrial production organism for platform chemicals from renewable resources but has limited genetic tools available. Here, we present a targeted gene disruption system using the Cre-lox system, development of a LacZ reporter assay for monitoring gene

  16. A gene network bioinformatics analysis for pemphigoid autoimmune blistering diseases.

    Science.gov (United States)

    Barone, Antonio; Toti, Paolo; Giuca, Maria Rita; Derchi, Giacomo; Covani, Ugo

    2015-07-01

    In this theoretical study, a text mining search and clustering analysis of data related to genes potentially involved in human pemphigoid autoimmune blistering diseases (PAIBD) was performed using web tools to create a gene/protein interaction network. The Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database was employed to identify a final set of PAIBD-involved genes and to calculate the overall significant interactions among genes: for each gene, the weighted number of links, or WNL, was registered and a clustering procedure was performed using the WNL analysis. Genes were ranked in class (leader, B, C, D and so on, up to orphans). An ontological analysis was performed for the set of 'leader' genes. Using the above-mentioned data network, 115 genes represented the final set; leader genes numbered 7 (intercellular adhesion molecule 1 (ICAM-1), interferon gamma (IFNG), interleukin (IL)-2, IL-4, IL-6, IL-8 and tumour necrosis factor (TNF)), class B genes were 13, whereas the orphans were 24. The ontological analysis attested that the molecular action was focused on extracellular space and cell surface, whereas the activation and regulation of the immunity system was widely involved. Despite the limited knowledge of the present pathologic phenomenon, attested by the presence of 24 genes revealing no protein-protein direct or indirect interactions, the network showed significant pathways gathered in several subgroups: cellular components, molecular functions, biological processes and the pathologic phenomenon obtained from the Kyoto Encyclopaedia of Genes and Genomes (KEGG) database. The molecular basis for PAIBD was summarised and expanded, which will perhaps give researchers promising directions for the identification of new therapeutic targets.

  17. Canine candidate genes for dilated cardiomyopathy: annotation of and polymorphic markers for 14 genes.

    Science.gov (United States)

    Wiersma, Anje C; Leegwater, Peter Aj; van Oost, Bernard A; Ollier, William E; Dukes-McEwan, Joanna

    2007-10-19

    Dilated cardiomyopathy is a myocardial disease occurring in humans and domestic animals and is characterized by dilatation of the left ventricle, reduced systolic function and increased sphericity of the left ventricle. Dilated cardiomyopathy has been observed in several, mostly large and giant, dog breeds, such as the Dobermann and the Great Dane. A number of genes have been identified, which are associated with dilated cardiomyopathy in the human, mouse and hamster. These genes mainly encode structural proteins of the cardiac myocyte. We present the annotation of, and marker development for, 14 of these genes of the dog genome, i.e. alpha-cardiac actin, caveolin 1, cysteine-rich protein 3, desmin, lamin A/C, LIM-domain binding factor 3, myosin heavy polypeptide 7, phospholamban, sarcoglycan delta, titin cap, alpha-tropomyosin, troponin I, troponin T and vinculin. A total of 33 Single Nucleotide Polymorphisms were identified for these canine genes and 11 polymorphic microsatellite repeats were developed. The presented polymorphisms provide a tool to investigate the role of the corresponding genes in canine Dilated Cardiomyopathy by linkage analysis or association studies.

  18. Synthetic biology in mammalian cells: Next generation research tools and therapeutics

    Science.gov (United States)

    Lienert, Florian; Lohmueller, Jason J; Garg, Abhishek; Silver, Pamela A

    2014-01-01

    Recent progress in DNA manipulation and gene circuit engineering has greatly improved our ability to programme and probe mammalian cell behaviour. These advances have led to a new generation of synthetic biology research tools and potential therapeutic applications. Programmable DNA-binding domains and RNA regulators are leading to unprecedented control of gene expression and elucidation of gene function. Rebuilding complex biological circuits such as T cell receptor signalling in isolation from their natural context has deepened our understanding of network motifs and signalling pathways. Synthetic biology is also leading to innovative therapeutic interventions based on cell-based therapies, protein drugs, vaccines and gene therapies. PMID:24434884

  19. Computational identification of putative cytochrome P450 genes in ...

    African Journals Online (AJOL)

    In this work, a computational study of expressed sequence tags (ESTs) of soybean was performed by data mining methods and bio-informatics tools and as a result 78 putative P450 genes were identified, including 57 new ones. These genes were classified into five clans and 20 families by sequence similarities and among ...

  20. Genetic manipulation of longevity-related genes as a tool to regulate yeast life span and metabolite production during winemaking

    Directory of Open Access Journals (Sweden)

    Orozco Helena

    2013-01-01

    Full Text Available Abstract Background Yeast viability and vitality are essential for different industrial processes where the yeast Saccharomyces cerevisiae is used as a biotechnological tool. Therefore, the decline of yeast biological functions during aging may compromise their successful biotechnological use. Life span is controlled by a variety of molecular mechanisms, many of which are connected to stress tolerance and genomic stability, although the metabolic status of a cell has proven a main factor affecting its longevity. Acetic acid and ethanol accumulation shorten chronological life span (CLS, while glycerol extends it. Results Different age-related gene classes have been modified by deletion or overexpression to test their role in longevity and metabolism. Overexpression of histone deacetylase SIR2 extends CLS and reduces acetate production, while overexpression of SIR2 homolog HST3 shortens CLS, increases the ethanol level, and reduces acetic acid production. HST3 overexpression also enhances ethanol tolerance. Increasing tolerance to oxidative stress by superoxide dismutase SOD2 overexpression has only a moderate positive effect on CLS. CLS during grape juice fermentation has also been studied for mutants on several mRNA binding proteins that are regulators of gene expression at the posttranscriptional level; we found that NGR1 and UTH4 deletions decrease CLS, while PUF3 and PUB1 deletions increase it. Besides, the pub1Δ mutation increases glycerol production and blocks stress granule formation during grape juice fermentation. Surprisingly, factors relating to apoptosis, such as caspase Yca1 or apoptosis-inducing factor Aif1, play a positive role in yeast longevity during winemaking as their deletions shorten CLS. Conclusions Manipulation of regulators of gene expression at both transcriptional (i.e., sirtuins and posttranscriptional (i.e., mRNA binding protein Pub1 levels allows to modulate yeast life span during its biotechnological use. Due to

  1. Identification of suitable reference genes for gene expression studies of shoulder instability.

    Directory of Open Access Journals (Sweden)

    Mariana Ferreira Leal

    Full Text Available Shoulder instability is a common shoulder injury, and patients present with plastic deformation of the glenohumeral capsule. Gene expression analysis may be a useful tool for increasing the general understanding of capsule deformation, and reverse-transcription quantitative polymerase chain reaction (RT-qPCR has become an effective method for such studies. Although RT-qPCR is highly sensitive and specific, it requires the use of suitable reference genes for data normalization to guarantee meaningful and reproducible results. In the present study, we evaluated the suitability of a set of reference genes using samples from the glenohumeral capsules of individuals with and without shoulder instability. We analyzed the expression of six commonly used reference genes (ACTB, B2M, GAPDH, HPRT1, TBP and TFRC in the antero-inferior, antero-superior and posterior portions of the glenohumeral capsules of cases and controls. The stability of the candidate reference gene expression was determined using four software packages: NormFinder, geNorm, BestKeeper and DataAssist. Overall, HPRT1 was the best single reference gene, and HPRT1 and B2M composed the best pair of reference genes from different analysis groups, including simultaneous analysis of all tissue samples. GenEx software was used to identify the optimal number of reference genes to be used for normalization and demonstrated that the accumulated standard deviation resulting from the use of 2 reference genes was similar to that resulting from the use of 3 or more reference genes. To identify the optimal combination of reference genes, we evaluated the expression of COL1A1. Although the use of different reference gene combinations yielded variable normalized quantities, the relative quantities within sample groups were similar and confirmed that no obvious differences were observed when using 2, 3 or 4 reference genes. Consequently, the use of 2 stable reference genes for normalization, especially

  2. Visualizing conserved gene location across microbe genomes

    Science.gov (United States)

    Shaw, Chris D.

    2009-01-01

    This paper introduces an analysis-based zoomable visualization technique for displaying the location of genes across many related species of microbes. The purpose of this visualizatiuon is to enable a biologist to examine the layout of genes in the organism of interest with respect to the gene organization of related organisms. During the genomic annotation process, the ability to observe gene organization in common with previously annotated genomes can help a biologist better confirm the structure and function of newly analyzed microbe DNA sequences. We have developed a visualization and analysis tool that enables the biologist to observe and examine gene organization among genomes, in the context of the primary sequence of interest. This paper describes the visualization and analysis steps, and presents a case study using a number of Rickettsia genomes.

  3. Bioinformatics tools for quantitative and functional metagenome and metatranscriptome data analysis in microbes.

    Science.gov (United States)

    Niu, Sheng-Yong; Yang, Jinyu; McDermaid, Adam; Zhao, Jing; Kang, Yu; Ma, Qin

    2017-05-08

    Metagenomic and metatranscriptomic sequencing approaches are more frequently being used to link microbiota to important diseases and ecological changes. Many analyses have been used to compare the taxonomic and functional profiles of microbiota across habitats or individuals. While a large portion of metagenomic analyses focus on species-level profiling, some studies use strain-level metagenomic analyses to investigate the relationship between specific strains and certain circumstances. Metatranscriptomic analysis provides another important insight into activities of genes by examining gene expression levels of microbiota. Hence, combining metagenomic and metatranscriptomic analyses will help understand the activity or enrichment of a given gene set, such as drug-resistant genes among microbiome samples. Here, we summarize existing bioinformatics tools of metagenomic and metatranscriptomic data analysis, the purpose of which is to assist researchers in deciding the appropriate tools for their microbiome studies. Additionally, we propose an Integrated Meta-Function mapping pipeline to incorporate various reference databases and accelerate functional gene mapping procedures for both metagenomic and metatranscriptomic analyses. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  4. A Critical Perspective On Microarray Breast Cancer Gene Expression Profiling

    NARCIS (Netherlands)

    Sontrop, H.M.J.

    2015-01-01

    Microarrays offer biologists an exciting tool that allows the simultaneous assessment of gene expression levels for thousands of genes at once. At the time of their inception, microarrays were hailed as the new dawn in cancer biology and oncology practice with the hope that within a decade diseases

  5. DNA-mediated gene transfer into ataxia-telangiectasia cells

    International Nuclear Information System (INIS)

    Crescenzi, M.; Pulciani, S.; Carbonari, M.; Tedesco, L.; Russo, G.; Gaetano, C.; Fiorilli, M.

    1986-01-01

    The complete description of the genetic lesion(s) underlying the AT mutation might, therefore, highlight not only a DNA-repair pathwa, but also an important aspect of the physiology of lymphocytes. DNA-mediated gene transfer into eukaryotic cells has proved a powerful tool for the molecular cloning of certain mammalian genes. The possibility to clone a given gene using this technology depends, basically, on the availability of a selectable marker associated with the expression of the transfected gene in the recipient cell. Recently, a human DNA repair gene has been cloned in CHO mutant cells by taking advantage of the increased resistance to ultraviolet radiation of the transformants. As a preliminary step toward the molecular cloning of the AT gene(s), the authors have attempted to confer radioresistance to AT cells by transfection with normal human DNA

  6. BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS.

    Science.gov (United States)

    Hoff, Katharina J; Lange, Simone; Lomsadze, Alexandre; Borodovsky, Mark; Stanke, Mario

    2016-03-01

    Gene finding in eukaryotic genomes is notoriously difficult to automate. The task is to design a work flow with a minimal set of tools that would reach state-of-the-art performance across a wide range of species. GeneMark-ET is a gene prediction tool that incorporates RNA-Seq data into unsupervised training and subsequently generates ab initio gene predictions. AUGUSTUS is a gene finder that usually requires supervised training and uses information from RNA-Seq reads in the prediction step. Complementary strengths of GeneMark-ET and AUGUSTUS provided motivation for designing a new combined tool for automatic gene prediction. We present BRAKER1, a pipeline for unsupervised RNA-Seq-based genome annotation that combines the advantages of GeneMark-ET and AUGUSTUS. As input, BRAKER1 requires a genome assembly file and a file in bam-format with spliced alignments of RNA-Seq reads to the genome. First, GeneMark-ET performs iterative training and generates initial gene structures. Second, AUGUSTUS uses predicted genes for training and then integrates RNA-Seq read information into final gene predictions. In our experiments, we observed that BRAKER1 was more accurate than MAKER2 when it is using RNA-Seq as sole source for training and prediction. BRAKER1 does not require pre-trained parameters or a separate expert-prepared training step. BRAKER1 is available for download at http://bioinf.uni-greifswald.de/bioinf/braker/ and http://exon.gatech.edu/GeneMark/ katharina.hoff@uni-greifswald.de or borodovsky@gatech.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  7. Investigation of next-generation sequencing data of Klebsiella pneumoniae using web-based tools.

    Science.gov (United States)

    Brhelova, Eva; Antonova, Mariya; Pardy, Filip; Kocmanova, Iva; Mayer, Jiri; Racil, Zdenek; Lengerova, Martina

    2017-11-01

    Rapid identification and characterization of multidrug-resistant Klebsiella pneumoniae strains is necessary due to the increasing frequency of severe infections in patients. The decreasing cost of next-generation sequencing enables us to obtain a comprehensive overview of genetic information in one step. The aim of this study is to demonstrate and evaluate the utility and scope of the application of web-based databases to next-generation sequenced (NGS) data. The whole genomes of 11 clinical Klebsiella pneumoniae isolates were sequenced using Illumina MiSeq. Selected web-based tools were used to identify a variety of genetic characteristics, such as acquired antimicrobial resistance genes, multilocus sequence types, plasmid replicons, and identify virulence factors, such as virulence genes, cps clusters, urease-nickel clusters and efflux systems. Using web-based tools hosted by the Center for Genomic Epidemiology, we detected resistance to 8 main antimicrobial groups with at least 11 acquired resistance genes. The isolates were divided into eight sequence types (ST11, 23, 37, 323, 433, 495 and 562, and a new one, ST1646). All of the isolates carried replicons of large plasmids. Capsular types, virulence factors and genes coding AcrAB and OqxAB efflux pumps were detected using BIGSdb-Kp, whereas the selected virulence genes, identified in almost all of the isolates, were detected using CLC Genomic Workbench software. Applying appropriate web-based online tools to NGS data enables the rapid extraction of comprehensive information that can be used for more efficient diagnosis and treatment of patients, while data processing is free of charge, easy and time-efficient.

  8. A Methodology for the Development of RESTful Semantic Web Services for Gene Expression Analysis.

    Directory of Open Access Journals (Sweden)

    Gabriela D A Guardia

    Full Text Available Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis.

  9. A Methodology for the Development of RESTful Semantic Web Services for Gene Expression Analysis.

    Science.gov (United States)

    Guardia, Gabriela D A; Pires, Luís Ferreira; Vêncio, Ricardo Z N; Malmegrim, Kelen C R; de Farias, Cléver R G

    2015-01-01

    Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS) Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis.

  10. Analysis of a Novel 17q25 Cell Cycle Gene Homolog: Is it a Breast Tumor Suppressor Gene?

    National Research Council Canada - National Science Library

    Kalikin, Linda

    2000-01-01

    ... of these molecular reagents into successful tools for the medical management of breast cancer. We hypothesize that a 350 kb region on 17q25 detected by our allelic imbalance studies harbors a novel breast tumor suppressor gene...

  11. In Vitro and In Vivo Effective Gene Delivery with Novel Liposomal Bubbles

    Science.gov (United States)

    Nishiie, Norihito; Suzuki, Ryo; Oda, Yusuke; Hirata, Keiichi; Taira, Yuichiro; Utoguchi, Naoki; Negishi, Yoichi; Maruyama, Kazuo

    2010-03-01

    Microbubbles, which were ultrasound contrast agents, could improve the transfection efficiency by cavitation with ultrasound exposure. However, conventional microbubbles had some problems regarding size and targeting ability. To solve these problems, we paid attention to liposomes that had many advantages as drug, antigen and gene delivery carriers. Because they can easily be controlled their size and added a targeting function. And we succeeded to prepare novel liposomal bubbles (Bubble liposomes) entrapping perfluoropropane which was utilized for contrast enhancement in ultrasonography. In this study, we assessed the feasibility of Bubble liposomes as gene delivery tools utilized cavitation by ultrasound exposure. In vitro gene delivery, Bubble liposomes could deliver plasmid DNA to many cell types such as tumor cells, T cell line and endothelial cells without cytotoxicity. In vivo gene delivery, Bubble liposomes could effectively deliver plasmid DNA into mouse femoral artery. This method was more effectively than conventional lipofection. We conclude that Bubble liposomes are unique and efficient gene delivery tools in vitro and in vivo.

  12. Gene Fusion Markup Language: a prototype for exchanging gene fusion data.

    Science.gov (United States)

    Kalyana-Sundaram, Shanker; Shanmugam, Achiraman; Chinnaiyan, Arul M

    2012-10-16

    An avalanche of next generation sequencing (NGS) studies has generated an unprecedented amount of genomic structural variation data. These studies have also identified many novel gene fusion candidates with more detailed resolution than previously achieved. However, in the excitement and necessity of publishing the observations from this recently developed cutting-edge technology, no community standardization approach has arisen to organize and represent the data with the essential attributes in an interchangeable manner. As transcriptome studies have been widely used for gene fusion discoveries, the current non-standard mode of data representation could potentially impede data accessibility, critical analyses, and further discoveries in the near future. Here we propose a prototype, Gene Fusion Markup Language (GFML) as an initiative to provide a standard format for organizing and representing the significant features of gene fusion data. GFML will offer the advantage of representing the data in a machine-readable format to enable data exchange, automated analysis interpretation, and independent verification. As this database-independent exchange initiative evolves it will further facilitate the formation of related databases, repositories, and analysis tools. The GFML prototype is made available at http://code.google.com/p/gfml-prototype/. The Gene Fusion Markup Language (GFML) presented here could facilitate the development of a standard format for organizing, integrating and representing the significant features of gene fusion data in an inter-operable and query-able fashion that will enable biologically intuitive access to gene fusion findings and expedite functional characterization. A similar model is envisaged for other NGS data analyses.

  13. aeGEPUCI: a database of gene expression in the dengue vector mosquito, Aedes aegypti

    Directory of Open Access Journals (Sweden)

    James Anthony A

    2010-10-01

    Full Text Available Abstract Background Aedes aegypti is the principal vector of dengue and yellow fever viruses. The availability of the sequenced and annotated genome enables genome-wide analyses of gene expression in this mosquito. The large amount of data resulting from these analyses requires efficient cataloguing before it becomes useful as the basis for new insights into gene expression patterns and studies of the underlying molecular mechanisms for generating these patterns. Findings We provide a publicly-accessible database and data-mining tool, aeGEPUCI, that integrates 1 microarray analyses of sex- and stage-specific gene expression in Ae. aegypti, 2 functional gene annotation, 3 genomic sequence data, and 4 computational sequence analysis tools. The database can be used to identify genes expressed in particular stages and patterns of interest, and to analyze putative cis-regulatory elements (CREs that may play a role in coordinating these patterns. The database is accessible from the address http://www.aegep.bio.uci.edu. Conclusions The combination of gene expression, function and sequence data coupled with integrated sequence analysis tools allows for identification of expression patterns and streamlines the development of CRE predictions and experiments to assess how patterns of expression are coordinated at the molecular level.

  14. Comparison of two next-generation sequencing kits for diagnosis of epileptic disorders with a user-friendly tool for displaying gene coverage, DeCovA

    Directory of Open Access Journals (Sweden)

    Sarra Dimassi

    2015-12-01

    Full Text Available In recent years, molecular genetics has been playing an increasing role in the diagnostic process of monogenic epilepsies. Knowing the genetic basis of one patient's epilepsy provides accurate genetic counseling and may guide therapeutic options. Genetic diagnosis of epilepsy syndromes has long been based on Sanger sequencing and search for large rearrangements using MLPA or DNA arrays (array-CGH or SNP-array. Recently, next-generation sequencing (NGS was demonstrated to be a powerful approach to overcome the wide clinical and genetic heterogeneity of epileptic disorders. Coverage is critical for assessing the quality and accuracy of results from NGS. However, it is often a difficult parameter to display in practice. The aim of the study was to compare two library-building methods (Haloplex, Agilent and SeqCap EZ, Roche for a targeted panel of 41 genes causing monogenic epileptic disorders. We included 24 patients, 20 of whom had known disease-causing mutations. For each patient both libraries were built in parallel and sequenced on an Ion Torrent Personal Genome Machine (PGM. To compare coverage and depth, we developed a simple homemade tool, named DeCovA (Depth and Coverage Analysis. DeCovA displays the sequencing depth of each base and the coverage of target genes for each genomic position. The fraction of each gene covered at different thresholds could be easily estimated. None of the two methods used, namely NextGene and Ion Reporter, were able to identify all the known mutations/CNVs displayed by the 20 patients. Variant detection rate was globally similar for the two techniques and DeCovA showed that failure to detect a mutation was mainly related to insufficient coverage.

  15. msBiodat analysis tool, big data analysis for high-throughput experiments.

    Science.gov (United States)

    Muñoz-Torres, Pau M; Rokć, Filip; Belužic, Robert; Grbeša, Ivana; Vugrek, Oliver

    2016-01-01

    Mass spectrometry (MS) are a group of a high-throughput techniques used to increase knowledge about biomolecules. They produce a large amount of data which is presented as a list of hundreds or thousands of proteins. Filtering those data efficiently is the first step for extracting biologically relevant information. The filtering may increase interest by merging previous data with the data obtained from public databases, resulting in an accurate list of proteins which meet the predetermined conditions. In this article we present msBiodat Analysis Tool, a web-based application thought to approach proteomics to the big data analysis. With this tool, researchers can easily select the most relevant information from their MS experiments using an easy-to-use web interface. An interesting feature of msBiodat analysis tool is the possibility of selecting proteins by its annotation on Gene Ontology using its Gene Id, ensembl or UniProt codes. The msBiodat analysis tool is a web-based application that allows researchers with any programming experience to deal with efficient database querying advantages. Its versatility and user-friendly interface makes easy to perform fast and accurate data screening by using complex queries. Once the analysis is finished, the result is delivered by e-mail. msBiodat analysis tool is freely available at http://msbiodata.irb.hr.

  16. Development of a software tool and criteria evaluation for efficient design of small interfering RNA

    International Nuclear Information System (INIS)

    Chaudhary, Aparna; Srivastava, Sonam; Garg, Sanjeev

    2011-01-01

    Research highlights: → The developed tool predicted siRNA constructs with better thermodynamic stability and total score based on positional and other criteria. → Off-target silencing below score 30 were observed for the best siRNA constructs for different genes. → Immunostimulation and cytotoxicity motifs considered and penalized in the developed tool. → Both positional and compositional criteria were observed to be important. -- Abstract: RNA interference can be used as a tool for gene silencing mediated by small interfering RNAs (siRNA). The critical step in effective and specific RNAi processing is the selection of suitable constructs. Major design criteria, i.e., Reynolds's design rules, thermodynamic stability, internal repeats, immunostimulatory motifs were emphasized and implemented in the siRNA design tool. The tool provides thermodynamic stability score, GC content and a total score based on other design criteria in the output. The viability of the tool was established with different datasets. In general, the siRNA constructs produced by the tool had better thermodynamic score and positional properties. Comparable thermodynamic scores and better total scores were observed with the existing tools. Moreover, the results generated had comparable off-target silencing effect. Criteria evaluations with additional criteria were achieved in WEKA.

  17. Virus-induced gene silencing (VIGS) as a reverse genetic tool to study development of symbiotic root nodules

    DEFF Research Database (Denmark)

    Kjær, Gabriela Didina Constantin; Grønlund, Mette; Stougaard, Jens

    2008-01-01

    Virus-induced gene silencing (VIGS) can provide a shortcut to plants with altered expression of specific genes. Here, we report that VIGS of the Nodule inception gene (Nin) can alter the nodulation phenotype and Nin gene expression in Pisum sativum. PsNin was chosen as target because of the disti...

  18. Emerging use of gene expression microarrays in plant physiology.

    Science.gov (United States)

    Wullschleger, Stan D; Difazio, Stephen P

    2003-01-01

    Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.

  19. Gene replacement in Penicillium roqueforti.

    Science.gov (United States)

    Goarin, Anne; Silar, Philippe; Malagnac, Fabienne

    2015-05-01

    Most cheese-making filamentous fungi lack suitable molecular tools to improve their biotechnology potential. Penicillium roqueforti, a species of high industrial importance, would benefit from functional data yielded by molecular genetic approaches. This work provides the first example of gene replacement by homologous recombination in P. roqueforti, demonstrating that knockout experiments can be performed in this fungus. To do so, we improved the existing transformation method to integrate transgenes into P. roqueforti genome. In the meantime, we cloned the PrNiaD gene, which encodes a NADPH-dependent nitrate reductase that reduces nitrate to nitrite. Then, we performed a deletion of the PrNiaD gene from P. roqueforti strain AGO. The ΔPrNiaD mutant strain is more resistant to chlorate-containing medium than the wild-type strain, but did not grow on nitrate-containing medium. Because genomic data are now available, we believe that generating selective deletions of candidate genes will be a key step to open the way for a comprehensive exploration of gene function in P. roqueforti.

  20. An Independent Filter for Gene Set Testing Based on Spectral Enrichment

    NARCIS (Netherlands)

    Frost, H Robert; Li, Zhigang; Asselbergs, Folkert W; Moore, Jason H

    2015-01-01

    Gene set testing has become an indispensable tool for the analysis of high-dimensional genomic data. An important motivation for testing gene sets, rather than individual genomic variables, is to improve statistical power by reducing the number of tested hypotheses. Given the dramatic growth in

  1. Bacterial metal resistance genes and metal bioavailability in contaminated sediments

    International Nuclear Information System (INIS)

    Roosa, Stéphanie; Wattiez, Ruddy; Prygiel, Emilie; Lesven, Ludovic; Billon, Gabriel; Gillan, David C.

    2014-01-01

    In bacteria a metal may be defined as bioavailable if it crosses the cytoplasmic membrane to reach the cytoplasm. Once inside the cell, specific metal resistance systems may be triggered. In this research, specific metal resistance genes were used to estimate metal bioavailability in sediment microbial communities. Gene levels were measured by quantitative PCR and correlated to metals in sediments using five different protocols to estimate dissolved, particle-adsorbed and occluded metals. The best correlations were obtained with czcA (a Cd/Zn/Co efflux pump) and Cd/Zn adsorbed or occluded in particles. Only adsorbed Co was correlated to czcA levels. We concluded that the measurement of czcA gene levels by quantitative PCR is a promising tool which may complement the classical approaches used to estimate Cd/Zn/Co bioavailability in sediment compartments. - Highlights: • Metal resistance genes were used to estimate metal bioavailability in sediments. • Gene levels were correlated to metals using 5 different metal extraction protocols. • CzcA gene levels determined by quantitative PCR is a promising tool for Cd/Zn/Co. - Capsule Bacterial czcA is a potential biomarker of Cd, Zn and Co bioavailability in aquatic sediments as shown by quantitative PCR and sequential metal extraction

  2. Large-scale gene function analysis with the PANTHER classification system.

    Science.gov (United States)

    Mi, Huaiyu; Muruganujan, Anushya; Casagrande, John T; Thomas, Paul D

    2013-08-01

    The PANTHER (protein annotation through evolutionary relationship) classification system (http://www.pantherdb.org/) is a comprehensive system that combines gene function, ontology, pathways and statistical analysis tools that enable biologists to analyze large-scale, genome-wide data from sequencing, proteomics or gene expression experiments. The system is built with 82 complete genomes organized into gene families and subfamilies, and their evolutionary relationships are captured in phylogenetic trees, multiple sequence alignments and statistical models (hidden Markov models or HMMs). Genes are classified according to their function in several different ways: families and subfamilies are annotated with ontology terms (Gene Ontology (GO) and PANTHER protein class), and sequences are assigned to PANTHER pathways. The PANTHER website includes a suite of tools that enable users to browse and query gene functions, and to analyze large-scale experimental data with a number of statistical tests. It is widely used by bench scientists, bioinformaticians, computer scientists and systems biologists. In the 2013 release of PANTHER (v.8.0), in addition to an update of the data content, we redesigned the website interface to improve both user experience and the system's analytical capability. This protocol provides a detailed description of how to analyze genome-wide experimental data with the PANTHER classification system.

  3. SELANSI: a toolbox for simulation of stochastic gene regulatory networks.

    Science.gov (United States)

    Pájaro, Manuel; Otero-Muras, Irene; Vázquez, Carlos; Alonso, Antonio A

    2018-03-01

    Gene regulation is inherently stochastic. In many applications concerning Systems and Synthetic Biology such as the reverse engineering and the de novo design of genetic circuits, stochastic effects (yet potentially crucial) are often neglected due to the high computational cost of stochastic simulations. With advances in these fields there is an increasing need of tools providing accurate approximations of the stochastic dynamics of gene regulatory networks (GRNs) with reduced computational effort. This work presents SELANSI (SEmi-LAgrangian SImulation of GRNs), a software toolbox for the simulation of stochastic multidimensional gene regulatory networks. SELANSI exploits intrinsic structural properties of gene regulatory networks to accurately approximate the corresponding Chemical Master Equation with a partial integral differential equation that is solved by a semi-lagrangian method with high efficiency. Networks under consideration might involve multiple genes with self and cross regulations, in which genes can be regulated by different transcription factors. Moreover, the validity of the method is not restricted to a particular type of kinetics. The tool offers total flexibility regarding network topology, kinetics and parameterization, as well as simulation options. SELANSI runs under the MATLAB environment, and is available under GPLv3 license at https://sites.google.com/view/selansi. antonio@iim.csic.es. © The Author(s) 2017. Published by Oxford University Press.

  4. Validation of suitable reference genes for quantitative gene expression analysis in Panax ginseng

    Directory of Open Access Journals (Sweden)

    Meizhen eWang

    2016-01-01

    Full Text Available Reverse transcription-qPCR (RT-qPCR has become a popular method for gene expression studies. Its results require data normalization by housekeeping genes. No single gene is proved to be stably expressed under all experimental conditions. Therefore, systematic evaluation of reference genes is necessary. With the aim to identify optimum reference genes for RT-qPCR analysis of gene expression in different tissues of Panax ginseng and the seedlings grown under heat stress, we investigated the expression stability of eight candidate reference genes, including elongation factor 1-beta (EF1-β, elongation factor 1-gamma (EF1-γ, eukaryotic translation initiation factor 3G (IF3G, eukaryotic translation initiation factor 3B (IF3B, actin (ACT, actin11 (ACT11, glyceraldehyde-3-phosphate dehydrogenase (GAPDH and cyclophilin ABH-like protein (CYC, using four widely used computational programs: geNorm, Normfinder, BestKeeper, and the comparative ΔCt method. The results were then integrated using the web-based tool RefFinder. As a result, EF1-γ, IF3G and EF1-β were the three most stable genes in different tissues of P. ginseng, while IF3G, ACT11 and GAPDH were the top three-ranked genes in seedlings treated with heat. Using three better reference genes alone or in combination as internal control, we examined the expression profiles of MAR, a multiple function-associated mRNA-like non-coding RNA (mlncRNA in P. ginseng. Taken together, we recommended EF1-γ/IF3G and IF3G/ACT11 as the suitable pair of reference genes for RT-qPCR analysis of gene expression in different tissues of P. ginseng and the seedlings grown under heat stress, respectively. The results serve as a foundation for future studies on P. ginseng functional genomics.

  5. Mining gene expression data by interpreting principal components

    Directory of Open Access Journals (Sweden)

    Mortazavi Ali

    2006-04-01

    Full Text Available Abstract Background There are many methods for analyzing microarray data that group together genes having similar patterns of expression over all conditions tested. However, in many instances the biologically important goal is to identify relatively small sets of genes that share coherent expression across only some conditions, rather than all or most conditions as required in traditional clustering; e.g. genes that are highly up-regulated and/or down-regulated similarly across only a subset of conditions. Equally important is the need to learn which conditions are the decisive ones in forming such gene sets of interest, and how they relate to diverse conditional covariates, such as disease diagnosis or prognosis. Results We present a method for automatically identifying such candidate sets of biologically relevant genes using a combination of principal components analysis and information theoretic metrics. To enable easy use of our methods, we have developed a data analysis package that facilitates visualization and subsequent data mining of the independent sources of significant variation present in gene microarray expression datasets (or in any other similarly structured high-dimensional dataset. We applied these tools to two public datasets, and highlight sets of genes most affected by specific subsets of conditions (e.g. tissues, treatments, samples, etc.. Statistically significant associations for highlighted gene sets were shown via global analysis for Gene Ontology term enrichment. Together with covariate associations, the tool provides a basis for building testable hypotheses about the biological or experimental causes of observed variation. Conclusion We provide an unsupervised data mining technique for diverse microarray expression datasets that is distinct from major methods now in routine use. In test uses, this method, based on publicly available gene annotations, appears to identify numerous sets of biologically relevant genes. It

  6. IIS--Integrated Interactome System: a web-based platform for the annotation, analysis and visualization of protein-metabolite-gene-drug interactions by integrating a variety of data sources and tools.

    Science.gov (United States)

    Carazzolle, Marcelo Falsarella; de Carvalho, Lucas Miguel; Slepicka, Hugo Henrique; Vidal, Ramon Oliveira; Pereira, Gonçalo Amarante Guimarães; Kobarg, Jörg; Meirelles, Gabriela Vaz

    2014-01-01

    High-throughput screening of physical, genetic and chemical-genetic interactions brings important perspectives in the Systems Biology field, as the analysis of these interactions provides new insights into protein/gene function, cellular metabolic variations and the validation of therapeutic targets and drug design. However, such analysis depends on a pipeline connecting different tools that can automatically integrate data from diverse sources and result in a more comprehensive dataset that can be properly interpreted. We describe here the Integrated Interactome System (IIS), an integrative platform with a web-based interface for the annotation, analysis and visualization of the interaction profiles of proteins/genes, metabolites and drugs of interest. IIS works in four connected modules: (i) Submission module, which receives raw data derived from Sanger sequencing (e.g. two-hybrid system); (ii) Search module, which enables the user to search for the processed reads to be assembled into contigs/singlets, or for lists of proteins/genes, metabolites and drugs of interest, and add them to the project; (iii) Annotation module, which assigns annotations from several databases for the contigs/singlets or lists of proteins/genes, generating tables with automatic annotation that can be manually curated; and (iv) Interactome module, which maps the contigs/singlets or the uploaded lists to entries in our integrated database, building networks that gather novel identified interactions, protein and metabolite expression/concentration levels, subcellular localization and computed topological metrics, GO biological processes and KEGG pathways enrichment. This module generates a XGMML file that can be imported into Cytoscape or be visualized directly on the web. We have developed IIS by the integration of diverse databases following the need of appropriate tools for a systematic analysis of physical, genetic and chemical-genetic interactions. IIS was validated with yeast two

  7. Pharmspresso: a text mining tool for extraction of pharmacogenomic concepts and relationships from full text.

    Science.gov (United States)

    Garten, Yael; Altman, Russ B

    2009-02-05

    Pharmacogenomics studies the relationship between genetic variation and the variation in drug response phenotypes. The field is rapidly gaining importance: it promises drugs targeted to particular subpopulations based on genetic background. The pharmacogenomics literature has expanded rapidly, but is dispersed in many journals. It is challenging, therefore, to identify important associations between drugs and molecular entities--particularly genes and gene variants, and thus these critical connections are often lost. Text mining techniques can allow us to convert the free-style text to a computable, searchable format in which pharmacogenomic concepts (such as genes, drugs, polymorphisms, and diseases) are identified, and important links between these concepts are recorded. Availability of full text articles as input into text mining engines is key, as literature abstracts often do not contain sufficient information to identify these pharmacogenomic associations. Thus, building on a tool called Textpresso, we have created the Pharmspresso tool to assist in identifying important pharmacogenomic facts in full text articles. Pharmspresso parses text to find references to human genes, polymorphisms, drugs and diseases and their relationships. It presents these as a series of marked-up text fragments, in which key concepts are visually highlighted. To evaluate Pharmspresso, we used a gold standard of 45 human-curated articles. Pharmspresso identified 78%, 61%, and 74% of target gene, polymorphism, and drug concepts, respectively. Pharmspresso is a text analysis tool that extracts pharmacogenomic concepts from the literature automatically and thus captures our current understanding of gene-drug interactions in a computable form. We have made Pharmspresso available at http://pharmspresso.stanford.edu.

  8. The Medicago truncatula gene expression atlas web server

    Directory of Open Access Journals (Sweden)

    Tang Yuhong

    2009-12-01

    Full Text Available Abstract Background Legumes (Leguminosae or Fabaceae play a major role in agriculture. Transcriptomics studies in the model legume species, Medicago truncatula, are instrumental in helping to formulate hypotheses about the role of legume genes. With the rapid growth of publically available Affymetrix GeneChip Medicago Genome Array GeneChip data from a great range of tissues, cell types, growth conditions, and stress treatments, the legume research community desires an effective bioinformatics system to aid efforts to interpret the Medicago genome through functional genomics. We developed the Medicago truncatula Gene Expression Atlas (MtGEA web server for this purpose. Description The Medicago truncatula Gene Expression Atlas (MtGEA web server is a centralized platform for analyzing the Medicago transcriptome. Currently, the web server hosts gene expression data from 156 Affymetrix GeneChip® Medicago genome arrays in 64 different experiments, covering a broad range of developmental and environmental conditions. The server enables flexible, multifaceted analyses of transcript data and provides a range of additional information about genes, including different types of annotation and links to the genome sequence, which help users formulate hypotheses about gene function. Transcript data can be accessed using Affymetrix probe identification number, DNA sequence, gene name, functional description in natural language, GO and KEGG annotation terms, and InterPro domain number. Transcripts can also be discovered through co-expression or differential expression analysis. Flexible tools to select a subset of experiments and to visualize and compare expression profiles of multiple genes have been implemented. Data can be downloaded, in part or full, in a tabular form compatible with common analytical and visualization software. The web server will be updated on a regular basis to incorporate new gene expression data and genome annotation, and is accessible

  9. Gene Therapy for the Inner Ear: Challenges and Promises

    OpenAIRE

    Ryan, Allen F.; Dazert, Stefan

    2009-01-01

    Since the recognition of genes as the discrete units of heritability, and of DNA as their molecular substrate, the utilization of genes for therapeutic purposes has been recognized as a potential means of correcting genetic disorders. The tools of molecular biology, which allow the manipulation of DNA sequence, provided the means to put this concept into practice. However, progress in the implementation of these ideas has been slow. Here we review the history of the idea of gene therapy and t...

  10. A Tool for Evaluating Strategies for Grouping of Biological Data

    OpenAIRE

    Jakoniene, Vaida; Lambrix, Patrick

    2007-01-01

    During the last decade an enormous amount of biological data has been generated and techniques and tools to analyze this data have been developed. Many of these tools use some form of grouping and are used in, for instance, data integration, data cleaning, prediction of protein functionality, and correlation of genes based on microarray data. A number of aspects influence the quality of the grouping results: the data sources, the grouping attributes and the algorithms implementing the groupin...

  11. Building ProteomeTools based on a complete synthetic human proteome

    Science.gov (United States)

    Zolg, Daniel P.; Wilhelm, Mathias; Schnatbaum, Karsten; Zerweck, Johannes; Knaute, Tobias; Delanghe, Bernard; Bailey, Derek J.; Gessulat, Siegfried; Ehrlich, Hans-Christian; Weininger, Maximilian; Yu, Peng; Schlegl, Judith; Kramer, Karl; Schmidt, Tobias; Kusebauch, Ulrike; Deutsch, Eric W.; Aebersold, Ruedi; Moritz, Robert L.; Wenschuh, Holger; Moehring, Thomas; Aiche, Stephan; Huhmer, Andreas; Reimer, Ulf; Kuster, Bernhard

    2018-01-01

    The ProteomeTools project builds molecular and digital tools from the human proteome to facilitate biomedical and life science research. Here, we report the generation and multimodal LC-MS/MS analysis of >330,000 synthetic tryptic peptides representing essentially all canonical human gene products and exemplify the utility of this data. The resource will be extended to >1 million peptides and all data will be shared with the community via ProteomicsDB and proteomeXchange. PMID:28135259

  12. A multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors for functional gene analysis.

    Science.gov (United States)

    Weber, Kristoffer; Bartsch, Udo; Stocking, Carol; Fehse, Boris

    2008-04-01

    Functional gene analysis requires the possibility of overexpression, as well as downregulation of one, or ideally several, potentially interacting genes. Lentiviral vectors are well suited for this purpose as they ensure stable expression of complementary DNAs (cDNAs), as well as short-hairpin RNAs (shRNAs), and can efficiently transduce a wide spectrum of cell targets when packaged within the coat proteins of other viruses. Here we introduce a multicolor panel of novel lentiviral "gene ontology" (LeGO) vectors designed according to the "building blocks" principle. Using a wide spectrum of different fluorescent markers, including drug-selectable enhanced green fluorescent protein (eGFP)- and dTomato-blasticidin-S resistance fusion proteins, LeGO vectors allow simultaneous analysis of multiple genes and shRNAs of interest within single, easily identifiable cells. Furthermore, each functional module is flanked by unique cloning sites, ensuring flexibility and individual optimization. The efficacy of these vectors for analyzing multiple genes in a single cell was demonstrated in several different cell types, including hematopoietic, endothelial, and neural stem and progenitor cells, as well as hepatocytes. LeGO vectors thus represent a valuable tool for investigating gene networks using conditional ectopic expression and knock-down approaches simultaneously.

  13. Dendrimers as Potential Therapeutic Tools in HIV Inhibition

    Directory of Open Access Journals (Sweden)

    Xiangbo Li

    2013-07-01

    Full Text Available The present treatments for HIV transfection include chemical agents and gene therapies. Although many chemical drugs, peptides and genes have been developed for HIV inhibition, a variety of non-ignorable drawbacks limited the efficiency of these materials. In this review, we discuss the application of dendrimers as both therapeutic agents and non-viral vectors of chemical agents and genes for HIV treatment. On the one hand, dendrimers with functional end groups combine with the gp120 of HIV and CD4 molecule of host cell to suppress the attachment of HIV to the host cell. Some of the dendrimers are capable of intruding into the cell and interfere with the later stages of HIV replication as well. On the other hand, dendrimers are also able to transfer chemical drugs and genes into the host cells, which conspicuously increase the anti-HIV activity of these materials. Dendrimers as therapeutic tools provide a potential treatment for HIV infection.

  14. Positional RNA-Seq identifies candidate genes for phenotypic engineering of sexual traits

    NARCIS (Netherlands)

    Arbore, Roberto; Sekii, Kiyono; Beisel, Christian; Ladurner, Peter; Berezikov, Eugene; Schaerer, Lukas

    2015-01-01

    Introduction: RNA interference (RNAi) of trait-specific genes permits the manipulation of specific phenotypic traits ("phenotypic engineering") and thus represents a powerful tool to test trait function in evolutionary studies. The identification of suitable candidate genes, however, often relies on

  15. Serial Expression Analysis: a web tool for the analysis of serial gene expression data

    Science.gov (United States)

    Nueda, Maria José; Carbonell, José; Medina, Ignacio; Dopazo, Joaquín; Conesa, Ana

    2010-01-01

    Serial transcriptomics experiments investigate the dynamics of gene expression changes associated with a quantitative variable such as time or dosage. The statistical analysis of these data implies the study of global and gene-specific expression trends, the identification of significant serial changes, the comparison of expression profiles and the assessment of transcriptional changes in terms of cellular processes. We have created the SEA (Serial Expression Analysis) suite to provide a complete web-based resource for the analysis of serial transcriptomics data. SEA offers five different algorithms based on univariate, multivariate and functional profiling strategies framed within a user-friendly interface and a project-oriented architecture to facilitate the analysis of serial gene expression data sets from different perspectives. SEA is available at sea.bioinfo.cipf.es. PMID:20525784

  16. Development and Application of Camelid Molecular Cytogenetic Tools

    Science.gov (United States)

    Avila, Felipe; Das, Pranab J.; Kutzler, Michelle; Owens, Elaine; Perelman, Polina; Rubes, Jiri; Hornak, Miroslav; Johnson, Warren E.

    2014-01-01

    Cytogenetic chromosome maps offer molecular tools for genome analysis and clinical cytogenetics and are of particular importance for species with difficult karyotypes, such as camelids (2n = 74). Building on the available human–camel zoo-fluorescence in situ hybridization (FISH) data, we developed the first cytogenetic map for the alpaca (Lama pacos, LPA) genome by isolating and identifying 151 alpaca bacterial artificial chromosome (BAC) clones corresponding to 44 specific genes. The genes were mapped by FISH to 31 alpaca autosomes and the sex chromosomes; 11 chromosomes had 2 markers, which were ordered by dual-color FISH. The STS gene mapped to Xpter/Ypter, demarcating the pseudoautosomal region, whereas no markers were assigned to chromosomes 14, 21, 22, 28, and 36. The chromosome-specific markers were applied in clinical cytogenetics to identify LPA20, the major histocompatibility complex (MHC)-carrying chromosome, as a part of an autosomal translocation in a sterile male llama (Lama glama, LGL; 2n = 73,XY). FISH with LPAX BACs and LPA36 paints, as well as comparative genomic hybridization, were also used to investigate the origin of the minute chromosome, an abnormally small LPA36 in infertile female alpacas. This collection of cytogenetically mapped markers represents a new tool for camelid clinical cytogenetics and has applications for the improvement of the alpaca genome map and sequence assembly. PMID:23109720

  17. Association of candidate genes with drought tolerance traits in diverse perennial ryegrass accessions

    Science.gov (United States)

    Xiaoqing Yu; Guihua Bai; Shuwei Liu; Na Luo; Ying Wang; Douglas S. Richmond; Paula M. Pijut; Scott A. Jackson; Jianming Yu; Yiwei. Jiang

    2013-01-01

    Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought tolerance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted in 192 diverse...

  18. Mitochondrial tRNA gene translocations in highly eusocial bees

    Directory of Open Access Journals (Sweden)

    Daniela Silvestre

    2006-01-01

    Full Text Available Mitochondrial gene rearrangement events, especially involving tRNA genes, have been described more frequently as more complete mitochondrial genome sequences are becoming available. In the present work, we analyzed mitochondrial tRNA gene rearrangements between two bee species belonging to the tribes Apini and Meliponini within the "corbiculate Apidae". Eleven tRNA genes are in different genome positions or strands. The molecular events responsible for each translocation are explained. Considering the high number of rearrangements observed, the data presented here contradict the general rule of high gene order conservation among closely related organisms, and also represent a powerful molecular tool to help solve questions about phylogeny and evolution in bees.

  19. Exploring Valid Reference Genes for Quantitative Real-time PCR Analysis in Plutella xylostella (Lepidoptera: Plutellidae)

    Science.gov (United States)

    Fu, Wei; Xie, Wen; Zhang, Zhuo; Wang, Shaoli; Wu, Qingjun; Liu, Yong; Zhou, Xiaomao; Zhou, Xuguo; Zhang, Youjun

    2013-01-01

    Abstract: Quantitative real-time PCR (qRT-PCR), a primary tool in gene expression analysis, requires an appropriate normalization strategy to control for variation among samples. The best option is to compare the mRNA level of a target gene with that of reference gene(s) whose expression level is stable across various experimental conditions. In this study, expression profiles of eight candidate reference genes from the diamondback moth, Plutella xylostella, were evaluated under diverse experimental conditions. RefFinder, a web-based analysis tool, integrates four major computational programs including geNorm, Normfinder, BestKeeper, and the comparative ΔCt method to comprehensively rank the tested candidate genes. Elongation factor 1 (EF1) was the most suited reference gene for the biotic factors (development stage, tissue, and strain). In contrast, although appropriate reference gene(s) do exist for several abiotic factors (temperature, photoperiod, insecticide, and mechanical injury), we were not able to identify a single universal reference gene. Nevertheless, a suite of candidate reference genes were specifically recommended for selected experimental conditions. Our finding is the first step toward establishing a standardized qRT-PCR analysis of this agriculturally important insect pest. PMID:23983612

  20. Genome-wide target profiling of piggyBac and Tol2 in HEK 293: pros and cons for gene discovery and gene therapy

    Science.gov (United States)

    2011-01-01

    Background DNA transposons have emerged as indispensible tools for manipulating vertebrate genomes with applications ranging from insertional mutagenesis and transgenesis to gene therapy. To fully explore the potential of two highly active DNA transposons, piggyBac and Tol2, as mammalian genetic tools, we have conducted a side-by-side comparison of the two transposon systems in the same setting to evaluate their advantages and disadvantages for use in gene therapy and gene discovery. Results We have observed that (1) the Tol2 transposase (but not piggyBac) is highly sensitive to molecular engineering; (2) the piggyBac donor with only the 40 bp 3'-and 67 bp 5'-terminal repeat domain is sufficient for effective transposition; and (3) a small amount of piggyBac transposases results in robust transposition suggesting the piggyBac transpospase is highly active. Performing genome-wide target profiling on data sets obtained by retrieving chromosomal targeting sequences from individual clones, we have identified several piggyBac and Tol2 hotspots and observed that (4) piggyBac and Tol2 display a clear difference in targeting preferences in the human genome. Finally, we have observed that (5) only sites with a particular sequence context can be targeted by either piggyBac or Tol2. Conclusions The non-overlapping targeting preference of piggyBac and Tol2 makes them complementary research tools for manipulating mammalian genomes. PiggyBac is the most promising transposon-based vector system for achieving site-specific targeting of therapeutic genes due to the flexibility of its transposase for being molecularly engineered. Insights from this study will provide a basis for engineering piggyBac transposases to achieve site-specific therapeutic gene targeting. PMID:21447194

  1. The Hematopoietic Expression Viewer: expanding mobile apps as a scientific tool.

    Science.gov (United States)

    James, Regis A; Rao, Mitchell M; Chen, Edward S; Goodell, Margaret A; Shaw, Chad A

    2012-07-15

    Many important data in current biological science comprise hundreds, thousands or more individual results. These massive data require computational tools to navigate results and effectively interact with the content. Mobile device apps are an increasingly important tool in the everyday lives of scientists and non-scientists alike. These software present individuals with compact and efficient tools to interact with complex data at meetings or other locations remote from their main computing environment. We believe that apps will be important tools for biologists, geneticists and physicians to review content while participating in biomedical research or practicing medicine. We have developed a prototype app for displaying gene expression data using the iOS platform. To present the software engineering requirements, we review the model-view-controller schema for Apple's iOS. We apply this schema to a simple app for querying locally developed microarray gene expression data. The challenge of this application is to balance between storing content locally within the app versus obtaining it dynamically via a network connection. The Hematopoietic Expression Viewer is available at http://www.shawlab.org/he_viewer. The source code for this project and any future information on how to obtain the app can be accessed at http://www.shawlab.org/he_viewer.

  2. Spatial gene expression quantification: a tool for analysis of in situ hybridizations in sea anemone Nematostella vectensis

    Directory of Open Access Journals (Sweden)

    Botman Daniel

    2012-10-01

    Full Text Available Abstract Background Spatial gene expression quantification is required for modeling gene regulation in developing organisms. The fruit fly Drosophila melanogaster is the model system most widely applied for spatial gene expression analysis due to its unique embryonic properties: the shape does not change significantly during its early cleavage cycles and most genes are differentially expressed along a straight axis. This system of development is quite exceptional in the animal kingdom. In the sea anemone Nematostella vectensis the embryo changes its shape during early development; there are cell divisions and cell movement, like in most other metazoans. Nematostella is an attractive case study for spatial gene expression since its transparent body wall makes it accessible to various imaging techniques. Findings Our new quantification method produces standardized gene expression profiles from raw or annotated Nematostella in situ hybridizations by measuring the expression intensity along its cell layer. The procedure is based on digital morphologies derived from high-resolution fluorescence pictures. Additionally, complete descriptions of nonsymmetric expression patterns have been constructed by transforming the gene expression images into a three-dimensional representation. Conclusions We created a standard format for gene expression data, which enables quantitative analysis of in situ hybridizations from embryos with various shapes in different developmental stages. The obtained expression profiles are suitable as input for optimization of gene regulatory network models, and for correlation analysis of genes from dissimilar Nematostella morphologies. This approach is potentially applicable to many other metazoan model organisms and may also be suitable for processing data from three-dimensional imaging techniques.

  3. Human microRNA target analysis and gene ontology clustering by GOmir, a novel stand-alone application.

    Science.gov (United States)

    Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia

    2009-06-16

    microRNAs (miRNAs) are single-stranded RNA molecules of about 20-23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA.

  4. Canine candidate genes for dilated cardiomyopathy: annotation of and polymorphic markers for 14 genes

    Directory of Open Access Journals (Sweden)

    van Oost Bernard A

    2007-10-01

    Full Text Available Abstract Background Dilated cardiomyopathy is a myocardial disease occurring in humans and domestic animals and is characterized by dilatation of the left ventricle, reduced systolic function and increased sphericity of the left ventricle. Dilated cardiomyopathy has been observed in several, mostly large and giant, dog breeds, such as the Dobermann and the Great Dane. A number of genes have been identified, which are associated with dilated cardiomyopathy in the human, mouse and hamster. These genes mainly encode structural proteins of the cardiac myocyte. Results We present the annotation of, and marker development for, 14 of these genes of the dog genome, i.e. α-cardiac actin, caveolin 1, cysteine-rich protein 3, desmin, lamin A/C, LIM-domain binding factor 3, myosin heavy polypeptide 7, phospholamban, sarcoglycan δ, titin cap, α-tropomyosin, troponin I, troponin T and vinculin. A total of 33 Single Nucleotide Polymorphisms were identified for these canine genes and 11 polymorphic microsatellite repeats were developed. Conclusion The presented polymorphisms provide a tool to investigate the role of the corresponding genes in canine Dilated Cardiomyopathy by linkage analysis or association studies.

  5. A TAD further: exogenous control of gene activation.

    Science.gov (United States)

    Mapp, Anna K; Ansari, Aseem Z

    2007-01-23

    Designer molecules that can be used to impose exogenous control on gene transcription, artificial transcription factors (ATFs), are highly desirable as mechanistic probes of gene regulation, as potential therapeutic agents, and as components of cell-based devices. Recently, several advances have been made in the design of ATFs that activate gene transcription (activator ATFs), including reports of small-molecule-based systems and ATFs that exhibit potent activity. However, the many open mechanistic questions about transcriptional activators, in particular, the structure and function of the transcriptional activation domain (TAD), have hindered rapid development of synthetic ATFs. A compelling need thus exists for chemical tools and insights toward a more detailed portrait of the dynamic process of gene activation.

  6. A new measure for functional similarity of gene products based on Gene Ontology

    Directory of Open Access Journals (Sweden)

    Lengauer Thomas

    2006-06-01

    Full Text Available Abstract Background Gene Ontology (GO is a standard vocabulary of functional terms and allows for coherent annotation of gene products. These annotations provide a basis for new methods that compare gene products regarding their molecular function and biological role. Results We present a new method for comparing sets of GO terms and for assessing the functional similarity of gene products. The method relies on two semantic similarity measures; simRel and funSim. One measure (simRel is applied in the comparison of the biological processes found in different groups of organisms. The other measure (funSim is used to find functionally related gene products within the same or between different genomes. Results indicate that the method, in addition to being in good agreement with established sequence similarity approaches, also provides a means for the identification of functionally related proteins independent of evolutionary relationships. The method is also applied to estimating functional similarity between all proteins in Saccharomyces cerevisiae and to visualizing the molecular function space of yeast in a map of the functional space. A similar approach is used to visualize the functional relationships between protein families. Conclusion The approach enables the comparison of the underlying molecular biology of different taxonomic groups and provides a new comparative genomics tool identifying functionally related gene products independent of homology. The proposed map of the functional space provides a new global view on the functional relationships between gene products or protein families.

  7. Comparative GO: a web application for comparative gene ontology and gene ontology-based gene selection in bacteria.

    Directory of Open Access Journals (Sweden)

    Mario Fruzangohar

    Full Text Available The primary means of classifying new functions for genes and proteins relies on Gene Ontology (GO, which defines genes/proteins using a controlled vocabulary in terms of their Molecular Function, Biological Process and Cellular Component. The challenge is to present this information to researchers to compare and discover patterns in multiple datasets using visually comprehensible and user-friendly statistical reports. Importantly, while there are many GO resources available for eukaryotes, there are none suitable for simultaneous, graphical and statistical comparison between multiple datasets. In addition, none of them supports comprehensive resources for bacteria. By using Streptococcus pneumoniae as a model, we identified and collected GO resources including genes, proteins, taxonomy and GO relationships from NCBI, UniProt and GO organisations. Then, we designed database tables in PostgreSQL database server and developed a Java application to extract data from source files and loaded into database automatically. We developed a PHP web application based on Model-View-Control architecture, used a specific data structure as well as current and novel algorithms to estimate GO graphs parameters. We designed different navigation and visualization methods on the graphs and integrated these into graphical reports. This tool is particularly significant when comparing GO groups between multiple samples (including those of pathogenic bacteria from different sources simultaneously. Comparing GO protein distribution among up- or down-regulated genes from different samples can improve understanding of biological pathways, and mechanism(s of infection. It can also aid in the discovery of genes associated with specific function(s for investigation as a novel vaccine or therapeutic targets.http://turing.ersa.edu.au/BacteriaGO.

  8. Genotyping microarray (gene chip) for the ABCR (ABCA4) gene.

    Science.gov (United States)

    Jaakson, K; Zernant, J; Külm, M; Hutchinson, A; Tonisson, N; Glavac, D; Ravnik-Glavac, M; Hawlina, M; Meltzer, M R; Caruso, R C; Testa, F; Maugeri, A; Hoyng, C B; Gouras, P; Simonelli, F; Lewis, R A; Lupski, J R; Cremers, F P M; Allikmets, R

    2003-11-01

    Genetic variation in the ABCR (ABCA4) gene has been associated with five distinct retinal phenotypes, including Stargardt disease/fundus flavimaculatus (STGD/FFM), cone-rod dystrophy (CRD), and age-related macular degeneration (AMD). Comparative genetic analyses of ABCR variation and diagnostics have been complicated by substantial allelic heterogeneity and by differences in screening methods. To overcome these limitations, we designed a genotyping microarray (gene chip) for ABCR that includes all approximately 400 disease-associated and other variants currently described, enabling simultaneous detection of all known ABCR variants. The ABCR genotyping microarray (the ABCR400 chip) was constructed by the arrayed primer extension (APEX) technology. Each sequence change in ABCR was included on the chip by synthesis and application of sequence-specific oligonucleotides. We validated the chip by screening 136 confirmed STGD patients and 96 healthy controls, each of whom we had analyzed previously by single strand conformation polymorphism (SSCP) technology and/or heteroduplex analysis. The microarray was >98% effective in determining the existing genetic variation and was comparable to direct sequencing in that it yielded many sequence changes undetected by SSCP. In STGD patient cohorts, the efficiency of the array to detect disease-associated alleles was between 54% and 78%, depending on the ethnic composition and degree of clinical and molecular characterization of a cohort. In addition, chip analysis suggested a high carrier frequency (up to 1:10) of ABCR variants in the general population. The ABCR genotyping microarray is a robust, cost-effective, and comprehensive screening tool for variation in one gene in which mutations are responsible for a substantial fraction of retinal disease. The ABCR chip is a prototype for the next generation of screening and diagnostic tools in ophthalmic genetics, bridging clinical and scientific research. Copyright 2003 Wiley

  9. Analysis of gene evolution and metabolic pathways using the Candida Gene Order Browser

    LENUS (Irish Health Repository)

    Fitzpatrick, David A

    2010-05-10

    Abstract Background Candida species are the most common cause of opportunistic fungal infection worldwide. Recent sequencing efforts have provided a wealth of Candida genomic data. We have developed the Candida Gene Order Browser (CGOB), an online tool that aids comparative syntenic analyses of Candida species. CGOB incorporates all available Candida clade genome sequences including two Candida albicans isolates (SC5314 and WO-1) and 8 closely related species (Candida dubliniensis, Candida tropicalis, Candida parapsilosis, Lodderomyces elongisporus, Debaryomyces hansenii, Pichia stipitis, Candida guilliermondii and Candida lusitaniae). Saccharomyces cerevisiae is also included as a reference genome. Results CGOB assignments of homology were manually curated based on sequence similarity and synteny. In total CGOB includes 65617 genes arranged into 13625 homology columns. We have also generated improved Candida gene sets by merging\\/removing partial genes in each genome. Interrogation of CGOB revealed that the majority of tandemly duplicated genes are under strong purifying selection in all Candida species. We identified clusters of adjacent genes involved in the same metabolic pathways (such as catabolism of biotin, galactose and N-acetyl glucosamine) and we showed that some clusters are species or lineage-specific. We also identified one example of intron gain in C. albicans. Conclusions Our analysis provides an important resource that is now available for the Candida community. CGOB is available at http:\\/\\/cgob.ucd.ie.

  10. Comparison of Nasal Epithelial Smoking-Induced Gene Expression on Affymetrix Exon 1.0 and Gene 1.0 ST Arrays

    Directory of Open Access Journals (Sweden)

    Xiaoling Zhang

    2013-01-01

    Full Text Available We have previously defined the impact of tobacco smoking on nasal epithelium gene expression using Affymetrix Exon 1.0 ST arrays. In this paper, we compared the performance of the Affymetrix GeneChip Human Gene 1.0 ST array with the Human Exon 1.0 ST array for detecting nasal smoking-related gene expression changes. RNA collected from the nasal epithelium of five current smokers and five never smokers was hybridized to both arrays. While the intersample correlation within each array platform was relatively higher in the Gene array than that in the Exon array, the majority of the genes most changed by smoking were tightly correlated between platforms. Although neither array dataset was powered to detect differentially expressed genes (DEGs at a false discovery rate (FDR <0.05, we identified more DEGs than expected by chance using the Gene ST array. These findings suggest that while both platforms show a high degree of correlation for detecting smoking-induced differential gene expression changes, the Gene ST array may be a more cost-effective platform in a clinical setting for gene-level genomewide expression profiling and an effective tool for exploring the host response to cigarette smoking and other inhaled toxins.

  11. Emerging Use of Gene Expression Microarrays in Plant Physiology

    Directory of Open Access Journals (Sweden)

    Stephen P. Difazio

    2006-04-01

    Full Text Available Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology were selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.

  12. From gene engineering to gene modulation and manipulation: can we prevent or detect gene doping in sports?

    Science.gov (United States)

    Fischetto, Giuseppe; Bermon, Stéphane

    2013-10-01

    During the last 2 decades, progress in deciphering the human gene map as well as the discovery of specific defective genes encoding particular proteins in some serious human diseases have resulted in attempts to treat sick patients with gene therapy. There has been considerable focus on human recombinant proteins which were gene-engineered and produced in vitro (insulin, growth hormone, insulin-like growth factor-1, erythropoietin). Unfortunately, these substances and methods also became improper tools for unscrupulous athletes. Biomedical research has focused on the possible direct insertion of gene material into the body, in order to replace some defective genes in vivo and/or to promote long-lasting endogenous synthesis of deficient proteins. Theoretically, diabetes, anaemia, muscular dystrophies, immune deficiency, cardiovascular diseases and numerous other illnesses could benefit from such innovative biomedical research, though much work remains to be done. Considering recent findings linking specific genotypes and physical performance, it is tempting to submit the young athletic population to genetic screening or, alternatively, to artificial gene expression modulation. Much research is already being conducted in order to achieve a safe transfer of genetic material to humans. This is of critical importance since uncontrolled production of the specifically coded protein, with serious secondary adverse effects (polycythaemia, acute cardiovascular problems, cancer, etc.), could occur. Other unpredictable reactions (immunogenicity of vectors or DNA-vector complex, autoimmune anaemia, production of wild genetic material) also remain possible at the individual level. Some new substances (myostatin blockers or anti-myostatin antibodies), although not gene material, might represent a useful and well-tolerated treatment to prevent progression of muscular dystrophies. Similarly, other molecules, in the roles of gene or metabolic activators [5-aminoimidazole-4

  13. Gene Prediction in Metagenomic Fragments with Deep Learning

    Directory of Open Access Journals (Sweden)

    Shao-Wu Zhang

    2017-01-01

    Full Text Available Next generation sequencing technologies used in metagenomics yield numerous sequencing fragments which come from thousands of different species. Accurately identifying genes from metagenomics fragments is one of the most fundamental issues in metagenomics. In this article, by fusing multifeatures (i.e., monocodon usage, monoamino acid usage, ORF length coverage, and Z-curve features and using deep stacking networks learning model, we present a novel method (called Meta-MFDL to predict the metagenomic genes. The results with 10 CV and independent tests show that Meta-MFDL is a powerful tool for identifying genes from metagenomic fragments.

  14. The molecular genetic linkage map of the model legume Medicago truncatula: an essential tool for comparative legume genomics and the isolation of agronomically important genes

    Directory of Open Access Journals (Sweden)

    Ané Jean-Michel

    2002-01-01

    Full Text Available Abstract Background The legume Medicago truncatula has emerged as a model plant for the molecular and genetic dissection of various plant processes involved in rhizobial, mycorrhizal and pathogenic plant-microbe interactions. Aiming to develop essential tools for such genetic approaches, we have established the first genetic map of this species. Two parental homozygous lines were selected from the cultivar Jemalong and from the Algerian natural population (DZA315 on the basis of their molecular and phenotypic polymorphism. Results An F2 segregating population of 124 individuals between these two lines was obtained using an efficient manual crossing technique established for M. truncatula and was used to construct a genetic map. This map spans 1225 cM (average 470 kb/cM and comprises 289 markers including RAPD, AFLP, known genes and isoenzymes arranged in 8 linkage groups (2n = 16. Markers are uniformly distributed throughout the map and segregation distortion is limited to only 3 linkage groups. By mapping a number of common markers, the eight linkage groups are shown to be homologous to those of diploid alfalfa (M. sativa, implying a good level of macrosynteny between the two genomes. Using this M. truncatula map and the derived F3 populations, we were able to map the Mtsym6 symbiotic gene on linkage group 8 and the SPC gene, responsible for the direction of pod coiling, on linkage group 7. Conclusions These results demonstrate that Medicago truncatula is amenable to diploid genetic analysis and they open the way to map-based cloning of symbiotic or other agronomically-important genes using this model plant.

  15. Conditional gene expression in the mouse using a Sleeping Beauty gene-trap transposon

    Directory of Open Access Journals (Sweden)

    Hackett Perry B

    2006-06-01

    Full Text Available Abstract Background Insertional mutagenesis techniques with transposable elements have been popular among geneticists studying model organisms from E. coli to Drosophila and, more recently, the mouse. One such element is the Sleeping Beauty (SB transposon that has been shown in several studies to be an effective insertional mutagen in the mouse germline. SB transposon vector studies have employed different functional elements and reporter molecules to disrupt and report the expression of endogenous mouse genes. We sought to generate a transposon system that would be capable of reporting the expression pattern of a mouse gene while allowing for conditional expression of a gene of interest in a tissue- or temporal-specific pattern. Results Here we report the systematic development and testing of a transposon-based gene-trap system incorporating the doxycycline-repressible Tet-Off (tTA system that is capable of activating the expression of genes under control of a Tet response element (TRE promoter. We demonstrate that the gene trap system is fully functional in vitro by introducing the "gene-trap tTA" vector into human cells by transposition and identifying clones that activate expression of a TRE-luciferase transgene in a doxycycline-dependent manner. In transgenic mice, we mobilize gene-trap tTA vectors, discover parameters that can affect germline mobilization rates, and identify candidate gene insertions to demonstrate the in vivo functionality of the vector system. We further demonstrate that the gene-trap can act as a reporter of endogenous gene expression and it can be coupled with bioluminescent imaging to identify genes with tissue-specific expression patterns. Conclusion Akin to the GAL4/UAS system used in the fly, we have made progress developing a tool for mutating and revealing the expression of mouse genes by generating the tTA transactivator in the presence of a secondary TRE-regulated reporter molecule. A vector like the gene

  16. CoPub: a literature-based keyword enrichment tool for microarray data analysis.

    Science.gov (United States)

    Frijters, Raoul; Heupers, Bart; van Beek, Pieter; Bouwhuis, Maurice; van Schaik, René; de Vlieg, Jacob; Polman, Jan; Alkema, Wynand

    2008-07-01

    Medline is a rich information source, from which links between genes and keywords describing biological processes, pathways, drugs, pathologies and diseases can be extracted. We developed a publicly available tool called CoPub that uses the information in the Medline database for the biological interpretation of microarray data. CoPub allows batch input of multiple human, mouse or rat genes and produces lists of keywords from several biomedical thesauri that are significantly correlated with the set of input genes. These lists link to Medline abstracts in which the co-occurring input genes and correlated keywords are highlighted. Furthermore, CoPub can graphically visualize differentially expressed genes and over-represented keywords in a network, providing detailed insight in the relationships between genes and keywords, and revealing the most influential genes as highly connected hubs. CoPub is freely accessible at http://services.nbic.nl/cgi-bin/copub/CoPub.pl.

  17. RNA interference: learning gene knock-down from cell physiology

    Directory of Open Access Journals (Sweden)

    Provenzano Maurizio

    2004-11-01

    Full Text Available Summary Over the past decade RNA interference (RNAi has emerged as a natural mechanism for silencing gene expression. This ancient cellular antiviral response can be exploited to allow specific inhibition of the function of any chosen target gene. RNAi is proving to be an invaluable research tool, allowing much more rapid characterization of the function of known genes. More importantly, RNAi technology considerably bolsters functional genomics to aid in the identification of novel genes involved in disease processes. This review briefly describes the molecular principles underlying the biology of RNAi phenomenon and discuss the main technical issues regarding optimization of RNAi experimental design.

  18. RNA interference: learning gene knock-down from cell physiology

    Science.gov (United States)

    Mocellin, Simone; Provenzano, Maurizio

    2004-01-01

    Over the past decade RNA interference (RNAi) has emerged as a natural mechanism for silencing gene expression. This ancient cellular antiviral response can be exploited to allow specific inhibition of the function of any chosen target gene. RNAi is proving to be an invaluable research tool, allowing much more rapid characterization of the function of known genes. More importantly, RNAi technology considerably bolsters functional genomics to aid in the identification of novel genes involved in disease processes. This review briefly describes the molecular principles underlying the biology of RNAi phenomenon and discuss the main technical issues regarding optimization of RNAi experimental design. PMID:15555080

  19. Gene Overexpression Resources in Cereals for Functional Genomics and Discovery of Useful Genes

    Directory of Open Access Journals (Sweden)

    Kiyomi Abe

    2016-09-01

    Full Text Available Identification and elucidation of functions of plant genes is valuable for both basic and applied research. In addition to natural variation in model plants, numerous loss-of-function resources have been produced by mutagenesis with chemicals, irradiation, or insertions of transposable elements or T-DNA. However, we may be unable to observe loss-of-function phenotypes for genes with functionally redundant homologs, and for those essential for growth and development. To offset such disadvantages, gain-of-function transgenic resources have been exploited. Activation-tagged lines have been generated using obligatory overexpression of endogenous genes by random insertion of an enhancer. Recent progress in DNA sequencing technology and bioinformatics has enabled the preparation of genomewide collections of full-length cDNAs (fl-cDNAs in some model species. Using the fl-cDNA clones, a novel gain-of-function strategy, Fl-cDNA OvereXpressor gene (FOX-hunting system, has been developed. A mutant phenotype in a FOX line can be directly attributed to the overexpressed fl-cDNA. Investigating a large population of FOX lines could reveal important genes conferring favorable phenotypes for crop breeding. Alternatively, a unique loss-of-function approach Chimeric REpressor gene Silencing Technology (CRES-T has been developed. In CRES-T, overexpression of a chimeric repressor, composed of the coding sequence of a transcription factor (TF and short peptide designated as the repression domain, could interfere with the action of endogenous TF in plants. Although plant TFs usually consist of gene families, CRES-T is effective, in principle, even for the TFs with functional redundancy. In this review, we focus on the current status of the gene-overexpression strategies and resources for identifying and elucidating novel functions of cereal genes. We discuss the potential of these research tools for identifying useful genes and phenotypes for application in crop

  20. GECKO: a complete large-scale gene expression analysis platform

    Directory of Open Access Journals (Sweden)

    Heuer Michael

    2004-12-01

    Full Text Available Abstract Background Gecko (Gene Expression: Computation and Knowledge Organization is a complete, high-capacity centralized gene expression analysis system, developed in response to the needs of a distributed user community. Results Based on a client-server architecture, with a centralized repository of typically many tens of thousands of Affymetrix scans, Gecko includes automatic processing pipelines for uploading data from remote sites, a data base, a computational engine implementing ~ 50 different analysis tools, and a client application. Among available analysis tools are clustering methods, principal component analysis, supervised classification including feature selection and cross-validation, multi-factorial ANOVA, statistical contrast calculations, and various post-processing tools for extracting data at given error rates or significance levels. On account of its open architecture, Gecko also allows for the integration of new algorithms. The Gecko framework is very general: non-Affymetrix and non-gene expression data can be analyzed as well. A unique feature of the Gecko architecture is the concept of the Analysis Tree (actually, a directed acyclic graph, in which all successive results in ongoing analyses are saved. This approach has proven invaluable in allowing a large (~ 100 users and distributed community to share results, and to repeatedly return over a span of years to older and potentially very complex analyses of gene expression data. Conclusions The Gecko system is being made publicly available as free software http://sourceforge.net/projects/geckoe. In totality or in parts, the Gecko framework should prove useful to users and system developers with a broad range of analysis needs.

  1. SSTAR, a Stand-Alone Easy-To-Use Antimicrobial Resistance Gene Predictor.

    Science.gov (United States)

    de Man, Tom J B; Limbago, Brandi M

    2016-01-01

    We present the easy-to-use Sequence Search Tool for Antimicrobial Resistance, SSTAR. It combines a locally executed BLASTN search against a customizable database with an intuitive graphical user interface for identifying antimicrobial resistance (AR) genes from genomic data. Although the database is initially populated from a public repository of acquired resistance determinants (i.e., ARG-ANNOT), it can be customized for particular pathogen groups and resistance mechanisms. For instance, outer membrane porin sequences associated with carbapenem resistance phenotypes can be added, and known intrinsic mechanisms can be included. Unique about this tool is the ability to easily detect putative new alleles and truncated versions of existing AR genes. Variants and potential new alleles are brought to the attention of the user for further investigation. For instance, SSTAR is able to identify modified or truncated versions of porins, which may be of great importance in carbapenemase-negative carbapenem-resistant Enterobacteriaceae. SSTAR is written in Java and is therefore platform independent and compatible with both Windows and Unix operating systems. SSTAR and its manual, which includes a simple installation guide, are freely available from https://github.com/tomdeman-bio/Sequence-Search-Tool-for-Antimicrobial-Resistance-SSTAR-. IMPORTANCE Whole-genome sequencing (WGS) is quickly becoming a routine method for identifying genes associated with antimicrobial resistance (AR). However, for many microbiologists, the use and analysis of WGS data present a substantial challenge. We developed SSTAR, software with a graphical user interface that enables the identification of known AR genes from WGS and has the unique capacity to easily detect new variants of known AR genes, including truncated protein variants. Current software solutions do not notify the user when genes are truncated and, therefore, likely nonfunctional, which makes phenotype predictions less accurate. SSTAR

  2. A new web-based data mining tool for the identification of candidate genes for human genetic disorders

    NARCIS (Netherlands)

    Driel, van M.A.; Cuelenaere, K.; Kemmeren, P.P.C.W.; Leunissen, J.A.M.; Brunner, H.G.

    2003-01-01

    To identify the gene underlying a human genetic disorder can be difficult and time-consuming. Typically, positional data delimit a chromosomal region that contains between 20 and 200 genes. The choice then lies between sequencing large numbers of genes, or setting priorities by combining positional

  3. Inferring gene networks from discrete expression data

    KAUST Repository

    Zhang, L.

    2013-07-18

    The modeling of gene networks from transcriptional expression data is an important tool in biomedical research to reveal signaling pathways and to identify treatment targets. Current gene network modeling is primarily based on the use of Gaussian graphical models applied to continuous data, which give a closedformmarginal likelihood. In this paper,we extend network modeling to discrete data, specifically data from serial analysis of gene expression, and RNA-sequencing experiments, both of which generate counts of mRNAtranscripts in cell samples.We propose a generalized linear model to fit the discrete gene expression data and assume that the log ratios of the mean expression levels follow a Gaussian distribution.We restrict the gene network structures to decomposable graphs and derive the graphs by selecting the covariance matrix of the Gaussian distribution with the hyper-inverse Wishart priors. Furthermore, we incorporate prior network models based on gene ontology information, which avails existing biological information on the genes of interest. We conduct simulation studies to examine the performance of our discrete graphical model and apply the method to two real datasets for gene network inference. © The Author 2013. Published by Oxford University Press. All rights reserved.

  4. A tool to ascertain taxonomic relatedness based on features derived ...

    Indian Academy of Sciences (India)

    MADHU

    gene to investigate the evolutionary relationships. They ... 1Environmental Genomics Unit, National Environmental Engineering ... However, use of 16S rRNA data does have some problems, ... This tool undergoes unsupervised learning and is particularly .... to be conducted (Buchala et al. ... using two layers of connections.

  5. Gene Editing and CRISPR Therapeutics: Strategies Taught by Cell and Gene Therapy.

    Science.gov (United States)

    Ramirez, Juan C

    2017-01-01

    A few years ago, we assisted in the demonstration for the first time of the revolutionary idea of a type of adaptive-immune system in the bacteria kingdom. This system, named CRISPR, and variants engineered in the lab, have been demonstrated as functional with extremely high frequency and fidelity in almost all eukaryotic cells studied to date. The capabilities of this RNA-guided nuclease have added to the interest that was announced with the advent of previous technologies for genome editing tools, such as ZFN and TALEN. The capabilities exhibited by these gene editors, opens up a novel scenario that indicates the promise of a next-generation medicine based on precision and personalized objectives, mostly due to the change in the paradigm regarding gene-surgery. This has certainly attracted, like never before, the attention of the biotech business and investor community. This chapter offers a brief overview of some of the factors that have contributed to a rapid entry into the biotech and pharmaceutical company's pipeline, focusing on how cell and gene therapies (CGT), collectively known as advanced therapies, have become the driving forces toward the therapeutic uses of gene editing technology. The sum of all those efforts for more than 30years has contributed to the new paradigm of considering genes as medicines. Copyright © 2017. Published by Elsevier Inc.

  6. NuGO contributions to GenePattern

    NARCIS (Netherlands)

    Groot, de P.J.; Reiff, C.; Mayer, C.; Müller, M.R.

    2008-01-01

    NuGO, the European Nutrigenomics Organization, utilizes 31 powerful computers for, e.g., data storage and analysis. These so-called black boxes (NBXses) are located at the sites of different partners. NuGO decided to use GenePattern as the preferred genomic analysis tool on each NBX. To handle the

  7. GDdom: An Online Tool for Calculation of Dominant Marker Gene Diversity.

    Science.gov (United States)

    Abuzayed, Mazen; El-Dabba, Nourhan; Frary, Anne; Doganlar, Sami

    2017-04-01

    Gene diversity (GD), also called polymorphism information content, is a commonly used measure of molecular marker polymorphism. Calculation of GD for dominant markers such as AFLP, RAPD, and multilocus SSRs is valuable for researchers. To meet this need, we developed a free online computer program, GDdom, which provides easy, quick, and accurate calculation of dominant marker GD with a commonly used formula. Results are presented in tabular form for quick interpretation.

  8. Development of an Efficient Genome Editing Tool in Bacillus licheniformis Using CRISPR-Cas9 Nickase.

    Science.gov (United States)

    Li, Kaifeng; Cai, Dongbo; Wang, Zhangqian; He, Zhili; Chen, Shouwen

    2018-03-15

    Bacillus strains are important industrial bacteria that can produce various biochemical products. However, low transformation efficiencies and a lack of effective genome editing tools have hindered its widespread application. Recently, clustered regularly interspaced short palindromic repeat (CRISPR)-Cas9 techniques have been utilized in many organisms as genome editing tools because of their high efficiency and easy manipulation. In this study, an efficient genome editing method was developed for Bacillus licheniformis using a CRISPR-Cas9 nickase integrated into the genome of B. licheniformis DW2 with overexpression driven by the P43 promoter. The yvmC gene was deleted using the CRISPR-Cas9n technique with homology arms of 1.0 kb as a representative example, and an efficiency of 100% was achieved. In addition, two genes were simultaneously disrupted with an efficiency of 11.6%, and the large DNA fragment bacABC (42.7 kb) was deleted with an efficiency of 79.0%. Furthermore, the heterologous reporter gene aprN , which codes for nattokinase in Bacillus subtilis , was inserted into the chromosome of B. licheniformis with an efficiency of 76.5%. The activity of nattokinase in the DWc9nΔ7/pP43SNT-S sacC strain reached 59.7 fibrinolytic units (FU)/ml, which was 25.7% higher than that of DWc9n/pP43SNT-S sacC Finally, the engineered strain DWc9nΔ7 (Δ epr Δ wprA Δ mpr Δ aprE Δ vpr Δ bprA Δ bacABC ), with multiple disrupted genes, was constructed using the CRISPR-Cas9n technique. Taken together, we have developed an efficient genome editing tool based on CRISPR-Cas9n in B. licheniformis This tool could be applied to strain improvement for future research. IMPORTANCE As important industrial bacteria, Bacillus strains have attracted significant attention due to their production of biological products. However, genetic manipulation of these bacteria is difficult. The CRISPR-Cas9 system has been applied to genome editing in some bacteria, and CRISPR-Cas9n was proven to

  9. A comprehensive aligned nifH gene database: a multipurpose tool for studies of nitrogen-fixing bacteria.

    Science.gov (United States)

    Gaby, John Christian; Buckley, Daniel H

    2014-01-01

    We describe a nitrogenase gene sequence database that facilitates analysis of the evolution and ecology of nitrogen-fixing organisms. The database contains 32 954 aligned nitrogenase nifH sequences linked to phylogenetic trees and associated sequence metadata. The database includes 185 linked multigene entries including full-length nifH, nifD, nifK and 16S ribosomal RNA (rRNA) gene sequences. Evolutionary analyses enabled by the multigene entries support an ancient horizontal transfer of nitrogenase genes between Archaea and Bacteria and provide evidence that nifH has a different history of horizontal gene transfer from the nifDK enzyme core. Further analyses show that lineages in nitrogenase cluster I and cluster III have different rates of substitution within nifD, suggesting that nifD is under different selection pressure in these two lineages. Finally, we find that that the genetic divergence of nifH and 16S rRNA genes does not correlate well at sequence dissimilarity values used commonly to define microbial species, as stains having <3% sequence dissimilarity in their 16S rRNA genes can have up to 23% dissimilarity in nifH. The nifH database has a number of uses including phylogenetic and evolutionary analyses, the design and assessment of primers/probes and the evaluation of nitrogenase sequence diversity. Database URL: http://www.css.cornell.edu/faculty/buckley/nifh.htm.

  10. A five-gene hedgehog signature developed as a patient preselection tool for hedgehog inhibitor therapy in medulloblastoma.

    Science.gov (United States)

    Shou, Yaping; Robinson, Douglas M; Amakye, Dereck D; Rose, Kristine L; Cho, Yoon-Jae; Ligon, Keith L; Sharp, Thad; Haider, Asifa S; Bandaru, Raj; Ando, Yuichi; Geoerger, Birgit; Doz, François; Ashley, David M; Hargrave, Darren R; Casanova, Michela; Tawbi, Hussein A; Rodon, Jordi; Thomas, Anne L; Mita, Alain C; MacDonald, Tobey J; Kieran, Mark W

    2015-02-01

    Distinct molecular subgroups of medulloblastoma, including hedgehog (Hh) pathway-activated disease, have been reported. We identified and clinically validated a five-gene Hh signature assay that can be used to preselect patients with Hh pathway-activated medulloblastoma. Gene characteristics of the Hh medulloblastoma subgroup were identified through published bioinformatic analyses. Thirty-two genes shown to be differentially expressed in fresh-frozen and formalin-fixed paraffin-embedded tumor samples and reproducibly analyzed by RT-PCR were measured in matched samples. These data formed the basis for building a multi-gene logistic regression model derived through elastic net methods from which the five-gene Hh signature emerged after multiple iterations. On the basis of signature gene expression levels, the model computed a propensity score to determine Hh activation using a threshold set a priori. The association between Hh activation status and tumor response to the Hh pathway inhibitor sonidegib (LDE225) was analyzed. Five differentially expressed genes in medulloblastoma (GLI1, SPHK1, SHROOM2, PDLIM3, and OTX2) were found to associate with Hh pathway activation status. In an independent validation study, Hh activation status of 25 medulloblastoma samples showed 100% concordance between the five-gene signature and Affymetrix profiling. Further, in medulloblastoma samples from 50 patients treated with sonidegib, all 6 patients who responded were found to have Hh-activated tumors. Three patients with Hh-activated tumors had stable or progressive disease. No patients with Hh-nonactivated tumors responded. This five-gene Hh signature can robustly identify Hh-activated medulloblastoma and may be used to preselect patients who might benefit from sonidegib treatment. ©2014 American Association for Cancer Research.

  11. Gene Ontology

    Directory of Open Access Journals (Sweden)

    Gaston K. Mazandu

    2012-01-01

    Full Text Available The wide coverage and biological relevance of the Gene Ontology (GO, confirmed through its successful use in protein function prediction, have led to the growth in its popularity. In order to exploit the extent of biological knowledge that GO offers in describing genes or groups of genes, there is a need for an efficient, scalable similarity measure for GO terms and GO-annotated proteins. While several GO similarity measures exist, none adequately addresses all issues surrounding the design and usage of the ontology. We introduce a new metric for measuring the distance between two GO terms using the intrinsic topology of the GO-DAG, thus enabling the measurement of functional similarities between proteins based on their GO annotations. We assess the performance of this metric using a ROC analysis on human protein-protein interaction datasets and correlation coefficient analysis on the selected set of protein pairs from the CESSM online tool. This metric achieves good performance compared to the existing annotation-based GO measures. We used this new metric to assess functional similarity between orthologues, and show that it is effective at determining whether orthologues are annotated with similar functions and identifying cases where annotation is inconsistent between orthologues.

  12. Hunting down frame shifts: Ecological analysis of diverse functional gene sequences

    Directory of Open Access Journals (Sweden)

    Michal eStrejcek

    2015-11-01

    Full Text Available Functional gene ecological analyses using amplicon sequencing can be challenging as translated sequences are often burdened with shifted reading frames. The aim of this work was to evaluate several bioinformatics tools designed to correct errors which arise during sequencing in an effort to reduce the number of frame-shifts (FS. Genes encoding for alpha subunits of biphenyl (bphA and benzoate (benA dioxygenases were used as model sequences. FrameBot, a FS correction tool, was able to reduce the number of detected FS to zero. However, up to 43.1% of sequences were discarded by FrameBot as non-specific targets. Therefore, we proposed a de novo mode of FrameBot for FS correction, which works on a similar basis as common chimera identifying platforms and is not dependent on reference sequences. By nature of FrameBot de novo design, it is crucial to provide it with data as error free as possible. We tested the ability of several publicly available correction tools to decrease the number of errors in the data sets. The combination of Maximum Expected Error (MEE filtering and single linkage pre-clustering (SLP proved the most efficient read procession. Applying FrameBot de novo on the processed data enabled analysis of BphA sequences with minimal losses of potentially functional sequences not homologous to those previously known. This experiment also demonstrated the extensive diversity of dioxygenases in soil. A script which performs FrameBot de novo is presented in the supplementary material to the study and the tool was implemented into FunGene Pipeline available at http://fungene.cme.msu.edu/FunGenePipeline/ and https://github.com/rdpstaff/Framebot.

  13. BEACON: automated tool for Bacterial GEnome Annotation ComparisON

    KAUST Repository

    Kalkatawi, Manal M.; Alam, Intikhab; Bajic, Vladimir B.

    2015-01-01

    We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/

  14. SQUAT: A web tool to mine human, murine and avian SAGE data

    Directory of Open Access Journals (Sweden)

    Besson Jérémy

    2008-09-01

    Full Text Available Abstract Background There is an increasing need in transcriptome research for gene expression data and pattern warehouses. It is of importance to integrate in these warehouses both raw transcriptomic data, as well as some properties encoded in these data, like local patterns. Description We have developed an application called SQUAT (SAGE Querying and Analysis Tools which is available at: http://bsmc.insa-lyon.fr/squat/. This database gives access to both raw SAGE data and patterns mined from these data, for three species (human, mouse and chicken. This database allows to make simple queries like "In which biological situations is my favorite gene expressed?" as well as much more complex queries like: ≪what are the genes that are frequently co-over-expressed with my gene of interest in given biological situations?≫. Connections with external web databases enrich biological interpretations, and enable sophisticated queries. To illustrate the power of SQUAT, we show and analyze the results of three different queries, one of which led to a biological hypothesis that was experimentally validated. Conclusion SQUAT is a user-friendly information retrieval platform, which aims at bringing some of the state-of-the-art mining tools to biologists.

  15. Selection of reference genes for gene expression studies in heart failure for left and right ventricles.

    Science.gov (United States)

    Li, Mengmeng; Rao, Man; Chen, Kai; Zhou, Jianye; Song, Jiangping

    2017-07-15

    Real-time quantitative reverse transcriptase-PCR (qRT-PCR) is a feasible tool for determining gene expression profiles, but the accuracy and reliability of the results depends on the stable expression of selected housekeeping genes in different samples. By far, researches on stable housekeeping genes in human heart failure samples are rare. Moreover the effect of heart failure on the expression of housekeeping genes in right and left ventricles is yet to be studied. Therefore we aim to provide stable housekeeping genes for both ventricles in heart failure and normal heart samples. In this study, we selected seven commonly used housekeeping genes as candidates. By using the qRT-PCR, the expression levels of ACTB, RAB7A, GAPDH, REEP5, RPL5, PSMB4 and VCP in eight heart failure and four normal heart samples were assessed. The stability of candidate housekeeping genes was evaluated by geNorm and Normfinder softwares. GAPDH showed the least variation in all heart samples. Results also indicated the difference of gene expression existed in heart failure left and right ventricles. GAPDH had the highest expression stability in both heart failure and normal heart samples. We also propose using different sets of housekeeping genes for left and right ventricles respectively. The combination of RPL5, GAPDH and PSMB4 is suitable for the right ventricle and the combination of GAPDH, REEP5 and RAB7A is suitable for the left ventricle. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Q-Bank Phytoplasma: A DNA Barcoding Tool for Phytoplasma Identification

    DEFF Research Database (Denmark)

    Contaldo, Nicoletta; Paltrinieri, Samanta; Makarova, Olga

    2015-01-01

    DNA barcoding is an identification method based on comparison of a short DNA sequence with known sequences from a database. A DNA barcoding tool has been developed for phytoplasma identification. This phytoplasma DNA barcoding protocol based on the tuf gene has been shown to identify phytoplasmas...

  17. Usage of U7 snRNA in gene therapy of hemoglobin C disorder ...

    African Journals Online (AJOL)

    Here, a bioinformatic analysis was performed to study the effect of co-expression between human Hb C b-globin chain gene and U7.623. The gene ontological results show that full recovery of hemoglobin function and biological process can be derived. This confirms that U7 snRNA can be a good tool for gene therapy in Hb ...

  18. Design of magnetic gene complexes as effective and serum resistant gene delivery systems for mesenchymal stem cells.

    Science.gov (United States)

    Zhang, Tian-Yuan; Wu, Jia-He; Xu, Qian-Hao; Wang, Xia-Rong; Lu, Jingxiong; Hu, Ying; Jo, Jun-Ichiro; Yamamoto, Masaya; Ling, Daishun; Tabata, Yasuhiko; Gao, Jian-Qing

    2017-03-30

    Gene engineered mesenchymal stem cells (MSCs) have been proposed as promising tools for their various applications in biomedicine. Nevertheless, the lack of an effective and safe way to genetically modify these stem cells is still a major obstacle in the current studies. Herein, we designed novel magnetic complexes by assembling cationized pullulan derivatives with magnetic iron oxide nanoparticles for delivering target genes to MSCs. Results showed that this complexes achieved effective gene expression with the assistance of external magnetic field, and resisted the adverse effect induced by serum proteins on the gene delivery. Moreover, neither significant cytotoxicity nor the interference on the osteogenic differentiation to MSCs were observed after magnetofection. Further studies revealed that this effective and serum resistant gene transfection was partly due to the accelerated and enhanced intracellular uptake process driven by external magnetic field. To conclude, the current study presented a novel option for genetic modification of MSCs in an effective, relatively safe and serum compatible way. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. eap Gene as novel target for specific identification of Staphylococcus aureus.

    Science.gov (United States)

    Hussain, Muzaffar; von Eiff, Christof; Sinha, Bhanu; Joost, Insa; Herrmann, Mathias; Peters, Georg; Becker, Karsten

    2008-02-01

    The cell surface-associated extracellular adherence protein (Eap) mediates adherence of Staphylococcus aureus to host extracellular matrix components and inhibits inflammation, wound healing, and angiogenesis. A well-characterized collection of S. aureus and non-S. aureus staphylococcal isolates (n = 813) was tested for the presence of the Eap-encoding gene (eap) by PCR to investigate the use of the eap gene as a specific diagnostic tool for identification of S. aureus. Whereas all 597 S. aureus isolates were eap positive, this gene was not detectable in 216 non-S. aureus staphylococcal isolates comprising 47 different species and subspecies of coagulase-negative staphylococci and non-S. aureus coagulase-positive or coagulase-variable staphylococci. Furthermore, non-S. aureus isolates did not express Eap homologs, as verified on the transcriptional and protein levels. Based on these data, the sensitivity and specificity of the newly developed PCR targeting the eap gene were both 100%. Thus, the unique occurrence of Eap in S. aureus offers a promising tool particularly suitable for molecular diagnostics of this pathogen.

  20. The implication of assessing a polymorphism in estrogen receptor alpha gene in the risk assessment of osteoporosis using a screening tool for osteoporosis in Asians.

    Science.gov (United States)

    Ongphiphadhanakul, Boonsong; Chanprasertyothin, Suwannee; Payattikul, Penpan; Saetung, Sunee; Rajatanavin, Rajata

    2003-10-01

    Both genetic and environmental factors interact to determine bone mass and the risk for developing postmenopausal osteoporosis. Recently, an Asian-specific tool, the Osteoporosis Self-Assessment Tool for Asians (OSTA), has been developed to assess the risk of osteoporosis in women. An index is calculated by multiplying the difference in body weight in kilograms and age in years by 0.2 and disregarding the decimal digits. The risk of osteoporosis is classified as high, intermediate or low according to the OSTA index less than -4, -4 to -1 and greater than -1. In the present study we examined how a single nucleotide polymorphism (SNP) in exon 8 of the estrogen receptor alpha (ERalpha) gene affected the predictive value of the OSTA index. Subjects consisted of 358 postmenopausal women who were at least 55 years old. BMDs were measured by DXA, and the SNP in the ERalpha gene was assessed by PCR-RFLP. When considering both the OSTA index and ERalpha genotype in a logistic regression model, it was found that both the OSTA index and the ERalpha genotype independently contributed to the risk of osteoporosis. The odds ratios were 1.58 (95% CI 1.26-1.91) and 2.51 (95% CI 1.42-4.44) for one unit decrement in the OSTA index and each copy of the A allele of the ERalpha genotype, respectively. The joint effect conformed more to a multiplicative model of interaction than an additive model. This suggests that persons with the high-risk genotype are at far greater risk of developing osteoporosis with advancing age or decreasing body weight, the two variables from which the OSTA index is derived. Targeting preventive measures for osteoporosis subjects with risk factors and also disease-susceptibility alleles is likely to be more cost effective.

  1. PRGdb 3.0: a comprehensive platform for prediction and analysis of plant disease resistance genes.

    Science.gov (United States)

    Osuna-Cruz, Cristina M; Paytuvi-Gallart, Andreu; Di Donato, Antimo; Sundesha, Vicky; Andolfo, Giuseppe; Aiese Cigliano, Riccardo; Sanseverino, Walter; Ercolano, Maria R

    2018-01-04

    The Plant Resistance Genes database (PRGdb; http://prgdb.org) has been redesigned with a new user interface, new sections, new tools and new data for genetic improvement, allowing easy access not only to the plant science research community but also to breeders who want to improve plant disease resistance. The home page offers an overview of easy-to-read search boxes that streamline data queries and directly show plant species for which data from candidate or cloned genes have been collected. Bulk data files and curated resistance gene annotations are made available for each plant species hosted. The new Gene Model view offers detailed information on each cloned resistance gene structure to highlight shared attributes with other genes. PRGdb 3.0 offers 153 reference resistance genes and 177 072 annotated candidate Pathogen Receptor Genes (PRGs). Compared to the previous release, the number of putative genes has been increased from 106 to 177 K from 76 sequenced Viridiplantae and algae genomes. The DRAGO 2 tool, which automatically annotates and predicts (PRGs) from DNA and amino acid with high accuracy and sensitivity, has been added. BLAST search has been implemented to offer users the opportunity to annotate and compare their own sequences. The improved section on plant diseases displays useful information linked to genes and genomes to connect complementary data and better address specific needs. Through, a revised and enlarged collection of data, the development of new tools and a renewed portal, PRGdb 3.0 engages the plant science community in developing a consensus plan to improve knowledge and strategies to fight diseases that afflict main crops and other plants. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Citrate synthase gene sequence: a new tool for phylogenetic analysis and identification of Ehrlichia.

    Science.gov (United States)

    Inokuma, H; Brouqui, P; Drancourt, M; Raoult, D

    2001-09-01

    The sequence of the citrate synthase gene (gltA) of 13 ehrlichial species (Ehrlichia chaffeensis, Ehrlichia canis, Ehrlichia muris, an Ehrlichia species recently detected from Ixodes ovatus, Cowdria ruminantium, Ehrlichia phagocytophila, Ehrlichia equi, the human granulocytic ehrlichiosis [HGE] agent, Anaplasma marginale, Anaplasma centrale, Ehrlichia sennetsu, Ehrlichia risticii, and Neorickettsia helminthoeca) have been determined by degenerate PCR and the Genome Walker method. The ehrlichial gltA genes are 1,197 bp (E. sennetsu and E. risticii) to 1,254 bp (A. marginale and A. centrale) long, and GC contents of the gene vary from 30.5% (Ehrlichia sp. detected from I. ovatus) to 51.0% (A. centrale). The percent identities of the gltA nucleotide sequences among ehrlichial species were 49.7% (E. risticii versus A. centrale) to 99.8% (HGE agent versus E. equi). The percent identities of deduced amino acid sequences were 44.4% (E. sennetsu versus E. muris) to 99.5% (HGE agent versus E. equi), whereas the homology range of 16S rRNA genes was 83.5% (E. risticii versus the Ehrlichia sp. detected from I. ovatus) to 99.9% (HGE agent, E. equi, and E. phagocytophila). The architecture of the phylogenetic trees constructed by gltA nucleotide sequences or amino acid sequences was similar to that derived from the 16S rRNA gene sequences but showed more-significant bootstrap values. Based upon the alignment analysis of the ehrlichial gltA sequences, two sets of primers were designed to amplify tick-borne Ehrlichia and Neorickettsia genogroup Ehrlichia (N. helminthoeca, E. sennetsu, and E. risticii), respectively. Tick-borne Ehrlichia species were specifically identified by restriction fragment length polymorphism (RFLP) patterns of AcsI and XhoI with the exception of E. muris and the very closely related ehrlichia derived from I. ovatus for which sequence analysis of the PCR product is needed. Similarly, Neorickettsia genogroup Ehrlichia species were specifically identified by

  3. De-repressing LncRNA-Targeted Genes to Upregulate Gene Expression: Focus on Small Molecule Therapeutics

    Directory of Open Access Journals (Sweden)

    Roya Pedram Fatemi

    2014-01-01

    Full Text Available Non-protein coding RNAs (ncRNAs make up the overwhelming majority of transcripts in the genome and have recently gained attention for their complex regulatory role in cells, including the regulation of protein-coding genes. Furthermore, ncRNAs play an important role in normal development and their expression levels are dysregulated in several diseases. Recently, several long noncoding RNAs (lncRNAs have been shown to alter the epigenetic status of genomic loci and suppress the expression of target genes. This review will present examples of such a mechanism and focus on the potential to target lncRNAs for achieving therapeutic gene upregulation by de-repressing genes that are epigenetically silenced in various diseases. Finally, the potential to target lncRNAs, through their interactions with epigenetic enzymes, using various tools, such as small molecules, viral vectors and antisense oligonucleotides, will be discussed. We suggest that small molecule modulators of a novel class of drug targets, lncRNA-protein interactions, have great potential to treat some cancers, cardiovascular disease, and neurological disorders.

  4. FMAj: a tool for high content analysis of muscle dynamics in Drosophila metamorphosis.

    Science.gov (United States)

    Kuleesha, Yadav; Puah, Wee Choo; Lin, Feng; Wasser, Martin

    2014-01-01

    During metamorphosis in Drosophila melanogaster, larval muscles undergo two different developmental fates; one population is removed by cell death, while the other persistent subset undergoes morphological remodeling and survives to adulthood. Thanks to the ability to perform live imaging of muscle development in transparent pupae and the power of genetics, metamorphosis in Drosophila can be used as a model to study the regulation of skeletal muscle mass. However, time-lapse microscopy generates sizeable image data that require new tools for high throughput image analysis. We performed targeted gene perturbation in muscles and acquired 3D time-series images of muscles in metamorphosis using laser scanning confocal microscopy. To quantify the phenotypic effects of gene perturbations, we designed the Fly Muscle Analysis tool (FMAj) which is based on the ImageJ and MySQL frameworks for image processing and data storage, respectively. The image analysis pipeline of FMAj contains three modules. The first module assists in adding annotations to time-lapse datasets, such as genotypes, experimental parameters and temporal reference points, which are used to compare different datasets. The second module performs segmentation and feature extraction of muscle cells and nuclei. Users can provide annotations to the detected objects, such as muscle identities and anatomical information. The third module performs comparative quantitative analysis of muscle phenotypes. We applied our tool to the phenotypic characterization of two atrophy related genes that were silenced by RNA interference. Reduction of Drosophila Tor (Target of Rapamycin) expression resulted in enhanced atrophy compared to control, while inhibition of the autophagy factor Atg9 caused suppression of atrophy and enlarged muscle fibers of abnormal morphology. FMAj enabled us to monitor the progression of atrophic and hypertrophic phenotypes of individual muscles throughout metamorphosis. We designed a new tool to

  5. FMAj: a tool for high content analysis of muscle dynamics in Drosophila metamorphosis

    Science.gov (United States)

    2014-01-01

    Background During metamorphosis in Drosophila melanogaster, larval muscles undergo two different developmental fates; one population is removed by cell death, while the other persistent subset undergoes morphological remodeling and survives to adulthood. Thanks to the ability to perform live imaging of muscle development in transparent pupae and the power of genetics, metamorphosis in Drosophila can be used as a model to study the regulation of skeletal muscle mass. However, time-lapse microscopy generates sizeable image data that require new tools for high throughput image analysis. Results We performed targeted gene perturbation in muscles and acquired 3D time-series images of muscles in metamorphosis using laser scanning confocal microscopy. To quantify the phenotypic effects of gene perturbations, we designed the Fly Muscle Analysis tool (FMAj) which is based on the ImageJ and MySQL frameworks for image processing and data storage, respectively. The image analysis pipeline of FMAj contains three modules. The first module assists in adding annotations to time-lapse datasets, such as genotypes, experimental parameters and temporal reference points, which are used to compare different datasets. The second module performs segmentation and feature extraction of muscle cells and nuclei. Users can provide annotations to the detected objects, such as muscle identities and anatomical information. The third module performs comparative quantitative analysis of muscle phenotypes. We applied our tool to the phenotypic characterization of two atrophy related genes that were silenced by RNA interference. Reduction of Drosophila Tor (Target of Rapamycin) expression resulted in enhanced atrophy compared to control, while inhibition of the autophagy factor Atg9 caused suppression of atrophy and enlarged muscle fibers of abnormal morphology. FMAj enabled us to monitor the progression of atrophic and hypertrophic phenotypes of individual muscles throughout metamorphosis

  6. Positive-negative-selection-mediated gene targeting in rice

    Directory of Open Access Journals (Sweden)

    Zenpei eShimatani

    2015-01-01

    Full Text Available Gene targeting (GT refers to the designed modification of genomic sequence(s through homologous recombination (HR. GT is a powerful tool both for the study of gene function and for molecular breeding. However, in transformation of higher plants, non-homologous end joining (NHEJ occurs overwhelmingly in somatic cells, masking HR-mediated GT. Positive-negative selection (PNS is an approach for finding HR-mediated GT events because it can eliminate NHEJ effectively by expression of a negative-selection marker gene. In rice—a major crop worldwide—reproducible PNS-mediated GT of endogenous genes has now been successfully achieved. The procedure is based on strong PNS using diphtheria toxin A-fragment as a negative marker, and has succeeded in the directed modification of several endogenous rice genes in various ways. In addition to gene knock-outs and knock-ins, a nucleotide substitution in a target gene was also achieved recently. This review presents a summary of the development of the rice PNS system, highlighting its advantages. Different types of gene modification and gene editing aimed at developing new plant breeding technology (NPBT based on PNS are discussed.

  7. Genetic Tool Development for a New Host for Biotechnology, the Thermotolerant Bacterium Bacillus coagulans▿ †

    Science.gov (United States)

    Kovács, Ákos T.; van Hartskamp, Mariska; Kuipers, Oscar P.; van Kranenburg, Richard

    2010-01-01

    Bacillus coagulans has good potential as an industrial production organism for platform chemicals from renewable resources but has limited genetic tools available. Here, we present a targeted gene disruption system using the Cre-lox system, development of a LacZ reporter assay for monitoring gene transcription, and heterologous d-lactate dehydrogenase expression. PMID:20400555

  8. Construction of heterologous gene expression cassettes for the development of recombinant Clostridium beijerinckii.

    Science.gov (United States)

    Oh, Young Hoon; Eom, Gyeong Tae; Kang, Kyoung Hee; Joo, Jeong Chan; Jang, Young-Ah; Choi, Jae Woo; Song, Bong Keun; Lee, Seung Hwan; Park, Si Jae

    2016-04-01

    Gene-expression cassettes for the construction of recombinant Clostridium beijerinckii were developed as potential tools for metabolic engineering of C. beijerinckii. Gene expression cassettes containing ColE1 origin and pAMB origin along with the erythromycin resistance gene were constructed, in which promoters from Escherichia coli, Lactococcus lactis, Ralstonia eutropha, C. acetobutylicum, and C. beijerinckii are examined as potential promoters in C. beijerinckii. Zymogram analysis of the cell extracts and comparison of lipase activities of the recombinant C. beijerinckii strains expressing Pseudomonas fluorescens tliA gene suggested that the tliA gene was functionally expressed by all the examined promoters with different expression level. Also, recombinant C. beijerinckii expressing C. beijerinckii secondary alcohol dehydrogenase by the constructed expression cassettes successfully produced 2-propanol from glucose. The best promoter for TliA expression was the R. eutropha phaP promoter while that for 2-propanol production was the putative C. beijerinckii pta promoter. Gene expression cassettes developed in this study may be useful tools for the construction of recombinant C. beijerinckii strains as host strains for the valuable chemicals and fuels from renewable resources.

  9. Reference gene selection for quantitative real-time PCR analysis in virus infected cells: SARS corona virus, Yellow fever virus, Human Herpesvirus-6, Camelpox virus and Cytomegalovirus infections

    Directory of Open Access Journals (Sweden)

    Müller Marcel A

    2005-02-01

    Full Text Available Abstract Ten potential reference genes were compared for their use in experiments investigating cellular mRNA expression of virus infected cells. Human cell lines were infected with Cytomegalovirus, Human Herpesvirus-6, Camelpox virus, SARS coronavirus or Yellow fever virus. The expression levels of these genes and the viral replication were determined by real-time PCR. Genes were ranked by the BestKeeper tool, the GeNorm tool and by criteria we reported previously. Ranking lists of the genes tested were tool dependent. However, over all, β-actin is an unsuitable as reference gene, whereas TATA-Box binding protein and peptidyl-prolyl-isomerase A are stable reference genes for expression studies in virus infected cells.

  10. Novel candidate genes important for asthma and hypertension comorbidity revealed from associative gene networks.

    Science.gov (United States)

    Saik, Olga V; Demenkov, Pavel S; Ivanisenko, Timofey V; Bragina, Elena Yu; Freidin, Maxim B; Goncharova, Irina A; Dosenko, Victor E; Zolotareva, Olga I; Hofestaedt, Ralf; Lavrik, Inna N; Rogaev, Evgeny I; Ivanisenko, Vladimir A

    2018-02-13

    biological processes related to the functioning of central nervous system. The application of methods of reconstruction and analysis of gene networks is a productive tool for studying the molecular mechanisms of comorbid conditions. The method put forth to rank genes by their importance to the comorbid condition of asthma and hypertension was employed that resulted in prediction of 10 genes, playing the key role in the development of the comorbid condition. The results can be utilised to plan experiments for identification of novel candidate genes along with searching for novel pharmacological targets.

  11. Joint mapping of genes and conditions via multidimensional unfolding analysis

    Directory of Open Access Journals (Sweden)

    Engelen Kristof

    2007-06-01

    Full Text Available Abstract Background Microarray compendia profile the expression of genes in a number of experimental conditions. Such data compendia are useful not only to group genes and conditions based on their similarity in overall expression over profiles but also to gain information on more subtle relations between genes and conditions. Getting a clear visual overview of all these patterns in a single easy-to-grasp representation is a useful preliminary analysis step: We propose to use for this purpose an advanced exploratory method, called multidimensional unfolding. Results We present a novel algorithm for multidimensional unfolding that overcomes both general problems and problems that are specific for the analysis of gene expression data sets. Applying the algorithm to two publicly available microarray compendia illustrates its power as a tool for exploratory data analysis: The unfolding analysis of a first data set resulted in a two-dimensional representation which clearly reveals temporal regulation patterns for the genes and a meaningful structure for the time points, while the analysis of a second data set showed the algorithm's ability to go beyond a mere identification of those genes that discriminate between different patient or tissue types. Conclusion Multidimensional unfolding offers a useful tool for preliminary explorations of microarray data: By relying on an easy-to-grasp low-dimensional geometric framework, relations among genes, among conditions and between genes and conditions are simultaneously represented in an accessible way which may reveal interesting patterns in the data. An additional advantage of the method is that it can be applied to the raw data without necessitating the choice of suitable genewise transformations of the data.

  12. Gene hunting: molecular analysis of the chicken genome

    NARCIS (Netherlands)

    Crooijmans, R.P.M.A.

    2000-01-01

    This dissertation describes the development of molecular tools to identify genes that are involved in production and health traits in poultry. To unravel the chicken genome, fluorescent molecular markers (microsatellite markers) were developed and optimized to perform high throughput

  13. A cross-species alignment tool (CAT)

    DEFF Research Database (Denmark)

    Li, Heng; Guan, Liang; Liu, Tao

    2007-01-01

    BACKGROUND: The main two sorts of automatic gene annotation frameworks are ab initio and alignment-based, the latter splitting into two sub-groups. The first group is used for intra-species alignments, among which are successful ones with high specificity and speed. The other group contains more...... sensitive methods which are usually applied in aligning inter-species sequences. RESULTS: Here we present a new algorithm called CAT (for Cross-species Alignment Tool). It is designed to align mRNA sequences to mammalian-sized genomes. CAT is implemented using C scripts and is freely available on the web...... at http://xat.sourceforge.net/. CONCLUSIONS: Examined from different angles, CAT outperforms other extant alignment tools. Tested against all available mouse-human and zebrafish-human orthologs, we demonstrate that CAT combines the specificity and speed of the best intra-species algorithms, like BLAT...

  14. Modified T-cells (using TCR and CTAs, chimeric antigen receptor (CAR and other molecular tools in recent gene therapy

    Directory of Open Access Journals (Sweden)

    A.S. Odiba

    2018-07-01

    Full Text Available T-cell-based cancer immunotherapy by the transfer of cloned TCRs that are isolated from tumor penetrating T-cells becomes a possibility through NY-ESOc259; a human-derived affinity-enhanced TCR that provides a level of sufficiency in long-term safety and efficacy. NY-ESOc259 recognizes a peptide common to CTAs (LAGE-1 and NY-ESO-1 in melanoma. Risks associated with insertion related transformation in gene therapy have been alleviated through strategies that include the engineering of transcription activator like effector nucleases (TALEN, RNA-guided nucleases (CRISPR/Cas9, Zinc-finger nucleases (ZFN. Cancer immunotherapy based on the genetic modification of autologous T-cells (dependent on the engineered autologous CD8+ T-cells, designed to distinguish and destroy cells bearing tumor-specific antigens via a CAR is able to exterminate B-cell leukemias and lymphomas that are resilient to conventional therapies. A tool with a very large reservoir of potentials in molecular therapy strategy is the Pluripotent Stem Cells (PSC, with pluripotency factors that include Klf4, Sox2, c-Myc, Oct4, differentiating into disease-associated cell phenotypes of three germ layers, comprising of mesoderm (e.g. cardiac cells, blood and muscle, endoderm (liver, pancreas and ectoderm (epidermis, neurons. It finds good application in disease modelling as well as therapeutic options in the restoration of CGD by using AAVS1 as the vector where the therapeutic cassette is integrated into the locus to restore superoxide production in the granulocytes. Fascinatingly, Clinical trial involving iPSC are already underway where scientists have plans to use iPSC-derived cells to treat macular degeneration (a devastating age-related eye disease. Application of these findings has redefined incurable diseases disorders as curable. Keywords: Clinical trials, Disorders, Gene therapy, Molecular biology, Pharmacotherapy, Vector

  15. Interrogating the topological robustness of gene regulatory circuits by randomization.

    Directory of Open Access Journals (Sweden)

    Bin Huang

    2017-03-01

    Full Text Available One of the most important roles of cells is performing their cellular tasks properly for survival. Cells usually achieve robust functionality, for example, cell-fate decision-making and signal transduction, through multiple layers of regulation involving many genes. Despite the combinatorial complexity of gene regulation, its quantitative behavior has been typically studied on the basis of experimentally verified core gene regulatory circuitry, composed of a small set of important elements. It is still unclear how such a core circuit operates in the presence of many other regulatory molecules and in a crowded and noisy cellular environment. Here we report a new computational method, named random circuit perturbation (RACIPE, for interrogating the robust dynamical behavior of a gene regulatory circuit even without accurate measurements of circuit kinetic parameters. RACIPE generates an ensemble of random kinetic models corresponding to a fixed circuit topology, and utilizes statistical tools to identify generic properties of the circuit. By applying RACIPE to simple toggle-switch-like motifs, we observed that the stable states of all models converge to experimentally observed gene state clusters even when the parameters are strongly perturbed. RACIPE was further applied to a proposed 22-gene network of the Epithelial-to-Mesenchymal Transition (EMT, from which we identified four experimentally observed gene states, including the states that are associated with two different types of hybrid Epithelial/Mesenchymal phenotypes. Our results suggest that dynamics of a gene circuit is mainly determined by its topology, not by detailed circuit parameters. Our work provides a theoretical foundation for circuit-based systems biology modeling. We anticipate RACIPE to be a powerful tool to predict and decode circuit design principles in an unbiased manner, and to quantitatively evaluate the robustness and heterogeneity of gene expression.

  16. KirCII- promising tool for polyketide diversification

    DEFF Research Database (Denmark)

    Musiol-Kroll, Ewa Maria; Härtner, Thomas; Kulik, Andreas

    2014-01-01

    Kirromycin is produced by Streptomyces collinus Tü 365. This compound is synthesized by a large assembly line of type I polyketide synthases and non-ribosomal peptide synthetases (PKS I/NRPS), encoded by the genes kirAI-kirAVI and kirB. The PKSs KirAI-KirAV have no acyltransferase domains integra...... introducing the non-native substrates in an in vivo context. Thus, KirCII represents a promising tool for polyketide diversification....

  17. Proteomics as a Tool for Understanding Schizophrenia

    OpenAIRE

    Martins-de-Souza, Daniel

    2011-01-01

    Schizophrenia is likely to be a multifactorial disorder, consequence of alterations in gene and protein expression since the neurodevelopment that together to environmental factors will trigger the establishment of the disease. In the post-genomic era, proteomics has emerged as a promising strategy for revealing disease and treatment biomarkers as well as a tool for the comprehension of the mechanisms of schizophrenia pathobiology. Here, there is a discussion of the potential pathways and str...

  18. Jane: a new tool for the cophylogeny reconstruction problem.

    Science.gov (United States)

    Conow, Chris; Fielder, Daniel; Ovadia, Yaniv; Libeskind-Hadas, Ran

    2010-02-03

    This paper describes the theory and implementation of a new software tool, called Jane, for the study of historical associations. This problem arises in parasitology (associations of hosts and parasites), molecular systematics (associations of orderings and genes), and biogeography (associations of regions and orderings). The underlying problem is that of reconciling pairs of trees subject to biologically plausible events and costs associated with these events. Existing software tools for this problem have strengths and limitations, and the new Jane tool described here provides functionality that complements existing tools. The Jane software tool uses a polynomial time dynamic programming algorithm in conjunction with a genetic algorithm to find very good, and often optimal, solutions even for relatively large pairs of trees. The tool allows the user to provide rich timing information on both the host and parasite trees. In addition the user can limit host switch distance and specify multiple host switch costs by specifying regions in the host tree and costs for host switches between pairs of regions. Jane also provides a graphical user interface that allows the user to interactively experiment with modifications to the solutions found by the program. Jane is shown to be a useful tool for cophylogenetic reconstruction. Its functionality complements existing tools and it is therefore likely to be of use to researchers in the areas of parasitology, molecular systematics, and biogeography.

  19. Postnatal Cardiac Gene Editing Using CRISPR/Cas9 With AAV9-Mediated Delivery of Short Guide RNAs Results in Mosaic Gene Disruption.

    Science.gov (United States)

    Johansen, Anne Katrine; Molenaar, Bas; Versteeg, Danielle; Leitoguinho, Ana Rita; Demkes, Charlotte; Spanjaard, Bastiaan; de Ruiter, Hesther; Akbari Moqadam, Farhad; Kooijman, Lieneke; Zentilin, Lorena; Giacca, Mauro; van Rooij, Eva

    2017-10-27

    CRISPR/Cas9 (clustered regularly interspaced palindromic repeats/CRISPR-associated protein 9)-based DNA editing has rapidly evolved as an attractive tool to modify the genome. Although CRISPR/Cas9 has been extensively used to manipulate the germline in zygotes, its application in postnatal gene editing remains incompletely characterized. To evaluate the feasibility of CRISPR/Cas9-based cardiac genome editing in vivo in postnatal mice. We generated cardiomyocyte-specific Cas9 mice and demonstrated that Cas9 expression does not affect cardiac function or gene expression. As a proof-of-concept, we delivered short guide RNAs targeting 3 genes critical for cardiac physiology, Myh6 , Sav1 , and Tbx20 , using a cardiotropic adeno-associated viral vector 9. Despite a similar degree of DNA disruption and subsequent mRNA downregulation, only disruption of Myh6 was sufficient to induce a cardiac phenotype, irrespective of short guide RNA exposure or the level of Cas9 expression. DNA sequencing analysis revealed target-dependent mutations that were highly reproducible across mice resulting in differential rates of in- and out-of-frame mutations. Finally, we applied a dual short guide RNA approach to effectively delete an important coding region of Sav1 , which increased the editing efficiency. Our results indicate that the effect of postnatal CRISPR/Cas9-based cardiac gene editing using adeno-associated virus serotype 9 to deliver a single short guide RNA is target dependent. We demonstrate a mosaic pattern of gene disruption, which hinders the application of the technology to study gene function. Further studies are required to expand the versatility of CRISPR/Cas9 as a robust tool to study novel cardiac gene functions in vivo. © 2017 American Heart Association, Inc.

  20. SoFoCles: feature filtering for microarray classification based on gene ontology.

    Science.gov (United States)

    Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A

    2010-02-01

    Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.

  1. Systematic identification of human housekeeping genes possibly useful as references in gene expression studies.

    Science.gov (United States)

    Caracausi, Maria; Piovesan, Allison; Antonaros, Francesca; Strippoli, Pierluigi; Vitale, Lorenza; Pelleri, Maria Chiara

    2017-09-01

    The ideal reference, or control, gene for the study of gene expression in a given organism should be expressed at a medium‑high level for easy detection, should be expressed at a constant/stable level throughout different cell types and within the same cell type undergoing different treatments, and should maintain these features through as many different tissues of the organism. From a biological point of view, these theoretical requirements of an ideal reference gene appear to be best suited to housekeeping (HK) genes. Recent advancements in the quality and completeness of human expression microarray data and in their statistical analysis may provide new clues toward the quantitative standardization of human gene expression studies in biology and medicine, both cross‑ and within‑tissue. The systematic approach used by the present study is based on the Transcriptome Mapper tool and exploits the automated reassignment of probes to corresponding genes, intra‑ and inter‑sample normalization, elaboration and representation of gene expression values in linear form within an indexed and searchable database with a graphical interface recording quantitative levels of expression, expression variability and cross‑tissue width of expression for more than 31,000 transcripts. The present study conducted a meta‑analysis of a pool of 646 expression profile data sets from 54 different human tissues and identified actin γ 1 as the HK gene that best fits the combination of all the traditional criteria to be used as a reference gene for general use; two ribosomal protein genes, RPS18 and RPS27, and one aquaporin gene, POM121 transmembrane nucleporin C, were also identified. The present study provided a list of tissue‑ and organ‑specific genes that may be most suited for the following individual tissues/organs: Adipose tissue, bone marrow, brain, heart, kidney, liver, lung, ovary, skeletal muscle and testis; and also provides in these cases a representative

  2. BEACON: automated tool for Bacterial GEnome Annotation ComparisON

    KAUST Repository

    Kalkatawi, Manal M.

    2015-08-18

    Background Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). Results The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON’s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced. Conclusions We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/

  3. BEACON: automated tool for Bacterial GEnome Annotation ComparisON.

    Science.gov (United States)

    Kalkatawi, Manal; Alam, Intikhab; Bajic, Vladimir B

    2015-08-18

    Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON's utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27%, while the number of genes without any function assignment is reduced. We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .

  4. Isolation of Genes from Chromosome Region Ip31 Involved in the Development of Breast Cancer

    National Research Council Canada - National Science Library

    Cowell, John

    2000-01-01

    .... Using gene analysis tools, we have been able to demonstrate that few full-length genes are located in this region and that the ESTs from the databases are clustered to a proximal position of the contig...

  5. Geochip: A high throughput genomic tool for linking community structure to functions

    Energy Technology Data Exchange (ETDEWEB)

    Van Nostrand, Joy D.; Liang, Yuting; He, Zhili; Li, Guanghe; Zhou, Jizhong

    2009-01-30

    GeoChip is a comprehensive functional gene array that targets key functional genes involved in the geochemical cycling of N, C, and P, sulfate reduction, metal resistance and reduction, and contaminant degradation. Studies have shown the GeoChip to be a sensitive, specific, and high-throughput tool for microbial community analysis that has the power to link geochemical processes with microbial community structure. However, several challenges remain regarding the development and applications of microarrays for microbial community analysis.

  6. Gene transcription in sea otters (Enhydra lutris); development of a diagnostic tool for sea otter and ecosystem health

    Science.gov (United States)

    Bowen, Lizabeth; Miles, A. Keith; Murray, Michael; Haulena, Martin; Tuttle, Judy; van Bonn, William; Adams, Lance; Bodkin, James L.; Ballachey, Brenda E.; Estes, James A.; Tinker, M. Tim; Keister, Robin; Stott, Jeffrey L.

    2012-01-01

    Gene transcription analysis for diagnosing or monitoring wildlife health requires the ability to distinguish pathophysiological change from natural variation. Herein, we describe methodology for the development of quantitative real-time polymerase chain reaction (qPCR) assays to measure differential transcript levels of multiple immune function genes in the sea otter (Enhydra lutris); sea otter-specific qPCR primer sequences for the genes of interest are defined. We establish a ‘reference’ range of transcripts for each gene in a group of clinically healthy captive and free-ranging sea otters. The 10 genes of interest represent multiple physiological systems that play a role in immuno-modulation, inflammation, cell protection, tumour suppression, cellular stress response, xenobiotic metabolizing enzymes, antioxidant enzymes and cell–cell adhesion. The cycle threshold (CT) measures for most genes were normally distributed; the complement cytolysis inhibitor was the exception. The relative enumeration of multiple gene transcripts in simple peripheral blood samples expands the diagnostic capability currently available to assess the health of sea otters in situ and provides a better understanding of the state of their environment.

  7. Knowledge Discovery in Biological Databases for Revealing Candidate Genes Linked to Complex Phenotypes.

    Science.gov (United States)

    Hassani-Pak, Keywan; Rawlings, Christopher

    2017-06-13

    Genetics and "omics" studies designed to uncover genotype to phenotype relationships often identify large numbers of potential candidate genes, among which the causal genes are hidden. Scientists generally lack the time and technical expertise to review all relevant information available from the literature, from key model species and from a potentially wide range of related biological databases in a variety of data formats with variable quality and coverage. Computational tools are needed for the integration and evaluation of heterogeneous information in order to prioritise candidate genes and components of interaction networks that, if perturbed through potential interventions, have a positive impact on the biological outcome in the whole organism without producing negative side effects. Here we review several bioinformatics tools and databases that play an important role in biological knowledge discovery and candidate gene prioritization. We conclude with several key challenges that need to be addressed in order to facilitate biological knowledge discovery in the future.

  8. Importing statistical measures into Artemis enhances gene identification in the Leishmania genome project

    Directory of Open Access Journals (Sweden)

    McDonagh Paul D

    2003-06-01

    Full Text Available Abstract Background Seattle Biomedical Research Institute (SBRI as part of the Leishmania Genome Network (LGN is sequencing chromosomes of the trypanosomatid protozoan species Leishmania major. At SBRI, chromosomal sequence is annotated using a combination of trained and untrained non-consensus gene-prediction algorithms with ARTEMIS, an annotation platform with rich and user-friendly interfaces. Results Here we describe a methodology used to import results from three different protein-coding gene-prediction algorithms (GLIMMER, TESTCODE and GENESCAN into the ARTEMIS sequence viewer and annotation tool. Comparison of these methods, along with the CODONUSAGE algorithm built into ARTEMIS, shows the importance of combining methods to more accurately annotate the L. major genomic sequence. Conclusion An improvised and powerful tool for gene prediction has been developed by importing data from widely-used algorithms into an existing annotation platform. This approach is especially fruitful in the Leishmania genome project where there is large proportion of novel genes requiring manual annotation.

  9. Tools to covisualize and coanalyze proteomic data with genomes and transcriptomes: validation of genes and alternative mRNA splicing

    DEFF Research Database (Denmark)

    Pang, Chi; Tay, Aidan; Aya, Carlos

    2014-01-01

    contigs, along with RNA-seq reads. This is done in the Integrated Genome Viewer (IGV). A Results Analyzer reports the precise base position where LC-MS/MS-derived peptides cover genes or gene isoforms, on the chromosomes or contigs where this occurs. In prokaryotes, the PG Nexus pipeline facilitates...... the validation of genes, where annotation or gene prediction is available, or the discovery of genes using a "virtual protein"-based unbiased approach. We illustrate this with a comprehensive proteogenomics analysis of two strains of Campylobacter concisus . For higher eukaryotes, the PG Nexus facilitates gene...

  10. Gene Ontology Consortium: going forward.

    Science.gov (United States)

    2015-01-01

    The Gene Ontology (GO; http://www.geneontology.org) is a community-based bioinformatics resource that supplies information about gene product function using ontologies to represent biological knowledge. Here we describe improvements and expansions to several branches of the ontology, as well as updates that have allowed us to more efficiently disseminate the GO and capture feedback from the research community. The Gene Ontology Consortium (GOC) has expanded areas of the ontology such as cilia-related terms, cell-cycle terms and multicellular organism processes. We have also implemented new tools for generating ontology terms based on a set of logical rules making use of templates, and we have made efforts to increase our use of logical definitions. The GOC has a new and improved web site summarizing new developments and documentation, serving as a portal to GO data. Users can perform GO enrichment analysis, and search the GO for terms, annotations to gene products, and associated metadata across multiple species using the all-new AmiGO 2 browser. We encourage and welcome the input of the research community in all biological areas in our continued effort to improve the Gene Ontology. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Host-Induced Gene Silencing of Rice Blast Fungus Magnaporthe oryzae Pathogenicity Genes Mediated by the Brome Mosaic Virus.

    Science.gov (United States)

    Zhu, Lin; Zhu, Jian; Liu, Zhixue; Wang, Zhengyi; Zhou, Cheng; Wang, Hong

    2017-09-26

    Magnaporthe oryzae is a devastating plant pathogen, which has a detrimental impact on rice production worldwide. Despite its agronomical importance, some newly-emerging pathotypes often overcome race-specific disease resistance rapidly. It is thus desirable to develop a novel strategy for the long-lasting resistance of rice plants to ever-changing fungal pathogens. Brome mosaic virus (BMV)-induced RNA interference (RNAi) has emerged as a useful tool to study host-resistance genes for rice blast protection. Planta-generated silencing of targeted genes inside biotrophic pathogens can be achieved by expression of M. oryzae -derived gene fragments in the BMV-mediated gene silencing system, a technique termed host-induced gene silencing (HIGS). In this study, the effectiveness of BMV-mediated HIGS in M. oryzae was examined by targeting three predicted pathogenicity genes, MoABC1, MoMAC1 and MoPMK1 . Systemic generation of fungal gene-specific small interfering RNA (siRNA) molecules induced by inoculation of BMV viral vectors inhibited disease development and reduced the transcription of targeted fungal genes after subsequent M. oryzae inoculation. Combined introduction of fungal gene sequences in sense and antisense orientation mediated by the BMV silencing vectors significantly enhanced the efficiency of this host-generated trans-specific RNAi, implying that these fungal genes played crucial roles in pathogenicity. Collectively, our results indicated that BMV-HIGS system was a great strategy for protecting host plants against the invasion of pathogenic fungi.

  12. Gene expression patterns combined with network analysis identify hub genes associated with bladder cancer.

    Science.gov (United States)

    Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia

    2015-06-01

    To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. Gene analogue finder: a GRID solution for finding functionally analogous gene products

    Directory of Open Access Journals (Sweden)

    Licciulli Flavio

    2007-09-01

    Full Text Available Abstract Background To date more than 2,1 million gene products from more than 100000 different species have been described specifying their function, the processes they are involved in and their cellular localization using a very well defined and structured vocabulary, the gene ontology (GO. Such vast, well defined knowledge opens the possibility of compare gene products at the level of functionality, finding gene products which have a similar function or are involved in similar biological processes without relying on the conventional sequence similarity approach. Comparisons within such a large space of knowledge are highly data and computing intensive. For this reason this project was based upon the use of the computational GRID, a technology offering large computing and storage resources. Results We have developed a tool, GENe AnaloGue FINdEr (ENGINE that parallelizes the search process and distributes the calculation and data over the computational GRID, splitting the process into many sub-processes and joining the calculation and the data on the same machine and therefore completing the whole search in about 3 days instead of occupying one single machine for more than 5 CPU years. The results of the functional comparison contain potential functional analogues for more than 79000 gene products from the most important species. 46% of the analyzed gene products are well enough described for such an analysis to individuate functional analogues, such as well-known members of the same gene family, or gene products with similar functions which would never have been associated by standard methods. Conclusion ENGINE has produced a list of potential functionally analogous relations between gene products within and between species using, in place of the sequence, the gene description of the GO, thus demonstrating the potential of the GO. However, the current limiting factor is the quality of the associations of many gene products from non

  14. GOBO: gene expression-based outcome for breast cancer online.

    Directory of Open Access Journals (Sweden)

    Markus Ringnér

    Full Text Available Microarray-based gene expression analysis holds promise of improving prognostication and treatment decisions for breast cancer patients. However, the heterogeneity of breast cancer emphasizes the need for validation of prognostic gene signatures in larger sample sets stratified into relevant subgroups. Here, we describe a multifunctional user-friendly online tool, GOBO (http://co.bmc.lu.se/gobo, allowing a range of different analyses to be performed in an 1881-sample breast tumor data set, and a 51-sample breast cancer cell line set, both generated on Affymetrix U133A microarrays. GOBO supports a wide range of applications including: 1 rapid assessment of gene expression levels in subgroups of breast tumors and cell lines, 2 identification of co-expressed genes for creation of potential metagenes, 3 association with outcome for gene expression levels of single genes, sets of genes, or gene signatures in multiple subgroups of the 1881-sample breast cancer data set. The design and implementation of GOBO facilitate easy incorporation of additional query functions and applications, as well as additional data sets irrespective of tumor type and array platform.

  15. Virus-induced gene silencing in diverse maize lines using the Brome Mosaic virus-based silencing vector

    Science.gov (United States)

    Virus-induced gene silencing (VIGS) is a widely used tool for gene function studies in many plant species, though its use in monocots has been limited. Using a Brome mosaic virus (BMV) vector designed to silence the maize phytoene desaturase gene, a genetically diverse set of maize inbred lines was ...

  16. Tools for neuroanatomy and neurogenetics in Drosophila

    Energy Technology Data Exchange (ETDEWEB)

    Pfeiffer, Barret D.; Jenett, Arnim; Hammonds, Ann S.; Ngo, Teri-T B.; Misra, Sima; Murphy, Christine; Scully, Audra; Carlson, Joseph W.; Wan, Kenneth H.; Laverty, Todd R.; Mungall, Chris; Svirskas, Rob; Kadonaga, James T.; Doe, Chris Q.; Eisen, Michael B.; Celniker, Susan E.; Rubin, Gerald M.

    2008-08-11

    We demonstrate the feasibility of generating thousands of transgenic Drosophila melanogaster lines in which the expression of an exogenous gene is reproducibly directed to distinct small subsets of cells in the adult brain. We expect the expression patterns produced by the collection of 5,000 lines that we are currently generating to encompass all neurons in the brain in a variety of intersecting patterns. Overlapping 3-kb DNA fragments from the flanking noncoding and intronic regions of genes thought to have patterned expression in the adult brain were inserted into a defined genomic location by site-specific recombination. These fragments were then assayed for their ability to function as transcriptional enhancers in conjunction with a synthetic core promoter designed to work with a wide variety of enhancer types. An analysis of 44 fragments from four genes found that >80% drive expression patterns in the brain; the observed patterns were, on average, comprised of <100 cells. Our results suggest that the D. melanogaster genome contains >50,000 enhancers and that multiple enhancers drive distinct subsets of expression of a gene in each tissue and developmental stage. We expect that these lines will be valuable tools for neuroanatomy as well as for the elucidation of neuronal circuits and information flow in the fly brain.

  17. MVisAGe Identifies Concordant and Discordant Genomic Alterations of Driver Genes in Squamous Tumors.

    Science.gov (United States)

    Walter, Vonn; Du, Ying; Danilova, Ludmila; Hayward, Michele C; Hayes, D Neil

    2018-06-15

    Integrated analyses of multiple genomic datatypes are now common in cancer profiling studies. Such data present opportunities for numerous computational experiments, yet analytic pipelines are limited. Tools such as the cBioPortal and Regulome Explorer, although useful, are not easy to access programmatically or to implement locally. Here, we introduce the MVisAGe R package, which allows users to quantify gene-level associations between two genomic datatypes to investigate the effect of genomic alterations (e.g., DNA copy number changes on gene expression). Visualizing Pearson/Spearman correlation coefficients according to the genomic positions of the underlying genes provides a powerful yet novel tool for conducting exploratory analyses. We demonstrate its utility by analyzing three publicly available cancer datasets. Our approach highlights canonical oncogenes in chr11q13 that displayed the strongest associations between expression and copy number, including CCND1 and CTTN , genes not identified by copy number analysis in the primary reports. We demonstrate highly concordant usage of shared oncogenes on chr3q, yet strikingly diverse oncogene usage on chr11q as a function of HPV infection status. Regions of chr19 that display remarkable associations between methylation and gene expression were identified, as were previously unreported miRNA-gene expression associations that may contribute to the epithelial-to-mesenchymal transition. Significance: This study presents an important bioinformatics tool that will enable integrated analyses of multiple genomic datatypes. Cancer Res; 78(12); 3375-85. ©2018 AACR . ©2018 American Association for Cancer Research.

  18. CRISPR-Cas9: a promising tool for gene editing on induced pluripotent stem cells.

    Science.gov (United States)

    Kim, Eun Ji; Kang, Ki Ho; Ju, Ji Hyeon

    2017-01-01

    Recent advances in genome editing with programmable nucleases have opened up new avenues for multiple applications, from basic research to clinical therapy. The ease of use of the technology-and particularly clustered regularly interspaced short palindromic repeats (CRISPR)-will allow us to improve our understanding of genomic variation in disease processes via cellular and animal models. Here, we highlight the progress made in correcting gene mutations in monogenic hereditary disorders and discuss various CRISPR-associated applications, such as cancer research, synthetic biology, and gene therapy using induced pluripotent stem cells. The challenges, ethical issues, and future prospects of CRISPR-based systems for human research are also discussed.

  19. CRISPR-Cas9: a promising tool for gene editing on induced pluripotent stem cells

    Science.gov (United States)

    Kim, Eun Ji; Kang, Ki Ho; Ju, Ji Hyeon

    2017-01-01

    Recent advances in genome editing with programmable nucleases have opened up new avenues for multiple applications, from basic research to clinical therapy. The ease of use of the technology—and particularly clustered regularly interspaced short palindromic repeats (CRISPR)—will allow us to improve our understanding of genomic variation in disease processes via cellular and animal models. Here, we highlight the progress made in correcting gene mutations in monogenic hereditary disorders and discuss various CRISPR-associated applications, such as cancer research, synthetic biology, and gene therapy using induced pluripotent stem cells. The challenges, ethical issues, and future prospects of CRISPR-based systems for human research are also discussed. PMID:28049282

  20. Tools for the Validation of Genomes and Transcriptomes with Proteomics data

    DEFF Research Database (Denmark)

    Pang, Chi Nam Ignatius; Aya, Carlos; Tay, Aidan

    data generated from protein mass spectrometry. We are developing a set of tools which allow users to: •Co-visualise genomics, transcriptomics, and proteomics data using the Integrated Genomics Viewer (IGV).1 •Validate the existence of genes and mRNAs using peptides identified from mass spectrometry...

  1. Transgenic tools to characterize neuronal properties of discrete populations of zebrafish neurons.

    Science.gov (United States)

    Satou, Chie; Kimura, Yukiko; Hirata, Hiromi; Suster, Maximiliano L; Kawakami, Koichi; Higashijima, Shin-ichi

    2013-09-01

    The developing nervous system consists of a variety of cell types. Transgenic animals expressing reporter genes in specific classes of neuronal cells are powerful tools for the study of neuronal network formation. We generated a wide variety of transgenic zebrafish that expressed reporter genes in specific classes of neurons or neuronal progenitors. These include lines in which neurons of specific neurotransmitter phenotypes expressed fluorescent proteins or Gal4, and lines in which specific subsets of the dorsal progenitor domain in the spinal cord expressed fluorescent proteins. Using these, we examined domain organization in the developing dorsal spinal cord, and found that there are six progenitor domains in zebrafish, which is similar to the domain organization in mice. We also systematically characterized neurotransmitter properties of the neurons that are produced from each domain. Given that reporter gene expressions occurs in a wide area of the nervous system in the lines generated, these transgenic fish should serve as powerful tools for the investigation of not only the neurons in the dorsal spinal cord but also neuronal structures and functions in many other regions of the nervous system.

  2. Annotating novel genes by integrating synthetic lethals and genomic information

    Directory of Open Access Journals (Sweden)

    Faty Mahamadou

    2008-01-01

    Full Text Available Abstract Background Large scale screening for synthetic lethality serves as a common tool in yeast genetics to systematically search for genes that play a role in specific biological processes. Often the amounts of data resulting from a single large scale screen far exceed the capacities of experimental characterization of every identified target. Thus, there is need for computational tools that select promising candidate genes in order to reduce the number of follow-up experiments to a manageable size. Results We analyze synthetic lethality data for arp1 and jnm1, two spindle migration genes, in order to identify novel members in this process. To this end, we use an unsupervised statistical method that integrates additional information from biological data sources, such as gene expression, phenotypic profiling, RNA degradation and sequence similarity. Different from existing methods that require large amounts of synthetic lethal data, our method merely relies on synthetic lethality information from two single screens. Using a Multivariate Gaussian Mixture Model, we determine the best subset of features that assign the target genes to two groups. The approach identifies a small group of genes as candidates involved in spindle migration. Experimental testing confirms the majority of our candidates and we present she1 (YBL031W as a novel gene involved in spindle migration. We applied the statistical methodology also to TOR2 signaling as another example. Conclusion We demonstrate the general use of Multivariate Gaussian Mixture Modeling for selecting candidate genes for experimental characterization from synthetic lethality data sets. For the given example, integration of different data sources contributes to the identification of genetic interaction partners of arp1 and jnm1 that play a role in the same biological process.

  3. A random variance model for detection of differential gene expression in small microarray experiments.

    Science.gov (United States)

    Wright, George W; Simon, Richard M

    2003-12-12

    Microarray techniques provide a valuable way of characterizing the molecular nature of disease. Unfortunately expense and limited specimen availability often lead to studies with small sample sizes. This makes accurate estimation of variability difficult, since variance estimates made on a gene by gene basis will have few degrees of freedom, and the assumption that all genes share equal variance is unlikely to be true. We propose a model by which the within gene variances are drawn from an inverse gamma distribution, whose parameters are estimated across all genes. This results in a test statistic that is a minor variation of those used in standard linear models. We demonstrate that the model assumptions are valid on experimental data, and that the model has more power than standard tests to pick up large changes in expression, while not increasing the rate of false positives. This method is incorporated into BRB-ArrayTools version 3.0 (http://linus.nci.nih.gov/BRB-ArrayTools.html). ftp://linus.nci.nih.gov/pub/techreport/RVM_supplement.pdf

  4. Gene-associated markers provide tools for tackling illegal fishing and false eco-certification

    DEFF Research Database (Denmark)

    Eg Nielsen, Einar; Cariani, Alessia; Aoidh, Eoin Mac

    2012-01-01

    certificates and eco-labels are urgently needed. Here we show that, by using gene-associated single nucleotide polymorphisms, individual marine fish can be assigned back to population of origin with unprecedented high levels of precision. By applying high differentiation single nucleotide polymorphism assays...

  5. Gene therapy for the inner ear: challenges and promises.

    Science.gov (United States)

    Ryan, Allen F; Dazert, Stefan

    2009-01-01

    Since the recognition of genes as the discrete units of heritability, and of DNA as their molecular substrate, the utilization of genes for therapeutic purposes has been recognized as a potential means of correcting genetic disorders. The tools of molecular biology, which allow the manipulation of DNA sequence, provided the means to put this concept into practice. However, progress in the implementation of these ideas has been slow. Here we review the history of the idea of gene therapy and the complexity of genetic disorders. We also discuss the requirements for sequence-based therapy to be accomplished for different types of inherited diseases, as well as the methods available for gene manipulation. The challenges that have limited the applications of gene therapy are reviewed, as are ethical concerns. Finally, we discuss the promise of gene therapy to address inherited and acquired disorders of the inner ear. Copyright (c) 2009 S. Karger AG, Basel.

  6. GeneTrailExpress: a web-based pipeline for the statistical evaluation of microarray experiments

    Directory of Open Access Journals (Sweden)

    Kohlbacher Oliver

    2008-12-01

    Full Text Available Abstract Background High-throughput methods that allow for measuring the expression of thousands of genes or proteins simultaneously have opened new avenues for studying biochemical processes. While the noisiness of the data necessitates an extensive pre-processing of the raw data, the high dimensionality requires effective statistical analysis methods that facilitate the identification of crucial biological features and relations. For these reasons, the evaluation and interpretation of expression data is a complex, labor-intensive multi-step process. While a variety of tools for normalizing, analysing, or visualizing expression profiles has been developed in the last years, most of these tools offer only functionality for accomplishing certain steps of the evaluation pipeline. Results Here, we present a web-based toolbox that provides rich functionality for all steps of the evaluation pipeline. Our tool GeneTrailExpress offers besides standard normalization procedures powerful statistical analysis methods for studying a large variety of biological categories and pathways. Furthermore, an integrated graph visualization tool, BiNA, enables the user to draw the relevant biological pathways applying cutting-edge graph-layout algorithms. Conclusion Our gene expression toolbox with its interactive visualization of the pathways and the expression values projected onto the nodes will simplify the analysis and interpretation of biochemical pathways considerably.

  7. High-performance web services for querying gene and variant annotation.

    Science.gov (United States)

    Xin, Jiwen; Mark, Adam; Afrasiabi, Cyrus; Tsueng, Ginger; Juchler, Moritz; Gopal, Nikhil; Stupp, Gregory S; Putman, Timothy E; Ainscough, Benjamin J; Griffith, Obi L; Torkamani, Ali; Whetzel, Patricia L; Mungall, Christopher J; Mooney, Sean D; Su, Andrew I; Wu, Chunlei

    2016-05-06

    Efficient tools for data management and integration are essential for many aspects of high-throughput biology. In particular, annotations of genes and human genetic variants are commonly used but highly fragmented across many resources. Here, we describe MyGene.info and MyVariant.info, high-performance web services for querying gene and variant annotation information. These web services are currently accessed more than three million times permonth. They also demonstrate a generalizable cloud-based model for organizing and querying biological annotation information. MyGene.info and MyVariant.info are provided as high-performance web services, accessible at http://mygene.info and http://myvariant.info . Both are offered free of charge to the research community.

  8. Development of an anhydrotetracycline-inducible gene expression system for solvent-producing Clostridium acetobutylicum: A useful tool for strain engineering.

    Science.gov (United States)

    Dong, Hongjun; Tao, Wenwen; Zhang, Yanping; Li, Yin

    2012-01-01

    Clostridium acetobutylicum is an important solvent (acetone-butanol-ethanol) producing bacterium. However, a stringent, effective, and convenient-to-use inducible gene expression system that can be used for regulating the gene expression strength in C. acetobutylicum is currently not available. Here, we report an anhydrotetracycline-inducible gene expression system for solvent-producing bacterium C. acetobutylicum. This system consists of a functional chloramphenicol acetyltransferase gene promoter containing tet operators (tetO), Pthl promoter (thiolase gene promoter from C. acetobutylicum) controlling TetR repressor expression cassette, and the chemical inducer anhydrotetracycline (aTc). The optimized system, designated as pGusA2-2tetO1, allows gene regulation in an inducer aTc concentration-dependent way, with an inducibility of over two orders of magnitude. The stringency of TetR repression supports the introduction of the genes encoding counterselective marker into C. acetobutylicum, which can be used to increase the mutant screening efficiency. This aTc-inducible gene expression system will thus increase the genetic manipulation capability for engineering C. acetobutylicum. Copyright © 2011 Elsevier Inc. All rights reserved.

  9. Jane: a new tool for the cophylogeny reconstruction problem

    Directory of Open Access Journals (Sweden)

    Ovadia Yaniv

    2010-02-01

    Full Text Available Abstract Background This paper describes the theory and implementation of a new software tool, called Jane, for the study of historical associations. This problem arises in parasitology (associations of hosts and parasites, molecular systematics (associations of orderings and genes, and biogeography (associations of regions and orderings. The underlying problem is that of reconciling pairs of trees subject to biologically plausible events and costs associated with these events. Existing software tools for this problem have strengths and limitations, and the new Jane tool described here provides functionality that complements existing tools. Results The Jane software tool uses a polynomial time dynamic programming algorithm in conjunction with a genetic algorithm to find very good, and often optimal, solutions even for relatively large pairs of trees. The tool allows the user to provide rich timing information on both the host and parasite trees. In addition the user can limit host switch distance and specify multiple host switch costs by specifying regions in the host tree and costs for host switches between pairs of regions. Jane also provides a graphical user interface that allows the user to interactively experiment with modifications to the solutions found by the program. Conclusions Jane is shown to be a useful tool for cophylogenetic reconstruction. Its functionality complements existing tools and it is therefore likely to be of use to researchers in the areas of parasitology, molecular systematics, and biogeography.

  10. CRISPR/Cas9-mediated gene targeting in Arabidopsis using sequential transformation.

    Science.gov (United States)

    Miki, Daisuke; Zhang, Wenxin; Zeng, Wenjie; Feng, Zhengyan; Zhu, Jian-Kang

    2018-05-17

    Homologous recombination-based gene targeting is a powerful tool for precise genome modification and has been widely used in organisms ranging from yeast to higher organisms such as Drosophila and mouse. However, gene targeting in higher plants, including the most widely used model plant Arabidopsis thaliana, remains challenging. Here we report a sequential transformation method for gene targeting in Arabidopsis. We find that parental lines expressing the bacterial endonuclease Cas9 from the egg cell- and early embryo-specific DD45 gene promoter can improve the frequency of single-guide RNA-targeted gene knock-ins and sequence replacements via homologous recombination at several endogenous sites in the Arabidopsis genome. These heritable gene targeting can be identified by regular PCR. Our approach enables routine and fine manipulation of the Arabidopsis genome.

  11. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E.; Re, Matteo

    2014-01-01

    Objective In the context of “network medicine”, gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. Materials and methods We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. Results The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different “informativeness” embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Conclusions Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further

  12. Automated Eukaryotic Gene Structure Annotation Using EVidenceModeler and the Program to Assemble Spliced Alignments

    Energy Technology Data Exchange (ETDEWEB)

    Haas, B J; Salzberg, S L; Zhu, W; Pertea, M; Allen, J E; Orvis, J; White, O; Buell, C R; Wortman, J R

    2007-12-10

    EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

  13. Major genes and QTL influencing wool production and quality: a review

    Directory of Open Access Journals (Sweden)

    Purvis Ian

    2005-12-01

    Full Text Available Abstract The opportunity exists to utilise our knowledge of major genes that influence the economically important traits in wool sheep. Genes with Mendelian inheritance have been identified for many important traits in wool sheep. Of particular importance are genes influencing pigmentation, wool quality and the keratin proteins, the latter of which are important for the morphology of the wool fibre. Gene mapping studies have identified some chromosomal regions associated with variation in wool quality and production traits. The challenge now is to build on this knowledge base in a cost-effective way to deliver molecular tools that facilitate enhanced genetic improvement programs for wool sheep.

  14. Transcriptome profiling and digital gene expression analysis of genes associated with salinity resistance in peanut

    Directory of Open Access Journals (Sweden)

    Jiongming Sui

    2018-03-01

    Full Text Available Background: Soil salinity can significantly reduce crop production, but the molecular mechanism of salinity tolerance in peanut is poorly understood. A mutant (S1 with higher salinity resistance than its mutagenic parent HY22 (S3 was obtained. Transcriptome sequencing and digital gene expression (DGE analysis were performed with leaves of S1 and S3 before and after plants were irrigated with 250 mM NaCl. Results: A total of 107,725 comprehensive transcripts were assembled into 67,738 unigenes using TIGR Gene Indices clustering tools (TGICL. All unigenes were searched against the euKaryotic Ortholog Groups (KOG, gene ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG databases, and these unigenes were assigned to 26 functional KOG categories, 56 GO terms, 32 KEGG groups, respectively. In total 112 differentially expressed genes (DEGs between S1 and S3 after salinity stress were screened, among them, 86 were responsive to salinity stress in S1 and/or S3. These 86 DEGs included genes that encoded the following kinds of proteins that are known to be involved in resistance to salinity stress: late embryogenesis abundant proteins (LEAs, major intrinsic proteins (MIPs or aquaporins, metallothioneins (MTs, lipid transfer protein (LTP, calcineurin B-like protein-interacting protein kinases (CIPKs, 9-cis-epoxycarotenoid dioxygenase (NCED and oleosins, etc. Of these 86 DEGs, 18 could not be matched with known proteins. Conclusion: The results from this study will be useful for further research on the mechanism of salinity resistance and will provide a useful gene resource for the variety breeding of salinity resistance in peanut. Keywords: Digital gene expression, Gene, Mutant, NaCl, Peanut (Arachis hypogaea L., RNA-seq, Salinity stress, Salinity tolerance, Soil salinity, Transcripts, Unigenes

  15. Nonviral Technologies for Gene Therapy in Cardiovascular Research

    Directory of Open Access Journals (Sweden)

    Cheng-Huang Su

    2008-06-01

    Full Text Available Gene therapy, which is still at an experimental stage, is a technique that attempts to correct or prevent a disease by delivering genes into an individual's cells and tissues. In gene delivery, a vector is a vehicle for transferring genetic material into cells and tissues. Synthetic vectors are considered to be prerequisites for gene delivery, because viral vectors have fundamental problems in relation to safety issues as well as large-scale production. Among the physical approaches, ultrasound with its associated bioeffects such as acoustic cavitation, especially inertial cavitation, can increase the permeability of cell membranes to macromolecules such as plasmid DNA. Microbubbles or ultrasound contrast agents lower the threshold for cavitation by ultrasound energy. Furthermore, ultrasound-enhanced gene delivery using polymers or other nonviral vectors may hold much promise for the future but is currently at the preclinical stage. We all know aging is cruel and inevitable. Currently, among the promising areas for gene therapy in acquired diseases, the incidences of cancer and ischemic cardiovascular diseases are strongly correlated with the aging process. As a result, gene therapy technology may play important roles in these diseases in the future. This brief review focuses on understanding the barriers to gene transfer as well as describing the useful nonviral vectors or tools that are applied to gene delivery and introducing feasible models in terms of ultrasound-based gene delivery.

  16. Fast and sensitive detection of indels induced by precise gene targeting

    DEFF Research Database (Denmark)

    Yang, Zhang; Steentoft, Catharina; Hauge, Camilla

    2015-01-01

    The nuclease-based gene editing tools are rapidly transforming capabilities for altering the genome of cells and organisms with great precision and in high throughput studies. A major limitation in application of precise gene editing lies in lack of sensitive and fast methods to detect...... and characterize the induced DNA changes. Precise gene editing induces double-stranded DNA breaks that are repaired by error-prone non-homologous end joining leading to introduction of insertions and deletions (indels) at the target site. These indels are often small and difficult and laborious to detect...

  17. Landscape genetics as a tool for conservation planning: predicting the effects of landscape change on gene flow.

    Science.gov (United States)

    van Strien, Maarten J; Keller, Daniela; Holderegger, Rolf; Ghazoul, Jaboury; Kienast, Felix; Bolliger, Janine

    2014-03-01

    For conservation managers, it is important to know whether landscape changes lead to increasing or decreasing gene flow. Although the discipline of landscape genetics assesses the influence of landscape elements on gene flow, no studies have yet used landscape-genetic models to predict gene flow resulting from landscape change. A species that has already been severely affected by landscape change is the large marsh grasshopper (Stethophyma grossum), which inhabits moist areas in fragmented agricultural landscapes in Switzerland. From transects drawn between all population pairs within maximum dispersal distance (landscape composition as well as some measures of habitat configuration. Additionally, a complete sampling of all populations in our study area allowed incorporating measures of population topology. These measures together with the landscape metrics formed the predictor variables in linear models with gene flow as response variable (F(ST) and mean pairwise assignment probability). With a modified leave-one-out cross-validation approach, we selected the model with the highest predictive accuracy. With this model, we predicted gene flow under several landscape-change scenarios, which simulated construction, rezoning or restoration projects, and the establishment of a new population. For some landscape-change scenarios, significant increase or decrease in gene flow was predicted, while for others little change was forecast. Furthermore, we found that the measures of population topology strongly increase model fit in landscape genetic analysis. This study demonstrates the use of predictive landscape-genetic models in conservation and landscape planning.

  18. GeneViTo: Visualizing gene-product functional and structural features in genomic datasets

    Directory of Open Access Journals (Sweden)

    Promponas Vasilis J

    2003-10-01

    Full Text Available Abstract Background The availability of increasing amounts of sequence data from completely sequenced genomes boosts the development of new computational methods for automated genome annotation and comparative genomics. Therefore, there is a need for tools that facilitate the visualization of raw data and results produced by bioinformatics analysis, providing new means for interactive genome exploration. Visual inspection can be used as a basis to assess the quality of various analysis algorithms and to aid in-depth genomic studies. Results GeneViTo is a JAVA-based computer application that serves as a workbench for genome-wide analysis through visual interaction. The application deals with various experimental information concerning both DNA and protein sequences (derived from public sequence databases or proprietary data sources and meta-data obtained by various prediction algorithms, classification schemes or user-defined features. Interaction with a Graphical User Interface (GUI allows easy extraction of genomic and proteomic data referring to the sequence itself, sequence features, or general structural and functional features. Emphasis is laid on the potential comparison between annotation and prediction data in order to offer a supplement to the provided information, especially in cases of "poor" annotation, or an evaluation of available predictions. Moreover, desired information can be output in high quality JPEG image files for further elaboration and scientific use. A compilation of properly formatted GeneViTo input data for demonstration is available to interested readers for two completely sequenced prokaryotes, Chlamydia trachomatis and Methanococcus jannaschii. Conclusions GeneViTo offers an inspectional view of genomic functional elements, concerning data stemming both from database annotation and analysis tools for an overall analysis of existing genomes. The application is compatible with Linux or Windows ME-2000-XP operating

  19. Regulation of endogenous human gene expression by ligand-inducible TALE transcription factors.

    Science.gov (United States)

    Mercer, Andrew C; Gaj, Thomas; Sirk, Shannon J; Lamb, Brian M; Barbas, Carlos F

    2014-10-17

    The construction of increasingly sophisticated synthetic biological circuits is dependent on the development of extensible tools capable of providing specific control of gene expression in eukaryotic cells. Here, we describe a new class of synthetic transcription factors that activate gene expression in response to extracellular chemical stimuli. These inducible activators consist of customizable transcription activator-like effector (TALE) proteins combined with steroid hormone receptor ligand-binding domains. We demonstrate that these ligand-responsive TALE transcription factors allow for tunable and conditional control of gene activation and can be used to regulate the expression of endogenous genes in human cells. Since TALEs can be designed to recognize any contiguous DNA sequence, the conditional gene regulatory system described herein will enable the design of advanced synthetic gene networks.

  20. Bioinformatics Database Tools in Analysis of Genetics of Neurodevelopmental Disorders

    Directory of Open Access Journals (Sweden)

    Dibyashree Mallik

    2017-10-01

    Full Text Available Bioinformatics tools are recently used in various sectors of biology. Many questions regarding Neurodevelopmental disorder which arises as a major health issue recently can be solved by using various bioinformatics databases. Schizophrenia is such a mental disorder which is now arises as a major threat in young age people because it is mostly seen in case of people during their late adolescence or early adulthood period. Databases like DISGENET, GWAS, PHARMGKB, and DRUGBANK have huge repository of genes associated with schizophrenia. We found a lot of genes are being associated with schizophrenia, but approximately 200 genes are found to be present in any of these databases. After further screening out process 20 genes are found to be highly associated with each other and are also a common genes in many other diseases also. It is also found that they all are serves as a common targeting gene in many antipsychotic drugs. After analysis of various biological properties, molecular function it is found that these 20 genes are mostly involved in biological regulation process and are having receptor activity. They are belonging mainly to receptor protein class. Among these 20 genes CYP2C9, CYP3A4, DRD2, HTR1A, HTR2A are shown to be a main targeting genes of most of the antipsychotic drugs and are associated with  more than 40% diseases. The basic findings of the present study enumerated that a suitable combined drug can be design by targeting these genes which can be used for the better treatment of schizophrenia.

  1. Simulation of E. coli gene regulation including overlapping cell cycles, growth, division, time delays and noise.

    Directory of Open Access Journals (Sweden)

    Ruoyu Luo

    Full Text Available Due to the complexity of biological systems, simulation of biological networks is necessary but sometimes complicated. The classic stochastic simulation algorithm (SSA by Gillespie and its modified versions are widely used to simulate the stochastic dynamics of biochemical reaction systems. However, it has remained a challenge to implement accurate and efficient simulation algorithms for general reaction schemes in growing cells. Here, we present a modeling and simulation tool, called 'GeneCircuits', which is specifically developed to simulate gene-regulation in exponentially growing bacterial cells (such as E. coli with overlapping cell cycles. Our tool integrates three specific features of these cells that are not generally included in SSA tools: 1 the time delay between the regulation and synthesis of proteins that is due to transcription and translation processes; 2 cell cycle-dependent periodic changes of gene dosage; and 3 variations in the propensities of chemical reactions that have time-dependent reaction rates as a consequence of volume expansion and cell division. We give three biologically relevant examples to illustrate the use of our simulation tool in quantitative studies of systems biology and synthetic biology.

  2. A compilation of Web-based research tools for miRNA analysis.

    Science.gov (United States)

    Shukla, Vaibhav; Varghese, Vinay Koshy; Kabekkodu, Shama Prasada; Mallya, Sandeep; Satyamoorthy, Kapaettu

    2017-09-01

    Since the discovery of microRNAs (miRNAs), a class of noncoding RNAs that regulate the gene expression posttranscriptionally in sequence-specific manner, there has been a release of number of tools useful for both basic and advanced applications. This is because of the significance of miRNAs in many pathophysiological conditions including cancer. Numerous bioinformatics tools that have been developed for miRNA analysis have their utility for detection, expression, function, target prediction and many other related features. This review provides a comprehensive assessment of web-based tools for the miRNA analysis that does not require prior knowledge of any computing languages. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  3. atBioNet– an integrated network analysis tool for genomics and biomarker discovery

    Directory of Open Access Journals (Sweden)

    Ding Yijun

    2012-07-01

    Full Text Available Abstract Background Large amounts of mammalian protein-protein interaction (PPI data have been generated and are available for public use. From a systems biology perspective, Proteins/genes interactions encode the key mechanisms distinguishing disease and health, and such mechanisms can be uncovered through network analysis. An effective network analysis tool should integrate different content-specific PPI databases into a comprehensive network format with a user-friendly platform to identify key functional modules/pathways and the underlying mechanisms of disease and toxicity. Results atBioNet integrates seven publicly available PPI databases into a network-specific knowledge base. Knowledge expansion is achieved by expanding a user supplied proteins/genes list with interactions from its integrated PPI network. The statistically significant functional modules are determined by applying a fast network-clustering algorithm (SCAN: a Structural Clustering Algorithm for Networks. The functional modules can be visualized either separately or together in the context of the whole network. Integration of pathway information enables enrichment analysis and assessment of the biological function of modules. Three case studies are presented using publicly available disease gene signatures as a basis to discover new biomarkers for acute leukemia, systemic lupus erythematosus, and breast cancer. The results demonstrated that atBioNet can not only identify functional modules and pathways related to the studied diseases, but this information can also be used to hypothesize novel biomarkers for future analysis. Conclusion atBioNet is a free web-based network analysis tool that provides a systematic insight into proteins/genes interactions through examining significant functional modules. The identified functional modules are useful for determining underlying mechanisms of disease and biomarker discovery. It can be accessed at: http://www.fda.gov/ScienceResearch/BioinformaticsTools

  4. atBioNet--an integrated network analysis tool for genomics and biomarker discovery.

    Science.gov (United States)

    Ding, Yijun; Chen, Minjun; Liu, Zhichao; Ding, Don; Ye, Yanbin; Zhang, Min; Kelly, Reagan; Guo, Li; Su, Zhenqiang; Harris, Stephen C; Qian, Feng; Ge, Weigong; Fang, Hong; Xu, Xiaowei; Tong, Weida

    2012-07-20

    Large amounts of mammalian protein-protein interaction (PPI) data have been generated and are available for public use. From a systems biology perspective, Proteins/genes interactions encode the key mechanisms distinguishing disease and health, and such mechanisms can be uncovered through network analysis. An effective network analysis tool should integrate different content-specific PPI databases into a comprehensive network format with a user-friendly platform to identify key functional modules/pathways and the underlying mechanisms of disease and toxicity. atBioNet integrates seven publicly available PPI databases into a network-specific knowledge base. Knowledge expansion is achieved by expanding a user supplied proteins/genes list with interactions from its integrated PPI network. The statistically significant functional modules are determined by applying a fast network-clustering algorithm (SCAN: a Structural Clustering Algorithm for Networks). The functional modules can be visualized either separately or together in the context of the whole network. Integration of pathway information enables enrichment analysis and assessment of the biological function of modules. Three case studies are presented using publicly available disease gene signatures as a basis to discover new biomarkers for acute leukemia, systemic lupus erythematosus, and breast cancer. The results demonstrated that atBioNet can not only identify functional modules and pathways related to the studied diseases, but this information can also be used to hypothesize novel biomarkers for future analysis. atBioNet is a free web-based network analysis tool that provides a systematic insight into proteins/genes interactions through examining significant functional modules. The identified functional modules are useful for determining underlying mechanisms of disease and biomarker discovery. It can be accessed at: http://www.fda.gov/ScienceResearch/BioinformaticsTools/ucm285284.htm.

  5. The carnegie protein trap library: a versatile tool for Drosophila developmental studies.

    Science.gov (United States)

    Buszczak, Michael; Paterno, Shelley; Lighthouse, Daniel; Bachman, Julia; Planck, Jamie; Owen, Stephenie; Skora, Andrew D; Nystul, Todd G; Ohlstein, Benjamin; Allen, Anna; Wilhelm, James E; Murphy, Terence D; Levis, Robert W; Matunis, Erika; Srivali, Nahathai; Hoskins, Roger A; Spradling, Allan C

    2007-03-01

    Metazoan physiology depends on intricate patterns of gene expression that remain poorly known. Using transposon mutagenesis in Drosophila, we constructed a library of 7404 protein trap and enhancer trap lines, the Carnegie collection, to facilitate gene expression mapping at single-cell resolution. By sequencing the genomic insertion sites, determining splicing patterns downstream of the enhanced green fluorescent protein (EGFP) exon, and analyzing expression patterns in the ovary and salivary gland, we found that 600-900 different genes are trapped in our collection. A core set of 244 lines trapped different identifiable protein isoforms, while insertions likely to act as GFP-enhancer traps were found in 256 additional genes. At least 8 novel genes were also identified. Our results demonstrate that the Carnegie collection will be useful as a discovery tool in diverse areas of cell and developmental biology and suggest new strategies for greatly increasing the coverage of the Drosophila proteome with protein trap insertions.

  6. Visualization of the Dynamics of Gene Expression in the Living Mouse

    Directory of Open Access Journals (Sweden)

    Amy Ryan

    2004-01-01

    Full Text Available Reporter genes can monitor the status and activity of recombinant genomes in a diverse array of organisms, from bacteria and yeast to plants and animals. We have combined luciferase reporter genes with a conditional gene expression system based on regulatory elements from the lac Operon of Escherichia coli to visualize the dynamics of gene expression in realtime in the living mouse. Using this technology, we have determined the rate of gene induction and repression, the level of target gene activity in response to different doses of inducer, and the schedule of induction during early embryogenesis of both the endogenous and the experimentally manipulated programs of mammalian gene expression associated with the HD/Hdh locus. The combination of in vivo imaging and lac regulation is a powerful tool for generating conditional transgenic mice that can be screened rapidly for optimal regulation and expression patterns, and for monitoring the induction and repression of regulated genes noninvasively in the living animal.

  7. NuGO contributions to GenePattern.

    Science.gov (United States)

    De Groot, P J; Reiff, C; Mayer, C; Müller, M

    2008-12-01

    NuGO, the European Nutrigenomics Organization, utilizes 31 powerful computers for, e.g., data storage and analysis. These so-called black boxes (NBXses) are located at the sites of different partners. NuGO decided to use GenePattern as the preferred genomic analysis tool on each NBX. To handle the custom made Affymetrix NuGO arrays, new NuGO modules are added to GenePattern. These NuGO modules execute the latest Bioconductor version ensuring up-to-date annotations and access to the latest scientific developments. The following GenePattern modules are provided by NuGO: NuGOArrayQualityAnalysis for comprehensive quality control, NuGOExpressionFileCreator for import and normalization of data, LimmaAnalysis for identification of differentially expressed genes, TopGoAnalysis for calculation of GO enrichment, and GetResultForGo for retrieval of information on genes associated with specific GO terms. All together, these NuGO modules allow comprehensive, up-to-date, and user friendly analysis of Affymetrix data. A special feature of the NuGO modules is that for analysis they allow the use of either the standard Affymetrix or the MBNI custom CDF-files, which remap probes based on current knowledge. In both cases a .chip-file is created to enable GSEA analysis. The NuGO GenePattern installations are distributed as binary Ubuntu (.deb) packages via the NuGO repository.

  8. Design and bioinformatics analysis of novel biomimetic peptides as nanocarriers for gene transfer

    Directory of Open Access Journals (Sweden)

    Asia Majidi

    2015-01-01

    Full Text Available Objective(s: The introduction of nucleic acids into cells for therapeutic objectives is significantly hindered by the size and charge of these molecules and therefore requires efficient vectors that assist cellular uptake. For several years great efforts have been devoted to the study of development of recombinant vectors based on biological domains with potential applications in gene therapy. Such vectors have been synthesized in genetically engineered approach, resulting in biomacromolecules with new properties that are not present in nature. Materials and Methods: In this study, we have designed new peptides using homology modeling with the purpose of overcoming the cell barriers for successful gene delivery through Bioinformatics tools. Three different carriers were designed and one of those with better score through Bioinformatics tools was cloned, expressed and its affinity for pDNA was monitored. Results: The resultszz demonstrated that the vector can effectively condense pDNAinto nanoparticles with the average sizes about 100 nm. Conclusion: We hope these peptides can overcome the biological barriers associated with gene transfer, and mediate efficient gene delivery.

  9. Conditions for gene disruption by homologous recombination of exogenous DNA into the Sulfolobus solfataricus genome

    NARCIS (Netherlands)

    Albers, Sonja-Verena; Driessen, Arnold J.M.

    2008-01-01

    The construction of directed gene deletion mutants is an essential tool in molecular biology that allows functional studies on the role of genes in their natural environment. For hyperthermophilic archaea, it has been difficult to obtain a reliable system to construct such mutants. However, during

  10. VISTA - computational tools for comparative genomics

    Energy Technology Data Exchange (ETDEWEB)

    Frazer, Kelly A.; Pachter, Lior; Poliakov, Alexander; Rubin,Edward M.; Dubchak, Inna

    2004-01-01

    Comparison of DNA sequences from different species is a fundamental method for identifying functional elements in genomes. Here we describe the VISTA family of tools created to assist biologists in carrying out this task. Our first VISTA server at http://www-gsd.lbl.gov/VISTA/ was launched in the summer of 2000 and was designed to align long genomic sequences and visualize these alignments with associated functional annotations. Currently the VISTA site includes multiple comparative genomics tools and provides users with rich capabilities to browse pre-computed whole-genome alignments of large vertebrate genomes and other groups of organisms with VISTA Browser, submit their own sequences of interest to several VISTA servers for various types of comparative analysis, and obtain detailed comparative analysis results for a set of cardiovascular genes. We illustrate capabilities of the VISTA site by the analysis of a 180 kilobase (kb) interval on human chromosome 5 that encodes for the kinesin family member3A (KIF3A) protein.

  11. Agrobacterium rhizogenes-mediated transformation of Arachis hypogaea: an efficient tool for functional study of genes

    Directory of Open Access Journals (Sweden)

    Shuai Liu

    2016-09-01

    Full Text Available We have developed a technique for efficient transformation of hairy roots of Arachis hypogaea L. using Agrobacterium rhizogenes K599, and have validated this approach for the investigation of gene function. As a model transgene, AhAREB1, a drought-resistance gene from peanut, was fused to green fluorescent protein, and four parameters that might influence the transformation efficiency were tested. The optimal procedure involved the use of petioles with four expanded leaves as explants, infection by K599 at optical density (OD600 of 0.6 for 15 min and co-cultivation for 2 d, giving transformation efficiencies of up to 91%. Hairy roots from transgenic peanut plants overexpressing AhAREB1 were unaffected by treatment with polyethylene glycol (PEG, demonstrating increased drought tolerance, whereas control roots showed clear signs of plasmolysis. Transgenic roots accumulated less superoxide anion (O2− than control roots under drought conditions. Additionally, transgenic roots displayed upregulation of four stress-response genes encoding WRKY transcription factor (WRKY33, MYB transcription factor (MYB92, abscisic acid receptor (PYL5 and dehydrin 2 (DHN2.

  12. A tool based on Ligation Detection Reaction-Universal Array (LDR-UA) for the characterization of VTEC by identification of virulence-associated and serogroup-specific genes.

    Science.gov (United States)

    Lauri, Andrea; Castiglioni, Bianca; Morabito, Stefano; Tozzoli, Rosangela; Consolandi, Clarissa; Mariani, Paola

    2011-02-01

    Verocytoxigenic Escherichia coli (VTEC) are zoonotic pathogens whose natural reservoir is represented by ruminants, particularly cattle. Infections are mainly acquired by consumption of undercooked contaminated food of animal origin, contact with infected animals and contaminated environment. VTEC O157 is the most frequently isolated serogroup from cases of human disease, however, other VTEC serogroups, such as O26, O111, O145 and O103, are increasingly reported as causing Hemolytic Uremic Syndrome (HUS) worldwide. The identification of VTEC is troublesome, hindering the development of effective prevention strategies. In fact, VTEC are morphologically indistinguishable from harmless E. coli and their pathogenic potential is not strictly dependent on the serogroup, but relies on the presence of a collection of virulence genes. We developed a diagnostic tool for VTEC based on the Ligation Detection Reaction coupled to Universal Array (LDR-UA) for the simultaneous identification of virulence factors and serogroup-associated genes. The method includes the investigation of 40 sites located in 13 fragments from 12 genes (sodCF1/F2, adfO, terB, ehxA, eae, vtx1, vtx2, ihp1, wzx, wbdI, rfbE, dnaK) and was evaluated by performing a trial on a collection of 67 E. coli strains, both VTEC and VT-negative E. coli, as well as on 25 isolates belonging to other related species. Results of this study showed that the LDR-UA technique was specific in identifying the target microorganism. Moreover, due to its higher throughput, the LDR-UA can be a valid and cheaper alternative to real time PCR-based (rt-PCR) methods for VTEC identification. Copyright © 2010 Elsevier Ltd. All rights reserved.

  13. Gene transfer in rodents and primates as a new tool for modeling diseases in animals and assessing functions by in vivo imaging

    Energy Technology Data Exchange (ETDEWEB)

    Deglon, N. [Atomic Energy Commission (CEA), Dept. of Medical Research and MIRCen Program, 91 - Orsay (France)

    2006-07-01

    The identification of disease-causing genes in familial forms of neuro-degenerative disorders and the development of genetic models closely replicating human CNS pathologies have drastically changed our understanding of the molecular events leading to neuronal cell death. If these achievements open new opportunities of therapeutic interventions efficient delivery systems taking into account the specificity of the central nervous system are required to administer therapeutic candidates. In addition, there is a need to develop 1) genetic models in large animals that replicate late stages of the diseases and 2) imaging techniques suitable for longitudinal, quantitative and non-invasive evaluation of disease progression and the evaluation of new therapeutic strategies. Over the last few years, we have investigated the potential of lentiviral vectors as tool to model and treat CNS disorders. The use of lentiviral vectors to create animal model of these pathologies holds various advantages compared to classical transgenic approaches. Viral vectors are versatile, highly flexible tools to perform in vivo studies. Multiple genetic models can be created in a short period of time. High transduction efficiencies as well as robust and sustained trans-gene expression lead to the rapid appearance of functional and behavioral abnormalities and severe neuro-degeneration. Targeted injections in different brain areas can be used to investigate the regional specificity of the neuro-pathology and eliminate potential side effects associated with a widespread over-expression of the trans-gene. Finally, models can be established in different mammalian species including non-human primates, thereby providing an opportunity to assess complex behavioral changes and perform longitudinal follow-up of neuro-pathological alterations by imaging. We have demonstrated the proof of principle of this approach for Huntington's disease. We have shown that the intratriatal injection of lentiviral

  14. Gene transfer in rodents and primates as a new tool for modeling diseases in animals and assessing functions by in vivo imaging

    International Nuclear Information System (INIS)

    Deglon, N.

    2006-01-01

    The identification of disease-causing genes in familial forms of neuro-degenerative disorders and the development of genetic models closely replicating human CNS pathologies have drastically changed our understanding of the molecular events leading to neuronal cell death. If these achievements open new opportunities of therapeutic interventions efficient delivery systems taking into account the specificity of the central nervous system are required to administer therapeutic candidates. In addition, there is a need to develop 1) genetic models in large animals that replicate late stages of the diseases and 2) imaging techniques suitable for longitudinal, quantitative and non-invasive evaluation of disease progression and the evaluation of new therapeutic strategies. Over the last few years, we have investigated the potential of lentiviral vectors as tool to model and treat CNS disorders. The use of lentiviral vectors to create animal model of these pathologies holds various advantages compared to classical transgenic approaches. Viral vectors are versatile, highly flexible tools to perform in vivo studies. Multiple genetic models can be created in a short period of time. High transduction efficiencies as well as robust and sustained trans-gene expression lead to the rapid appearance of functional and behavioral abnormalities and severe neuro-degeneration. Targeted injections in different brain areas can be used to investigate the regional specificity of the neuro-pathology and eliminate potential side effects associated with a widespread over-expression of the trans-gene. Finally, models can be established in different mammalian species including non-human primates, thereby providing an opportunity to assess complex behavioral changes and perform longitudinal follow-up of neuro-pathological alterations by imaging. We have demonstrated the proof of principle of this approach for Huntington's disease. We have shown that the intratriatal injection of lentiviral vector

  15. Selection on the Major Color Gene Melanocortin-1-Receptor Shaped the Evolution of the Melanocortin System Genes

    Directory of Open Access Journals (Sweden)

    Linda Dib

    2017-12-01

    Full Text Available Modular genetic systems and networks have complex evolutionary histories shaped by selection acting on single genes as well as on their integrated function within the network. However, uncovering molecular coevolution requires the detection of coevolving sites in sequences. Detailed knowledge of the functions of each gene in the system is also necessary to identify the selective agents driving coevolution. Using recently developed computational tools, we investigated the effect of positive selection on the coevolution of ten major genes in the melanocortin system, responsible for multiple physiological functions and human diseases. Substitutions driven by positive selection at the melanocortin-1-receptor (MC1R induced more coevolutionary changes on the system than positive selection on other genes in the system. Contrarily, selection on the highly pleiotropic POMC gene, which orchestrates the activation of the different melanocortin receptors, had the lowest coevolutionary influence. MC1R and possibly its main function, melanin pigmentation, seems to have influenced the evolution of the melanocortin system more than functions regulated by MC2-5Rs such as energy homeostasis, glucocorticoid-dependent stress and anti-inflammatory responses. Although replication in other regulatory systems is needed, this suggests that single functional aspects of a genetic network or system can be of higher importance than others in shaping coevolution among the genes that integrate it.

  16. Recombineering strategies for developing next generation BAC transgenic tools for optogenetics and beyond.

    Science.gov (United States)

    Ting, Jonathan T; Feng, Guoping

    2014-01-01

    The development and application of diverse BAC transgenic rodent lines has enabled rapid progress for precise molecular targeting of genetically-defined cell types in the mammalian central nervous system. These transgenic tools have played a central role in the optogenetic revolution in neuroscience. Indeed, an overwhelming proportion of studies in this field have made use of BAC transgenic Cre driver lines to achieve targeted expression of optogenetic probes in the brain. In addition, several BAC transgenic mouse lines have been established for direct cell-type specific expression of Channelrhodopsin-2 (ChR2). While the benefits of these new tools largely outweigh any accompanying challenges, many available BAC transgenic lines may suffer from confounds due in part to increased gene dosage of one or more "extra" genes contained within the large BAC DNA sequences. Here we discuss this under-appreciated issue and propose strategies for developing the next generation of BAC transgenic lines that are devoid of extra genes. Furthermore, we provide evidence that these strategies are simple, reproducible, and do not disrupt the intended cell-type specific transgene expression patterns for several distinct BAC clones. These strategies may be widely implemented for improved BAC transgenesis across diverse disciplines.

  17. Follistatin allows efficient retroviral-mediated gene transfer into rat liver

    International Nuclear Information System (INIS)

    Borgnon, Josephine; Djamouri, Fatima; Lorand, Isabelle; Rico, Virginie Di; Loux, Nathalie; Pages, Jean-Christophe; Franco, Dominique; Capron, Frederique; Weber, Anne

    2005-01-01

    Retroviral vectors are widely used tools for gene therapy. However, in vivo gene transfer is only effective in dividing cells, which, in liver, requires a regenerative stimulus. Follistatin is effective in promoting liver regeneration after 90% and 70% hepatectomy in rats. We studied its efficacy on liver regeneration and retroviral-mediated gene delivery in 50% hepatectomized rats. When human recombinant follistatin was infused into the portal vein immediately after 50% hepatectomy, hepatocyte proliferation was significantly higher than in control 50% hepatectomized rats. A single injection of virus particles administered 23 h after follistatin infusion resulted in more than 20% gene transduction efficiency in hepatocytes compared to 3% in control rats. It is concluded that a single injection of follistatin induces onset of proliferation in 50% hepatectomized rats and allows efficient retroviral-mediated gene transfer to the liver

  18. Lynx web services for annotations and systems analysis of multi-gene disorders.

    Science.gov (United States)

    Sulakhe, Dinanath; Taylor, Andrew; Balasubramanian, Sandhya; Feng, Bo; Xie, Bingqing; Börnigen, Daniela; Dave, Utpal J; Foster, Ian T; Gilliam, T Conrad; Maltsev, Natalia

    2014-07-01

    Lynx is a web-based integrated systems biology platform that supports annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Lynx has integrated multiple classes of biomedical data (genomic, proteomic, pathways, phenotypic, toxicogenomic, contextual and others) from various public databases as well as manually curated data from our group and collaborators (LynxKB). Lynx provides tools for gene list enrichment analysis using multiple functional annotations and network-based gene prioritization. Lynx provides access to the integrated database and the analytical tools via REST based Web Services (http://lynx.ci.uchicago.edu/webservices.html). This comprises data retrieval services for specific functional annotations, services to search across the complete LynxKB (powered by Lucene), and services to access the analytical tools built within the Lynx platform. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. Sugarcane genes related to mitochondrial function

    Directory of Open Access Journals (Sweden)

    Fonseca Ghislaine V.

    2001-01-01

    Full Text Available Mitochondria function as metabolic powerhouses by generating energy through oxidative phosphorylation and have become the focus of renewed interest due to progress in understanding the subtleties of their biogenesis and the discovery of the important roles which these organelles play in senescence, cell death and the assembly of iron-sulfur (Fe/S centers. Using proteins from the yeast Saccharomyces cerevisiae, Homo sapiens and Arabidopsis thaliana we searched the sugarcane expressed sequence tag (SUCEST database for the presence of expressed sequence tags (ESTs with similarity to nuclear genes related to mitochondrial functions. Starting with 869 protein sequences, we searched for sugarcane EST counterparts to these proteins using the basic local alignment search tool TBLASTN similarity searching program run against 260,781 sugarcane ESTs contained in 81,223 clusters. We were able to recover 367 clusters likely to represent sugarcane orthologues of the corresponding genes from S. cerevisiae, H. sapiens and A. thaliana with E-value <= 10-10. Gene products belonging to all functional categories related to mitochondrial functions were found and this allowed us to produce an overview of the nuclear genes required for sugarcane mitochondrial biogenesis and function as well as providing a starting point for detailed analysis of sugarcane gene structure and physiology.

  20. Implementing an online tool for genome-wide validation of survival-associated biomarkers in ovarian-cancer using microarray data from 1287 patients

    DEFF Research Database (Denmark)

    Győrffy, Balázs; Lánczky, András; Szállási, Zoltán

    2012-01-01

    was set up using gene expression data and survival information of 1287 ovarian cancer patients downloaded from Gene Expression Omnibus and The Cancer Genome Atlas (Affymetrix HG-U133A, HG-U133A 2.0, and HG-U133 Plus 2.0 microarrays). After quality control and normalization, only probes present on all......). A Kaplan–Meier survival plot was generated and significance was computed. The tool can be accessed online at www.kmplot.com/ovar. We used this integrative data analysis tool to validate the prognostic power of 37 biomarkers identified in the literature. Of these, CA125 (MUC16; P=3.7x10–5, hazard ratio (HR...... biomarker validation platform that mines all available microarray data to assess the prognostic power of 22 277 genes in 1287 ovarian cancer patients. We specifically used this tool to evaluate the effect of 37 previously published biomarkers on ovarian cancer prognosis....

  1. Global Metabolic Reconstruction and Metabolic Gene Evolution in the Cattle Genome

    Science.gov (United States)

    Kim, Woonsu; Park, Hyesun; Seo, Seongwon

    2016-01-01

    The sequence of cattle genome provided a valuable opportunity to systematically link genetic and metabolic traits of cattle. The objectives of this study were 1) to reconstruct genome-scale cattle-specific metabolic pathways based on the most recent and updated cattle genome build and 2) to identify duplicated metabolic genes in the cattle genome for better understanding of metabolic adaptations in cattle. A bioinformatic pipeline of an organism for amalgamating genomic annotations from multiple sources was updated. Using this, an amalgamated cattle genome database based on UMD_3.1, was created. The amalgamated cattle genome database is composed of a total of 33,292 genes: 19,123 consensus genes between NCBI and Ensembl databases, 8,410 and 5,493 genes only found in NCBI or Ensembl, respectively, and 266 genes from NCBI scaffolds. A metabolic reconstruction of the cattle genome and cattle pathway genome database (PGDB) was also developed using Pathway Tools, followed by an intensive manual curation. The manual curation filled or revised 68 pathway holes, deleted 36 metabolic pathways, and added 23 metabolic pathways. Consequently, the curated cattle PGDB contains 304 metabolic pathways, 2,460 reactions including 2,371 enzymatic reactions, and 4,012 enzymes. Furthermore, this study identified eight duplicated genes in 12 metabolic pathways in the cattle genome compared to human and mouse. Some of these duplicated genes are related with specific hormone biosynthesis and detoxifications. The updated genome-scale metabolic reconstruction is a useful tool for understanding biology and metabolic characteristics in cattle. There has been significant improvements in the quality of cattle genome annotations and the MetaCyc database. The duplicated metabolic genes in the cattle genome compared to human and mouse implies evolutionary changes in the cattle genome and provides a useful information for further research on understanding metabolic adaptations of cattle. PMID

  2. Why Choose This One? Factors in Scientists' Selection of Bioinformatics Tools

    Science.gov (United States)

    Bartlett, Joan C.; Ishimura, Yusuke; Kloda, Lorie A.

    2011-01-01

    Purpose: The objective was to identify and understand the factors involved in scientists' selection of preferred bioinformatics tools, such as databases of gene or protein sequence information (e.g., GenBank) or programs that manipulate and analyse biological data (e.g., BLAST). Methods: Eight scientists maintained research diaries for a two-week…

  3. Exploring the relationship between fractal features and bacterial essential genes

    International Nuclear Information System (INIS)

    Yu Yong-Ming; Yang Li-Cai; Zhao Lu-Lu; Liu Zhi-Ping; Zhou Qian

    2016-01-01

    Essential genes are indispensable for the survival of an organism in optimal conditions. Rapid and accurate identifications of new essential genes are of great theoretical and practical significance. Exploring features with predictive power is fundamental for this. Here, we calculate six fractal features from primary gene and protein sequences and then explore their relationship with gene essentiality by statistical analysis and machine learning-based methods. The models are applied to all the currently available identified genes in 27 bacteria from the database of essential genes (DEG). It is found that the fractal features of essential genes generally differ from those of non-essential genes. The fractal features are used to ascertain the parameters of two machine learning classifiers: Naïve Bayes and Random Forest. The area under the curve (AUC) of both classifiers show that each fractal feature is satisfactorily discriminative between essential genes and non-essential genes individually. And, although significant correlations exist among fractal features, gene essentiality can also be reliably predicted by various combinations of them. Thus, the fractal features analyzed in our study can be used not only to construct a good essentiality classifier alone, but also to be significant contributors for computational tools identifying essential genes. (paper)

  4. Primers-4-Yeast: a comprehensive web tool for planning primers for Saccharomyces cerevisiae.

    Science.gov (United States)

    Yofe, Ido; Schuldiner, Maya

    2014-02-01

    The budding yeast Saccharomyces cerevisiae is a key model organism of functional genomics, due to its ease and speed of genetic manipulations. In fact, in this yeast, the requirement for homologous sequences for recombination purposes is so small that 40 base pairs (bp) are sufficient. Hence, an enormous variety of genetic manipulations can be performed by simply planning primers with the correct homology, using a defined set of transformation plasmids. Although designing primers for yeast transformations and for the verification of their correct insertion is a common task in all yeast laboratories, primer planning is usually done manually and a tool that would enable easy, automated primer planning for the yeast research community is still lacking. Here we introduce Primers-4-Yeast, a web tool that allows primers to be designed in batches for S. cerevisiae gene-targeting transformations, and for the validation of correct insertions. This novel tool enables fast, automated, accurate primer planning for large sets of genes, introduces consistency in primer planning and is therefore suggested to serve as a standard in yeast research. Primers-4-Yeast is available at: http://www.weizmann.ac.il/Primers-4-Yeast Copyright © 2013 John Wiley & Sons, Ltd.

  5. Development of a versatile enrichment analysis tool reveals associations between the maternal brain and mental health disorders, including autism

    Science.gov (United States)

    2013-01-01

    Background A recent study of lateral septum (LS) suggested a large number of autism-related genes with altered expression in the postpartum state. However, formally testing the findings for enrichment of autism-associated genes proved to be problematic with existing software. Many gene-disease association databases have been curated which are not currently incorporated in popular, full-featured enrichment tools, and the use of custom gene lists in these programs can be difficult to perform and interpret. As a simple alternative, we have developed the Modular Single-set Enrichment Test (MSET), a minimal tool that enables one to easily evaluate expression data for enrichment of any conceivable gene list of interest. Results The MSET approach was validated by testing several publicly available expression data sets for expected enrichment in areas of autism, attention deficit hyperactivity disorder (ADHD), and arthritis. Using nine independent, unique autism gene lists extracted from association databases and two recent publications, a striking consensus of enrichment was detected within gene expression changes in LS of postpartum mice. A network of 160 autism-related genes was identified, representing developmental processes such as synaptic plasticity, neuronal morphogenesis, and differentiation. Additionally, maternal LS displayed enrichment for genes associated with bipolar disorder, schizophrenia, ADHD, and depression. Conclusions The transition to motherhood includes the most fundamental social bonding event in mammals and features naturally occurring changes in sociability. Some individuals with autism, schizophrenia, or other mental health disorders exhibit impaired social traits. Genes involved in these deficits may also contribute to elevated sociability in the maternal brain. To date, this is the first study to show a significant, quantitative link between the maternal brain and mental health disorders using large scale gene expression data. Thus, the

  6. Religious coalition opposes gene patents.

    Science.gov (United States)

    James, J S

    1995-05-19

    The biotechnology industry is concerned about a coalition of mainstream religious leaders, working with Jeremy Rifkin of the Foundation of Economic Trends, who oppose the patenting of human and animal life forms, body parts, and genes. The coalition called a press conference on May 18 to ask the government to prohibit the current patenting practices for genetic engineering. The biotechnology industry argues that patents indicate that a company's research tool has significant value, and encourages capitalists to invest their dollars in the development of new treatments for diseases. They also argue that the 29 biotech drugs that are on the market have been developed as a result of patents on genes. Although most business leaders are united in opposing restrictions, many scientists are divided, citing both religious and scientific reasons.

  7. Biobibliometrics (UGDH-TP53-BRCA1) Genes Connections in the Possible Relationship Between Breast Cancer and EEG.

    Science.gov (United States)

    Martzoukos, Yannis; Papavlasopoulos, Sozon; Poulos, Marios; Syrrou, Maria

    2017-01-01

    In recent years there has been an increasingly amount of data stored in biomedical Databases due to the breakthroughs in biology and bioinformatics, biomedical information is growing exponentially making efficient information retrieval from scientist more and more challenging. New Scientific fields as Bioinformatics seem to be the tool needed to extract scientifically important data based on experimental results and information provided by papers and journals. In this paper we are going to implement a custom made IT system in order to find connections between genes in the breast cancer pathways such the BRCA1 with the electrical energy in the human brain with UGDH gene via the TP53 tumor gene. The proposed system will be able to identify the appearance of each gene ID and compare the coexistence of two genes in PubMed articles/papers. The final system could become a useful tool against the struggle of scientists and medical professionals in the near future.

  8. A comparative gene expression database for invertebrates

    Directory of Open Access Journals (Sweden)

    Ormestad Mattias

    2011-08-01

    Full Text Available Abstract Background As whole genome and transcriptome sequencing gets cheaper and faster, a great number of 'exotic' animal models are emerging, rapidly adding valuable data to the ever-expanding Evo-Devo field. All these new organisms serve as a fantastic resource for the research community, but the sheer amount of data, some published, some not, makes detailed comparison of gene expression patterns very difficult to summarize - a problem sometimes even noticeable within a single lab. The need to merge existing data with new information in an organized manner that is publicly available to the research community is now more necessary than ever. Description In order to offer a homogenous way of storing and handling gene expression patterns from a variety of organisms, we have developed the first web-based comparative gene expression database for invertebrates that allows species-specific as well as cross-species gene expression comparisons. The database can be queried by gene name, developmental stage and/or expression domains. Conclusions This database provides a unique tool for the Evo-Devo research community that allows the retrieval, analysis and comparison of gene expression patterns within or among species. In addition, this database enables a quick identification of putative syn-expression groups that can be used to initiate, among other things, gene regulatory network (GRN projects.

  9. Gene Therapy for Pancreatic Cancer: Specificity, Issues and Hopes.

    Science.gov (United States)

    Rouanet, Marie; Lebrin, Marine; Gross, Fabian; Bournet, Barbara; Cordelier, Pierre; Buscail, Louis

    2017-06-08

    A recent death projection has placed pancreatic ductal adenocarcinoma as the second cause of death by cancer in 2030. The prognosis for pancreatic cancer is very poor and there is a great need for new treatments that can change this poor outcome. Developments of therapeutic innovations in combination with conventional chemotherapy are needed urgently. Among innovative treatments the gene therapy offers a promising avenue. The present review gives an overview of the general strategy of gene therapy as well as the limitations and stakes of the different experimental in vivo models, expression vectors (synthetic and viral), molecular tools (interference RNA, genome editing) and therapeutic genes (tumor suppressor genes, antiangiogenic and pro-apoptotic genes, suicide genes). The latest developments in pancreatic carcinoma gene therapy are described including gene-based tumor cell sensitization to chemotherapy, vaccination and adoptive immunotherapy (chimeric antigen receptor T-cells strategy). Nowadays, there is a specific development of oncolytic virus therapies including oncolytic adenoviruses, herpes virus, parvovirus or reovirus. A summary of all published and on-going phase-1 trials is given. Most of them associate gene therapy and chemotherapy or radiochemotherapy. The first results are encouraging for most of the trials but remain to be confirmed in phase 2 trials.

  10. ToTem: a tool for variant calling pipeline optimization.

    Science.gov (United States)

    Tom, Nikola; Tom, Ondrej; Malcikova, Jitka; Pavlova, Sarka; Kubesova, Blanka; Rausch, Tobias; Kolarik, Miroslav; Benes, Vladimir; Bystry, Vojtech; Pospisilova, Sarka

    2018-06-26

    High-throughput bioinformatics analyses of next generation sequencing (NGS) data often require challenging pipeline optimization. The key problem is choosing appropriate tools and selecting the best parameters for optimal precision and recall. Here we introduce ToTem, a tool for automated pipeline optimization. ToTem is a stand-alone web application with a comprehensive graphical user interface (GUI). ToTem is written in Java and PHP with an underlying connection to a MySQL database. Its primary role is to automatically generate, execute and benchmark different variant calling pipeline settings. Our tool allows an analysis to be started from any level of the process and with the possibility of plugging almost any tool or code. To prevent an over-fitting of pipeline parameters, ToTem ensures the reproducibility of these by using cross validation techniques that penalize the final precision, recall and F-measure. The results are interpreted as interactive graphs and tables allowing an optimal pipeline to be selected, based on the user's priorities. Using ToTem, we were able to optimize somatic variant calling from ultra-deep targeted gene sequencing (TGS) data and germline variant detection in whole genome sequencing (WGS) data. ToTem is a tool for automated pipeline optimization which is freely available as a web application at  https://totem.software .

  11. Assays for noninvasive imaging of reporter gene expression

    International Nuclear Information System (INIS)

    Gambhir, S.S.; Barrio, J.R.; Herschman, H.R.; Phelps, M.E.

    1999-01-01

    Repeated, noninvasive imaging of reporter gene expression is emerging as a valuable tool for monitoring the expression of genes in animals and humans. Monitoring of organ/cell transplantation in living animals and humans, and the assessment of environmental, behavioral, and pharmacologic modulation of gene expression in transgenic animals should soon be possible. The earliest clinical application is likely to be monitoring human gene therapy in tumors transduced with the herpes simplex virus type 1 thymidine kinase (HSV1-tk) suicide gene. Several candidate assays for imaging reporter gene expression have been studied, utilizing cytosine deaminase (CD), HSV1-tk, and dopamine 2 receptor (D2R) as reporter genes. For the HSV1-tk reporter gene, both uracil nucleoside derivatives (e.g., 5-iodo-2'-fluoro-2'-deoxy-1-β-D-arabinofuranosyl-5-iodouracil [FIAU] labeled with 124 I, 131 I ) and acycloguanosine derivatives {e.g., 8-[ 18 F]fluoro-9-[[2-hydroxy-1-(hydroxymethyl)ethoxy]methyl]guanine (8-[ 18 F]-fluoroganciclovir) ([ 18 F]FGCV), 9-[(3-[ 18 F]fluoro-1-hydroxy-2-propoxy)methyl]guanine ([ 18 F]FHPG)} have been investigated as reporter probes. For the D2R reporter gene, a derivative of spiperone {3-(2'-[ 18 F]-Fluoroethyl)spiperone ([ 18 F]FESP)} has been used with positron emission tomography (PET) imaging. In this review, the principles and specific assays for imaging reporter gene expression are presented and discussed. Specific examples utilizing adenoviral-mediated delivery of a reporter gene as well as tumors expressing reporter genes are discussed

  12. Engineered coryneform bacteria as a bio-tool for arsenic remediation.

    Science.gov (United States)

    Villadangos, Almudena F; Ordóñez, Efrén; Pedre, Brandán; Messens, Joris; Gil, Jose A; Mateos, Luis M

    2014-12-01

    Despite current remediation efforts, arsenic contamination in water sources is still a major health problem, highlighting the need for new approaches. In this work, strains of the nonpathogenic and highly arsenic-resistant bacterium Corynebacterium glutamicum were used as inexpensive tools to accumulate inorganic arsenic, either as arsenate (As(V)) or arsenite (As(III)) species. The assays made use of "resting cells" from these strains, which were assessed under well-established conditions and compared with C. glutamicum background controls. The two mutant As(V)-accumulating strains were those used in a previously published study: (i) ArsC1/C2, in which the gene/s encoding the mycothiol-dependent arsenate reductases is/are disrupted, and (ii) MshA/C mutants unable to produce mycothiol, the low molecular weight thiol essential for arsenate reduction. The As(III)-accumulating strains were either those lacking the arsenite permease activities (Acr3-1 and Acr3-2) needed in As(III) release or recombinant strains overexpressing the aquaglyceroporin genes (glpF) from Corynebacterium diphtheriae or Streptomyces coelicolor, to improve As(III) uptake. Both genetically modified strains accumulated 30-fold more As(V) and 15-fold more As(III) than the controls. The arsenic resistance of the modified strains was inversely proportional to their metal accumulation ability. Our results provide the basis for investigations into the use of these modified C. glutamicum strains as a new bio-tool in arsenic remediation efforts.

  13. Viral Cre-LoxP tools aid genome engineering in mammalian cells.

    Science.gov (United States)

    Sengupta, Ranjita; Mendenhall, Amy; Sarkar, Nandita; Mukherjee, Chandreyee; Afshari, Amirali; Huang, Joseph; Lu, Biao

    2017-01-01

    Targeted nucleases have transformed genome editing technology, providing more efficient methods to make targeted changes in mammalian genome. In parallel, there is an increasing demand of Cre-LoxP technology for complex genome manipulation such as large deletion, addition, gene fusion and conditional removal of gene sequences at the target site. However, an efficient and easy-to-use Cre-recombinase delivery system remains lacking. We designed and constructed two sets of expression vectors for Cre-recombinase using two highly efficient viral systems, the integrative lentivirus and non-integrative adeno associated virus. We demonstrate the effectiveness of those methods in Cre-delivery into stably-engineered HEK293 cells harboring LoxP-floxed red fluorescent protein (RFP) and puromycin (Puro) resistant reporters. The delivered Cre recombinase effectively excised the floxed RFP-Puro either directly or conditionally, therefore validating the function of these molecular tools. Given the convenient options of two selections markers, these viral-based systems offer a robust and easy-to-use tool for advanced genome editing, expanding complicated genome engineering to a variety of cell types and conditions. We have developed and functionally validated two viral-based Cre-recombinase delivery systems for efficient genome manipulation in various mammalian cells. The ease of gene delivery with the built-in reporters and inducible element enables live cell monitoring, drug selection and temporal knockout, broadening applications of genome editing.

  14. Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion.

    Science.gov (United States)

    Ni, ZhouXian; Ye, YouJu; Bai, Tiandao; Xu, Meng; Xu, Li-An

    2017-09-11

    The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between "IRa" and "IRb". The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.

  15. MADIBA: A web server toolkit for biological interpretation of Plasmodium and plant gene clusters

    Directory of Open Access Journals (Sweden)

    Louw Abraham I

    2008-02-01

    Full Text Available Abstract Background Microarray technology makes it possible to identify changes in gene expression of an organism, under various conditions. Data mining is thus essential for deducing significant biological information such as the identification of new biological mechanisms or putative drug targets. While many algorithms and software have been developed for analysing gene expression, the extraction of relevant information from experimental data is still a substantial challenge, requiring significant time and skill. Description MADIBA (MicroArray Data Interface for Biological Annotation facilitates the assignment of biological meaning to gene expression clusters by automating the post-processing stage. A relational database has been designed to store the data from gene to pathway for Plasmodium, rice and Arabidopsis. Tools within the web interface allow rapid analyses for the identification of the Gene Ontology terms relevant to each cluster; visualising the metabolic pathways where the genes are implicated, their genomic localisations, putative common transcriptional regulatory elements in the upstream sequences, and an analysis specific to the organism being studied. Conclusion MADIBA is an integrated, online tool that will assist researchers in interpreting their results and understand the meaning of the co-expression of a cluster of genes. Functionality of MADIBA was validated by analysing a number of gene clusters from several published experiments – expression profiling of the Plasmodium life cycle, and salt stress treatments of Arabidopsis and rice. In most of the cases, the same conclusions found by the authors were quickly and easily obtained after analysing the gene clusters with MADIBA.

  16. The GENOTEND chip: a new tool to analyse gene expression in muscles of beef cattle for beef quality prediction

    Directory of Open Access Journals (Sweden)

    Hocquette Jean-Francois

    2012-08-01

    Full Text Available Abstract Background Previous research programmes have described muscle biochemical traits and gene expression levels associated with beef tenderness. One of our results concerning the DNAJA1 gene (an Hsp40 was patented. This study aims to confirm the relationships previously identified between two gene families (heat shock proteins and energy metabolism and beef quality. Results We developed an Agilent chip with specific probes for bovine muscular genes. More than 3000 genes involved in muscle biology or meat quality were selected from genetic, proteomic or transcriptomic studies, or from scientific publications. As far as possible, several probes were used for each gene (e.g. 17 probes for DNAJA1. RNA from Longissimus thoracis muscle samples was hybridised on the chips. Muscles samples were from four groups of Charolais cattle: two groups of young bulls and two groups of steers slaughtered in two different years. Principal component analysis, simple correlation of gene expression levels with tenderness scores, and then multiple regression analysis provided the means to detect the genes within two families (heat shock proteins and energy metabolism which were the most associated with beef tenderness. For the 25 Charolais young bulls slaughtered in year 1, expression levels of DNAJA1 and other genes of the HSP family were related to the initial or overall beef tenderness. Similarly, expression levels of genes involved in fat or energy metabolism were related with the initial or overall beef tenderness but in the year 1 and year 2 groups of young bulls only. Generally, the genes individually correlated with tenderness are not consistent across genders and years indicating the strong influence of rearing conditions on muscle characteristics related to beef quality. However, a group of HSP genes, which explained about 40% of the variability in tenderness in the group of 25 young bulls slaughtered in year 1 (considered as the reference group, was

  17. The GENOTEND chip: a new tool to analyse gene expression in muscles of beef cattle for beef quality prediction.

    Science.gov (United States)

    Hocquette, Jean-Francois; Bernard-Capel, Carine; Vidal, Veronique; Jesson, Beline; Levéziel, Hubert; Renand, Gilles; Cassar-Malek, Isabelle

    2012-08-15

    Previous research programmes have described muscle biochemical traits and gene expression levels associated with beef tenderness. One of our results concerning the DNAJA1 gene (an Hsp40) was patented. This study aims to confirm the relationships previously identified between two gene families (heat shock proteins and energy metabolism) and beef quality. We developed an Agilent chip with specific probes for bovine muscular genes. More than 3000 genes involved in muscle biology or meat quality were selected from genetic, proteomic or transcriptomic studies, or from scientific publications. As far as possible, several probes were used for each gene (e.g. 17 probes for DNAJA1). RNA from Longissimus thoracis muscle samples was hybridised on the chips. Muscles samples were from four groups of Charolais cattle: two groups of young bulls and two groups of steers slaughtered in two different years. Principal component analysis, simple correlation of gene expression levels with tenderness scores, and then multiple regression analysis provided the means to detect the genes within two families (heat shock proteins and energy metabolism) which were the most associated with beef tenderness. For the 25 Charolais young bulls slaughtered in year 1, expression levels of DNAJA1 and other genes of the HSP family were related to the initial or overall beef tenderness. Similarly, expression levels of genes involved in fat or energy metabolism were related with the initial or overall beef tenderness but in the year 1 and year 2 groups of young bulls only. Generally, the genes individually correlated with tenderness are not consistent across genders and years indicating the strong influence of rearing conditions on muscle characteristics related to beef quality. However, a group of HSP genes, which explained about 40% of the variability in tenderness in the group of 25 young bulls slaughtered in year 1 (considered as the reference group), was validated in the groups of 30 Charolais young

  18. Using phylogenetically-informed annotation (PIA) to search for light-interacting genes in transcriptomes from non-model organisms.

    Science.gov (United States)

    Speiser, Daniel I; Pankey, M Sabrina; Zaharoff, Alexander K; Battelle, Barbara A; Bracken-Grissom, Heather D; Breinholt, Jesse W; Bybee, Seth M; Cronin, Thomas W; Garm, Anders; Lindgren, Annie R; Patel, Nipam H; Porter, Megan L; Protas, Meredith E; Rivera, Ajna S; Serb, Jeanne M; Zigler, Kirk S; Crandall, Keith A; Oakley, Todd H

    2014-11-19

    Tools for high throughput sequencing and de novo assembly make the analysis of transcriptomes (i.e. the suite of genes expressed in a tissue) feasible for almost any organism. Yet a challenge for biologists is that it can be difficult to assign identities to gene sequences, especially from non-model organisms. Phylogenetic analyses are one useful method for assigning identities to these sequences, but such methods tend to be time-consuming because of the need to re-calculate trees for every gene of interest and each time a new data set is analyzed. In response, we employed existing tools for phylogenetic analysis to produce a computationally efficient, tree-based approach for annotating transcriptomes or new genomes that we term Phylogenetically-Informed Annotation (PIA), which places uncharacterized genes into pre-calculated phylogenies of gene families. We generated maximum likelihood trees for 109 genes from a Light Interaction Toolkit (LIT), a collection of genes that underlie the function or development of light-interacting structures in metazoans. To do so, we searched protein sequences predicted from 29 fully-sequenced genomes and built trees using tools for phylogenetic analysis in the Osiris package of Galaxy (an open-source workflow management system). Next, to rapidly annotate transcriptomes from organisms that lack sequenced genomes, we repurposed a maximum likelihood-based Evolutionary Placement Algorithm (implemented in RAxML) to place sequences of potential LIT genes on to our pre-calculated gene trees. Finally, we implemented PIA in Galaxy and used it to search for LIT genes in 28 newly-sequenced transcriptomes from the light-interacting tissues of a range of cephalopod mollusks, arthropods, and cubozoan cnidarians. Our new trees for LIT genes are available on the Bitbucket public repository ( http://bitbucket.org/osiris_phylogenetics/pia/ ) and we demonstrate PIA on a publicly-accessible web server ( http://galaxy-dev.cnsi.ucsb.edu/pia/ ). Our new

  19. Accurate, model-based tuning of synthetic gene expression using introns in S. cerevisiae.

    Directory of Open Access Journals (Sweden)

    Ido Yofe

    2014-06-01

    Full Text Available Introns are key regulators of eukaryotic gene expression and present a potentially powerful tool for the design of synthetic eukaryotic gene expression systems. However, intronic control over gene expression is governed by a multitude of complex, incompletely understood, regulatory mechanisms. Despite this lack of detailed mechanistic understanding, here we show how a relatively simple model enables accurate and predictable tuning of synthetic gene expression system in yeast using several predictive intron features such as transcript folding and sequence motifs. Using only natural Saccharomyces cerevisiae introns as regulators, we demonstrate fine and accurate control over gene expression spanning a 100 fold expression range. These results broaden the engineering toolbox of synthetic gene expression systems and provide a framework in which precise and robust tuning of gene expression is accomplished.

  20. Efficient Oligo nucleotide mediated CRISPR-Cas9 Gene Editing in Aspergilli

    DEFF Research Database (Denmark)

    Nødvig, Christina Spuur; Hoof, Jakob Blæsbjerg; Kogle, Martin Engelhard

    2018-01-01

    CRISPR-Cas9 technologies are revolutionizing fungal gene editing. Here we show that survival of specific Cas9/sgRNA mediated DNA double strand breaks (DSBs) depends on the non-homologous end-joining, NHEJ, DNA repair pathway and we use this observation to develop a tool to assess protospacer....... niger, and in A. oryzae indicating that this type of repair may be wide spread in filamentous fungi. Importantly, we demonstrate that by using single-stranded oligo nucleotides for CRISPR-Cas9 mediated gene editing it is possible to introduce specific point mutations as well gene deletions...

  1. Genes and Gene Networks Involved in Sodium Fluoride-Elicited Cell Death Accompanying Endoplasmic Reticulum Stress in Oral Epithelial Cells

    Directory of Open Access Journals (Sweden)

    Yoshiaki Tabuchi

    2014-05-01

    Full Text Available Here, to understand the molecular mechanisms underlying cell death induced by sodium fluoride (NaF, we analyzed gene expression patterns in rat oral epithelial ROE2 cells exposed to NaF using global-scale microarrays and bioinformatics tools. A relatively high concentration of NaF (2 mM induced cell death concomitant with decreases in mitochondrial membrane potential, chromatin condensation and caspase-3 activation. Using 980 probe sets, we identified 432 up-regulated and 548 down-regulated genes, that were differentially expressed by >2.5-fold in the cells treated with 2 mM of NaF and categorized them into 4 groups by K-means clustering. Ingenuity® pathway analysis revealed several gene networks from gene clusters. The gene networks Up-I and Up-II included many up-regulated genes that were mainly associated with the biological function of induction or prevention of cell death, respectively, such as Atf3, Ddit3 and Fos (for Up-I and Atf4 and Hspa5 (for Up-II. Interestingly, knockdown of Ddit3 and Hspa5 significantly increased and decreased the number of viable cells, respectively. Moreover, several endoplasmic reticulum (ER stress-related genes including, Ddit3, Atf4 and Hapa5, were observed in these gene networks. These findings will provide further insight into the molecular mechanisms of NaF-induced cell death accompanying ER stress in oral epithelial cells.

  2. Radiation-modulated gene expression in C. elegans

    International Nuclear Information System (INIS)

    Nelson, G.A.; Bayeta, E.; Perez, C.; Lloyd, E.; Jones, T.; Smith, A.; Tian, J.

    2003-01-01

    Full text: We use the nematode C. elegans to characterize the genotoxic and cytotoxic effects of ionizing radiation with emphasis effects of charged particle radiation and have described the fluence vs. response relationships for mutation, chromosome aberration and certain developmental errors. These endpoints quantify the biological after repair and compensation pathways have completed their work. In order to address the control of these reactions we have turned to gene expression profiling to identify genes that uniquely respond to high LET species or respond differentially as a function of radiation properties. We have employed whole genome microarray methods to map gene expression following exposure to gamma rays, protons and accelerated iron ions. We found that 599 of 17871 genes analyzed showed differential expression 3 hrs after exposure to 3 Gy of at least one radiation types. 193 were up-regulated, 406 were down-regulated, and 90% were affected by only one species of radiation. Genes whose transcription levels responded significantly mapped to definite statistical clusters that were unique for each radiation type. We are now trying to establish the functional relationships of the genes their relevance to mitigation of radiation-induced damage. Three approaches are being used. First, bioinformatics tools are being used to determine the roles of genes in co-regulated gene sets. Second, we are applying the technique of RNA interference to determine whether our radiation-induced genes affect cell survival (measured in terms of embryo survival) and chromosome aberration (intestinal anaphase bridges). Finally we are focussing on the response of the most strongly-regulated gene in our data set. This is the autosomal gene, F36D3.9, whose predicted structure is that of a cysteine protease resembling cathepsin B. An enzymological approach is being used to characterize this gene at the protein level. This work was supported by NASA Cooperative Agreement NCC9-149

  3. DISC1 mouse models as a tool to decipher gene-environment interactions in psychiatric disorders

    Directory of Open Access Journals (Sweden)

    Tyler eCash-Padgett

    2013-09-01

    Full Text Available DISC1 was discovered in a Scottish pedigree in which a chromosomal translocation that breaks this gene segregates with psychiatric disorders, mainly depression and schizophrenia. Linkage and association studies in diverse populations support DISC1 as a susceptibility gene to a variety of neuropsychiatric disorders. Many Disc1 mouse models have been generated to study its neuronal functions. These mouse models display variable phenotypes, some of them relevant to schizophrenia, others to depression.The Disc1 mouse models are popular genetic models for studying gene-environment interactions in schizophrenia. Five different Disc1 models have been combined with environmental factors. The environmental stressors employed can be classified as either early immune activation or later social paradigms. These studies cover major time points along the neurodevelopmental trajectory: prenatal, early postnatal, adolescence, and adulthood. Various combinations of molecular, anatomical and behavioral methods have been used to assess the outcomes. Additionally, three of the studies sought to rescue the resulting abnormalities.Here we provide background on the environmental paradigms used, summarize the results of these studies combining Disc1 mouse models with environmental stressors and discuss what we can learn and how to proceed. A major question is how the genetic and environmental factors determine which psychiatric disorder will be clinically manifested. To address this we can take advantage of the many Disc1 models available and expose them to the same environmental stressor. The complementary experiment would be to expose the same model to different environmental stressors. DISC1 is an ideal gene for this approach, since in the Scottish pedigree the same chromosomal translocation results in different psychiatric conditions.

  4. VCS: Tool for Visualizing Copy Number Variation and Single Nucleotide Polymorphism

    Directory of Open Access Journals (Sweden)

    HyoYoung Kim

    2014-12-01

    Full Text Available Copy number variation (CNV or single nucleotide phlyorphism (SNP is useful genetic resource to aid in understanding complex phenotypes or deseases susceptibility. Although thousands of CNVs and SNPs are currently avaliable in the public databases, they are somewhat difficult to use for analyses without visualization tools. We developed a web-based tool called the VCS (visualization of CNV or SNP to visualize the CNV or SNP detected. The VCS tool can assist to easily interpret a biological meaning from the numerical value of CNV and SNP. The VCS provides six visualization tools: i the enrichment of genome contents in CNV; ii the physical distribution of CNV or SNP on chromosomes; iii the distribution of log2 ratio of CNVs with criteria of interested; iv the number of CNV or SNP per binning unit; v the distribution of homozygosity of SNP genotype; and vi cytomap of genes within CNV or SNP region.

  5. Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes

    OpenAIRE

    Kreiman, Gabriel

    2004-01-01

    Sequence information and high‐throughput methods to measure gene expression levels open the door to explore transcriptional regulation using computational tools. Combinatorial regulation and sparseness of regulatory elements throughout the genome allow organisms to control the spatial and temporal patterns of gene expression. Here we study the organization of cis‐regulatory elements in sets of co‐regulated genes. We build an algorithm to search for combinations of transcription factor binding...

  6. TargetRNA: a tool for predicting targets of small RNA action in bacteria

    OpenAIRE

    Tjaden, Brian

    2008-01-01

    Many small RNA (sRNA) genes in bacteria act as posttranscriptional regulators of target messenger RNAs. Here, we present TargetRNA, a web tool for predicting mRNA targets of sRNA action in bacteria. TargetRNA takes as input a genomic sequence that may correspond to an sRNA gene. TargetRNA then uses a dynamic programming algorithm to search each annotated message in a specified genome for mRNAs that evince basepair-binding potential to the input sRNA sequence. Based on the calculated basepair-...

  7. Immunostimulatory Gene Therapy Using Oncolytic Viruses as Vehicles

    Directory of Open Access Journals (Sweden)

    Angelica Loskog

    2015-11-01

    Full Text Available Immunostimulatory gene therapy has been developed during the past twenty years. The aim of immunostimulatory gene therapy is to tilt the suppressive tumor microenvironment to promote anti-tumor immunity. Hence, like a Trojan horse, the gene vehicle can carry warriors and weapons into enemy territory to combat the tumor from within. The most promising immune stimulators are those activating and sustaining Th1 responses, but even if potent effects were seen in preclinical models, many clinical trials failed to show objective responses in cancer patients. However, with new tools to control ongoing immunosuppression in cancer patients, immunostimulatory gene therapy is now emerging as an interesting option. In parallel, oncolytic viruses have been shown to be safe in patients. To prolong immune stimulation and to increase efficacy, these two fields are now merging and oncolytic viruses are armed with immunostimulatory transgenes. These novel agents are racing towards approval as established cancer immunotherapeutics.

  8. ESTs, cDNA microarrays, and gene expression profiling: tools for dissecting plant physiology and development.

    Science.gov (United States)

    Alba, Rob; Fei, Zhangjun; Payton, Paxton; Liu, Yang; Moore, Shanna L; Debbie, Paul; Cohn, Jonathan; D'Ascenzo, Mark; Gordon, Jeffrey S; Rose, Jocelyn K C; Martin, Gregory; Tanksley, Steven D; Bouzayen, Mondher; Jahn, Molly M; Giovannoni, Jim

    2004-09-01

    Gene expression profiling holds tremendous promise for dissecting the regulatory mechanisms and transcriptional networks that underlie biological processes. Here we provide details of approaches used by others and ourselves for gene expression profiling in plants with emphasis on cDNA microarrays and discussion of both experimental design and downstream analysis. We focus on methods and techniques emphasizing fabrication of cDNA microarrays, fluorescent labeling, cDNA hybridization, experimental design, and data processing. We include specific examples that demonstrate how this technology can be used to further our understanding of plant physiology and development (specifically fruit development and ripening) and for comparative genomics by comparing transcriptome activity in tomato and pepper fruit.

  9. Neuropsychopharmacology and neurogenetic aspects of executive functioning: should reward gene polymorphisms constitute a diagnostic tool to identify individuals at risk for impaired judgment?

    Science.gov (United States)

    Bowirrat, Abdalla; Chen, Thomas J H; Oscar-Berman, Marlene; Madigan, Margaret; Chen, Amanda Lh; Bailey, John A; Braverman, Eric R; Kerner, Mallory; Giordano, John; Morse, Siobhan; Downs, B William; Waite, Roger L; Fornari, Frank; Armaly, Zaher; Blum, Kenneth

    2012-04-01

    Executive functions are processes that act in harmony to control behaviors necessary for maintaining focus and achieving outcomes. Executive dysfunction in neuropsychiatric disorders is attributed to structural or functional pathology of brain networks involving prefrontal cortex (PFC) and its connections with other brain regions. The PFC receives innervations from different neurons associated with a number of neurotransmitters, especially dopamine (DA). Here we review findings on the contribution of PFC DA to higher-order cognitive and emotional behaviors. We suggest that examination of multifactorial interactions of an individual's genetic history, along with environmental risk factors, can assist in the characterization of executive functioning for that individual. Based upon the results of genetic studies, we also propose genetic mapping as a probable diagnostic tool serving as a therapeutic adjunct for augmenting executive functioning capabilities. We conclude that preservation of the neurological underpinnings of executive functions requires the integrity of complex neural systems including the influence of specific genes and associated polymorphisms to provide adequate neurotransmission.

  10. Neuropsychopharmacology and Neurogenetic Aspects of Executive Functioning: Should Reward Gene Polymorphisms Constitute a Diagnostic Tool to Identify Individuals at Risk for Impaired Judgment?

    Science.gov (United States)

    Bowirrat, Abdalla; Chen, Thomas JH; Oscar-Berman, Marlene; Madigan, Margaret; Chen, Amanda LH; Bailey, John A.; Braverman, Eric R.; Kerner, Mallory; Giordano, John; Morse, Siohban; Downs, B. William; Waite, Roger L.; Fornari, Frank; Armaly, Zaher; Blum, Kenneth

    2013-01-01

    Executive functions are processes that act in harmony to control behaviors necessary for maintaining focus and achieving outcomes. Executive dysfunction in neuropsychiatric disorders is attributed to structural or functional pathology of brain networks involving prefrontal cortex (PFC) and its connections with other brain regions. The PFC receives innervations from different neurons associated with a number of neurotransmitters, especially dopamine (DA). Here we review findings on the contribution of PFC DA to higher-order cognitive and emotional behaviors. We suggest examination of multifactorial interactions of an individual’s genetic history, along with environmental risk factors, can assist in the characterization of executive functioning for that individual. Based upon the results of genetic studies we also propose genetic mapping as a probable diagnostic tool serving as a therapeutic adjunct for augmenting executive functioning capabilities. We conclude that preservation of the neurological underpinnings of executive functions requires the integrity of complex neural systems including the influence of specific genes and associated polymorphisms to provide adequate neurotransmission. PMID:22371275

  11. Multi-tissue RNA-seq and transcriptome characterisation of the spiny dogfish shark (Squalus acanthias provides a molecular tool for biological research and reveals new genes involved in osmoregulation.

    Directory of Open Access Journals (Sweden)

    Andres Chana-Munoz

    Full Text Available The spiny dogfish shark (Squalus acanthias is one of the most commonly used cartilaginous fishes in biological research, especially in the fields of nitrogen metabolism, ion transporters and osmoregulation. Nonetheless, transcriptomic data for this organism is scarce. In the present study, a multi-tissue RNA-seq experiment and de novo transcriptome assembly was performed in four different spiny dogfish tissues (brain, liver, kidney and ovary, providing an annotated sequence resource. The characterization of the transcriptome greatly increases the scarce sequence information for shark species. Reads were assembled with the Trinity de novo assembler both within each tissue and across all tissues combined resulting in 362,690 transcripts in the combined assembly which represent 289,515 Trinity genes. BUSCO analysis determined a level of 87% completeness for the combined transcriptome. In total, 123,110 proteins were predicted of which 78,679 and 83,164 had significant hits against the SwissProt and Uniref90 protein databases, respectively. Additionally, 61,215 proteins aligned to known protein domains, 7,208 carried a signal peptide and 15,971 possessed at least one transmembrane region. Based on the annotation, 81,582 transcripts were assigned to gene ontology terms and 42,078 belong to known clusters of orthologous groups (eggNOG. To demonstrate the value of our molecular resource, we show that the improved transcriptome data enhances the current possibilities of osmoregulation research in spiny dogfish by utilizing the novel gene and protein annotations to investigate a set of genes involved in urea synthesis and urea, ammonia and water transport, all of them crucial in osmoregulation. We describe the presence of different gene copies and isoforms of key enzymes involved in this process, including arginases and transporters of urea and ammonia, for which sequence information is currently absent in the databases for this model species. The

  12. Multi-tissue RNA-seq and transcriptome characterisation of the spiny dogfish shark (Squalus acanthias) provides a molecular tool for biological research and reveals new genes involved in osmoregulation.

    Science.gov (United States)

    Chana-Munoz, Andres; Jendroszek, Agnieszka; Sønnichsen, Malene; Kristiansen, Rune; Jensen, Jan K; Andreasen, Peter A; Bendixen, Christian; Panitz, Frank

    2017-01-01

    The spiny dogfish shark (Squalus acanthias) is one of the most commonly used cartilaginous fishes in biological research, especially in the fields of nitrogen metabolism, ion transporters and osmoregulation. Nonetheless, transcriptomic data for this organism is scarce. In the present study, a multi-tissue RNA-seq experiment and de novo transcriptome assembly was performed in four different spiny dogfish tissues (brain, liver, kidney and ovary), providing an annotated sequence resource. The characterization of the transcriptome greatly increases the scarce sequence information for shark species. Reads were assembled with the Trinity de novo assembler both within each tissue and across all tissues combined resulting in 362,690 transcripts in the combined assembly which represent 289,515 Trinity genes. BUSCO analysis determined a level of 87% completeness for the combined transcriptome. In total, 123,110 proteins were predicted of which 78,679 and 83,164 had significant hits against the SwissProt and Uniref90 protein databases, respectively. Additionally, 61,215 proteins aligned to known protein domains, 7,208 carried a signal peptide and 15,971 possessed at least one transmembrane region. Based on the annotation, 81,582 transcripts were assigned to gene ontology terms and 42,078 belong to known clusters of orthologous groups (eggNOG). To demonstrate the value of our molecular resource, we show that the improved transcriptome data enhances the current possibilities of osmoregulation research in spiny dogfish by utilizing the novel gene and protein annotations to investigate a set of genes involved in urea synthesis and urea, ammonia and water transport, all of them crucial in osmoregulation. We describe the presence of different gene copies and isoforms of key enzymes involved in this process, including arginases and transporters of urea and ammonia, for which sequence information is currently absent in the databases for this model species. The transcriptome

  13. An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods.

    Science.gov (United States)

    Valentini, Giorgio; Paccanaro, Alberto; Caniza, Horacio; Romero, Alfonso E; Re, Matteo

    2014-06-01

    In the context of "network medicine", gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different "informativeness" embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further bio-medical investigation. Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both

  14. Computational Identification of Novel Genes: Current and Future Perspectives.

    Science.gov (United States)

    Klasberg, Steffen; Bitard-Feildel, Tristan; Mallet, Ludovic

    2016-01-01

    While it has long been thought that all genomic novelties are derived from the existing material, many genes lacking homology to known genes were found in recent genome projects. Some of these novel genes were proposed to have evolved de novo, ie, out of noncoding sequences, whereas some have been shown to follow a duplication and divergence process. Their discovery called for an extension of the historical hypotheses about gene origination. Besides the theoretical breakthrough, increasing evidence accumulated that novel genes play important roles in evolutionary processes, including adaptation and speciation events. Different techniques are available to identify genes and classify them as novel. Their classification as novel is usually based on their similarity to known genes, or lack thereof, detected by comparative genomics or against databases. Computational approaches are further prime methods that can be based on existing models or leveraging biological evidences from experiments. Identification of novel genes remains however a challenging task. With the constant software and technologies updates, no gold standard, and no available benchmark, evaluation and characterization of genomic novelty is a vibrant field. In this review, the classical and state-of-the-art tools for gene prediction are introduced. The current methods for novel gene detection are presented; the methodological strategies and their limits are discussed along with perspective approaches for further studies.

  15. Preliminary screening of the radiosensitivity-associated genes on colorectal cancer

    International Nuclear Information System (INIS)

    Xing Chungen; Yang Xiaodong; Zhou Liying; Wu Yongyou; Jiang Yinfen; Dai Hong; Lv Xiaodong; Gong Wei

    2007-01-01

    The screening of radiosensitive genes of human colorectal cancer was made by gene chip. Two human colorectal cancer cell lines LOVO and SW480 were cultivated and the total RNA was extracted from at least lxl0 7 cells. Then the gene expression profiling was performed by HG-U133 Plus 2.0 Array and the difference of gene expression has been analyzed. The results shows that there are 16882 genes expressed in LOVO cell and 17114 genes expressed in SW480 cell through gene expression profiling. It has been found that the genes with 2-fold expressed differentially include 908 genes up-regulated and 1312 genes down-regulated. The same genes, such as Fas and NFkB which is up-regulated, Caspas6, and RAD21 which is down-regulated, have been proved to be related to radiosensitivity. The genes with high expression level including CEACAM5, THBS1, SERPINE2, ARL7, HPGD in LOVO cell may also be related to the radiosensitivity. And the genes with high expression level including SCD, NQ01, LYZ, KRT20, ATP1B1 in SW480 cell may be related to the radioresistance of human colorectal cancer. It could be concluded that the radiosensitivity of colorectal cancer can be reflected from gene and protein expression level. And gene expression profiling is a fast and sensitive tool to predict the radiosensitivity and screen radiosensitive genes of colorectal cancer. (authors)

  16. GENEASE: Real time bioinformatics tool for multi-omics and disease ontology exploration, analysis and visualization.

    Science.gov (United States)

    Ghandikota, Sudhir; Hershey, Gurjit K Khurana; Mersha, Tesfaye B

    2018-03-24

    Advances in high-throughput sequencing technologies have made it possible to generate multiple omics data at an unprecedented rate and scale. The accumulation of these omics data far outpaces the rate at which biologists can mine and generate new hypothesis to test experimentally. There is an urgent need to develop a myriad of powerful tools to efficiently and effectively search and filter these resources to address specific post-GWAS functional genomics questions. However, to date, these resources are scattered across several databases and often lack a unified portal for data annotation and analytics. In addition, existing tools to analyze and visualize these databases are highly fragmented, resulting researchers to access multiple applications and manual interventions for each gene or variant in an ad hoc fashion until all the questions are answered. In this study, we present GENEASE, a web-based one-stop bioinformatics tool designed to not only query and explore multi-omics and phenotype databases (e.g., GTEx, ClinVar, dbGaP, GWAS Catalog, ENCODE, Roadmap Epigenomics, KEGG, Reactome, Gene and Phenotype Ontology) in a single web interface but also to perform seamless post genome-wide association downstream functional and overlap analysis for non-coding regulatory variants. GENEASE accesses over 50 different databases in public domain including model organism-specific databases to facilitate gene/variant and disease exploration, enrichment and overlap analysis in real time. It is a user-friendly tool with point-and-click interface containing links for support information including user manual and examples. GENEASE can be accessed freely at http://research.cchmc.org/mershalab/genease_new/login.html. Tesfaye.Mersha@cchmc.org, Sudhir.Ghandikota@cchmc.org. Supplementary data are available at Bioinformatics online.

  17. Transcriptomic network analysis of micronuclei-related genes: a case study

    DEFF Research Database (Denmark)

    van Leeuwen, D. M.; Pedersen, Marie; Knudsen, Lisbeth E.

    2011-01-01

    checkpoint and aneuploidy. The MN-related gene network was tested against a transcriptomics case study associated with MN measurements. In this case study, transcriptomic data from children and adults differentially exposed to ambient air pollution in the Czech Republic were analysed and visualised......Mechanistically relevant information on responses of humans to xenobiotic exposure in relation to chemically induced biological effects, such as micronuclei (MN) formation can be obtained through large-scale transcriptomics studies. Network analysis may enhance the analysis and visualisation...... of such data. Therefore, this study aimed to develop a 'MN formation' network based on a priori knowledge, by using the pathway tool MetaCore. The gene network contained 27 genes and three gene complexes that are related to processes involved in MN formation, e.g. spindle assembly checkpoint, cell cycle...

  18. Application of Statistical Tools for Data Analysis and Interpretation in Rice Plant Pathology

    Directory of Open Access Journals (Sweden)

    Parsuram Nayak

    2018-01-01

    Full Text Available There has been a significant advancement in the application of statistical tools in plant pathology during the past four decades. These tools include multivariate analysis of disease dynamics involving principal component analysis, cluster analysis, factor analysis, pattern analysis, discriminant analysis, multivariate analysis of variance, correspondence analysis, canonical correlation analysis, redundancy analysis, genetic diversity analysis, and stability analysis, which involve in joint regression, additive main effects and multiplicative interactions, and genotype-by-environment interaction biplot analysis. The advanced statistical tools, such as non-parametric analysis of disease association, meta-analysis, Bayesian analysis, and decision theory, take an important place in analysis of disease dynamics. Disease forecasting methods by simulation models for plant diseases have a great potentiality in practical disease control strategies. Common mathematical tools such as monomolecular, exponential, logistic, Gompertz and linked differential equations take an important place in growth curve analysis of disease epidemics. The highly informative means of displaying a range of numerical data through construction of box and whisker plots has been suggested. The probable applications of recent advanced tools of linear and non-linear mixed models like the linear mixed model, generalized linear model, and generalized linear mixed models have been presented. The most recent technologies such as micro-array analysis, though cost effective, provide estimates of gene expressions for thousands of genes simultaneously and need attention by the molecular biologists. Some of these advanced tools can be well applied in different branches of rice research, including crop improvement, crop production, crop protection, social sciences as well as agricultural engineering. The rice research scientists should take advantage of these new opportunities adequately in

  19. BRED: a simple and powerful tool for constructing mutant and recombinant bacteriophage genomes.

    Directory of Open Access Journals (Sweden)

    Laura J Marinelli

    Full Text Available Advances in DNA sequencing technology have facilitated the determination of hundreds of complete genome sequences both for bacteria and their bacteriophages. Some of these bacteria have well-developed and facile genetic systems for constructing mutants to determine gene function, and recombineering is a particularly effective tool. However, generally applicable methods for constructing defined mutants of bacteriophages are poorly developed, in part because of the inability to use selectable markers such as drug resistance genes during viral lytic growth. Here we describe a method for simple and effective directed mutagenesis of bacteriophage genomes using Bacteriophage Recombineering of Electroporated DNA (BRED, in which a highly efficient recombineering system is utilized directly on electroporated phage DNA; no selection is required and mutants can be readily detected by PCR. We describe the use of BRED to construct unmarked gene deletions, in-frame internal deletions, base substitutions, precise gene replacements, and the addition of gene tags.

  20. Bidirectional manipulation of gene expression in adipocytes using CRISPRa and siRNA

    DEFF Research Database (Denmark)

    Lundh, Morten; Pluciñska, Kaja; Isidor, Marie S

    2017-01-01

    OBJECTIVE: Functional investigation of novel gene/protein targets associated with adipocyte differentiation or function heavily relies on efficient and accessible tools to manipulate gene expression in adipocytes in vitro. Recent advances in gene-editing technologies such as CRISPR-Cas9 have...... not only eased gene editing but also greatly facilitated modulation of gene expression without altering the genome. Here, we aimed to develop and validate a competent in vitro adipocyte model of controllable functionality as well as multiplexed gene manipulation in adipocytes, using the CRISPRa "SAM......" system and siRNAs to simultaneously overexpress and silence selected genes in the same cell populations. METHODS: We introduced a stable expression of dCas9-VP64 and MS2-P65, the core components of the CRIPSRa SAM system, in mesenchymal C3H/10T1/2 cells through viral delivery and used guide RNAs...

  1. Molecular analysis of the NDP gene in two families with Norrie disease.

    Science.gov (United States)

    Rivera-Vega, M Refugio; Chiñas-Lopez, Silvet; Vaca, Ana Luisa Jimenez; Arenas-Sordo, M Luz; Kofman-Alfaro, Susana; Messina-Baas, Olga; Cuevas-Covarrubias, Sergio Alberto

    2005-04-01

    To describe the molecular defects in the Norrie disease protein (NDP) gene in two families with Norrie disease (ND). We analysed two families with ND at molecular level through polymerase chain reaction, DNA sequence analysis and GeneScan. Two molecular defects found in the NDP gene were: a missense mutation (265C > G) within codon 97 that resulted in the interchange of arginine by proline, and a partial deletion in the untranslated 3' region of exon 3 of the NDP gene. Clinical findings were more severe in the family that presented the partial deletion. We also diagnosed the carrier status of one daughter through GeneScan; this method proved to be a useful tool for establishing female carriers of ND. Here we report two novel mutations in the NDP gene in Mexican patients and propose that GeneScan is a viable mean of establishing ND carrier status.

  2. In silico tools for the analysis of antibiotic biosynthetic pathways

    DEFF Research Database (Denmark)

    Weber, Tilmann

    2014-01-01

    Natural products of bacteria and fungi are the most important source for antimicrobial drug leads. For decades, such compounds were exclusively found by chemical/bioactivity-guided screening approaches. The rapid progress in sequencing technologies only recently allowed the development of novel...... screening methods based on the genome sequences of potential producing organisms. The basic principle of such genome mining approaches is to identify genes, which are involved in the biosynthesis of such molecules, and to predict the products of the identified pathways. Thus, bioinformatics methods...... and tools are crucial for genome mining. In this review, a comprehensive overview is given on programs and databases for the identification and analysis of antibiotic biosynthesis gene clusters in genomic data....

  3. Annotating the Function of the Human Genome with Gene Ontology and Disease Ontology.

    Science.gov (United States)

    Hu, Yang; Zhou, Wenyang; Ren, Jun; Dong, Lixiang; Wang, Yadong; Jin, Shuilin; Cheng, Liang

    2016-01-01

    Increasing evidences indicated that function annotation of human genome in molecular level and phenotype level is very important for systematic analysis of genes. In this study, we presented a framework named Gene2Function to annotate Gene Reference into Functions (GeneRIFs), in which each functional description of GeneRIFs could be annotated by a text mining tool Open Biomedical Annotator (OBA), and each Entrez gene could be mapped to Human Genome Organisation Gene Nomenclature Committee (HGNC) gene symbol. After annotating all the records about human genes of GeneRIFs, 288,869 associations between 13,148 mRNAs and 7,182 terms, 9,496 associations between 948 microRNAs and 533 terms, and 901 associations between 139 long noncoding RNAs (lncRNAs) and 297 terms were obtained as a comprehensive annotation resource of human genome. High consistency of term frequency of individual gene (Pearson correlation = 0.6401, p = 2.2e - 16) and gene frequency of individual term (Pearson correlation = 0.1298, p = 3.686e - 14) in GeneRIFs and GOA shows our annotation resource is very reliable.

  4. A Next Generation Sequencing custom gene panel as first line diagnostic tool for atypical cases of syndromic obesity: Application in a case of Alström syndrome.

    Science.gov (United States)

    Maltese, Paolo E; Iarossi, Giancarlo; Ziccardi, Lucia; Colombo, Leonardo; Buzzonetti, Luca; Crinò, Antonino; Tezzele, Silvia; Bertelli, Matteo

    2018-02-01

    Obesity phenotype can be manifested as an isolated trait or accompanied by multisystem disorders as part of a syndromic picture. In both situations, same molecular pathways may be involved to different degrees. This evidence is stronger in syndromic obesity, in which phenotypes of different syndromes may overlap. In these cases, genetic testing can unequivocally provide a final diagnosis. Here we describe a patient who met the diagnostic criteria for Alström syndrome only during adolescence. Genetic testing was requested at 25 years of age for a final confirmation of the diagnosis. The genetic diagnosis of Alström syndrome was obtained through a Next Generation Sequencing genetic test approach using a custom-designed gene panel of 47 genes associated with syndromic and non-syndromic obesity. Genetic analysis revealed a novel homozygous frameshift variant p.(Arg1550Lysfs*10) on exon 8 of the ALMS1 gene. This case shows the need for a revision of the diagnostic criteria guidelines, as a consequence of the recent advent of massive parallel sequencing technology. Indications for genetic testing reported in these currently accepted diagnostic criteria for Alström syndrome, were drafted when sequencing was expensive and time consuming. Nowadays, Next Generation Sequencing testing could be considered as first line diagnostic tool not only for Alström syndrome but, more generally, for all those atypical or not clearly distinguishable cases of syndromic obesity, thus avoiding delayed diagnosis and treatments. Early diagnosis permits a better follow-up and pre-symptomatic interventions. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  5. Tool path in torus tool CNC machining

    Directory of Open Access Journals (Sweden)

    XU Ying

    2016-10-01

    Full Text Available This paper is about tool path in torus tool CNC machining.The mathematical model of torus tool is established.The tool path planning algorithm is determined through calculation of the cutter location,boundary discretization,calculation of adjacent tool path and so on,according to the conversion formula,the cutter contact point will be converted to the cutter location point and then these points fit a toolpath.Lastly,the path planning algorithm is implemented by using Matlab programming.The cutter location points for torus tool are calculated by Matlab,and then fit these points to a toolpath.While using UG software,another tool path of free surface is simulated of the same data.It is drew compared the two tool paths that using torus tool is more efficient.

  6. Wetlab-2 - Quantitative PCR Tools for Spaceflight Studies of Gene Expression Aboard the International Space Station

    Science.gov (United States)

    Schonfeld, Julie E.

    2015-01-01

    Wetlab-2 is a research platform for conducting real-time quantitative gene expression analysis aboard the International Space Station. The system enables spaceflight genomic studies involving a wide variety of biospecimen types in the unique microgravity environment of space. Currently, gene expression analyses of space flown biospecimens must be conducted post flight after living cultures or frozen or chemically fixed samples are returned to Earth from the space station. Post-flight analysis is limited for several reasons. First, changes in gene expression can be transient, changing over a timescale of minutes. The delay between sampling on Earth can range from days to months, and RNA may degrade during this period of time, even in fixed or frozen samples. Second, living organisms that return to Earth may quickly re-adapt to terrestrial conditions. Third, forces exerted on samples during reentry and return to Earth may affect results. Lastly, follow up experiments designed in response to post-flight results must wait for a new flight opportunity to be tested.

  7. Isolation and characterization of a floral homeotic gene in Fraxinus nigra causing earlier flowering and homeotic alterations in transgenic Arabidopsis

    Science.gov (United States)

    Jun Hyung Lee; Paula M. Pijut

    2017-01-01

    Reproductive sterility, which can be obtained by manipulating floral organ identity genes, is an important tool for gene containment of genetically engineered trees. In Arabidopsis, AGAMOUS (AG) is the only C-class gene responsible for both floral meristem determinacy and floral organ identity, and its mutations produce...

  8. Array data extractor (ADE): a LabVIEW program to extract and merge gene array data.

    Science.gov (United States)

    Kurtenbach, Stefan; Kurtenbach, Sarah; Zoidl, Georg

    2013-12-01

    Large data sets from gene expression array studies are publicly available offering information highly valuable for research across many disciplines ranging from fundamental to clinical research. Highly advanced bioinformatics tools have been made available to researchers, but a demand for user-friendly software allowing researchers to quickly extract expression information for multiple genes from multiple studies persists. Here, we present a user-friendly LabVIEW program to automatically extract gene expression data for a list of genes from multiple normalized microarray datasets. Functionality was tested for 288 class A G protein-coupled receptors (GPCRs) and expression data from 12 studies comparing normal and diseased human hearts. Results confirmed known regulation of a beta 1 adrenergic receptor and further indicate novel research targets. Although existing software allows for complex data analyses, the LabVIEW based program presented here, "Array Data Extractor (ADE)", provides users with a tool to retrieve meaningful information from multiple normalized gene expression datasets in a fast and easy way. Further, the graphical programming language used in LabVIEW allows applying changes to the program without the need of advanced programming knowledge.

  9. Analysis and prediction of gene splice sites in four Aspergillus genomes

    DEFF Research Database (Denmark)

    Wang, Kai; Ussery, David; Brunak, Søren

    2009-01-01

    Several Aspergillus fungal genomic sequences have been published, with many more in progress. Obviously, it is essential to have high-quality, consistently annotated sets of proteins from each of the genomes, in order to make meaningful comparisons. We have developed a dedicated, publicly available......, splice site prediction program called NetAspGene, for the genus Aspergillus. Gene sequences from Aspergillus fumigatus, the most common mould pathogen, were used to build and test our model. Compared to many animals and plants, Aspergillus contains smaller introns; thus we have applied a larger window...... better splice site prediction than other available tools. NetAspGene will be very helpful for the study in Aspergillus splice sites and especially in alternative splicing. A webpage for NetAspGene is publicly available at http://www.cbs.dtu.dk/services/NetAspGene....

  10. FurIOS: a web-based tool for identification of Vibrionaceae species using the fur gene

    DEFF Research Database (Denmark)

    Machado, Henrique; Cardoso, Joao; Giubergia, Sonia

    2017-01-01

    -sequence. The input is a DNA sequence that can be uploaded on the web service; the output is a table containing the strain identifier, e-value, and percentage of identity for each of the matches with rows colored in green for hits with high probability of being the same species. The service is available on the web at......: http://www.cbs.dtu.dk/services/furIOS-1.0/. The fur-sequences can be derived either from genome sequences or from PCR-amplification of the genomic region encoding the fur gene. We have used 191 strains identified as Vibrionaceae based on 16S rRNA gene sequence to test the PCR method and the web service...

  11. Magnetic nanoparticles for targeted therapeutic gene delivery and magnetic-inducing heating on hepatoma

    International Nuclear Information System (INIS)

    Yuan, Chenyan; Zhang, Jia; Li, Hongbo; Zhang, Hao; Wang, Ling; Zhang, Dongsheng; An, Yanli

    2014-01-01

    Gene therapy holds great promise for treating cancers, but their clinical applications are being hampered due to uncontrolled gene delivery and expression. To develop a targeted, safe and efficient tumor therapy system, we constructed a tissue-specific suicide gene delivery system by using magnetic nanoparticles (MNPs) as carriers for the combination of gene therapy and hyperthermia on hepatoma. The suicide gene was hepatoma-targeted and hypoxia-enhanced, and the MNPs possessed the ability to elevate temperature to the effective range for tumor hyperthermia as imposed on an alternating magnetic field (AMF). The tumoricidal effects of targeted gene therapy associated with hyperthermia were evaluated in vitro and in vivo. The experiment demonstrated that hyperthermia combined with a targeted gene therapy system proffer an effective tool for tumor therapy with high selectivity and the synergistic effect of hepatoma suppression. (paper)

  12. GeneChip expression profiling reveals the alterations of energy metabolism related genes in osteocytes under large gradient high magnetic fields.

    Science.gov (United States)

    Wang, Yang; Chen, Zhi-Hao; Yin, Chun; Ma, Jian-Hua; Li, Di-Jie; Zhao, Fan; Sun, Yu-Long; Hu, Li-Fang; Shang, Peng; Qian, Ai-Rong

    2015-01-01

    The diamagnetic levitation as a novel ground-based model for simulating a reduced gravity environment has recently been applied in life science research. In this study a specially designed superconducting magnet with a large gradient high magnetic field (LG-HMF), which can provide three apparent gravity levels (μ-g, 1-g, and 2-g), was used to simulate a space-like gravity environment. Osteocyte, as the most important mechanosensor in bone, takes a pivotal position in mediating the mechano-induced bone remodeling. In this study, the effects of LG-HMF on gene expression profiling of osteocyte-like cell line MLO-Y4 were investigated by Affymetrix DNA microarray. LG-HMF affected osteocyte gene expression profiling. Differentially expressed genes (DEGs) and data mining were further analyzed by using bioinfomatic tools, such as DAVID, iReport. 12 energy metabolism related genes (PFKL, AK4, ALDOC, COX7A1, STC1, ADM, CA9, CA12, P4HA1, APLN, GPR35 and GPR84) were further confirmed by real-time PCR. An integrated gene interaction network of 12 DEGs was constructed. Bio-data mining showed that genes involved in glucose metabolic process and apoptosis changed notablly. Our results demostrated that LG-HMF affected the expression of energy metabolism related genes in osteocyte. The identification of sensitive genes to special environments may provide some potential targets for preventing and treating bone loss or osteoporosis.

  13. GeneChip expression profiling reveals the alterations of energy metabolism related genes in osteocytes under large gradient high magnetic fields.

    Directory of Open Access Journals (Sweden)

    Yang Wang

    Full Text Available The diamagnetic levitation as a novel ground-based model for simulating a reduced gravity environment has recently been applied in life science research. In this study a specially designed superconducting magnet with a large gradient high magnetic field (LG-HMF, which can provide three apparent gravity levels (μ-g, 1-g, and 2-g, was used to simulate a space-like gravity environment. Osteocyte, as the most important mechanosensor in bone, takes a pivotal position in mediating the mechano-induced bone remodeling. In this study, the effects of LG-HMF on gene expression profiling of osteocyte-like cell line MLO-Y4 were investigated by Affymetrix DNA microarray. LG-HMF affected osteocyte gene expression profiling. Differentially expressed genes (DEGs and data mining were further analyzed by using bioinfomatic tools, such as DAVID, iReport. 12 energy metabolism related genes (PFKL, AK4, ALDOC, COX7A1, STC1, ADM, CA9, CA12, P4HA1, APLN, GPR35 and GPR84 were further confirmed by real-time PCR. An integrated gene interaction network of 12 DEGs was constructed. Bio-data mining showed that genes involved in glucose metabolic process and apoptosis changed notablly. Our results demostrated that LG-HMF affected the expression of energy metabolism related genes in osteocyte. The identification of sensitive genes to special environments may provide some potential targets for preventing and treating bone loss or osteoporosis.

  14. Identification of mechanosensitive genes during skeletal development: alteration of genes associated with cytoskeletal rearrangement and cell signalling pathways.

    Science.gov (United States)

    Rolfe, Rebecca A; Nowlan, Niamh C; Kenny, Elaine M; Cormican, Paul; Morris, Derek W; Prendergast, Patrick J; Kelly, Daniel; Murphy, Paula

    2014-01-20

    Mechanical stimulation is necessary for regulating correct formation of the skeleton. Here we test the hypothesis that mechanical stimulation of the embryonic skeletal system impacts expression levels of genes implicated in developmentally important signalling pathways in a genome wide approach. We use a mutant mouse model with altered mechanical stimulation due to the absence of limb skeletal muscle (Splotch-delayed) where muscle-less embryos show specific defects in skeletal elements including delayed ossification, changes in the size and shape of cartilage rudiments and joint fusion. We used Microarray and RNA sequencing analysis tools to identify differentially expressed genes between muscle-less and control embryonic (TS23) humerus tissue. We found that 680 independent genes were down-regulated and 452 genes up-regulated in humeri from muscle-less Spd embryos compared to littermate controls (at least 2-fold; corrected p-value ≤0.05). We analysed the resulting differentially expressed gene sets using Gene Ontology annotations to identify significant enrichment of genes associated with particular biological processes, showing that removal of mechanical stimuli from muscle contractions affected genes associated with development and differentiation, cytoskeletal architecture and cell signalling. Among cell signalling pathways, the most strongly disturbed was Wnt signalling, with 34 genes including 19 pathway target genes affected. Spatial gene expression analysis showed that both a Wnt ligand encoding gene (Wnt4) and a pathway antagonist (Sfrp2) are up-regulated specifically in the developing joint line, while the expression of a Wnt target gene, Cd44, is no longer detectable in muscle-less embryos. The identification of 84 genes associated with the cytoskeleton that are down-regulated in the absence of muscle indicates a number of candidate genes that are both mechanoresponsive and potentially involved in mechanotransduction, converting a mechanical stimulus

  15. CRISPR-Cas Targeting of Host Genes as an Antiviral Strategy.

    Science.gov (United States)

    Chen, Shuliang; Yu, Xiao; Guo, Deyin

    2018-01-16

    Currently, a new gene editing tool-the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) associated (Cas) system-is becoming a promising approach for genetic manipulation at the genomic level. This simple method, originating from the adaptive immune defense system in prokaryotes, has been developed and applied to antiviral research in humans. Based on the characteristics of virus-host interactions and the basic rules of nucleic acid cleavage or gene activation of the CRISPR-Cas system, it can be used to target both the virus genome and host factors to clear viral reservoirs and prohibit virus infection or replication. Here, we summarize recent progress of the CRISPR-Cas technology in editing host genes as an antiviral strategy.

  16. Quantification of differential gene expression by multiplexed targeted resequencing of cDNA

    Science.gov (United States)

    Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.

    2017-01-01

    Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677

  17. PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.

    Science.gov (United States)

    Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay

    2015-12-01

    A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.

  18. Serum as a modulator of lipoplex-mediated gene transfection : dependence of amphiphile, cell type and complex stability

    NARCIS (Netherlands)

    Audouy, S; Molema, G; de Leij, L; Hoekstra, D

    2000-01-01

    Background Cationic liposomes belong to the family of non-viral vectors for gene delivery. Despite several drawbacks, such as low efficiency compared to viruses and inactivation by serum, cationic liposomes remain a promising tool for gene therapy. Therefore further investigation of the mechanism of

  19. Unstable Expression of Commonly Used Reference Genes in Rat Pancreatic Islets Early after Isolation Affects Results of Gene Expression Studies.

    Directory of Open Access Journals (Sweden)

    Lucie Kosinová

    Full Text Available The use of RT-qPCR provides a powerful tool for gene expression studies; however, the proper interpretation of the obtained data is crucially dependent on accurate normalization based on stable reference genes. Recently, strong evidence has been shown indicating that the expression of many commonly used reference genes may vary significantly due to diverse experimental conditions. The isolation of pancreatic islets is a complicated procedure which creates severe mechanical and metabolic stress leading possibly to cellular damage and alteration of gene expression. Despite of this, freshly isolated islets frequently serve as a control in various gene expression and intervention studies. The aim of our study was to determine expression of 16 candidate reference genes and one gene of interest (F3 in isolated rat pancreatic islets during short-term cultivation in order to find a suitable endogenous control for gene expression studies. We compared the expression stability of the most commonly used reference genes and evaluated the reliability of relative and absolute quantification using RT-qPCR during 0-120 hrs after isolation. In freshly isolated islets, the expression of all tested genes was markedly depressed and it increased several times throughout the first 48 hrs of cultivation. We observed significant variability among samples at 0 and 24 hrs but substantial stabilization from 48 hrs onwards. During the first 48 hrs, relative quantification failed to reflect the real changes in respective mRNA concentrations while in the interval 48-120 hrs, the relative expression generally paralleled the results determined by absolute quantification. Thus, our data call into question the suitability of relative quantification for gene expression analysis in pancreatic islets during the first 48 hrs of cultivation, as the results may be significantly affected by unstable expression of reference genes. However, this method could provide reliable information

  20. TabSQL: a MySQL tool to facilitate mapping user data to public databases.

    Science.gov (United States)

    Xia, Xiao-Qin; McClelland, Michael; Wang, Yipeng

    2010-06-23

    With advances in high-throughput genomics and proteomics, it is challenging for biologists to deal with large data files and to map their data to annotations in public databases. We developed TabSQL, a MySQL-based application tool, for viewing, filtering and querying data files with large numbers of rows. TabSQL provides functions for downloading and installing table files from public databases including the Gene Ontology database (GO), the Ensembl databases, and genome databases from the UCSC genome bioinformatics site. Any other database that provides tab-delimited flat files can also be imported. The downloaded gene annotation tables can be queried together with users' data in TabSQL using either a graphic interface or command line. TabSQL allows queries across the user's data and public databases without programming. It is a convenient tool for biologists to annotate and enrich their data.

  1. Ligand-Modified Human Serum Albumin Nanoparticles for Enhanced Gene Delivery.

    Science.gov (United States)

    Look, Jennifer; Wilhelm, Nadine; von Briesen, Hagen; Noske, Nadja; Günther, Christine; Langer, Klaus; Gorjup, Erwin

    2015-09-08

    The development of nonviral gene delivery systems is a great challenge to enable safe gene therapy. In this study, ligand-modified nanoparticles based on human serum albumin (HSA) were developed and optimized for an efficient gene therapy. Different glutaraldehyde cross-linking degrees were investigated to optimize the HSA nanoparticles for gene delivery. The peptide sequence arginine-glycine-aspartate (RGD) and the HIV-1 transactivator of transduction sequence (Tat) are well-known as promising targeting ligands. Plasmid DNA loaded HSA nanoparticles were covalently modified on their surface with these different ligands. The transfection potential of the obtained plasmid DNA loaded RGD- and Tat-modified nanoparticles was investigated in vitro, and optimal incubation conditions for these preparations were studied. It turned out that Tat-modified HSA nanoparticles with the lowest cross-linking degree of 20% showed the highest transfection potential. Taken together, ligand-functionalized HSA nanoparticles represent promising tools for efficient and safe gene therapy.

  2. Gene annotation from scientific literature using mappings between keyword systems.

    Science.gov (United States)

    Pérez, Antonio J; Perez-Iratxeta, Carolina; Bork, Peer; Thode, Guillermo; Andrade, Miguel A

    2004-09-01

    The description of genes in databases by keywords helps the non-specialist to quickly grasp the properties of a gene and increases the efficiency of computational tools that are applied to gene data (e.g. searching a gene database for sequences related to a particular biological process). However, the association of keywords to genes or protein sequences is a difficult process that ultimately implies examination of the literature related to a gene. To support this task, we present a procedure to derive keywords from the set of scientific abstracts related to a gene. Our system is based on the automated extraction of mappings between related terms from different databases using a model of fuzzy associations that can be applied with all generality to any pair of linked databases. We tested the system by annotating genes of the SWISS-PROT database with keywords derived from the abstracts linked to their entries (stored in the MEDLINE database of scientific references). The performance of the annotation procedure was much better for SWISS-PROT keywords (recall of 47%, precision of 68%) than for Gene Ontology terms (recall of 8%, precision of 67%). The algorithm can be publicly accessed and used for the annotation of sequences through a web server at http://www.bork.embl.de/kat

  3. WGDB: Wood Gene Database with search interface.

    Science.gov (United States)

    Goyal, Neha; Ginwal, H S

    2014-01-01

    Wood quality can be defined in terms of particular end use with the involvement of several traits. Over the last fifteen years researchers have assessed the wood quality traits in forest trees. The wood quality was categorized as: cell wall biochemical traits, fibre properties include the microfibril angle, density and stiffness in loblolly pine [1]. The user friendly and an open-access database has been developed named Wood Gene Database (WGDB) for describing the wood genes along the information of protein and published research articles. It contains 720 wood genes from species namely Pinus, Deodar, fast growing trees namely Poplar, Eucalyptus. WGDB designed to encompass the majority of publicly accessible genes codes for cellulose, hemicellulose and lignin in tree species which are responsive to wood formation and quality. It is an interactive platform for collecting, managing and searching the specific wood genes; it also enables the data mining relate to the genomic information specifically in Arabidopsis thaliana, Populus trichocarpa, Eucalyptus grandis, Pinus taeda, Pinus radiata, Cedrus deodara, Cedrus atlantica. For user convenience, this database is cross linked with public databases namely NCBI, EMBL & Dendrome with the search engine Google for making it more informative and provides bioinformatics tools named BLAST,COBALT. The database is freely available on www.wgdb.in.

  4. GOMA: functional enrichment analysis tool based on GO modules

    Institute of Scientific and Technical Information of China (English)

    Qiang Huang; Ling-Yun Wu; Yong Wang; Xiang-Sun Zhang

    2013-01-01

    Analyzing the function of gene sets is a critical step in interpreting the results of high-throughput experiments in systems biology.A variety of enrichment analysis tools have been developed in recent years,but most output a long list of significantly enriched terms that are often redundant,making it difficult to extract the most meaningful functions.In this paper,we present GOMA,a novel enrichment analysis method based on the new concept of enriched functional Gene Ontology (GO) modules.With this method,we systematically revealed functional GO modules,i.e.,groups of functionally similar GO terms,via an optimization model and then ranked them by enrichment scores.Our new method simplifies enrichment analysis results by reducing redundancy,thereby preventing inconsistent enrichment results among functionally similar terms and providing more biologically meaningful results.

  5. Use of deep whole-genome sequencing data to identify structure risk variants in breast cancer susceptibility genes.

    Science.gov (United States)

    Guo, Xingyi; Shi, Jiajun; Cai, Qiuyin; Shu, Xiao-Ou; He, Jing; Wen, Wanqing; Allen, Jamie; Pharoah, Paul; Dunning, Alison; Hunter, David J; Kraft, Peter; Easton, Douglas F; Zheng, Wei; Long, Jirong

    2018-03-01

    Functional disruptions of susceptibility genes by large genomic structure variant (SV) deletions in germlines are known to be associated with cancer risk. However, few studies have been conducted to systematically search for SV deletions in breast cancer susceptibility genes. We analysed deep (> 30x) whole-genome sequencing (WGS) data generated in blood samples from 128 breast cancer patients of Asian and European descent with either a strong family history of breast cancer or early cancer onset disease. To identify SV deletions in known or suspected breast cancer susceptibility genes, we used multiple SV calling tools including Genome STRiP, Delly, Manta, BreakDancer and Pindel. SV deletions were detected by at least three of these bioinformatics tools in five genes. Specifically, we identified heterozygous deletions covering a fraction of the coding regions of BRCA1 (with approximately 80kb in two patients), and TP53 genes (with ∼1.6 kb in two patients), and of intronic regions (∼1 kb) of the PALB2 (one patient), PTEN (three patients) and RAD51C genes (one patient). We confirmed the presence of these deletions using real-time quantitative PCR (qPCR). Our study identified novel SV deletions in breast cancer susceptibility genes and the identification of such SV deletions may improve clinical testing.

  6. Molecular studies on the function of tumor suppressor gene in gastrointestinal cancer

    International Nuclear Information System (INIS)

    Kim, You Cheoul

    1993-01-01

    Cancer of stomach, colon and liver are a group of the most common cancer in Korea. However, results with current therapeutic modalities are still unsatisfactory. The intensive efforts have been made to understand basic pathogenesis and to find better therapeutic tools for the treatment of this miserable disease. We studies the alteration of tumor suppressor gene in various Gastrointestinal cancer in Korea. Results showed that genetic alteration of Rb gene was in 83% of colorectal cancer. Our results suggest that genetic alteration of Rb gene is crucially involved in the tumorigenesis of colorectum in Korea. (Author)

  7. PASTEC: an automatic transposable element classification tool.

    Directory of Open Access Journals (Sweden)

    Claire Hoede

    Full Text Available SUMMARY: The classification of transposable elements (TEs is key step towards deciphering their potential impact on the genome. However, this process is often based on manual sequence inspection by TE experts. With the wealth of genomic sequences now available, this task requires automation, making it accessible to most scientists. We propose a new tool, PASTEC, which classifies TEs by searching for structural features and similarities. This tool outperforms currently available software for TE classification. The main innovation of PASTEC is the search for HMM profiles, which is useful for inferring the classification of unknown TE on the basis of conserved functional domains of the proteins. In addition, PASTEC is the only tool providing an exhaustive spectrum of possible classifications to the order level of the Wicker hierarchical TE classification system. It can also automatically classify other repeated elements, such as SSR (Simple Sequence Repeats, rDNA or potential repeated host genes. Finally, the output of this new tool is designed to facilitate manual curation by providing to biologists with all the evidence accumulated for each TE consensus. AVAILABILITY: PASTEC is available as a REPET module or standalone software (http://urgi.versailles.inra.fr/download/repet/REPET_linux-x64-2.2.tar.gz. It requires a Unix-like system. There are two standalone versions: one of which is parallelized (requiring Sun grid Engine or Torque, and the other of which is not.

  8. A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes

    Science.gov (United States)

    Seo, Minseok; Shin, Su-kyung; Kwon, Eun-Young; Kim, Sung-Eun; Bae, Yun-Jung; Lee, Seungyeoun; Sung, Mi-Kyung; Choi, Myung-Sook; Park, Taesung

    2016-01-01

    Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs) among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs). However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods. Through analysis of

  9. A Model-Based Joint Identification of Differentially Expressed Genes and Phenotype-Associated Genes.

    Directory of Open Access Journals (Sweden)

    Samuel Sunghwan Cho

    Full Text Available Over the last decade, many analytical methods and tools have been developed for microarray data. The detection of differentially expressed genes (DEGs among different treatment groups is often a primary purpose of microarray data analysis. In addition, association studies investigating the relationship between genes and a phenotype of interest such as survival time are also popular in microarray data analysis. Phenotype association analysis provides a list of phenotype-associated genes (PAGs. However, it is sometimes necessary to identify genes that are both DEGs and PAGs. We consider the joint identification of DEGs and PAGs in microarray data analyses. The first approach we used was a naïve approach that detects DEGs and PAGs separately and then identifies the genes in an intersection of the list of PAGs and DEGs. The second approach we considered was a hierarchical approach that detects DEGs first and then chooses PAGs from among the DEGs or vice versa. In this study, we propose a new model-based approach for the joint identification of DEGs and PAGs. Unlike the previous two-step approaches, the proposed method identifies genes simultaneously that are DEGs and PAGs. This method uses standard regression models but adopts different null hypothesis from ordinary regression models, which allows us to perform joint identification in one-step. The proposed model-based methods were evaluated using experimental data and simulation studies. The proposed methods were used to analyze a microarray experiment in which the main interest lies in detecting genes that are both DEGs and PAGs, where DEGs are identified between two diet groups and PAGs are associated with four phenotypes reflecting the expression of leptin, adiponectin, insulin-like growth factor 1, and insulin. Model-based approaches provided a larger number of genes, which are both DEGs and PAGs, than other methods. Simulation studies showed that they have more power than other methods

  10. Genomic variation in Salmonella enterica core genes for epidemiological typing

    DEFF Research Database (Denmark)

    Leekitcharoenphon, Pimlapas; Lukjancenko, Oksana; Rundsten, Carsten Friis

    2012-01-01

    Background: Technological advances in high throughput genome sequencing are making whole genome sequencing (WGS) available as a routine tool for bacterial typing. Standardized procedures for identification of relevant genes and of variation are needed to enable comparison between studies and over...... genomes and evaluate their value as typing targets, comparing whole genome typing and traditional methods such as 16S and MLST. A consensus tree based on variation of core genes gives much better resolution than 16S and MLST; the pan-genome family tree is similar to the consensus tree, but with higher...... that there is a positive selection towards mutations leading to amino acid changes. Conclusions: Genomic variation within the core genome is useful for investigating molecular evolution and providing candidate genes for bacterial genome typing. Identification of genes with different degrees of variation is important...

  11. Gene Expression Analysis of Plum pox virus (Sharka) Susceptibility/Resistance in Apricot (Prunus armeniaca L.).

    Science.gov (United States)

    Rubio, Manuel; Ballester, Ana Rosa; Olivares, Pedro Manuel; Castro de Moura, Manuel; Dicenta, Federico; Martínez-Gómez, Pedro

    2015-01-01

    RNA-Seq has proven to be a very powerful tool in the analysis of the Plum pox virus (PPV, sharka disease)/Prunus interaction. This technique is an important complementary tool to other means of studying genomics. In this work an analysis of gene expression of resistance/susceptibility to PPV in apricot is performed. RNA-Seq has been applied to analyse the gene expression changes induced by PPV infection in leaves from two full-sib apricot genotypes, "Rojo Pasión" and "Z506-7", resistant and susceptible to PPV, respectively. Transcriptomic analyses revealed the existence of more than 2,000 genes related to the pathogen response and resistance to PPV in apricot. These results showed that the response to infection by the virus in the susceptible genotype is associated with an induction of genes involved in pathogen resistance such as the allene oxide synthase, S-adenosylmethionine synthetase 2 and the major MLP-like protein 423. Over-expression of the Dicer protein 2a may indicate the suppression of a gene silencing mechanism of the plant by PPV HCPro and P1 PPV proteins. On the other hand, there were 164 genes involved in resistance mechanisms that have been identified in apricot, 49 of which are located in the PPVres region (scaffold 1 positions from 8,050,804 to 8,244,925), which is responsible for PPV resistance in apricot. Among these genes in apricot there are several MATH domain-containing genes, although other genes inside (Pleiotropic drug resistance 9 gene) or outside (CAP, Cysteine-rich secretory proteins, Antigen 5 and Pathogenesis-related 1 protein; and LEA, Late embryogenesis abundant protein) PPVres region could also be involved in the resistance.

  12. Analysis of aberrant methylation on promoter sequences of tumor suppressor genes and total DNA in sputum samples: a promising tool for early detection of COPD and lung cancer in smokers

    Directory of Open Access Journals (Sweden)

    Guzmán Leda

    2012-07-01

    Full Text Available Abstract Background Chronic obstructive pulmonary disease (COPD is a disorder associated to cigarette smoke and lung cancer (LC. Since epigenetic changes in oncogenes and tumor suppressor genes (TSGs are clearly important in the development of LC. In this study, we hypothesize that tobacco smokers are susceptible for methylation in the promoter region of TSGs in airway epithelial cells when compared with non-smoker subjects. The purpose of this study was to investigate the usefulness of detection of genes promoter methylation in sputum specimens, as a complementary tool to identify LC biomarkers among smokers with early COPD. Methods We determined the amount of DNA in induced sputum from patients with COPD (n = 23, LC (n = 26, as well as in healthy subjects (CTR (n = 33, using a commercial kit for DNA purification, followed by absorbance measurement at 260 nm. The frequency of CDKN2A, CDH1 and MGMT promoter methylation in the same groups was determined by methylation-specific polymerase chain reaction (MSP. The Fisher’s exact test was employed to compare frequency of results between different groups. Results DNA concentration was 7.4 and 5.8 times higher in LC and COPD compared to the (CTR (p  Conclusions We provide evidence that aberrant methylation of TSGs in samples of induced sputum is a useful tool for early diagnostic of lung diseases (LC and COPD in smoker subjects. Virtual slides The abstract MUST finish with the following text: Virtual Slides The virtual slide(s for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1127865005664160

  13. Engineering bacterial translation initiation - Do we have all the tools we need?

    Science.gov (United States)

    Vigar, Justin R J; Wieden, Hans-Joachim

    2017-11-01

    Reliable tools that allow precise and predictable control over gene expression are critical for the success of nearly all bioengineering applications. Translation initiation is the most regulated phase during protein biosynthesis, and is therefore a promising target for exerting control over gene expression. At the translational level, the copy number of a protein can be fine-tuned by altering the interaction between the translation initiation region of an mRNA and the ribosome. These interactions can be controlled by modulating the mRNA structure using numerous approaches, including small molecule ligands, RNAs, or RNA-binding proteins. A variety of naturally occurring regulatory elements have been repurposed, facilitating advances in synthetic gene regulation strategies. The pursuit of a comprehensive understanding of mechanisms governing translation initiation provides the framework for future engineering efforts. Here we outline state-of-the-art strategies used to predictably control translation initiation in bacteria. We also discuss current limitations in the field and future goals. Due to its function as the rate-determining step, initiation is the ideal point to exert effective translation regulation. Several engineering tools are currently available to rationally design the initiation characteristics of synthetic mRNAs. However, improvements are required to increase the predictability, effectiveness, and portability of these tools. Predictable and reliable control over translation initiation will allow greater predictability when designing, constructing, and testing genetic circuits. The ability to build more complex circuits predictably will advance synthetic biology and contribute to our fundamental understanding of the underlying principles of these processes. "This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier

  14. Along the Central Dogma-Controlling Gene Expression with Small Molecules.

    Science.gov (United States)

    Schneider-Poetsch, Tilman; Yoshida, Minoru

    2018-05-04

    The central dogma of molecular biology, that DNA is transcribed into RNA and RNA translated into protein, was coined in the early days of modern biology. Back in the 1950s and 1960s, bacterial genetics first opened the way toward understanding life as the genetically encoded interaction of macromolecules. As molecular biology progressed and our knowledge of gene control deepened, it became increasingly clear that expression relied on many more levels of regulation. In the process of dissecting mechanisms of gene expression, specific small-molecule inhibitors played an important role and became valuable tools of investigation. Small molecules offer significant advantages over genetic tools, as they allow inhibiting a process at any desired time point, whereas mutating or altering the gene of an important regulator would likely result in a dead organism. With the advent of modern sequencing technology, it has become possible to monitor global cellular effects of small-molecule treatment and thereby overcome the limitations of classical biochemistry, which usually looks at a biological system in isolation. This review focuses on several molecules, especially natural products, that have played an important role in dissecting gene expression and have opened up new fields of investigation as well as clinical venues for disease treatment. Expected final online publication date for the Annual Review of Biochemistry Volume 87 is June 20, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

  15. Engineering and Functional Characterization of Fusion Genes Identifies Novel Oncogenic Drivers of Cancer. | Office of Cancer Genomics

    Science.gov (United States)

    Oncogenic gene fusions drive many human cancers, but tools to more quickly unravel their functional contributions are needed. Here we describe methodology permitting fusion gene construction for functional evaluation. Using this strategy, we engineered the known fusion oncogenes, BCR-ABL1, EML4-ALK, and ETV6-NTRK3, as well as 20 previously uncharacterized fusion genes identified in TCGA datasets.

  16. Detection of Horizontal Gene Transfers from Phylogenetic Comparisons

    Science.gov (United States)

    Pylro, Victor Satler; Vespoli, Luciano de Souza; Duarte, Gabriela Frois; Yotoko, Karla Suemy Clemente

    2012-01-01

    Bacterial phylogenies have become one of the most important challenges for microbial ecology. This field started in the mid-1970s with the aim of using the sequence of the small subunit ribosomal RNA (16S) tool to infer bacterial phylogenies. Phylogenetic hypotheses based on other sequences usually give conflicting topologies that reveal different evolutionary histories, which in some cases may be the result of horizontal gene transfer events. Currently, one of the major goals of molecular biology is to understand the role that horizontal gene transfer plays in species adaptation and evolution. In this work, we compared the phylogenetic tree based on 16S with the tree based on dszC, a gene involved in the cleavage of carbon-sulfur bonds. Bacteria of several genera perform this survival task when living in environments lacking free mineral sulfur. The biochemical pathway of the desulphurization process was extensively studied due to its economic importance, since this step is expensive and indispensable in fuel production. Our results clearly show that horizontal gene transfer events could be detected using common phylogenetic methods with gene sequences obtained from public sequence databases. PMID:22675653

  17. APPRIS 2017: principal isoforms for multiple gene sets

    Science.gov (United States)

    Rodriguez-Rivas, Juan; Di Domenico, Tomás; Vázquez, Jesús; Valencia, Alfonso

    2018-01-01

    Abstract The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. PMID:29069475

  18. Molecular Biology at the Cutting Edge: A Review on CRISPR/CAS9 Gene Editing for Undergraduates

    Science.gov (United States)

    Thurtle-Schmidt, Deborah M.; Lo, Te-Wen

    2018-01-01

    Disrupting a gene to determine its effect on an organism's phenotype is an indispensable tool in molecular biology. Such techniques are critical for understanding how a gene product contributes to the development and cellular identity of organisms. The explosion of genomic sequencing technologies combined with recent advances in genome-editing…

  19. Ribosomal protein gene knockdown causes developmental defects in zebrafish.

    Directory of Open Access Journals (Sweden)

    Tamayo Uechi

    Full Text Available The ribosomal proteins (RPs form the majority of cellular proteins and are mandatory for cellular growth. RP genes have been linked, either directly or indirectly, to various diseases in humans. Mutations in RP genes are also associated with tissue-specific phenotypes, suggesting a possible role in organ development during early embryogenesis. However, it is not yet known how mutations in a particular RP gene result in specific cellular changes, or how RP genes might contribute to human diseases. The development of animal models with defects in RP genes will be essential for studying these questions. In this study, we knocked down 21 RP genes in zebrafish by using morpholino antisense oligos to inhibit their translation. Of these 21, knockdown of 19 RPs resulted in the development of morphants with obvious deformities. Although mutations in RP genes, like other housekeeping genes, would be expected to result in nonspecific developmental defects with widespread phenotypes, we found that knockdown of some RP genes resulted in phenotypes specific to each gene, with varying degrees of abnormality in the brain, body trunk, eyes, and ears at about 25 hours post fertilization. We focused further on the organogenesis of the brain. Each knocked-down gene that affected the morphogenesis of the brain produced a different pattern of abnormality. Among the 7 RP genes whose knockdown produced severe brain phenotypes, 3 human orthologs are located within chromosomal regions that have been linked to brain-associated diseases, suggesting a possible involvement of RP genes in brain or neurological diseases. The RP gene knockdown system developed in this study could be a powerful tool for studying the roles of ribosomes in human diseases.

  20. Search for 5'-leader regulatory RNA structures based on gene annotation aided by the RiboGap database.

    Science.gov (United States)

    Naghdi, Mohammad Reza; Smail, Katia; Wang, Joy X; Wade, Fallou; Breaker, Ronald R; Perreault, Jonathan

    2017-03-15

    The discovery of noncoding RNAs (ncRNAs) and their importance for gene regulation led us to develop bioinformatics tools to pursue the discovery of novel ncRNAs. Finding ncRNAs de novo is challenging, first due to the difficulty of retrieving large numbers of sequences for given gene activities, and second due to exponential demands on calculation needed for comparative genomics on a large scale. Recently, several tools for the prediction of conserved RNA secondary structure were developed, but many of them are not designed to uncover new ncRNAs, or are too slow for conducting analyses on a large scale. Here we present various approaches using the database RiboGap as a primary tool for finding known ncRNAs and for uncovering simple sequence motifs with regulatory roles. This database also can be used to easily extract intergenic sequences of eubacteria and archaea to find conserved RNA structures upstream of given genes. We also show how to extend analysis further to choose the best candidate ncRNAs for experimental validation. Copyright © 2017 Elsevier Inc. All rights reserved.