WorldWideScience

Sample records for integrating microrna annotation

  1. miRBase: integrating microRNA annotation and deep-sequencing data.

    Science.gov (United States)

    Kozomara, Ana; Griffiths-Jones, Sam

    2011-01-01

    miRBase is the primary online repository for all microRNA sequences and annotation. The current release (miRBase 16) contains over 15,000 microRNA gene loci in over 140 species, and over 17,000 distinct mature microRNA sequences. Deep-sequencing technologies have delivered a sharp rise in the rate of novel microRNA discovery. We have mapped reads from short RNA deep-sequencing experiments to microRNAs in miRBase and developed web interfaces to view these mappings. The user can view all read data associated with a given microRNA annotation, filter reads by experiment and count, and search for microRNAs by tissue- and stage-specific expression. These data can be used as a proxy for relative expression levels of microRNA sequences, provide detailed evidence for microRNA annotations and alternative isoforms of mature microRNAs, and allow us to revisit previous annotations. miRBase is available online at: http://www.mirbase.org/.

  2. miRBase: annotating high confidence microRNAs using deep sequencing data.

    Science.gov (United States)

    Kozomara, Ana; Griffiths-Jones, Sam

    2014-01-01

    We describe an update of the miRBase database (http://www.mirbase.org/), the primary microRNA sequence repository. The latest miRBase release (v20, June 2013) contains 24 521 microRNA loci from 206 species, processed to produce 30 424 mature microRNA products. The rate of deposition of novel microRNAs and the number of researchers involved in their discovery continue to increase, driven largely by small RNA deep sequencing experiments. In the face of these increases, and a range of microRNA annotation methods and criteria, maintaining the quality of the microRNA sequence data set is a significant challenge. Here, we describe recent developments of the miRBase database to address this issue. In particular, we describe the collation and use of deep sequencing data sets to assign levels of confidence to miRBase entries. We now provide a high confidence subset of miRBase entries, based on the pattern of mapped reads. The high confidence microRNA data set is available alongside the complete microRNA collection at http://www.mirbase.org/. We also describe embedding microRNA-specific Wikipedia pages on the miRBase website to encourage the microRNA community to contribute and share textual and functional information.

  3. DeepBase: annotation and discovery of microRNAs and other noncoding RNAs from deep-sequencing data.

    Science.gov (United States)

    Yang, Jian-Hua; Qu, Liang-Hu

    2012-01-01

    Recent advances in high-throughput deep-sequencing technology have produced large numbers of short and long RNA sequences and enabled the detection and profiling of known and novel microRNAs (miRNAs) and other noncoding RNAs (ncRNAs) at unprecedented sensitivity and depth. In this chapter, we describe the use of deepBase, a database that we have developed to integrate all public deep-sequencing data and to facilitate the comprehensive annotation and discovery of miRNAs and other ncRNAs from these data. deepBase provides an integrative, interactive, and versatile web graphical interface to evaluate miRBase-annotated miRNA genes and other known ncRNAs, explores the expression patterns of miRNAs and other ncRNAs, and discovers novel miRNAs and other ncRNAs from deep-sequencing data. deepBase also provides a deepView genome browser to comparatively analyze these data at multiple levels. deepBase is available at http://deepbase.sysu.edu.cn/.

  4. Annotation of mammalian primary microRNAs

    Directory of Open Access Journals (Sweden)

    Enright Anton J

    2008-11-01

    Full Text Available Abstract Background MicroRNAs (miRNAs are important regulators of gene expression and have been implicated in development, differentiation and pathogenesis. Hundreds of miRNAs have been discovered in mammalian genomes. Approximately 50% of mammalian miRNAs are expressed from introns of protein-coding genes; the primary transcript (pri-miRNA is therefore assumed to be the host transcript. However, very little is known about the structure of pri-miRNAs expressed from intergenic regions. Here we annotate transcript boundaries of miRNAs in human, mouse and rat genomes using various transcription features. The 5' end of the pri-miRNA is predicted from transcription start sites, CpG islands and 5' CAGE tags mapped in the upstream flanking region surrounding the precursor miRNA (pre-miRNA. The 3' end of the pri-miRNA is predicted based on the mapping of polyA signals, and supported by cDNA/EST and ditags data. The predicted pri-miRNAs are also analyzed for promoter and insulator-associated regulatory regions. Results We define sets of conserved and non-conserved human, mouse and rat pre-miRNAs using bidirectional BLAST and synteny analysis. Transcription features in their flanking regions are used to demarcate the 5' and 3' boundaries of the pri-miRNAs. The lengths and boundaries of primary transcripts are highly conserved between orthologous miRNAs. A significant fraction of pri-miRNAs have lengths between 1 and 10 kb, with very few introns. We annotate a total of 59 pri-miRNA structures, which include 82 pre-miRNAs. 36 pri-miRNAs are conserved in all 3 species. In total, 18 of the confidently annotated transcripts express more than one pre-miRNA. The upstream regions of 54% of the predicted pri-miRNAs are found to be associated with promoter and insulator regulatory sequences. Conclusion Little is known about the primary transcripts of intergenic miRNAs. Using comparative data, we are able to identify the boundaries of a significant proportion of

  5. Revised annotation of Plutella xylostella microRNAs and their genome-wide target identification.

    Science.gov (United States)

    Etebari, K; Asgari, S

    2016-12-01

    The diamondback moth, Plutella xylostella, is the most devastating pest of brassica crops worldwide. Although 128 mature microRNAs (miRNAs) have been annotated from this species in miRBase, there is a need to extend and correct the current P. xylostella miRNA repertoire as a result of its recently improved genome assembly and more available small RNA sequence data. We used our new ultra-deep sequence data and bioinformatics to re-annotate the P. xylostella genome for high confidence miRNAs with the correct 5p and 3p arm features. Furthermore, all the P. xylostella annotated genes were also screened to identify potential miRNA binding sites using three target-predicting algorithms. In total, 203 mature miRNAs were annotated, including 33 novel miRNAs. We identified 7691 highly confident binding sites for 160 pxy-miRNAs. The data provided here will facilitate future studies involving functional analyses of P. xylostella miRNAs as a platform to introduce novel approaches for sustainable management of this destructive pest. © 2016 The Royal Entomological Society.

  6. Supporting Keyword Search for Image Retrieval with Integration of Probabilistic Annotation

    Directory of Open Access Journals (Sweden)

    Tie Hua Zhou

    2015-05-01

    Full Text Available The ever-increasing quantities of digital photo resources are annotated with enriching vocabularies to form semantic annotations. Photo-sharing social networks have boosted the need for efficient and intuitive querying to respond to user requirements in large-scale image collections. In order to help users formulate efficient and effective image retrieval, we present a novel integration of a probabilistic model based on keyword query architecture that models the probability distribution of image annotations: allowing users to obtain satisfactory results from image retrieval via the integration of multiple annotations. We focus on the annotation integration step in order to specify the meaning of each image annotation, thus leading to the most representative annotations of the intent of a keyword search. For this demonstration, we show how a probabilistic model has been integrated to semantic annotations to allow users to intuitively define explicit and precise keyword queries in order to retrieve satisfactory image results distributed in heterogeneous large data sources. Our experiments on SBU (collected by Stony Brook University database show that (i our integrated annotation contains higher quality representatives and semantic matches; and (ii the results indicating annotation integration can indeed improve image search result quality.

  7. Graph-based sequence annotation using a data integration approach.

    Science.gov (United States)

    Pesch, Robert; Lysenko, Artem; Hindle, Matthew; Hassani-Pak, Keywan; Thiele, Ralf; Rawlings, Christopher; Köhler, Jacob; Taubert, Jan

    2008-08-25

    The automated annotation of data from high throughput sequencing and genomics experiments is a significant challenge for bioinformatics. Most current approaches rely on sequential pipelines of gene finding and gene function prediction methods that annotate a gene with information from different reference data sources. Each function prediction method contributes evidence supporting a functional assignment. Such approaches generally ignore the links between the information in the reference datasets. These links, however, are valuable for assessing the plausibility of a function assignment and can be used to evaluate the confidence in a prediction. We are working towards a novel annotation system that uses the network of information supporting the function assignment to enrich the annotation process for use by expert curators and predicting the function of previously unannotated genes. In this paper we describe our success in the first stages of this development. We present the data integration steps that are needed to create the core database of integrated reference databases (UniProt, PFAM, PDB, GO and the pathway database Ara-Cyc) which has been established in the ONDEX data integration system. We also present a comparison between different methods for integration of GO terms as part of the function assignment pipeline and discuss the consequences of this analysis for improving the accuracy of gene function annotation. The methods and algorithms presented in this publication are an integral part of the ONDEX system which is freely available from http://ondex.sf.net/.

  8. Graph-based sequence annotation using a data integration approach

    Directory of Open Access Journals (Sweden)

    Pesch Robert

    2008-06-01

    Full Text Available The automated annotation of data from high throughput sequencing and genomics experiments is a significant challenge for bioinformatics. Most current approaches rely on sequential pipelines of gene finding and gene function prediction methods that annotate a gene with information from different reference data sources. Each function prediction method contributes evidence supporting a functional assignment. Such approaches generally ignore the links between the information in the reference datasets. These links, however, are valuable for assessing the plausibility of a function assignment and can be used to evaluate the confidence in a prediction. We are working towards a novel annotation system that uses the network of information supporting the function assignment to enrich the annotation process for use by expert curators and predicting the function of previously unannotated genes. In this paper we describe our success in the first stages of this development. We present the data integration steps that are needed to create the core database of integrated reference databases (UniProt, PFAM, PDB, GO and the pathway database Ara- Cyc which has been established in the ONDEX data integration system. We also present a comparison between different methods for integration of GO terms as part of the function assignment pipeline and discuss the consequences of this analysis for improving the accuracy of gene function annotation.

  9. Improving Microbial Genome Annotations in an Integrated Database Context

    Science.gov (United States)

    Chen, I-Min A.; Markowitz, Victor M.; Chu, Ken; Anderson, Iain; Mavromatis, Konstantinos; Kyrpides, Nikos C.; Ivanova, Natalia N.

    2013-01-01

    Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG) family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/. PMID:23424620

  10. Improving microbial genome annotations in an integrated database context.

    Directory of Open Access Journals (Sweden)

    I-Min A Chen

    Full Text Available Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/.

  11. Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data

    Directory of Open Access Journals (Sweden)

    Merchant Sabeeha S

    2011-07-01

    Full Text Available Abstract Background Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. Description The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of

  12. LocusTrack: Integrated visualization of GWAS results and genomic annotation.

    Science.gov (United States)

    Cuellar-Partida, Gabriel; Renteria, Miguel E; MacGregor, Stuart

    2015-01-01

    Genome-wide association studies (GWAS) are an important tool for the mapping of complex traits and diseases. Visual inspection of genomic annotations may be used to generate insights into the biological mechanisms underlying GWAS-identified loci. We developed LocusTrack, a web-based application that annotates and creates plots of regional GWAS results and incorporates user-specified tracks that display annotations such as linkage disequilibrium (LD), phylogenetic conservation, chromatin state, and other genomic and regulatory elements. Currently, LocusTrack can integrate annotation tracks from the UCSC genome-browser as well as from any tracks provided by the user. LocusTrack is an easy-to-use application and can be accessed at the following URL: http://gump.qimr.edu.au/general/gabrieC/LocusTrack/. Users can upload and manage GWAS results and select from and/or provide annotation tracks using simple and intuitive menus. LocusTrack scripts and associated data can be downloaded from the website and run locally.

  13. Saint: a lightweight integration environment for model annotation.

    Science.gov (United States)

    Lister, Allyson L; Pocock, Matthew; Taschuk, Morgan; Wipat, Anil

    2009-11-15

    Saint is a web application which provides a lightweight annotation integration environment for quantitative biological models. The system enables modellers to rapidly mark up models with biological information derived from a range of data sources. Saint is freely available for use on the web at http://www.cisban.ac.uk/saint. The web application is implemented in Google Web Toolkit and Tomcat, with all major browsers supported. The Java source code is freely available for download at http://saint-annotate.sourceforge.net. The Saint web server requires an installation of libSBML and has been tested on Linux (32-bit Ubuntu 8.10 and 9.04).

  14. SoFIA: a data integration framework for annotating high-throughput datasets.

    Science.gov (United States)

    Childs, Liam Harold; Mamlouk, Soulafa; Brandt, Jörgen; Sers, Christine; Leser, Ulf

    2016-09-01

    Integrating heterogeneous datasets from several sources is a common bioinformatics task that often requires implementing a complex workflow intermixing database access, data filtering, format conversions, identifier mapping, among further diverse operations. Data integration is especially important when annotating next generation sequencing data, where a multitude of diverse tools and heterogeneous databases can be used to provide a large variety of annotation for genomic locations, such a single nucleotide variants or genes. Each tool and data source is potentially useful for a given project and often more than one are used in parallel for the same purpose. However, software that always produces all available data is difficult to maintain and quickly leads to an excess of data, creating an information overload rather than the desired goal-oriented and integrated result. We present SoFIA, a framework for workflow-driven data integration with a focus on genomic annotation. SoFIA conceptualizes workflow templates as comprehensive workflows that cover as many data integration operations as possible in a given domain. However, these templates are not intended to be executed as a whole; instead, when given an integration task consisting of a set of input data and a set of desired output data, SoFIA derives a minimal workflow that completes the task. These workflows are typically fast and create exactly the information a user wants without requiring them to do any implementation work. Using a comprehensive genome annotation template, we highlight the flexibility, extensibility and power of the framework using real-life case studies. https://github.com/childsish/sofia/releases/latest under the GNU General Public License liam.childs@hu-berlin.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  15. Structuring osteosarcoma knowledge: an osteosarcoma-gene association database based on literature mining and manual annotation.

    Science.gov (United States)

    Poos, Kathrin; Smida, Jan; Nathrath, Michaela; Maugg, Doris; Baumhoer, Daniel; Neumann, Anna; Korsching, Eberhard

    2014-01-01

    Osteosarcoma (OS) is the most common primary bone cancer exhibiting high genomic instability. This genomic instability affects multiple genes and microRNAs to a varying extent depending on patient and tumor subtype. Massive research is ongoing to identify genes including their gene products and microRNAs that correlate with disease progression and might be used as biomarkers for OS. However, the genomic complexity hampers the identification of reliable biomarkers. Up to now, clinico-pathological factors are the key determinants to guide prognosis and therapeutic treatments. Each day, new studies about OS are published and complicate the acquisition of information to support biomarker discovery and therapeutic improvements. Thus, it is necessary to provide a structured and annotated view on the current OS knowledge that is quick and easily accessible to researchers of the field. Therefore, we developed a publicly available database and Web interface that serves as resource for OS-associated genes and microRNAs. Genes and microRNAs were collected using an automated dictionary-based gene recognition procedure followed by manual review and annotation by experts of the field. In total, 911 genes and 81 microRNAs related to 1331 PubMed abstracts were collected (last update: 29 October 2013). Users can evaluate genes and microRNAs according to their potential prognostic and therapeutic impact, the experimental procedures, the sample types, the biological contexts and microRNA target gene interactions. Additionally, a pathway enrichment analysis of the collected genes highlights different aspects of OS progression. OS requires pathways commonly deregulated in cancer but also features OS-specific alterations like deregulated osteoclast differentiation. To our knowledge, this is the first effort of an OS database containing manual reviewed and annotated up-to-date OS knowledge. It might be a useful resource especially for the bone tumor research community, as specific

  16. Profiling microRNAs in lung tissue from pigs infected with Actinobacillus pleuropneumoniae

    DEFF Research Database (Denmark)

    Podolska, Agnieszka; Anthon, Christian; Bak, Mads

    2012-01-01

    significantly up-regulated in the necrotic sample and 12 were down-regulated. The expression analysis of a number of candidates revealed microRNAs of potential importance in the innate immune response. MiR-155, a known key player in inflammation, was found expressed in both samples. Moreover, miR-664-5p, mi......R-451 and miR-15a appear as very promising candidates for microRNAs involved in response to pathogen infection. Conclusions: This is the first study revealing significant differences in composition and expression profiles of miRNAs in lungs infected with a bacterial pathogen. Our results extend......Background: MicroRNAs (miRNAs) are a class of non-protein-coding genes that play a crucial regulatory role in mammalian development and disease. Whereas a large number of miRNAs have been annotated at the structural level during the latest years, functional annotation is sparse. Actinobacillus...

  17. mESAdb: microRNA expression and sequence analysis database.

    Science.gov (United States)

    Kaya, Koray D; Karakülah, Gökhan; Yakicier, Cengiz M; Acar, Aybar C; Konu, Ozlen

    2011-01-01

    microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data.

  18. Annotation Of Novel And Conserved MicroRNA Genes In The Build 10 Sus scrofa Reference Genome And Determination Of Their Expression Levels In Ten Different Tissues

    DEFF Research Database (Denmark)

    Thomsen, Bo; Nielsen, Mathilde; Hedegaard, Jakob

    The DNA template used in the pig genome sequencing project was provided by a Duroc pig named TJ Tabasco. In an effort to annotate microRNA (miRNA) genes in the reference genome we have conducted deep sequencing to determine the miRNA transcriptomes in ten different tissues isolated from Pinky......, a genetically identical clone of TJ Tabasco. The purpose was to generate miRNA sequences that are highly homologous to the reference genome sequence, which along with computational prediction will improve confidence in the genomic annotation of miRNA genes. Based on homology searches of the sequence data...... against miRBase, we identified more than 600 conserved known miRNA/miRNA*, which is a significant increase relative to the 211 porcine miRNA/miRNA* deposited in the current version of miRBase. Furthermore, the genome-wide transcript profiles provided important information on the relative abundance...

  19. Integrative microRNA and proteomic approaches identify novel osteoarthritis genes and their collaborative metabolic and inflammatory networks.

    Directory of Open Access Journals (Sweden)

    Dimitrios Iliopoulos

    Full Text Available BACKGROUND: Osteoarthritis is a multifactorial disease characterized by destruction of the articular cartilage due to genetic, mechanical and environmental components affecting more than 100 million individuals all over the world. Despite the high prevalence of the disease, the absence of large-scale molecular studies limits our ability to understand the molecular pathobiology of osteoathritis and identify targets for drug development. METHODOLOGY/PRINCIPAL FINDINGS: In this study we integrated genetic, bioinformatic and proteomic approaches in order to identify new genes and their collaborative networks involved in osteoarthritis pathogenesis. MicroRNA profiling of patient-derived osteoarthritic cartilage in comparison to normal cartilage, revealed a 16 microRNA osteoarthritis gene signature. Using reverse-phase protein arrays in the same tissues we detected 76 differentially expressed proteins between osteoarthritic and normal chondrocytes. Proteins such as SOX11, FGF23, KLF6, WWOX and GDF15 not implicated previously in the genesis of osteoarthritis were identified. Integration of microRNA and proteomic data with microRNA gene-target prediction algorithms, generated a potential "interactome" network consisting of 11 microRNAs and 58 proteins linked by 414 potential functional associations. Comparison of the molecular and clinical data, revealed specific microRNAs (miR-22, miR-103 and proteins (PPARA, BMP7, IL1B to be highly correlated with Body Mass Index (BMI. Experimental validation revealed that miR-22 regulated PPARA and BMP7 expression and its inhibition blocked inflammatory and catabolic changes in osteoarthritic chondrocytes. CONCLUSIONS/SIGNIFICANCE: Our findings indicate that obesity and inflammation are related to osteoarthritis, a metabolic disease affected by microRNA deregulation. Gene network approaches provide new insights for elucidating the complexity of diseases such as osteoarthritis. The integration of microRNA, proteomic

  20. Bioinformatics resource manager v2.3: an integrated software environment for systems biology with microRNA and cross-species analysis tools

    Directory of Open Access Journals (Sweden)

    Tilton Susan C

    2012-11-01

    Full Text Available Abstract Background MicroRNAs (miRNAs are noncoding RNAs that direct post-transcriptional regulation of protein coding genes. Recent studies have shown miRNAs are important for controlling many biological processes, including nervous system development, and are highly conserved across species. Given their importance, computational tools are necessary for analysis, interpretation and integration of high-throughput (HTP miRNA data in an increasing number of model species. The Bioinformatics Resource Manager (BRM v2.3 is a software environment for data management, mining, integration and functional annotation of HTP biological data. In this study, we report recent updates to BRM for miRNA data analysis and cross-species comparisons across datasets. Results BRM v2.3 has the capability to query predicted miRNA targets from multiple databases, retrieve potential regulatory miRNAs for known genes, integrate experimentally derived miRNA and mRNA datasets, perform ortholog mapping across species, and retrieve annotation and cross-reference identifiers for an expanded number of species. Here we use BRM to show that developmental exposure of zebrafish to 30 uM nicotine from 6–48 hours post fertilization (hpf results in behavioral hyperactivity in larval zebrafish and alteration of putative miRNA gene targets in whole embryos at developmental stages that encompass early neurogenesis. We show typical workflows for using BRM to integrate experimental zebrafish miRNA and mRNA microarray datasets with example retrievals for zebrafish, including pathway annotation and mapping to human ortholog. Functional analysis of differentially regulated (p Conclusions BRM provides the ability to mine complex data for identification of candidate miRNAs or pathways that drive phenotypic outcome and, therefore, is a useful hypothesis generation tool for systems biology. The miRNA workflow in BRM allows for efficient processing of multiple miRNA and mRNA datasets in a single

  1. Bioinformatics resource manager v2.3: an integrated software environment for systems biology with microRNA and cross-species analysis tools

    Science.gov (United States)

    2012-01-01

    Background MicroRNAs (miRNAs) are noncoding RNAs that direct post-transcriptional regulation of protein coding genes. Recent studies have shown miRNAs are important for controlling many biological processes, including nervous system development, and are highly conserved across species. Given their importance, computational tools are necessary for analysis, interpretation and integration of high-throughput (HTP) miRNA data in an increasing number of model species. The Bioinformatics Resource Manager (BRM) v2.3 is a software environment for data management, mining, integration and functional annotation of HTP biological data. In this study, we report recent updates to BRM for miRNA data analysis and cross-species comparisons across datasets. Results BRM v2.3 has the capability to query predicted miRNA targets from multiple databases, retrieve potential regulatory miRNAs for known genes, integrate experimentally derived miRNA and mRNA datasets, perform ortholog mapping across species, and retrieve annotation and cross-reference identifiers for an expanded number of species. Here we use BRM to show that developmental exposure of zebrafish to 30 uM nicotine from 6–48 hours post fertilization (hpf) results in behavioral hyperactivity in larval zebrafish and alteration of putative miRNA gene targets in whole embryos at developmental stages that encompass early neurogenesis. We show typical workflows for using BRM to integrate experimental zebrafish miRNA and mRNA microarray datasets with example retrievals for zebrafish, including pathway annotation and mapping to human ortholog. Functional analysis of differentially regulated (p<0.05) gene targets in BRM indicates that nicotine exposure disrupts genes involved in neurogenesis, possibly through misregulation of nicotine-sensitive miRNAs. Conclusions BRM provides the ability to mine complex data for identification of candidate miRNAs or pathways that drive phenotypic outcome and, therefore, is a useful hypothesis

  2. Virtual Ribosome - a comprehensive DNA translation tool with support for integration of sequence feature annotation

    DEFF Research Database (Denmark)

    Wernersson, Rasmus

    2006-01-01

    of alternative start codons. ( ii) Integration of sequences feature annotation - in particular, native support for working with files containing intron/ exon structure annotation. The software is available for both download and online use at http://www.cbs.dtu.dk/services/VirtualRibosome/....

  3. An integrated expression atlas of miRNAs and their promoters in human and mouse

    DEFF Research Database (Denmark)

    de Rie, Derek; Abugessaisa, Imad; Alam, Tanvir

    2017-01-01

    MicroRNAs (miRNAs) are short non-coding RNAs with key roles in cellular regulation. As part of the fifth edition of the Functional Annotation of Mammalian Genome (FANTOM5) project, we created an integrated expression atlas of miRNAs and their promoters by deep-sequencing 492 short RNA (sRNA) libr...

  4. The use of semantic similarity measures for optimally integrating heterogeneous Gene Ontology data from large scale annotation pipelines

    Directory of Open Access Journals (Sweden)

    Gaston K Mazandu

    2014-08-01

    Full Text Available With the advancement of new high throughput sequencing technologies, there has been an increase in the number of genome sequencing projects worldwide, which has yielded complete genome sequences of human, animals and plants. Subsequently, several labs have focused on genome annotation, consisting of assigning functions to gene products, mostly using Gene Ontology (GO terms. As a consequence, there is an increased heterogeneity in annotations across genomes due to different approaches used by different pipelines to infer these annotations and also due to the nature of the GO structure itself. This makes a curator's task difficult, even if they adhere to the established guidelines for assessing these protein annotations. Here we develop a genome-scale approach for integrating GO annotations from different pipelines using semantic similarity measures. We used this approach to identify inconsistencies and similarities in functional annotations between orthologs of human and Drosophila melanogaster, to assess the quality of GO annotations derived from InterPro2GO mappings compared to manually annotated GO annotations for the Drosophila melanogaster proteome from a FlyBase dataset and human, and to filter GO annotation data for these proteomes. Results obtained indicate that an efficient integration of GO annotations eliminates redundancy up to 27.08 and 22.32% in the Drosophila melanogaster and human GO annotation datasets, respectively. Furthermore, we identified lack of and missing annotations for some orthologs, and annotation mismatches between InterPro2GO and manual pipelines in these two proteomes, thus requiring further curation. This simplifies and facilitates tasks of curators in assessing protein annotations, reduces redundancy and eliminates inconsistencies in large annotation datasets for ease of comparative functional genomics.

  5. TAM: a method for enrichment and depletion analysis of a microRNA category in a list of microRNAs.

    Science.gov (United States)

    Lu, Ming; Shi, Bing; Wang, Juan; Cao, Qun; Cui, Qinghua

    2010-08-09

    MicroRNAs (miRNAs) are a class of important gene regulators. The number of identified miRNAs has been increasing dramatically in recent years. An emerging major challenge is the interpretation of the genome-scale miRNA datasets, including those derived from microarray and deep-sequencing. It is interesting and important to know the common rules or patterns behind a list of miRNAs, (i.e. the deregulated miRNAs resulted from an experiment of miRNA microarray or deep-sequencing). For the above purpose, this study presents a method and develops a tool (TAM) for annotations of meaningful human miRNAs categories. We first integrated miRNAs into various meaningful categories according to prior knowledge, such as miRNA family, miRNA cluster, miRNA function, miRNA associated diseases, and tissue specificity. Using TAM, given lists of miRNAs can be rapidly annotated and summarized according to the integrated miRNA categorical data. Moreover, given a list of miRNAs, TAM can be used to predict novel related miRNAs. Finally, we confirmed the usefulness and reliability of TAM by applying it to deregulated miRNAs in acute myocardial infarction (AMI) from two independent experiments. TAM can efficiently identify meaningful categories for given miRNAs. In addition, TAM can be used to identify novel miRNA biomarkers. TAM tool, source codes, and miRNA category data are freely available at http://cmbi.bjmu.edu.cn/tam.

  6. NAViGaTing the micronome--using multiple microRNA prediction databases to identify signalling pathway-associated microRNAs.

    Directory of Open Access Journals (Sweden)

    Elize A Shirdel

    2011-02-01

    Full Text Available MicroRNAs are a class of small RNAs known to regulate gene expression at the transcript level, the protein level, or both. Since microRNA binding is sequence-based but possibly structure-specific, work in this area has resulted in multiple databases storing predicted microRNA:target relationships computed using diverse algorithms. We integrate prediction databases, compare predictions to in vitro data, and use cross-database predictions to model the microRNA:transcript interactome--referred to as the micronome--to study microRNA involvement in well-known signalling pathways as well as associations with disease. We make this data freely available with a flexible user interface as our microRNA Data Integration Portal--mirDIP (http://ophid.utoronto.ca/mirDIP.mirDIP integrates prediction databases to elucidate accurate microRNA:target relationships. Using NAViGaTOR to produce interaction networks implicating microRNAs in literature-based, KEGG-based and Reactome-based pathways, we find these signalling pathway networks have significantly more microRNA involvement compared to chance (p<0.05, suggesting microRNAs co-target many genes in a given pathway. Further examination of the micronome shows two distinct classes of microRNAs; universe microRNAs, which are involved in many signalling pathways; and intra-pathway microRNAs, which target multiple genes within one signalling pathway. We find universe microRNAs to have more targets (p<0.0001, to be more studied (p<0.0002, and to have higher degree in the KEGG cancer pathway (p<0.0001, compared to intra-pathway microRNAs.Our pathway-based analysis of mirDIP data suggests microRNAs are involved in intra-pathway signalling. We identify two distinct classes of microRNAs, suggesting a hierarchical organization of microRNAs co-targeting genes both within and between pathways, and implying differential involvement of universe and intra-pathway microRNAs at the disease level.

  7. Large-scale identification of microRNA targets in murine Dgcr8-deficient embryonic stem cell lines.

    Directory of Open Access Journals (Sweden)

    Matthew P A Davis

    Full Text Available Small RNAs such as microRNAs play important roles in embryonic stem cell maintenance and differentiation. A broad range of microRNAs is expressed in embryonic stem cells while only a fraction of their targets have been identified. We have performed large-scale identification of embryonic stem cell microRNA targets using a murine embryonic stem cell line deficient in the expression of Dgcr8. These cells are heavily depleted for microRNAs, allowing us to reintroduce specific microRNA duplexes and identify refined target sets. We used deep sequencing of small RNAs, mRNA expression profiling and bioinformatics analysis of microRNA seed matches in 3' UTRs to identify target transcripts. Consequently, we have identified a network of microRNAs that converge on the regulation of several important cellular pathways. Additionally, our experiments have revealed a novel candidate for Dgcr8-independent microRNA genesis and highlighted the challenges currently facing miRNA annotation.

  8. BioVLAB-MMIA: a cloud environment for microRNA and mRNA integrated analysis (MMIA) on Amazon EC2.

    Science.gov (United States)

    Lee, Hyungro; Yang, Youngik; Chae, Heejoon; Nam, Seungyoon; Choi, Donghoon; Tangchaisin, Patanachai; Herath, Chathura; Marru, Suresh; Nephew, Kenneth P; Kim, Sun

    2012-09-01

    MicroRNAs, by regulating the expression of hundreds of target genes, play critical roles in developmental biology and the etiology of numerous diseases, including cancer. As a vast amount of microRNA expression profile data are now publicly available, the integration of microRNA expression data sets with gene expression profiles is a key research problem in life science research. However, the ability to conduct genome-wide microRNA-mRNA (gene) integration currently requires sophisticated, high-end informatics tools, significant expertise in bioinformatics and computer science to carry out the complex integration analysis. In addition, increased computing infrastructure capabilities are essential in order to accommodate large data sets. In this study, we have extended the BioVLAB cloud workbench to develop an environment for the integrated analysis of microRNA and mRNA expression data, named BioVLAB-MMIA. The workbench facilitates computations on the Amazon EC2 and S3 resources orchestrated by the XBaya Workflow Suite. The advantages of BioVLAB-MMIA over the web-based MMIA system include: 1) readily expanded as new computational tools become available; 2) easily modifiable by re-configuring graphic icons in the workflow; 3) on-demand cloud computing resources can be used on an "as needed" basis; 4) distributed orchestration supports complex and long running workflows asynchronously. We believe that BioVLAB-MMIA will be an easy-to-use computing environment for researchers who plan to perform genome-wide microRNA-mRNA (gene) integrated analysis tasks.

  9. Annotating the Function of the Human Genome with Gene Ontology and Disease Ontology.

    Science.gov (United States)

    Hu, Yang; Zhou, Wenyang; Ren, Jun; Dong, Lixiang; Wang, Yadong; Jin, Shuilin; Cheng, Liang

    2016-01-01

    Increasing evidences indicated that function annotation of human genome in molecular level and phenotype level is very important for systematic analysis of genes. In this study, we presented a framework named Gene2Function to annotate Gene Reference into Functions (GeneRIFs), in which each functional description of GeneRIFs could be annotated by a text mining tool Open Biomedical Annotator (OBA), and each Entrez gene could be mapped to Human Genome Organisation Gene Nomenclature Committee (HGNC) gene symbol. After annotating all the records about human genes of GeneRIFs, 288,869 associations between 13,148 mRNAs and 7,182 terms, 9,496 associations between 948 microRNAs and 533 terms, and 901 associations between 139 long noncoding RNAs (lncRNAs) and 297 terms were obtained as a comprehensive annotation resource of human genome. High consistency of term frequency of individual gene (Pearson correlation = 0.6401, p = 2.2e - 16) and gene frequency of individual term (Pearson correlation = 0.1298, p = 3.686e - 14) in GeneRIFs and GOA shows our annotation resource is very reliable.

  10. Ubiquitous Annotation Systems

    DEFF Research Database (Denmark)

    Hansen, Frank Allan

    2006-01-01

    Ubiquitous annotation systems allow users to annotate physical places, objects, and persons with digital information. Especially in the field of location based information systems much work has been done to implement adaptive and context-aware systems, but few efforts have focused on the general...... requirements for linking information to objects in both physical and digital space. This paper surveys annotation techniques from open hypermedia systems, Web based annotation systems, and mobile and augmented reality systems to illustrate different approaches to four central challenges ubiquitous annotation...... systems have to deal with: anchoring, structuring, presentation, and authoring. Through a number of examples each challenge is discussed and HyCon, a context-aware hypermedia framework developed at the University of Aarhus, Denmark, is used to illustrate an integrated approach to ubiquitous annotations...

  11. Towards the integration, annotation and association of historical microarray experiments with RNA-seq.

    Science.gov (United States)

    Chavan, Shweta S; Bauer, Michael A; Peterson, Erich A; Heuck, Christoph J; Johann, Donald J

    2013-01-01

    Transcriptome analysis by microarrays has produced important advances in biomedicine. For instance in multiple myeloma (MM), microarray approaches led to the development of an effective disease subtyping via cluster assignment, and a 70 gene risk score. Both enabled an improved molecular understanding of MM, and have provided prognostic information for the purposes of clinical management. Many researchers are now transitioning to Next Generation Sequencing (NGS) approaches and RNA-seq in particular, due to its discovery-based nature, improved sensitivity, and dynamic range. Additionally, RNA-seq allows for the analysis of gene isoforms, splice variants, and novel gene fusions. Given the voluminous amounts of historical microarray data, there is now a need to associate and integrate microarray and RNA-seq data via advanced bioinformatic approaches. Custom software was developed following a model-view-controller (MVC) approach to integrate Affymetrix probe set-IDs, and gene annotation information from a variety of sources. The tool/approach employs an assortment of strategies to integrate, cross reference, and associate microarray and RNA-seq datasets. Output from a variety of transcriptome reconstruction and quantitation tools (e.g., Cufflinks) can be directly integrated, and/or associated with Affymetrix probe set data, as well as necessary gene identifiers and/or symbols from a diversity of sources. Strategies are employed to maximize the annotation and cross referencing process. Custom gene sets (e.g., MM 70 risk score (GEP-70)) can be specified, and the tool can be directly assimilated into an RNA-seq pipeline. A novel bioinformatic approach to aid in the facilitation of both annotation and association of historic microarray data, in conjunction with richer RNA-seq data, is now assisting with the study of MM cancer biology.

  12. Identification of Novel and Conserved microRNAs in Homalodisca vitripennis, the Glassy-Winged Sharpshooter by Expression Profiling.

    Directory of Open Access Journals (Sweden)

    Raja Sekhar Nandety

    Full Text Available The glassy-winged sharpshooter (GWSS Homalodisca vitripennis (Hemiptera: Cicadellidae, is a xylem-feeding leafhopper and an important vector of the bacterium Xylella fastidiosa; the causal agent of Pierce's disease of grapevines. MicroRNAs are a class of small RNAs that play an important role in the functional development of various organisms including insects. In H. vitripennis, we identified microRNAs using high-throughput deep sequencing of adults followed by computational and manual annotation. A total of 14 novel microRNAs that are not found in the miRBase were identified from adult H. vitripennis. Conserved microRNAs were also found in our datasets. By comparison to our previously determined transcriptome sequence of H. vitripennis, we identified the potential targets of the microRNAs in the transcriptome. This microRNA profile information not only provides a more nuanced understanding of the biological and physiological mechanisms that govern gene expression in H. vitripennis, but may also lead to the identification of novel mechanisms for biorationally designed management strategies through the use of microRNAs.

  13. PANDORA: keyword-based analysis of protein sets by integration of annotation sources.

    Science.gov (United States)

    Kaplan, Noam; Vaaknin, Avishay; Linial, Michal

    2003-10-01

    Recent advances in high-throughput methods and the application of computational tools for automatic classification of proteins have made it possible to carry out large-scale proteomic analyses. Biological analysis and interpretation of sets of proteins is a time-consuming undertaking carried out manually by experts. We have developed PANDORA (Protein ANnotation Diagram ORiented Analysis), a web-based tool that provides an automatic representation of the biological knowledge associated with any set of proteins. PANDORA uses a unique approach of keyword-based graphical analysis that focuses on detecting subsets of proteins that share unique biological properties and the intersections of such sets. PANDORA currently supports SwissProt keywords, NCBI Taxonomy, InterPro entries and the hierarchical classification terms from ENZYME, SCOP and GO databases. The integrated study of several annotation sources simultaneously allows a representation of biological relations of structure, function, cellular location, taxonomy, domains and motifs. PANDORA is also integrated into the ProtoNet system, thus allowing testing thousands of automatically generated clusters. We illustrate how PANDORA enhances the biological understanding of large, non-uniform sets of proteins originating from experimental and computational sources, without the need for prior biological knowledge on individual proteins.

  14. annot8r: GO, EC and KEGG annotation of EST datasets

    Directory of Open Access Journals (Sweden)

    Schmid Ralf

    2008-04-01

    Full Text Available Abstract Background The expressed sequence tag (EST methodology is an attractive option for the generation of sequence data for species for which no completely sequenced genome is available. The annotation and comparative analysis of such datasets poses a formidable challenge for research groups that do not have the bioinformatics infrastructure of major genome sequencing centres. Therefore, there is a need for user-friendly tools to facilitate the annotation of non-model species EST datasets with well-defined ontologies that enable meaningful cross-species comparisons. To address this, we have developed annot8r, a platform for the rapid annotation of EST datasets with GO-terms, EC-numbers and KEGG-pathways. Results annot8r automatically downloads all files relevant for the annotation process and generates a reference database that stores UniProt entries, their associated Gene Ontology (GO, Enzyme Commission (EC and Kyoto Encyclopaedia of Genes and Genomes (KEGG annotation and additional relevant data. For each of GO, EC and KEGG, annot8r extracts a specific sequence subset from the UniProt dataset based on the information stored in the reference database. These three subsets are then formatted for BLAST searches. The user provides the protein or nucleotide sequences to be annotated and annot8r runs BLAST searches against these three subsets. The BLAST results are parsed and the corresponding annotations retrieved from the reference database. The annotations are saved both as flat files and also in a relational postgreSQL results database to facilitate more advanced searches within the results. annot8r is integrated with the PartiGene suite of EST analysis tools. Conclusion annot8r is a tool that assigns GO, EC and KEGG annotations for data sets resulting from EST sequencing projects both rapidly and efficiently. The benefits of an underlying relational database, flexibility and the ease of use of the program make it ideally suited for non

  15. A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data.

    Science.gov (United States)

    Lu, Qiongshi; Hu, Yiming; Sun, Jiehuan; Cheng, Yuwei; Cheung, Kei-Hoi; Zhao, Hongyu

    2015-05-27

    Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu.

  16. DAVID Knowledgebase: a gene-centered database integrating heterogeneous gene annotation resources to facilitate high-throughput gene functional analysis

    Directory of Open Access Journals (Sweden)

    Baseler Michael W

    2007-11-01

    Full Text Available Abstract Background Due to the complex and distributed nature of biological research, our current biological knowledge is spread over many redundant annotation databases maintained by many independent groups. Analysts usually need to visit many of these bioinformatics databases in order to integrate comprehensive annotation information for their genes, which becomes one of the bottlenecks, particularly for the analytic task associated with a large gene list. Thus, a highly centralized and ready-to-use gene-annotation knowledgebase is in demand for high throughput gene functional analysis. Description The DAVID Knowledgebase is built around the DAVID Gene Concept, a single-linkage method to agglomerate tens of millions of gene/protein identifiers from a variety of public genomic resources into DAVID gene clusters. The grouping of such identifiers improves the cross-reference capability, particularly across NCBI and UniProt systems, enabling more than 40 publicly available functional annotation sources to be comprehensively integrated and centralized by the DAVID gene clusters. The simple, pair-wise, text format files which make up the DAVID Knowledgebase are freely downloadable for various data analysis uses. In addition, a well organized web interface allows users to query different types of heterogeneous annotations in a high-throughput manner. Conclusion The DAVID Knowledgebase is designed to facilitate high throughput gene functional analysis. For a given gene list, it not only provides the quick accessibility to a wide range of heterogeneous annotation data in a centralized location, but also enriches the level of biological information for an individual gene. Moreover, the entire DAVID Knowledgebase is freely downloadable or searchable at http://david.abcc.ncifcrf.gov/knowledgebase/.

  17. Improving integrative searching of systems chemical biology data using semantic annotation.

    Science.gov (United States)

    Chen, Bin; Ding, Ying; Wild, David J

    2012-03-08

    Systems chemical biology and chemogenomics are considered critical, integrative disciplines in modern biomedical research, but require data mining of large, integrated, heterogeneous datasets from chemistry and biology. We previously developed an RDF-based resource called Chem2Bio2RDF that enabled querying of such data using the SPARQL query language. Whilst this work has proved useful in its own right as one of the first major resources in these disciplines, its utility could be greatly improved by the application of an ontology for annotation of the nodes and edges in the RDF graph, enabling a much richer range of semantic queries to be issued. We developed a generalized chemogenomics and systems chemical biology OWL ontology called Chem2Bio2OWL that describes the semantics of chemical compounds, drugs, protein targets, pathways, genes, diseases and side-effects, and the relationships between them. The ontology also includes data provenance. We used it to annotate our Chem2Bio2RDF dataset, making it a rich semantic resource. Through a series of scientific case studies we demonstrate how this (i) simplifies the process of building SPARQL queries, (ii) enables useful new kinds of queries on the data and (iii) makes possible intelligent reasoning and semantic graph mining in chemogenomics and systems chemical biology. Chem2Bio2OWL is available at http://chem2bio2rdf.org/owl. The document is available at http://chem2bio2owl.wikispaces.com.

  18. DeAnnIso: a tool for online detection and annotation of isomiRs from small RNA sequencing data.

    Science.gov (United States)

    Zhang, Yuanwei; Zang, Qiguang; Zhang, Huan; Ban, Rongjun; Yang, Yifan; Iqbal, Furhan; Li, Ao; Shi, Qinghua

    2016-07-08

    Small RNA (sRNA) Sequencing technology has revealed that microRNAs (miRNAs) are capable of exhibiting frequent variations from their canonical sequences, generating multiple variants: the isoforms of miRNAs (isomiRs). However, integrated tool to precisely detect and systematically annotate isomiRs from sRNA sequencing data is still in great demand. Here, we present an online tool, DeAnnIso (Detection and Annotation of IsomiRs from sRNA sequencing data). DeAnnIso can detect all the isomiRs in an uploaded sample, and can extract the differentially expressing isomiRs from paired or multiple samples. Once the isomiRs detection is accomplished, detailed annotation information, including isomiRs expression, isomiRs classification, SNPs in miRNAs and tissue specific isomiR expression are provided to users. Furthermore, DeAnnIso provides a comprehensive module of target analysis and enrichment analysis for the selected isomiRs. Taken together, DeAnnIso is convenient for users to screen for isomiRs of their interest and useful for further functional studies. The server is implemented in PHP + Perl + R and available to all users for free at: http://mcg.ustc.edu.cn/bsc/deanniso/ and http://mcg2.ustc.edu.cn/bsc/deanniso/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. OAHG: an integrated resource for annotating human genes with multi-level ontologies.

    Science.gov (United States)

    Cheng, Liang; Sun, Jie; Xu, Wanying; Dong, Lixiang; Hu, Yang; Zhou, Meng

    2016-10-05

    OAHG, an integrated resource, aims to establish a comprehensive functional annotation resource for human protein-coding genes (PCGs), miRNAs, and lncRNAs by multi-level ontologies involving Gene Ontology (GO), Disease Ontology (DO), and Human Phenotype Ontology (HPO). Many previous studies have focused on inferring putative properties and biological functions of PCGs and non-coding RNA genes from different perspectives. During the past several decades, a few of databases have been designed to annotate the functions of PCGs, miRNAs, and lncRNAs, respectively. A part of functional descriptions in these databases were mapped to standardize terminologies, such as GO, which could be helpful to do further analysis. Despite these developments, there is no comprehensive resource recording the function of these three important types of genes. The current version of OAHG, release 1.0 (Jun 2016), integrates three ontologies involving GO, DO, and HPO, six gene functional databases and two interaction databases. Currently, OAHG contains 1,434,694 entries involving 16,929 PCGs, 637 miRNAs, 193 lncRNAs, and 24,894 terms of ontologies. During the performance evaluation, OAHG shows the consistencies with existing gene interactions and the structure of ontology. For example, terms with more similar structure could be associated with more associated genes (Pearson correlation γ 2  = 0.2428, p < 2.2e-16).

  20. Snap: an integrated SNP annotation platform

    DEFF Research Database (Denmark)

    Li, Shengting; Ma, Lijia; Li, Heng

    2007-01-01

    Snap (Single Nucleotide Polymorphism Annotation Platform) is a server designed to comprehensively analyze single genes and relationships between genes basing on SNPs in the human genome. The aim of the platform is to facilitate the study of SNP finding and analysis within the framework of medical...

  1. Improving integrative searching of systems chemical biology data using semantic annotation

    Directory of Open Access Journals (Sweden)

    Chen Bin

    2012-03-01

    Full Text Available Abstract Background Systems chemical biology and chemogenomics are considered critical, integrative disciplines in modern biomedical research, but require data mining of large, integrated, heterogeneous datasets from chemistry and biology. We previously developed an RDF-based resource called Chem2Bio2RDF that enabled querying of such data using the SPARQL query language. Whilst this work has proved useful in its own right as one of the first major resources in these disciplines, its utility could be greatly improved by the application of an ontology for annotation of the nodes and edges in the RDF graph, enabling a much richer range of semantic queries to be issued. Results We developed a generalized chemogenomics and systems chemical biology OWL ontology called Chem2Bio2OWL that describes the semantics of chemical compounds, drugs, protein targets, pathways, genes, diseases and side-effects, and the relationships between them. The ontology also includes data provenance. We used it to annotate our Chem2Bio2RDF dataset, making it a rich semantic resource. Through a series of scientific case studies we demonstrate how this (i simplifies the process of building SPARQL queries, (ii enables useful new kinds of queries on the data and (iii makes possible intelligent reasoning and semantic graph mining in chemogenomics and systems chemical biology. Availability Chem2Bio2OWL is available at http://chem2bio2rdf.org/owl. The document is available at http://chem2bio2owl.wikispaces.com.

  2. Current and future trends in marine image annotation software

    Science.gov (United States)

    Gomes-Pereira, Jose Nuno; Auger, Vincent; Beisiegel, Kolja; Benjamin, Robert; Bergmann, Melanie; Bowden, David; Buhl-Mortensen, Pal; De Leo, Fabio C.; Dionísio, Gisela; Durden, Jennifer M.; Edwards, Luke; Friedman, Ariell; Greinert, Jens; Jacobsen-Stout, Nancy; Lerner, Steve; Leslie, Murray; Nattkemper, Tim W.; Sameoto, Jessica A.; Schoening, Timm; Schouten, Ronald; Seager, James; Singh, Hanumant; Soubigou, Olivier; Tojeira, Inês; van den Beld, Inge; Dias, Frederico; Tempera, Fernando; Santos, Ricardo S.

    2016-12-01

    Given the need to describe, analyze and index large quantities of marine imagery data for exploration and monitoring activities, a range of specialized image annotation tools have been developed worldwide. Image annotation - the process of transposing objects or events represented in a video or still image to the semantic level, may involve human interactions and computer-assisted solutions. Marine image annotation software (MIAS) have enabled over 500 publications to date. We review the functioning, application trends and developments, by comparing general and advanced features of 23 different tools utilized in underwater image analysis. MIAS requiring human input are basically a graphical user interface, with a video player or image browser that recognizes a specific time code or image code, allowing to log events in a time-stamped (and/or geo-referenced) manner. MIAS differ from similar software by the capability of integrating data associated to video collection, the most simple being the position coordinates of the video recording platform. MIAS have three main characteristics: annotating events in real time, posteriorly to annotation and interact with a database. These range from simple annotation interfaces, to full onboard data management systems, with a variety of toolboxes. Advanced packages allow to input and display data from multiple sensors or multiple annotators via intranet or internet. Posterior human-mediated annotation often include tools for data display and image analysis, e.g. length, area, image segmentation, point count; and in a few cases the possibility of browsing and editing previous dive logs or to analyze the annotations. The interaction with a database allows the automatic integration of annotations from different surveys, repeated annotation and collaborative annotation of shared datasets, browsing and querying of data. Progress in the field of automated annotation is mostly in post processing, for stable platforms or still images

  3. A semi-automatic annotation tool for cooking video

    Science.gov (United States)

    Bianco, Simone; Ciocca, Gianluigi; Napoletano, Paolo; Schettini, Raimondo; Margherita, Roberto; Marini, Gianluca; Gianforme, Giorgio; Pantaleo, Giuseppe

    2013-03-01

    In order to create a cooking assistant application to guide the users in the preparation of the dishes relevant to their profile diets and food preferences, it is necessary to accurately annotate the video recipes, identifying and tracking the foods of the cook. These videos present particular annotation challenges such as frequent occlusions, food appearance changes, etc. Manually annotate the videos is a time-consuming, tedious and error-prone task. Fully automatic tools that integrate computer vision algorithms to extract and identify the elements of interest are not error free, and false positive and false negative detections need to be corrected in a post-processing stage. We present an interactive, semi-automatic tool for the annotation of cooking videos that integrates computer vision techniques under the supervision of the user. The annotation accuracy is increased with respect to completely automatic tools and the human effort is reduced with respect to completely manual ones. The performance and usability of the proposed tool are evaluated on the basis of the time and effort required to annotate the same video sequences.

  4. Annotating temporal information in clinical narratives.

    Science.gov (United States)

    Sun, Weiyi; Rumshisky, Anna; Uzuner, Ozlem

    2013-12-01

    Temporal information in clinical narratives plays an important role in patients' diagnosis, treatment and prognosis. In order to represent narrative information accurately, medical natural language processing (MLP) systems need to correctly identify and interpret temporal information. To promote research in this area, the Informatics for Integrating Biology and the Bedside (i2b2) project developed a temporally annotated corpus of clinical narratives. This corpus contains 310 de-identified discharge summaries, with annotations of clinical events, temporal expressions and temporal relations. This paper describes the process followed for the development of this corpus and discusses annotation guideline development, annotation methodology, and corpus quality. Copyright © 2013 Elsevier Inc. All rights reserved.

  5. BOWiki: an ontology-based wiki for annotation of data and integration of knowledge in biology

    Directory of Open Access Journals (Sweden)

    Gregorio Sergio E

    2009-05-01

    Full Text Available Abstract Motivation Ontology development and the annotation of biological data using ontologies are time-consuming exercises that currently require input from expert curators. Open, collaborative platforms for biological data annotation enable the wider scientific community to become involved in developing and maintaining such resources. However, this openness raises concerns regarding the quality and correctness of the information added to these knowledge bases. The combination of a collaborative web-based platform with logic-based approaches and Semantic Web technology can be used to address some of these challenges and concerns. Results We have developed the BOWiki, a web-based system that includes a biological core ontology. The core ontology provides background knowledge about biological types and relations. Against this background, an automated reasoner assesses the consistency of new information added to the knowledge base. The system provides a platform for research communities to integrate information and annotate data collaboratively. Availability The BOWiki and supplementary material is available at http://www.bowiki.net/. The source code is available under the GNU GPL from http://onto.eva.mpg.de/trac/BoWiki.

  6. Ontological Annotation with WordNet

    Energy Technology Data Exchange (ETDEWEB)

    Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.; Chappell, Alan R.; Whitney, Paul D.; Posse, Christian; Paulson, Patrick R.; Baddeley, Bob; Hohimer, Ryan E.; White, Amanda M.

    2006-06-06

    Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of WordNet to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

  7. miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments.

    Science.gov (United States)

    Hackenberg, Michael; Sturm, Martin; Langenberger, David; Falcón-Pérez, Juan Manuel; Aransay, Ana M

    2009-07-01

    Next-generation sequencing allows now the sequencing of small RNA molecules and the estimation of their expression levels. Consequently, there will be a high demand of bioinformatics tools to cope with the several gigabytes of sequence data generated in each single deep-sequencing experiment. Given this scene, we developed miRanalyzer, a web server tool for the analysis of deep-sequencing experiments for small RNAs. The web server tool requires a simple input file containing a list of unique reads and its copy numbers (expression levels). Using these data, miRanalyzer (i) detects all known microRNA sequences annotated in miRBase, (ii) finds all perfect matches against other libraries of transcribed sequences and (iii) predicts new microRNAs. The prediction of new microRNAs is an especially important point as there are many species with very few known microRNAs. Therefore, we implemented a highly accurate machine learning algorithm for the prediction of new microRNAs that reaches AUC values of 97.9% and recall values of up to 75% on unseen data. The web tool summarizes all the described steps in a single output page, which provides a comprehensive overview of the analysis, adding links to more detailed output pages for each analysis module. miRanalyzer is available at http://web.bioinformatics.cicbiogune.es/microRNA/.

  8. Pipeline to upgrade the genome annotations

    Directory of Open Access Journals (Sweden)

    Lijin K. Gopi

    2017-12-01

    Full Text Available Current era of functional genomics is enriched with good quality draft genomes and annotations for many thousands of species and varieties with the support of the advancements in the next generation sequencing technologies (NGS. Around 25,250 genomes, of the organisms from various kingdoms, are submitted in the NCBI genome resource till date. Each of these genomes was annotated using various tools and knowledge-bases that were available during the period of the annotation. It is obvious that these annotations will be improved if the same genome is annotated using improved tools and knowledge-bases. Here we present a new genome annotation pipeline, strengthened with various tools and knowledge-bases that are capable of producing better quality annotations from the consensus of the predictions from different tools. This resource also perform various additional annotations, apart from the usual gene predictions and functional annotations, which involve SSRs, novel repeats, paralogs, proteins with transmembrane helices, signal peptides etc. This new annotation resource is trained to evaluate and integrate all the predictions together to resolve the overlaps and ambiguities of the boundaries. One of the important highlights of this resource is the capability of predicting the phylogenetic relations of the repeats using the evolutionary trace analysis and orthologous gene clusters. We also present a case study, of the pipeline, in which we upgrade the genome annotation of Nelumbo nucifera (sacred lotus. It is demonstrated that this resource is capable of producing an improved annotation for a better understanding of the biology of various organisms.

  9. MimoSA: a system for minimotif annotation

    Directory of Open Access Journals (Sweden)

    Kundeti Vamsi

    2010-06-01

    Full Text Available Abstract Background Minimotifs are short peptide sequences within one protein, which are recognized by other proteins or molecules. While there are now several minimotif databases, they are incomplete. There are reports of many minimotifs in the primary literature, which have yet to be annotated, while entirely novel minimotifs continue to be published on a weekly basis. Our recently proposed function and sequence syntax for minimotifs enables us to build a general tool that will facilitate structured annotation and management of minimotif data from the biomedical literature. Results We have built the MimoSA application for minimotif annotation. The application supports management of the Minimotif Miner database, literature tracking, and annotation of new minimotifs. MimoSA enables the visualization, organization, selection and editing functions of minimotifs and their attributes in the MnM database. For the literature components, Mimosa provides paper status tracking and scoring of papers for annotation through a freely available machine learning approach, which is based on word correlation. The paper scoring algorithm is also available as a separate program, TextMine. Form-driven annotation of minimotif attributes enables entry of new minimotifs into the MnM database. Several supporting features increase the efficiency of annotation. The layered architecture of MimoSA allows for extensibility by separating the functions of paper scoring, minimotif visualization, and database management. MimoSA is readily adaptable to other annotation efforts that manually curate literature into a MySQL database. Conclusions MimoSA is an extensible application that facilitates minimotif annotation and integrates with the Minimotif Miner database. We have built MimoSA as an application that integrates dynamic abstract scoring with a high performance relational model of minimotif syntax. MimoSA's TextMine, an efficient paper-scoring algorithm, can be used to

  10. WormBase: Annotating many nematode genomes.

    Science.gov (United States)

    Howe, Kevin; Davis, Paul; Paulini, Michael; Tuli, Mary Ann; Williams, Gary; Yook, Karen; Durbin, Richard; Kersey, Paul; Sternberg, Paul W

    2012-01-01

    WormBase (www.wormbase.org) has been serving the scientific community for over 11 years as the central repository for genomic and genetic information for the soil nematode Caenorhabditis elegans. The resource has evolved from its beginnings as a database housing the genomic sequence and genetic and physical maps of a single species, and now represents the breadth and diversity of nematode research, currently serving genome sequence and annotation for around 20 nematodes. In this article, we focus on WormBase's role of genome sequence annotation, describing how we annotate and integrate data from a growing collection of nematode species and strains. We also review our approaches to sequence curation, and discuss the impact on annotation quality of large functional genomics projects such as modENCODE.

  11. SeedVicious: Analysis of microRNA target and near-target sites.

    Science.gov (United States)

    Marco, Antonio

    2018-01-01

    Here I describe seedVicious, a versatile microRNA target site prediction software that can be easily fitted into annotation pipelines and run over custom datasets. SeedVicious finds microRNA canonical sites plus other, less efficient, target sites. Among other novel features, seedVicious can compute evolutionary gains/losses of target sites using maximum parsimony, and also detect near-target sites, which have one nucleotide different from a canonical site. Near-target sites are important to study population variation in microRNA regulation. Some analyses suggest that near-target sites may also be functional sites, although there is no conclusive evidence for that, and they may actually be target alleles segregating in a population. SeedVicious does not aim to outperform but to complement existing microRNA prediction tools. For instance, the precision of TargetScan is almost doubled (from 11% to ~20%) when we filter predictions by the distance between target sites using this program. Interestingly, two adjacent canonical target sites are more likely to be present in bona fide target transcripts than pairs of target sites at slightly longer distances. The software is written in Perl and runs on 64-bit Unix computers (Linux and MacOS X). Users with no computing experience can also run the program in a dedicated web-server by uploading custom data, or browse pre-computed predictions. SeedVicious and its associated web-server and database (SeedBank) are distributed under the GPL/GNU license.

  12. Isolation of microRNA targets using biotinylated synthetic microRNAs

    DEFF Research Database (Denmark)

    Ørom, Ulf Andersson; Lund, Anders H

    2007-01-01

    MicroRNAs are small regulatory RNAs found in multicellular organisms where they post-transcriptionally regulate gene expression. In animals, microRNAs bind mRNAs via incomplete base pairings making the identification of microRNA targets inherently difficult. Here, we present a detailed method...... for experimental identification of microRNA targets based on affinity purification of tagged microRNAs associated with their targets. Udgivelsesdato: 2007-Oct...

  13. Integration of hormonal signaling networks and mobile microRNAs is required for vascular patterning in Arabidopsis roots

    KAUST Repository

    Muraro, D.

    2013-12-31

    As multicellular organisms grow, positional information is continually needed to regulate the pattern in which cells are arranged. In the Arabidopsis root, most cell types are organized in a radially symmetric pattern; however, a symmetry-breaking event generates bisymmetric auxin and cytokinin signaling domains in the stele. Bidirectional cross-talk between the stele and the surrounding tissues involving a mobile transcription factor, SHORT ROOT (SHR), and mobile microRNA species also determines vascular pattern, but it is currently unclear how these signals integrate. We use a multicellular model to determine a minimal set of components necessary for maintaining a stable vascular pattern. Simulations perturbing the signaling network show that, in addition to the mutually inhibitory interaction between auxin and cytokinin, signaling through SHR, microRNA165/6, and PHABULOSA is required to maintain a stable bisymmetric pattern. We have verified this prediction by observing loss of bisymmetry in shr mutants. The model reveals the importance of several features of the network, namely the mutual degradation of microRNA165/6 and PHABULOSA and the existence of an additional negative regulator of cytokinin signaling. These components form a plausible mechanism capable of patterning vascular tissues in the absence of positional inputs provided by the transport of hormones from the shoot.

  14. Integration of hormonal signaling networks and mobile microRNAs is required for vascular patterning in Arabidopsis roots

    KAUST Repository

    Muraro, D.; Mellor, N.; Pound, M. P.; Help, H.; Lucas, M.; Chopard, J.; Byrne, H. M.; Godin, C.; Hodgman, T. C.; King, J. R.; Pridmore, T. P.; Helariutta, Y.; Bennett, M. J.; Bishopp, A.

    2013-01-01

    As multicellular organisms grow, positional information is continually needed to regulate the pattern in which cells are arranged. In the Arabidopsis root, most cell types are organized in a radially symmetric pattern; however, a symmetry-breaking event generates bisymmetric auxin and cytokinin signaling domains in the stele. Bidirectional cross-talk between the stele and the surrounding tissues involving a mobile transcription factor, SHORT ROOT (SHR), and mobile microRNA species also determines vascular pattern, but it is currently unclear how these signals integrate. We use a multicellular model to determine a minimal set of components necessary for maintaining a stable vascular pattern. Simulations perturbing the signaling network show that, in addition to the mutually inhibitory interaction between auxin and cytokinin, signaling through SHR, microRNA165/6, and PHABULOSA is required to maintain a stable bisymmetric pattern. We have verified this prediction by observing loss of bisymmetry in shr mutants. The model reveals the importance of several features of the network, namely the mutual degradation of microRNA165/6 and PHABULOSA and the existence of an additional negative regulator of cytokinin signaling. These components form a plausible mechanism capable of patterning vascular tissues in the absence of positional inputs provided by the transport of hormones from the shoot.

  15. Automating Ontological Annotation with WordNet

    Energy Technology Data Exchange (ETDEWEB)

    Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.; Chappell, Alan R.; Whitney, Paul D.; Posse, Christian; Paulson, Patrick R.; Baddeley, Bob L.; Hohimer, Ryan E.; White, Amanda M.

    2006-01-22

    Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of WordNet to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

  16. An open annotation ontology for science on web 3.0.

    Science.gov (United States)

    Ciccarese, Paolo; Ocana, Marco; Garcia Castro, Leyla Jael; Das, Sudeshna; Clark, Tim

    2011-05-17

    There is currently a gap between the rich and expressive collection of published biomedical ontologies, and the natural language expression of biomedical papers consumed on a daily basis by scientific researchers. The purpose of this paper is to provide an open, shareable structure for dynamic integration of biomedical domain ontologies with the scientific document, in the form of an Annotation Ontology (AO), thus closing this gap and enabling application of formal biomedical ontologies directly to the literature as it emerges. Initial requirements for AO were elicited by analysis of integration needs between biomedical web communities, and of needs for representing and integrating results of biomedical text mining. Analysis of strengths and weaknesses of previous efforts in this area was also performed. A series of increasingly refined annotation tools were then developed along with a metadata model in OWL, and deployed for feedback and additional requirements the ontology to users at a major pharmaceutical company and a major academic center. Further requirements and critiques of the model were also elicited through discussions with many colleagues and incorporated into the work. This paper presents Annotation Ontology (AO), an open ontology in OWL-DL for annotating scientific documents on the web. AO supports both human and algorithmic content annotation. It enables "stand-off" or independent metadata anchored to specific positions in a web document by any one of several methods. In AO, the document may be annotated but is not required to be under update control of the annotator. AO contains a provenance model to support versioning, and a set model for specifying groups and containers of annotation. AO is freely available under open source license at http://purl.org/ao/, and extensive documentation including screencasts is available on AO's Google Code page: http://code.google.com/p/annotation-ontology/ . The Annotation Ontology meets critical requirements for

  17. Annotation-based enrichment of Digital Objects using open-source frameworks

    Directory of Open Access Journals (Sweden)

    Marcus Emmanuel Barnes

    2017-07-01

    Full Text Available The W3C Web Annotation Data Model, Protocol, and Vocabulary unify approaches to annotations across the web, enabling their aggregation, discovery and persistence over time. In addition, new javascript libraries provide the ability for users to annotate multi-format content. In this paper, we describe how we have leveraged these developments to provide annotation features alongside Islandora’s existing preservation, access, and management capabilities. We also discuss our experience developing with the Web Annotation Model as an open web architecture standard, as well as our approach to integrating mature external annotation libraries. The resulting software (the Web Annotation Utility Module for Islandora accommodates annotation across multiple formats. This solution can be used in various digital scholarship contexts.

  18. IIS--Integrated Interactome System: a web-based platform for the annotation, analysis and visualization of protein-metabolite-gene-drug interactions by integrating a variety of data sources and tools.

    Science.gov (United States)

    Carazzolle, Marcelo Falsarella; de Carvalho, Lucas Miguel; Slepicka, Hugo Henrique; Vidal, Ramon Oliveira; Pereira, Gonçalo Amarante Guimarães; Kobarg, Jörg; Meirelles, Gabriela Vaz

    2014-01-01

    High-throughput screening of physical, genetic and chemical-genetic interactions brings important perspectives in the Systems Biology field, as the analysis of these interactions provides new insights into protein/gene function, cellular metabolic variations and the validation of therapeutic targets and drug design. However, such analysis depends on a pipeline connecting different tools that can automatically integrate data from diverse sources and result in a more comprehensive dataset that can be properly interpreted. We describe here the Integrated Interactome System (IIS), an integrative platform with a web-based interface for the annotation, analysis and visualization of the interaction profiles of proteins/genes, metabolites and drugs of interest. IIS works in four connected modules: (i) Submission module, which receives raw data derived from Sanger sequencing (e.g. two-hybrid system); (ii) Search module, which enables the user to search for the processed reads to be assembled into contigs/singlets, or for lists of proteins/genes, metabolites and drugs of interest, and add them to the project; (iii) Annotation module, which assigns annotations from several databases for the contigs/singlets or lists of proteins/genes, generating tables with automatic annotation that can be manually curated; and (iv) Interactome module, which maps the contigs/singlets or the uploaded lists to entries in our integrated database, building networks that gather novel identified interactions, protein and metabolite expression/concentration levels, subcellular localization and computed topological metrics, GO biological processes and KEGG pathways enrichment. This module generates a XGMML file that can be imported into Cytoscape or be visualized directly on the web. We have developed IIS by the integration of diverse databases following the need of appropriate tools for a systematic analysis of physical, genetic and chemical-genetic interactions. IIS was validated with yeast two

  19. Integrating UIMA annotators in a web-based text processing framework.

    Science.gov (United States)

    Chen, Xiang; Arnold, Corey W

    2013-01-01

    The Unstructured Information Management Architecture (UIMA) [1] framework is a growing platform for natural language processing (NLP) applications. However, such applications may be difficult for non-technical users deploy. This project presents a web-based framework that wraps UIMA-based annotator systems into a graphical user interface for researchers and clinicians, and a web service for developers. An annotator that extracts data elements from lung cancer radiology reports is presented to illustrate the use of the system. Annotation results from the web system can be exported to multiple formats for users to utilize in other aspects of their research and workflow. This project demonstrates the benefits of a lay-user interface for complex NLP applications. Efforts such as this can lead to increased interest and support for NLP work in the clinical domain.

  20. The integration of a metadata generation framework in a music annotation workflow

    OpenAIRE

    Corthaut, Nik; Lippens, Stefaan; Govaerts, Sten; Duval, Erik; Martens, Jean-Pierre

    2009-01-01

    In the MuziK project we try to automate the typically hard task of annotating music files manually. This annotation is used for music recommendation and for automated playlist creation. The music experts of Aristo Music (http://www.aristomusic.com) defined the data fields. High quality annotations are required since the results, playlists, are used in commercial live settings and the cost of a wrong selection is high [1].

  1. PCAS – a precomputed proteome annotation database resource

    Directory of Open Access Journals (Sweden)

    Luo Jingchu

    2003-11-01

    Full Text Available Abstract Background Many model proteomes or "complete" sets of proteins of given organisms are now publicly available. Much effort has been invested in computational annotation of those "draft" proteomes. Motif or domain based algorithms play a pivotal role in functional classification of proteins. Employing most available computational algorithms, mainly motif or domain recognition algorithms, we set up to develop an online proteome annotation system with integrated proteome annotation data to complement existing resources. Results We report here the development of PCAS (ProteinCentric Annotation System as an online resource of pre-computed proteome annotation data. We applied most available motif or domain databases and their analysis methods, including hmmpfam search of HMMs in Pfam, SMART and TIGRFAM, RPS-PSIBLAST search of PSSMs in CDD, pfscan of PROSITE patterns and profiles, as well as PSI-BLAST search of SUPERFAMILY PSSMs. In addition, signal peptide and TM are predicted using SignalP and TMHMM respectively. We mapped SUPERFAMILY and COGs to InterPro, so the motif or domain databases are integrated through InterPro. PCAS displays table summaries of pre-computed data and a graphical presentation of motifs or domains relative to the protein. As of now, PCAS contains human IPI, mouse IPI, and rat IPI, A. thaliana, C. elegans, D. melanogaster, S. cerevisiae, and S. pombe proteome. PCAS is available at http://pak.cbi.pku.edu.cn/proteome/gca.php Conclusion PCAS gives better annotation coverage for model proteomes by employing a wider collection of available algorithms. Besides presenting the most confident annotation data, PCAS also allows customized query so users can inspect statistically less significant boundary information as well. Therefore, besides providing general annotation information, PCAS could be used as a discovery platform. We plan to update PCAS twice a year. We will upgrade PCAS when new proteome annotation algorithms

  2. JGI Plant Genomics Gene Annotation Pipeline

    Energy Technology Data Exchange (ETDEWEB)

    Shu, Shengqiang; Rokhsar, Dan; Goodstein, David; Hayes, David; Mitros, Therese

    2014-07-14

    Plant genomes vary in size and are highly complex with a high amount of repeats, genome duplication and tandem duplication. Gene encodes a wealth of information useful in studying organism and it is critical to have high quality and stable gene annotation. Thanks to advancement of sequencing technology, many plant species genomes have been sequenced and transcriptomes are also sequenced. To use these vastly large amounts of sequence data to make gene annotation or re-annotation in a timely fashion, an automatic pipeline is needed. JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is our effort toward this aim with aid of a RNA-seq transcriptome assembly pipeline. It utilizes several gene predictors based on homolog peptides and transcript ORFs. See Methods for detail. Here we present genome annotation of JGI flagship green plants produced by this pipeline plus Arabidopsis and rice except for chlamy which is done by a third party. The genome annotations of these species and others are used in our gene family build pipeline and accessible via JGI Phytozome portal whose URL and front page snapshot are shown below.

  3. NoGOA: predicting noisy GO annotations using evidences and sparse representation.

    Science.gov (United States)

    Yu, Guoxian; Lu, Chang; Wang, Jun

    2017-07-21

    Gene Ontology (GO) is a community effort to represent functional features of gene products. GO annotations (GOA) provide functional associations between GO terms and gene products. Due to resources limitation, only a small portion of annotations are manually checked by curators, and the others are electronically inferred. Although quality control techniques have been applied to ensure the quality of annotations, the community consistently report that there are still considerable noisy (or incorrect) annotations. Given the wide application of annotations, however, how to identify noisy annotations is an important but yet seldom studied open problem. We introduce a novel approach called NoGOA to predict noisy annotations. NoGOA applies sparse representation on the gene-term association matrix to reduce the impact of noisy annotations, and takes advantage of sparse representation coefficients to measure the semantic similarity between genes. Secondly, it preliminarily predicts noisy annotations of a gene based on aggregated votes from semantic neighborhood genes of that gene. Next, NoGOA estimates the ratio of noisy annotations for each evidence code based on direct annotations in GOA files archived on different periods, and then weights entries of the association matrix via estimated ratios and propagates weights to ancestors of direct annotations using GO hierarchy. Finally, it integrates evidence-weighted association matrix and aggregated votes to predict noisy annotations. Experiments on archived GOA files of six model species (H. sapiens, A. thaliana, S. cerevisiae, G. gallus, B. Taurus and M. musculus) demonstrate that NoGOA achieves significantly better results than other related methods and removing noisy annotations improves the performance of gene function prediction. The comparative study justifies the effectiveness of integrating evidence codes with sparse representation for predicting noisy GO annotations. Codes and datasets are available at http://mlda.swu.edu.cn/codes.php?name=NoGOA .

  4. Detection of plant microRNAs in honey.

    Directory of Open Access Journals (Sweden)

    Angelo Gismondi

    Full Text Available For the first time in the literature, our group has managed to demonstrate the existence of plant RNAs in honey samples. In particular, in our work, different RNA extraction procedures were performed in order to identify a purification method for nucleic acids from honey. Purity, stability and integrity of the RNA samples were evaluated by spectrophotometric, PCR and electrophoretic analyses. Among all honey RNAs, we specifically revealed the presence of both plastidial and nuclear plant transcripts: RuBisCO large subunit mRNA, maturase K messenger and 18S ribosomal RNA. Surprisingly, nine plant microRNAs (miR482b, miR156a, miR396c, miR171a, miR858, miR162a, miR159c, miR395a and miR2118a were also detected and quantified by qPCR. In this context, a comparison between microRNA content in plant samples (i.e. flowers, nectars and their derivative honeys was carried out. In addition, peculiar microRNA profiles were also identified in six different monofloral honeys. Finally, the same plant microRNAs were investigated in other plant food products: tea, cocoa and coffee. Since plant microRNAs introduced by diet have been recently recognized as being able to modulate the consumer's gene expression, our research suggests that honey's benefits for human health may be strongly correlated to the bioactivity of plant microRNAs contained in this matrix.

  5. Bridging the phenotypic and genetic data useful for integrated breeding through a data annotation using the Crop Ontology developed by the crop communities of practice

    Directory of Open Access Journals (Sweden)

    Rosemary eShrestha

    2012-08-01

    Full Text Available The Crop Ontology (CO of the Generation Challenge Program (GCP (http://cropontology.org/ is developed for the Integrated Breeding Platform (https://www.integratedbreeding.net/ by several centers of The Consultative Group on International Agricultural Research (CGIAR: Bioversity, CIMMYT, CIP, ICRISAT, IITA, and IRRI. Integrated breeding necessitates that breeders access genotypic and phenotypic data related to a given trait. The Crop Ontology provides validated trait names used by the crop communities of practice for harmonizing the annotation of phenotypic and genotypic data and thus supporting data accessibility and discovery through web queries. The trait information is completed by the description of the measurement methods and scales, and images. The trait dictionaries used to produce the Integrated Breeding (IB fieldbooks are synchronized with the Crop Ontology terms for an automatic annotation of the phenotypic data measured in the field. The IB fieldbook provides breeders with direct access to the CO to get additional descriptive information on the traits. Ontologies and trait dictionaries are online for cassava, chickpea, common bean, groundnut, maize, Musa, potato, rice, sorghum and wheat. Online curation and annotation tools facilitate (http://cropontology.org direct maintenance of the trait information and production of trait dictionaries by the crop communities. An important feature is the cross referencing of CO terms with the Crop database trait ID and with their synonyms in Plant Ontology and Trait Ontology. Web links between cross referenced terms in CO provide online access to data annotated with similar ontological terms, particularly the genetic data in Gramene (University of Cornell or the evaluation and climatic data in the Global Repository of evaluation trials of the Climate Change, Agriculture and Food Security programme (CCAFS. Cross-referencing and annotation will be further applied in the Integrated Breeding Platform.

  6. Network-based ranking methods for prediction of novel disease associated microRNAs.

    Science.gov (United States)

    Le, Duc-Hau

    2015-10-01

    Many studies have shown roles of microRNAs on human disease and a number of computational methods have been proposed to predict such associations by ranking candidate microRNAs according to their relevance to a disease. Among them, machine learning-based methods usually have a limitation in specifying non-disease microRNAs as negative training samples. Meanwhile, network-based methods are becoming dominant since they well exploit a "disease module" principle in microRNA functional similarity networks. Of which, random walk with restart (RWR) algorithm-based method is currently state-of-the-art. The use of this algorithm was inspired from its success in predicting disease gene because the "disease module" principle also exists in protein interaction networks. Besides, many algorithms designed for webpage ranking have been successfully applied in ranking disease candidate genes because web networks share topological properties with protein interaction networks. However, these algorithms have not yet been utilized for disease microRNA prediction. We constructed microRNA functional similarity networks based on shared targets of microRNAs, and then we integrated them with a microRNA functional synergistic network, which was recently identified. After analyzing topological properties of these networks, in addition to RWR, we assessed the performance of (i) PRINCE (PRIoritizatioN and Complex Elucidation), which was proposed for disease gene prediction; (ii) PageRank with Priors (PRP) and K-Step Markov (KSM), which were used for studying web networks; and (iii) a neighborhood-based algorithm. Analyses on topological properties showed that all microRNA functional similarity networks are small-worldness and scale-free. The performance of each algorithm was assessed based on average AUC values on 35 disease phenotypes and average rankings of newly discovered disease microRNAs. As a result, the performance on the integrated network was better than that on individual ones. In

  7. Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.

    Science.gov (United States)

    Apweiler, R; Gateau, A; Contrino, S; Martin, M J; Junker, V; O'Donovan, C; Lang, F; Mitaritonna, N; Kappus, S; Bairoch, A

    1997-01-01

    SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.

  8. Psychomotor Battery Approaches to Performance Prediction and Evaluation in Hyperbaric, Thermal and Vibratory Environments: Annotated Bibliographies and Integrative Review

    Science.gov (United States)

    1980-10-01

    W77-Mar78 and Vibratory Environments: Annotated Biblia - 4.-EFRIGOO EOT*_1 graphies and Integrative Review. I. CONTRACT OR GRANT NUMSER(a) David J...Papers In the third phase of the effort, the final version of the three speciai-environrneni performance battery bibliographies was corriiled and the...performance at much lower pressu. (e.g. 3 to 4 ATA when nitrogen is involved). The following sections will integrate the available liter - ature on the effects

  9. Use of Annotations for Component and Framework Interoperability

    Science.gov (United States)

    David, O.; Lloyd, W.; Carlson, J.; Leavesley, G. H.; Geter, F.

    2009-12-01

    The popular programming languages Java and C# provide annotations, a form of meta-data construct. Software frameworks for web integration, web services, database access, and unit testing now take advantage of annotations to reduce the complexity of APIs and the quantity of integration code between the application and framework infrastructure. Adopting annotation features in frameworks has been observed to lead to cleaner and leaner application code. The USDA Object Modeling System (OMS) version 3.0 fully embraces the annotation approach and additionally defines a meta-data standard for components and models. In version 3.0 framework/model integration previously accomplished using API calls is now achieved using descriptive annotations. This enables the framework to provide additional functionality non-invasively such as implicit multithreading, and auto-documenting capabilities while achieving a significant reduction in the size of the model source code. Using a non-invasive methodology leads to models and modeling components with only minimal dependencies on the modeling framework. Since models and modeling components are not directly bound to framework by the use of specific APIs and/or data types they can more easily be reused both within the framework as well as outside of it. To study the effectiveness of an annotation based framework approach with other modeling frameworks, a framework-invasiveness study was conducted to evaluate the effects of framework design on model code quality. A monthly water balance model was implemented across several modeling frameworks and several software metrics were collected. The metrics selected were measures of non-invasive design methods for modeling frameworks from a software engineering perspective. It appears that the use of annotations positively impacts several software quality measures. In a next step, the PRMS model was implemented in OMS 3.0 and is currently being implemented for water supply forecasting in the

  10. DFAST and DAGA: web-based integrated genome annotation tools and resources.

    Science.gov (United States)

    Tanizawa, Yasuhiro; Fujisawa, Takatomo; Kaminuma, Eli; Nakamura, Yasukazu; Arita, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus , obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii , whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp.

  11. MicroRNAs in CAG trinucleotide repeat expansion disorders: an integrated review of the literature.

    Science.gov (United States)

    Dumitrescu, Laura; Popescu, Bogdan O

    2015-01-01

    MicroRNAs are small RNAs involved in gene silencing. They play important roles in transcriptional regulation and are selectively and abundantly expressed in the central nervous system. A considerable amount of the human genome is comprised of tandem repeating nucleotide streams. Several diseases are caused by above-threshold expansion of certain trinucleotide repeats occurring in a protein-coding or non-coding region. Though monogenic, CAG trinucleotide repeat expansion disorders have a complex pathogenesis, various combinations of multiple coexisting pathways resulting in one common final consequence: selective neurodegeneration. Mutant protein and mutant transcript gain of toxic function are considered to be the core pathogenic mechanisms. The profile of microRNAs in CAG trinucleotide repeat disorders is scarcely described, however microRNA dysregulation has been identified in these diseases and microRNA-related intereference with gene expression is considered to be involved in their pathogenesis. Better understanding of microRNAs functions and means of manipulation promises to offer further insights into the pathogenic pathways of CAG repeat expansion disorders, to point out new potential targets for drug intervention and to provide some of the much needed etiopathogenic therapeutic agents. A number of disease-modifying microRNA silencing strategies are under development, but several implementation impediments still have to be resolved. CAG targeting seems feasible and efficient in animal models and is an appealing approach for clinical practice. Preliminary human trials are just beginning.

  12. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

    Science.gov (United States)

    2012-01-01

    Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR) are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas. PMID:23256920

  13. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

    Directory of Open Access Journals (Sweden)

    Liu Chang

    2012-12-01

    Full Text Available Abstract Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas.

  14. A framework for annotating human genome in disease context.

    Science.gov (United States)

    Xu, Wei; Wang, Huisong; Cheng, Wenqing; Fu, Dong; Xia, Tian; Kibbe, Warren A; Lin, Simon M

    2012-01-01

    Identification of gene-disease association is crucial to understanding disease mechanism. A rapid increase in biomedical literatures, led by advances of genome-scale technologies, poses challenge for manually-curated-based annotation databases to characterize gene-disease associations effectively and timely. We propose an automatic method-The Disease Ontology Annotation Framework (DOAF) to provide a comprehensive annotation of the human genome using the computable Disease Ontology (DO), the NCBO Annotator service and NCBI Gene Reference Into Function (GeneRIF). DOAF can keep the resulting knowledgebase current by periodically executing automatic pipeline to re-annotate the human genome using the latest DO and GeneRIF releases at any frequency such as daily or monthly. Further, DOAF provides a computable and programmable environment which enables large-scale and integrative analysis by working with external analytic software or online service platforms. A user-friendly web interface (doa.nubic.northwestern.edu) is implemented to allow users to efficiently query, download, and view disease annotations and the underlying evidences.

  15. dbSMR: a novel resource of genome-wide SNPs affecting microRNA mediated regulation

    Directory of Open Access Journals (Sweden)

    Hariharan Manoj

    2009-04-01

    Full Text Available Abstract Background MicroRNAs (miRNAs regulate several biological processes through post-transcriptional gene silencing. The efficiency of binding of miRNAs to target transcripts depends on the sequence as well as intramolecular structure of the transcript. Single Nucleotide Polymorphisms (SNPs can contribute to alterations in the structure of regions flanking them, thereby influencing the accessibility for miRNA binding. Description The entire human genome was analyzed for SNPs in and around predicted miRNA target sites. Polymorphisms within 200 nucleotides that could alter the intramolecular structure at the target site, thereby altering regulation were annotated. Collated information was ported in a MySQL database with a user-friendly interface accessible through the URL: http://miracle.igib.res.in/dbSMR. Conclusion The database has a user-friendly interface where the information can be queried using either the gene name, microRNA name, polymorphism ID or transcript ID. Combination queries using 'AND' or 'OR' is also possible along with specifying the degree of change of intramolecular bonding with and without the polymorphism. Such a resource would enable researchers address questions like the role of regulatory SNPs in the 3' UTRs and population specific regulatory modulations in the context of microRNA targets.

  16. Managing and Querying Image Annotation and Markup in XML.

    Science.gov (United States)

    Wang, Fusheng; Pan, Tony; Sharma, Ashish; Saltz, Joel

    2010-01-01

    Proprietary approaches for representing annotations and image markup are serious barriers for researchers to share image data and knowledge. The Annotation and Image Markup (AIM) project is developing a standard based information model for image annotation and markup in health care and clinical trial environments. The complex hierarchical structures of AIM data model pose new challenges for managing such data in terms of performance and support of complex queries. In this paper, we present our work on managing AIM data through a native XML approach, and supporting complex image and annotation queries through native extension of XQuery language. Through integration with xService, AIM databases can now be conveniently shared through caGrid.

  17. Managing and Querying Image Annotation and Markup in XML

    Science.gov (United States)

    Wang, Fusheng; Pan, Tony; Sharma, Ashish; Saltz, Joel

    2010-01-01

    Proprietary approaches for representing annotations and image markup are serious barriers for researchers to share image data and knowledge. The Annotation and Image Markup (AIM) project is developing a standard based information model for image annotation and markup in health care and clinical trial environments. The complex hierarchical structures of AIM data model pose new challenges for managing such data in terms of performance and support of complex queries. In this paper, we present our work on managing AIM data through a native XML approach, and supporting complex image and annotation queries through native extension of XQuery language. Through integration with xService, AIM databases can now be conveniently shared through caGrid. PMID:21218167

  18. The Eimeria Transcript DB: an integrated resource for annotated transcripts of protozoan parasites of the genus Eimeria

    Science.gov (United States)

    Rangel, Luiz Thibério; Novaes, Jeniffer; Durham, Alan M.; Madeira, Alda Maria B. N.; Gruber, Arthur

    2013-01-01

    Parasites of the genus Eimeria infect a wide range of vertebrate hosts, including chickens. We have recently reported a comparative analysis of the transcriptomes of Eimeria acervulina, Eimeria maxima and Eimeria tenella, integrating ORESTES data produced by our group and publicly available Expressed Sequence Tags (ESTs). All cDNA reads have been assembled, and the reconstructed transcripts have been submitted to a comprehensive functional annotation pipeline. Additional studies included orthology assignment across apicomplexan parasites and clustering analyses of gene expression profiles among different developmental stages of the parasites. To make all this body of information publicly available, we constructed the Eimeria Transcript Database (EimeriaTDB), a web repository that provides access to sequence data, annotation and comparative analyses. Here, we describe the web interface, available sequence data sets and query tools implemented on the site. The main goal of this work is to offer a public repository of sequence and functional annotation data of reconstructed transcripts of parasites of the genus Eimeria. We believe that EimeriaTDB will represent a valuable and complementary resource for the Eimeria scientific community and for those researchers interested in comparative genomics of apicomplexan parasites. Database URL: http://www.coccidia.icb.usp.br/eimeriatdb/ PMID:23411718

  19. Bridging the phenotypic and genetic data useful for integrated breeding through a data annotation using the Crop Ontology developed by the crop communities of practice

    Science.gov (United States)

    Shrestha, Rosemary; Matteis, Luca; Skofic, Milko; Portugal, Arllet; McLaren, Graham; Hyman, Glenn; Arnaud, Elizabeth

    2012-01-01

    The Crop Ontology (CO) of the Generation Challenge Program (GCP) (http://cropontology.org/) is developed for the Integrated Breeding Platform (IBP) (http://www.integratedbreeding.net/) by several centers of The Consultative Group on International Agricultural Research (CGIAR): bioversity, CIMMYT, CIP, ICRISAT, IITA, and IRRI. Integrated breeding necessitates that breeders access genotypic and phenotypic data related to a given trait. The CO provides validated trait names used by the crop communities of practice (CoP) for harmonizing the annotation of phenotypic and genotypic data and thus supporting data accessibility and discovery through web queries. The trait information is completed by the description of the measurement methods and scales, and images. The trait dictionaries used to produce the Integrated Breeding (IB) fieldbooks are synchronized with the CO terms for an automatic annotation of the phenotypic data measured in the field. The IB fieldbook provides breeders with direct access to the CO to get additional descriptive information on the traits. Ontologies and trait dictionaries are online for cassava, chickpea, common bean, groundnut, maize, Musa, potato, rice, sorghum, and wheat. Online curation and annotation tools facilitate (http://cropontology.org) direct maintenance of the trait information and production of trait dictionaries by the crop communities. An important feature is the cross referencing of CO terms with the Crop database trait ID and with their synonyms in Plant Ontology (PO) and Trait Ontology (TO). Web links between cross referenced terms in CO provide online access to data annotated with similar ontological terms, particularly the genetic data in Gramene (University of Cornell) or the evaluation and climatic data in the Global Repository of evaluation trials of the Climate Change, Agriculture and Food Security programme (CCAFS). Cross-referencing and annotation will be further applied in the IBP. PMID:22934074

  20. Gene Ontology annotation of the rice blast fungus, Magnaporthe oryzae

    Directory of Open Access Journals (Sweden)

    Deng Jixin

    2009-02-01

    Full Text Available Abstract Background Magnaporthe oryzae, the causal agent of blast disease of rice, is the most destructive disease of rice worldwide. The genome of this fungal pathogen has been sequenced and an automated annotation has recently been updated to Version 6 http://www.broad.mit.edu/annotation/genome/magnaporthe_grisea/MultiDownloads.html. However, a comprehensive manual curation remains to be performed. Gene Ontology (GO annotation is a valuable means of assigning functional information using standardized vocabulary. We report an overview of the GO annotation for Version 5 of M. oryzae genome assembly. Methods A similarity-based (i.e., computational GO annotation with manual review was conducted, which was then integrated with a literature-based GO annotation with computational assistance. For similarity-based GO annotation a stringent reciprocal best hits method was used to identify similarity between predicted proteins of M. oryzae and GO proteins from multiple organisms with published associations to GO terms. Significant alignment pairs were manually reviewed. Functional assignments were further cross-validated with manually reviewed data, conserved domains, or data determined by wet lab experiments. Additionally, biological appropriateness of the functional assignments was manually checked. Results In total, 6,286 proteins received GO term assignment via the homology-based annotation, including 2,870 hypothetical proteins. Literature-based experimental evidence, such as microarray, MPSS, T-DNA insertion mutation, or gene knockout mutation, resulted in 2,810 proteins being annotated with GO terms. Of these, 1,673 proteins were annotated with new terms developed for Plant-Associated Microbe Gene Ontology (PAMGO. In addition, 67 experiment-determined secreted proteins were annotated with PAMGO terms. Integration of the two data sets resulted in 7,412 proteins (57% being annotated with 1,957 distinct and specific GO terms. Unannotated proteins

  1. Estimating the annotation error rate of curated GO database sequence annotations

    Directory of Open Access Journals (Sweden)

    Brown Alfred L

    2007-05-01

    Full Text Available Abstract Background Annotations that describe the function of sequences are enormously important to researchers during laboratory investigations and when making computational inferences. However, there has been little investigation into the data quality of sequence function annotations. Here we have developed a new method of estimating the error rate of curated sequence annotations, and applied this to the Gene Ontology (GO sequence database (GOSeqLite. This method involved artificially adding errors to sequence annotations at known rates, and used regression to model the impact on the precision of annotations based on BLAST matched sequences. Results We estimated the error rate of curated GO sequence annotations in the GOSeqLite database (March 2006 at between 28% and 30%. Annotations made without use of sequence similarity based methods (non-ISS had an estimated error rate of between 13% and 18%. Annotations made with the use of sequence similarity methodology (ISS had an estimated error rate of 49%. Conclusion While the overall error rate is reasonably low, it would be prudent to treat all ISS annotations with caution. Electronic annotators that use ISS annotations as the basis of predictions are likely to have higher false prediction rates, and for this reason designers of these systems should consider avoiding ISS annotations where possible. Electronic annotators that use ISS annotations to make predictions should be viewed sceptically. We recommend that curators thoroughly review ISS annotations before accepting them as valid. Overall, users of curated sequence annotations from the GO database should feel assured that they are using a comparatively high quality source of information.

  2. Annotated bibliography

    International Nuclear Information System (INIS)

    1997-08-01

    Under a cooperative agreement with the U.S. Department of Energy's Office of Science and Technology, Waste Policy Institute (WPI) is conducting a five-year research project to develop a research-based approach for integrating communication products in stakeholder involvement related to innovative technology. As part of the research, WPI developed this annotated bibliography which contains almost 100 citations of articles/books/resources involving topics related to communication and public involvement aspects of deploying innovative cleanup technology. To compile the bibliography, WPI performed on-line literature searches (e.g., Dialog, International Association of Business Communicators Public Relations Society of America, Chemical Manufacturers Association, etc.), consulted past years proceedings of major environmental waste cleanup conferences (e.g., Waste Management), networked with professional colleagues and DOE sites to gather reports or case studies, and received input during the August 1996 Research Design Team meeting held to discuss the project's research methodology. Articles were selected for annotation based upon their perceived usefulness to the broad range of public involvement and communication practitioners

  3. Computational Characterization of Exogenous MicroRNAs that Can Be Transferred into Human Circulation.

    Directory of Open Access Journals (Sweden)

    Jiang Shu

    Full Text Available MicroRNAs have been long considered synthesized endogenously until very recent discoveries showing that human can absorb dietary microRNAs from animal and plant origins while the mechanism remains unknown. Compelling evidences of microRNAs from rice, milk, and honeysuckle transported to human blood and tissues have created a high volume of interests in the fundamental questions that which and how exogenous microRNAs can be transferred into human circulation and possibly exert functions in humans. Here we present an integrated genomics and computational analysis to study the potential deciding features of transportable microRNAs. Specifically, we analyzed all publicly available microRNAs, a total of 34,612 from 194 species, with 1,102 features derived from the microRNA sequence and structure. Through in-depth bioinformatics analysis, 8 groups of discriminative features have been used to characterize human circulating microRNAs and infer the likelihood that a microRNA will get transferred into human circulation. For example, 345 dietary microRNAs have been predicted as highly transportable candidates where 117 of them have identical sequences with their homologs in human and 73 are known to be associated with exosomes. Through a milk feeding experiment, we have validated 9 cow-milk microRNAs in human plasma using microRNA-sequencing analysis, including the top ranked microRNAs such as bta-miR-487b, miR-181b, and miR-421. The implications in health-related processes have been illustrated in the functional analysis. This work demonstrates the data-driven computational analysis is highly promising to study novel molecular characteristics of transportable microRNAs while bypassing the complex mechanistic details.

  4. Computational Characterization of Exogenous MicroRNAs that Can Be Transferred into Human Circulation

    Science.gov (United States)

    Shu, Jiang; Chiang, Kevin; Zempleni, Janos; Cui, Juan

    2015-01-01

    MicroRNAs have been long considered synthesized endogenously until very recent discoveries showing that human can absorb dietary microRNAs from animal and plant origins while the mechanism remains unknown. Compelling evidences of microRNAs from rice, milk, and honeysuckle transported to human blood and tissues have created a high volume of interests in the fundamental questions that which and how exogenous microRNAs can be transferred into human circulation and possibly exert functions in humans. Here we present an integrated genomics and computational analysis to study the potential deciding features of transportable microRNAs. Specifically, we analyzed all publicly available microRNAs, a total of 34,612 from 194 species, with 1,102 features derived from the microRNA sequence and structure. Through in-depth bioinformatics analysis, 8 groups of discriminative features have been used to characterize human circulating microRNAs and infer the likelihood that a microRNA will get transferred into human circulation. For example, 345 dietary microRNAs have been predicted as highly transportable candidates where 117 of them have identical sequences with their homologs in human and 73 are known to be associated with exosomes. Through a milk feeding experiment, we have validated 9 cow-milk microRNAs in human plasma using microRNA-sequencing analysis, including the top ranked microRNAs such as bta-miR-487b, miR-181b, and miR-421. The implications in health-related processes have been illustrated in the functional analysis. This work demonstrates the data-driven computational analysis is highly promising to study novel molecular characteristics of transportable microRNAs while bypassing the complex mechanistic details. PMID:26528912

  5. Genome-wide annotation of porcine microRNA genes and transcriptome profiling during Actinobacillus infection

    DEFF Research Database (Denmark)

    Nielsen, Mathilde

    MicroRNAs are small single stranded non-coding RNA molecules which contributes to the regulation of gene expression by primarily binding to the 3´end of protein coding mRNA, hereby inhibiting the translation process or promting degradation of the mRNA. The main focus of this PhD project was to ex......MicroRNAs are small single stranded non-coding RNA molecules which contributes to the regulation of gene expression by primarily binding to the 3´end of protein coding mRNA, hereby inhibiting the translation process or promting degradation of the mRNA. The main focus of this PhD project...

  6. FIGENIX: Intelligent automation of genomic annotation: expertise integration in a new software platform

    Directory of Open Access Journals (Sweden)

    Pontarotti Pierre

    2005-08-01

    Full Text Available Abstract Background Two of the main objectives of the genomic and post-genomic era are to structurally and functionally annotate genomes which consists of detecting genes' position and structure, and inferring their function (as well as of other features of genomes. Structural and functional annotation both require the complex chaining of numerous different software, algorithms and methods under the supervision of a biologist. The automation of these pipelines is necessary to manage huge amounts of data released by sequencing projects. Several pipelines already automate some of these complex chaining but still necessitate an important contribution of biologists for supervising and controlling the results at various steps. Results Here we propose an innovative automated platform, FIGENIX, which includes an expert system capable to substitute to human expertise at several key steps. FIGENIX currently automates complex pipelines of structural and functional annotation under the supervision of the expert system (which allows for example to make key decisions, check intermediate results or refine the dataset. The quality of the results produced by FIGENIX is comparable to those obtained by expert biologists with a drastic gain in terms of time costs and avoidance of errors due to the human manipulation of data. Conclusion The core engine and expert system of the FIGENIX platform currently handle complex annotation processes of broad interest for the genomic community. They could be easily adapted to new, or more specialized pipelines, such as for example the annotation of miRNAs, the classification of complex multigenic families, annotation of regulatory elements and other genomic features of interest.

  7. ACID: annotation of cassette and integron data

    Directory of Open Access Journals (Sweden)

    Stokes Harold W

    2009-04-01

    Full Text Available Abstract Background Although integrons and their associated gene cassettes are present in ~10% of bacteria and can represent up to 3% of the genome in which they are found, very few have been properly identified and annotated in public databases. These genetic elements have been overlooked in comparison to other vectors that facilitate lateral gene transfer between microorganisms. Description By automating the identification of integron integrase genes and of the non-coding cassette-associated attC recombination sites, we were able to assemble a database containing all publicly available sequence information regarding these genetic elements. Specialists manually curated the database and this information was used to improve the automated detection and annotation of integrons and their encoded gene cassettes. ACID (annotation of cassette and integron data can be searched using a range of queries and the data can be downloaded in a number of formats. Users can readily annotate their own data and integrate it into ACID using the tools provided. Conclusion ACID is a community resource providing easy access to annotations of integrons and making tools available to detect them in novel sequence data. ACID also hosts a forum to prompt integron-related discussion, which can hopefully lead to a more universal definition of this genetic element.

  8. Combinatorial microRNA target predictions

    DEFF Research Database (Denmark)

    Krek, Azra; Grün, Dominic; Poy, Matthew N.

    2005-01-01

    MicroRNAs are small noncoding RNAs that recognize and bind to partially complementary sites in the 3' untranslated regions of target genes in animals and, by unknown mechanisms, regulate protein production of the target transcript1, 2, 3. Different combinations of microRNAs are expressed...... in different cell types and may coordinately regulate cell-specific target genes. Here, we present PicTar, a computational method for identifying common targets of microRNAs. Statistical tests using genome-wide alignments of eight vertebrate genomes, PicTar's ability to specifically recover published micro......RNA targets, and experimental validation of seven predicted targets suggest that PicTar has an excellent success rate in predicting targets for single microRNAs and for combinations of microRNAs. We find that vertebrate microRNAs target, on average, roughly 200 transcripts each. Furthermore, our results...

  9. A Linked Data-Based Collaborative Annotation System for Increasing Learning Achievements

    Science.gov (United States)

    Zarzour, Hafed; Sellami, Mokhtar

    2017-01-01

    With the emergence of the Web 2.0, collaborative annotation practices have become more mature in the field of learning. In this context, several recent studies have shown the powerful effects of the integration of annotation mechanism in learning process. However, most of these studies provide poor support for semantically structured resources,…

  10. MicroRNAs as potential biomarkers in adrenocortical cancer: progress and challenges

    Directory of Open Access Journals (Sweden)

    Nadia eCHERRADI

    2016-01-01

    Full Text Available Adrenocortical carcinoma is a rare malignancy with poor prognosis and limited therapeutic options. Over the last decade, pan-genomic analyses of genetic and epigenetic alterations and genome-wide expression profile studies allowed major advances in the understanding of the molecular genetics of adrenocortical carcinoma. Besides the well-known dysfunctional molecular pathways in adrenocortical tumors such as the IGF2 pathway, the Wnt pathway and TP53, high-throughput technologies enabled a more comprehensive genomic characterization of adrenocortical cancer. Integration of expression profile data with exome sequencing, SNP array analysis, methylation and microRNA profiling led to the identification of subgroups of malignant tumors with distinct molecular alterations and clinical outcomes. MicroRNAs post-transcriptionally silence their target gene expression either by degrading mRNA or by inhibiting translation. Although our knowledge of the contribution of deregulated microRNAs to the pathogenesis of adrenocortical carcinoma is still in its infancy, recent studies support their relevance in gene expression alterations in these tumors. Some microRNAs have been shown to carry potential diagnostic and prognostic values while others may be good candidates for therapeutic interventions. With the emergence of disease-specific blood-borne microRNAs signatures, analyses of small cohorts of patients with adrenocortical carcinoma suggest that circulating microRNAs represent promising non-invasive biomarkers of malignancy or recurrence. However, some technical challenges still remain, and most of the microRNAs reported in the literature have not yet been validated in sufficiently powered and longitudinal studies. In this review, we discuss the current knowledge regarding the deregulation of tumor-associated and circulating microRNAs in adrenocortical carcinoma patients, while emphasizing their potential significance in adrenocortical carcinoma pathogenic

  11. Model and Interoperability using Meta Data Annotations

    Science.gov (United States)

    David, O.

    2011-12-01

    Software frameworks and architectures are in need for meta data to efficiently support model integration. Modelers have to know the context of a model, often stepping into modeling semantics and auxiliary information usually not provided in a concise structure and universal format, consumable by a range of (modeling) tools. XML often seems the obvious solution for capturing meta data, but its wide adoption to facilitate model interoperability is limited by XML schema fragmentation, complexity, and verbosity outside of a data-automation process. Ontologies seem to overcome those shortcomings, however the practical significance of their use remains to be demonstrated. OMS version 3 took a different approach for meta data representation. The fundamental building block of a modular model in OMS is a software component representing a single physical process, calibration method, or data access approach. Here, programing language features known as Annotations or Attributes were adopted. Within other (non-modeling) frameworks it has been observed that annotations lead to cleaner and leaner application code. Framework-supported model integration, traditionally accomplished using Application Programming Interfaces (API) calls is now achieved using descriptive code annotations. Fully annotated components for various hydrological and Ag-system models now provide information directly for (i) model assembly and building, (ii) data flow analysis for implicit multi-threading or visualization, (iii) automated and comprehensive model documentation of component dependencies, physical data properties, (iv) automated model and component testing, calibration, and optimization, and (v) automated audit-traceability to account for all model resources leading to a particular simulation result. Such a non-invasive methodology leads to models and modeling components with only minimal dependencies on the modeling framework but a strong reference to its originating code. Since models and

  12. Software for computing and annotating genomic ranges.

    Science.gov (United States)

    Lawrence, Michael; Huber, Wolfgang; Pagès, Hervé; Aboyoun, Patrick; Carlson, Marc; Gentleman, Robert; Morgan, Martin T; Carey, Vincent J

    2013-01-01

    We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.

  13. LeARN: a platform for detecting, clustering and annotating non-coding RNAs

    Directory of Open Access Journals (Sweden)

    Schiex Thomas

    2008-01-01

    Full Text Available Abstract Background In the last decade, sequencing projects have led to the development of a number of annotation systems dedicated to the structural and functional annotation of protein-coding genes. These annotation systems manage the annotation of the non-protein coding genes (ncRNAs in a very crude way, allowing neither the edition of the secondary structures nor the clustering of ncRNA genes into families which are crucial for appropriate annotation of these molecules. Results LeARN is a flexible software package which handles the complete process of ncRNA annotation by integrating the layers of automatic detection and human curation. Conclusion This software provides the infrastructure to deal properly with ncRNAs in the framework of any annotation project. It fills the gap between existing prediction software, that detect independent ncRNA occurrences, and public ncRNA repositories, that do not offer the flexibility and interactivity required for annotation projects. The software is freely available from the download section of the website http://bioinfo.genopole-toulouse.prd.fr/LeARN

  14. Lynx web services for annotations and systems analysis of multi-gene disorders.

    Science.gov (United States)

    Sulakhe, Dinanath; Taylor, Andrew; Balasubramanian, Sandhya; Feng, Bo; Xie, Bingqing; Börnigen, Daniela; Dave, Utpal J; Foster, Ian T; Gilliam, T Conrad; Maltsev, Natalia

    2014-07-01

    Lynx is a web-based integrated systems biology platform that supports annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Lynx has integrated multiple classes of biomedical data (genomic, proteomic, pathways, phenotypic, toxicogenomic, contextual and others) from various public databases as well as manually curated data from our group and collaborators (LynxKB). Lynx provides tools for gene list enrichment analysis using multiple functional annotations and network-based gene prioritization. Lynx provides access to the integrated database and the analytical tools via REST based Web Services (http://lynx.ci.uchicago.edu/webservices.html). This comprises data retrieval services for specific functional annotations, services to search across the complete LynxKB (powered by Lucene), and services to access the analytical tools built within the Lynx platform. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics.

    Science.gov (United States)

    Chepelev, Leonid L; Riazanov, Alexandre; Kouznetsov, Alexandre; Low, Hong Sang; Dumontier, Michel; Baker, Christopher J O

    2011-07-26

    The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have largely remained in the hands of human curators using manual or semi-automated protocols, leaving many novel entities unclassified. Since chemical function is often closely linked to structure, accurate structure-based classification and annotation of chemical entities is imperative to understanding their functionality. As part of an exploratory study, we have investigated the utility of semantic web technologies in automated chemical classification and annotation of lipids. Our prototype framework consists of two components: an ontology and a set of federated web services that operate upon it. The formal lipid ontology we use here extends a part of the LiPrO ontology and draws on the lipid hierarchy in the LIPID MAPS database, as well as literature-derived knowledge. The federated semantic web services that operate upon this ontology are deployed within the Semantic Annotation, Discovery, and Integration (SADI) framework. Structure-based lipid classification is enacted by two core services. Firstly, a structural annotation service detects and enumerates relevant functional groups for a specified chemical structure. A second service reasons over lipid ontology class descriptions using the attributes obtained from the annotation service and identifies the appropriate lipid classification. We extend the utility of these core services by combining them with additional SADI services that retrieve associations between lipids and proteins and identify publications related to specified lipid types. We analyze the performance of SADI-enabled eicosanoid classification relative to the LIPID MAPS classification and reflect on the contribution of our integrative methodology in the context of

  16. Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics

    Directory of Open Access Journals (Sweden)

    Dumontier Michel

    2011-07-01

    Full Text Available Abstract Background The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have largely remained in the hands of human curators using manual or semi-automated protocols, leaving many novel entities unclassified. Since chemical function is often closely linked to structure, accurate structure-based classification and annotation of chemical entities is imperative to understanding their functionality. Results As part of an exploratory study, we have investigated the utility of semantic web technologies in automated chemical classification and annotation of lipids. Our prototype framework consists of two components: an ontology and a set of federated web services that operate upon it. The formal lipid ontology we use here extends a part of the LiPrO ontology and draws on the lipid hierarchy in the LIPID MAPS database, as well as literature-derived knowledge. The federated semantic web services that operate upon this ontology are deployed within the Semantic Annotation, Discovery, and Integration (SADI framework. Structure-based lipid classification is enacted by two core services. Firstly, a structural annotation service detects and enumerates relevant functional groups for a specified chemical structure. A second service reasons over lipid ontology class descriptions using the attributes obtained from the annotation service and identifies the appropriate lipid classification. We extend the utility of these core services by combining them with additional SADI services that retrieve associations between lipids and proteins and identify publications related to specified lipid types. We analyze the performance of SADI-enabled eicosanoid classification relative to the LIPID MAPS classification and reflect on the contribution of

  17. SAS- Semantic Annotation Service for Geoscience resources on the web

    Science.gov (United States)

    Elag, M.; Kumar, P.; Marini, L.; Li, R.; Jiang, P.

    2015-12-01

    There is a growing need for increased integration across the data and model resources that are disseminated on the web to advance their reuse across different earth science applications. Meaningful reuse of resources requires semantic metadata to realize the semantic web vision for allowing pragmatic linkage and integration among resources. Semantic metadata associates standard metadata with resources to turn them into semantically-enabled resources on the web. However, the lack of a common standardized metadata framework as well as the uncoordinated use of metadata fields across different geo-information systems, has led to a situation in which standards and related Standard Names abound. To address this need, we have designed SAS to provide a bridge between the core ontologies required to annotate resources and information systems in order to enable queries and analysis over annotation from a single environment (web). SAS is one of the services that are provided by the Geosematnic framework, which is a decentralized semantic framework to support the integration between models and data and allow semantically heterogeneous to interact with minimum human intervention. Here we present the design of SAS and demonstrate its application for annotating data and models. First we describe how predicates and their attributes are extracted from standards and ingested in the knowledge-base of the Geosemantic framework. Then we illustrate the application of SAS in annotating data managed by SEAD and annotating simulation models that have web interface. SAS is a step in a broader approach to raise the quality of geoscience data and models that are published on the web and allow users to better search, access, and use of the existing resources based on standard vocabularies that are encoded and published using semantic technologies.

  18. Software for computing and annotating genomic ranges.

    Directory of Open Access Journals (Sweden)

    Michael Lawrence

    Full Text Available We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.

  19. AnnoLnc: a web server for systematically annotating novel human lncRNAs.

    Science.gov (United States)

    Hou, Mei; Tang, Xing; Tian, Feng; Shi, Fangyuan; Liu, Fenglin; Gao, Ge

    2016-11-16

    Long noncoding RNAs (lncRNAs) have been shown to play essential roles in almost every important biological process through multiple mechanisms. Although the repertoire of human lncRNAs has rapidly expanded, their biological function and regulation remain largely elusive, calling for a systematic and integrative annotation tool. Here we present AnnoLnc ( http://annolnc.cbi.pku.edu.cn ), a one-stop portal for systematically annotating novel human lncRNAs. Based on more than 700 data sources and various tool chains, AnnoLnc enables a systematic annotation covering genomic location, secondary structure, expression patterns, transcriptional regulation, miRNA interaction, protein interaction, genetic association and evolution. An intuitive web interface is available for interactive analysis through both desktops and mobile devices, and programmers can further integrate AnnoLnc into their pipeline through standard JSON-based Web Service APIs. To the best of our knowledge, AnnoLnc is the only web server to provide on-the-fly and systematic annotation for newly identified human lncRNAs. Compared with similar tools, the annotation generated by AnnoLnc covers a much wider spectrum with intuitive visualization. Case studies demonstrate the power of AnnoLnc in not only rediscovering known functions of human lncRNAs but also inspiring novel hypotheses.

  20. Phenex: ontological annotation of phenotypic diversity.

    Directory of Open Access Journals (Sweden)

    James P Balhoff

    2010-05-01

    Full Text Available Phenotypic differences among species have long been systematically itemized and described by biologists in the process of investigating phylogenetic relationships and trait evolution. Traditionally, these descriptions have been expressed in natural language within the context of individual journal publications or monographs. As such, this rich store of phenotype data has been largely unavailable for statistical and computational comparisons across studies or integration with other biological knowledge.Here we describe Phenex, a platform-independent desktop application designed to facilitate efficient and consistent annotation of phenotypic similarities and differences using Entity-Quality syntax, drawing on terms from community ontologies for anatomical entities, phenotypic qualities, and taxonomic names. Phenex can be configured to load only those ontologies pertinent to a taxonomic group of interest. The graphical user interface was optimized for evolutionary biologists accustomed to working with lists of taxa, characters, character states, and character-by-taxon matrices.Annotation of phenotypic data using ontologies and globally unique taxonomic identifiers will allow biologists to integrate phenotypic data from different organisms and studies, leveraging decades of work in systematics and comparative morphology.

  1. Phenex: ontological annotation of phenotypic diversity.

    Science.gov (United States)

    Balhoff, James P; Dahdul, Wasila M; Kothari, Cartik R; Lapp, Hilmar; Lundberg, John G; Mabee, Paula; Midford, Peter E; Westerfield, Monte; Vision, Todd J

    2010-05-05

    Phenotypic differences among species have long been systematically itemized and described by biologists in the process of investigating phylogenetic relationships and trait evolution. Traditionally, these descriptions have been expressed in natural language within the context of individual journal publications or monographs. As such, this rich store of phenotype data has been largely unavailable for statistical and computational comparisons across studies or integration with other biological knowledge. Here we describe Phenex, a platform-independent desktop application designed to facilitate efficient and consistent annotation of phenotypic similarities and differences using Entity-Quality syntax, drawing on terms from community ontologies for anatomical entities, phenotypic qualities, and taxonomic names. Phenex can be configured to load only those ontologies pertinent to a taxonomic group of interest. The graphical user interface was optimized for evolutionary biologists accustomed to working with lists of taxa, characters, character states, and character-by-taxon matrices. Annotation of phenotypic data using ontologies and globally unique taxonomic identifiers will allow biologists to integrate phenotypic data from different organisms and studies, leveraging decades of work in systematics and comparative morphology.

  2. Combined evidence annotation of transposable elements in genome sequences.

    Directory of Open Access Journals (Sweden)

    Hadi Quesneville

    2005-07-01

    Full Text Available Transposable elements (TEs are mobile, repetitive sequences that make up significant fractions of metazoan genomes. Despite their near ubiquity and importance in genome and chromosome biology, most efforts to annotate TEs in genome sequences rely on the results of a single computational program, RepeatMasker. In contrast, recent advances in gene annotation indicate that high-quality gene models can be produced from combining multiple independent sources of computational evidence. To elevate the quality of TE annotations to a level comparable to that of gene models, we have developed a combined evidence-model TE annotation pipeline, analogous to systems used for gene annotation, by integrating results from multiple homology-based and de novo TE identification methods. As proof of principle, we have annotated "TE models" in Drosophila melanogaster Release 4 genomic sequences using the combined computational evidence derived from RepeatMasker, BLASTER, TBLASTX, all-by-all BLASTN, RECON, TE-HMM and the previous Release 3.1 annotation. Our system is designed for use with the Apollo genome annotation tool, allowing automatic results to be curated manually to produce reliable annotations. The euchromatic TE fraction of D. melanogaster is now estimated at 5.3% (cf. 3.86% in Release 3.1, and we found a substantially higher number of TEs (n = 6,013 than previously identified (n = 1,572. Most of the new TEs derive from small fragments of a few hundred nucleotides long and highly abundant families not previously annotated (e.g., INE-1. We also estimated that 518 TE copies (8.6% are inserted into at least one other TE, forming a nest of elements. The pipeline allows rapid and thorough annotation of even the most complex TE models, including highly deleted and/or nested elements such as those often found in heterochromatic sequences. Our pipeline can be easily adapted to other genome sequences, such as those of the D. melanogaster heterochromatin or other

  3. PANDA: pathway and annotation explorer for visualizing and interpreting gene-centric data.

    Science.gov (United States)

    Hart, Steven N; Moore, Raymond M; Zimmermann, Michael T; Oliver, Gavin R; Egan, Jan B; Bryce, Alan H; Kocher, Jean-Pierre A

    2015-01-01

    Objective. Bringing together genomics, transcriptomics, proteomics, and other -omics technologies is an important step towards developing highly personalized medicine. However, instrumentation has advances far beyond expectations and now we are able to generate data faster than it can be interpreted. Materials and Methods. We have developed PANDA (Pathway AND Annotation) Explorer, a visualization tool that integrates gene-level annotation in the context of biological pathways to help interpret complex data from disparate sources. PANDA is a web-based application that displays data in the context of well-studied pathways like KEGG, BioCarta, and PharmGKB. PANDA represents data/annotations as icons in the graph while maintaining the other data elements (i.e., other columns for the table of annotations). Custom pathways from underrepresented diseases can be imported when existing data sources are inadequate. PANDA also allows sharing annotations among collaborators. Results. In our first use case, we show how easy it is to view supplemental data from a manuscript in the context of a user's own data. Another use-case is provided describing how PANDA was leveraged to design a treatment strategy from the somatic variants found in the tumor of a patient with metastatic sarcomatoid renal cell carcinoma. Conclusion. PANDA facilitates the interpretation of gene-centric annotations by visually integrating this information with context of biological pathways. The application can be downloaded or used directly from our website: http://bioinformaticstools.mayo.edu/research/panda-viewer/.

  4. Smoking-related microRNAs and mRNAs in human peripheral blood mononuclear cells

    International Nuclear Information System (INIS)

    Su, Ming-Wei; Yu, Sung-Liang; Lin, Wen-Chang; Tsai, Ching-Hui; Chen, Po-Hua; Lee, Yungling Leo

    2016-01-01

    Teenager smoking is of great importance in public health. Functional roles of microRNAs have been documented in smoke-induced gene expression changes, but comprehensive mechanisms of microRNA-mRNA regulation and benefits remained poorly understood. We conducted the Teenager Smoking Reduction Trial (TSRT) to investigate the causal association between active smoking reduction and whole-genome microRNA and mRNA expression changes in human peripheral blood mononuclear cells (PBMC). A total of 12 teenagers with a substantial reduction in smoke quantity and a decrease in urine cotinine/creatinine ratio were enrolled in genomic analyses. In Gene Set Enrichment Analysis (GSEA) and Ingenuity Pathway Analysis (IPA), differentially expressed genes altered by smoke reduction were mainly associated with glucocorticoid receptor signaling pathway. The integrative analysis of microRNA and mRNA found eleven differentially expressed microRNAs negatively correlated with predicted target genes. CD83 molecule regulated by miR-4498 in human PBMC, was critical for the canonical pathway of communication between innate and adaptive immune cells. Our data demonstrated that microRNAs could regulate immune responses in human PBMC after habitual smokers quit smoking and support the potential translational value of microRNAs in regulating disease-relevant gene expression caused by tobacco smoke. - Highlights: • We conducted a smoke reduction trial program and investigated the causal relationship between smoke and gene regulation. • MicroRNA and mRNA expression changes were examined in human PBMC. • MicroRNAs are important in regulating disease-causal genes after tobacco smoke reduction.

  5. Smoking-related microRNAs and mRNAs in human peripheral blood mononuclear cells

    Energy Technology Data Exchange (ETDEWEB)

    Su, Ming-Wei [Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan (China); Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan (China); Yu, Sung-Liang [Department of Clinical Laboratory Sciences and Medical Biotechnology, College of Medicine, National Taiwan University, Taipei, Taiwan (China); Lin, Wen-Chang [Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan (China); Tsai, Ching-Hui [Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan (China); Chen, Po-Hua [School of Medicine, National Taiwan University, Taipei, Taiwan (China); Lee, Yungling Leo, E-mail: leolee@ntu.edu.tw [Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan (China); Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan (China)

    2016-08-15

    Teenager smoking is of great importance in public health. Functional roles of microRNAs have been documented in smoke-induced gene expression changes, but comprehensive mechanisms of microRNA-mRNA regulation and benefits remained poorly understood. We conducted the Teenager Smoking Reduction Trial (TSRT) to investigate the causal association between active smoking reduction and whole-genome microRNA and mRNA expression changes in human peripheral blood mononuclear cells (PBMC). A total of 12 teenagers with a substantial reduction in smoke quantity and a decrease in urine cotinine/creatinine ratio were enrolled in genomic analyses. In Gene Set Enrichment Analysis (GSEA) and Ingenuity Pathway Analysis (IPA), differentially expressed genes altered by smoke reduction were mainly associated with glucocorticoid receptor signaling pathway. The integrative analysis of microRNA and mRNA found eleven differentially expressed microRNAs negatively correlated with predicted target genes. CD83 molecule regulated by miR-4498 in human PBMC, was critical for the canonical pathway of communication between innate and adaptive immune cells. Our data demonstrated that microRNAs could regulate immune responses in human PBMC after habitual smokers quit smoking and support the potential translational value of microRNAs in regulating disease-relevant gene expression caused by tobacco smoke. - Highlights: • We conducted a smoke reduction trial program and investigated the causal relationship between smoke and gene regulation. • MicroRNA and mRNA expression changes were examined in human PBMC. • MicroRNAs are important in regulating disease-causal genes after tobacco smoke reduction.

  6. Virus-Clip: a fast and memory-efficient viral integration site detection tool at single-base resolution with annotation capability.

    Science.gov (United States)

    Ho, Daniel W H; Sze, Karen M F; Ng, Irene O L

    2015-08-28

    Viral integration into the human genome upon infection is an important risk factor for various human malignancies. We developed viral integration site detection tool called Virus-Clip, which makes use of information extracted from soft-clipped sequencing reads to identify exact positions of human and virus breakpoints of integration events. With initial read alignment to virus reference genome and streamlined procedures, Virus-Clip delivers a simple, fast and memory-efficient solution to viral integration site detection. Moreover, it can also automatically annotate the integration events with the corresponding affected human genes. Virus-Clip has been verified using whole-transcriptome sequencing data and its detection was validated to have satisfactory sensitivity and specificity. Marked advancement in performance was detected, compared to existing tools. It is applicable to versatile types of data including whole-genome sequencing, whole-transcriptome sequencing, and targeted sequencing. Virus-Clip is available at http://web.hku.hk/~dwhho/Virus-Clip.zip.

  7. Human microRNA target analysis and gene ontology clustering by GOmir, a novel stand-alone application.

    Science.gov (United States)

    Roubelakis, Maria G; Zotos, Pantelis; Papachristoudis, Georgios; Michalopoulos, Ioannis; Pappa, Kalliopi I; Anagnou, Nicholas P; Kossida, Sophia

    2009-06-16

    microRNAs (miRNAs) are single-stranded RNA molecules of about 20-23 nucleotides length found in a wide variety of organisms. miRNAs regulate gene expression, by interacting with target mRNAs at specific sites in order to induce cleavage of the message or inhibit translation. Predicting or verifying mRNA targets of specific miRNAs is a difficult process of great importance. GOmir is a novel stand-alone application consisting of two separate tools: JTarget and TAGGO. JTarget integrates miRNA target prediction and functional analysis by combining the predicted target genes from TargetScan, miRanda, RNAhybrid and PicTar computational tools as well as the experimentally supported targets from TarBase and also providing a full gene description and functional analysis for each target gene. On the other hand, TAGGO application is designed to automatically group gene ontology annotations, taking advantage of the Gene Ontology (GO), in order to extract the main attributes of sets of proteins. GOmir represents a new tool incorporating two separate Java applications integrated into one stand-alone Java application. GOmir (by using up to five different databases) introduces miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering. Additionally, a reverse search initiated by a potential target can also be conducted. GOmir can freely be downloaded BRFAA.

  8. OmniSearch: a semantic search system based on the Ontology for MIcroRNA Target (OMIT) for microRNA-target gene interaction data.

    Science.gov (United States)

    Huang, Jingshan; Gutierrez, Fernando; Strachan, Harrison J; Dou, Dejing; Huang, Weili; Smith, Barry; Blake, Judith A; Eilbeck, Karen; Natale, Darren A; Lin, Yu; Wu, Bin; Silva, Nisansa de; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming; Ruttenberg, Alan

    2016-01-01

    As a special class of non-coding RNAs (ncRNAs), microRNAs (miRNAs) perform important roles in numerous biological and pathological processes. The realization of miRNA functions depends largely on how miRNAs regulate specific target genes. It is therefore critical to identify, analyze, and cross-reference miRNA-target interactions to better explore and delineate miRNA functions. Semantic technologies can help in this regard. We previously developed a miRNA domain-specific application ontology, Ontology for MIcroRNA Target (OMIT), whose goal was to serve as a foundation for semantic annotation, data integration, and semantic search in the miRNA field. In this paper we describe our continuing effort to develop the OMIT, and demonstrate its use within a semantic search system, OmniSearch, designed to facilitate knowledge capture of miRNA-target interaction data. Important changes in the current version OMIT are summarized as: (1) following a modularized ontology design (with 2559 terms imported from the NCRO ontology); (2) encoding all 1884 human miRNAs (vs. 300 in previous versions); and (3) setting up a GitHub project site along with an issue tracker for more effective community collaboration on the ontology development. The OMIT ontology is free and open to all users, accessible at: http://purl.obolibrary.org/obo/omit.owl. The OmniSearch system is also free and open to all users, accessible at: http://omnisearch.soc.southalabama.edu/index.php/Software.

  9. Emerging roles of microRNAs as molecular switches in the integrated circuit of the cancer cell

    Science.gov (United States)

    Sotiropoulou, Georgia; Pampalakis, Georgios; Lianidou, Evi; Mourelatos, Zissimos

    2009-01-01

    Transformation of normal cells into malignant tumors requires the acquisition of six hallmark traits, e.g., self-sufficiency in growth signals, insensitivity to antigrowth signals and self-renewal, evasion of apoptosis, limitless replication potential, angiogenesis, invasion, and metastasis, which are common to all cancers (Hanahan and Weinberg 2000). These new cellular traits evolve from defects in major regulatory microcircuits that are fundamental for normal homeostasis. The discovery of microRNAs (miRNAs) as a new class of small non-protein-coding RNAs that control gene expression post-transcriptionally by binding to various mRNA targets suggests that these tiny RNA molecules likely act as molecular switches in the extensive regulatory web that involves thousands of transcripts. Most importantly, accumulating evidence suggests that numerous microRNAs are aberrantly expressed in human cancers. In this review, we discuss the emergent roles of microRNAs as switches that function to turn on/off known cellular microcircuits. We outline recent compelling evidence that deregulated microRNA-mediated control of cellular microcircuits cooperates with other well-established regulatory mechanisms to confer the hallmark traits of the cancer cell. Furthermore, these exciting insights into aberrant microRNA control in cancer-associated circuits may be exploited for cancer therapies that will target deregulated miRNA switches. PMID:19561119

  10. High-performance web services for querying gene and variant annotation.

    Science.gov (United States)

    Xin, Jiwen; Mark, Adam; Afrasiabi, Cyrus; Tsueng, Ginger; Juchler, Moritz; Gopal, Nikhil; Stupp, Gregory S; Putman, Timothy E; Ainscough, Benjamin J; Griffith, Obi L; Torkamani, Ali; Whetzel, Patricia L; Mungall, Christopher J; Mooney, Sean D; Su, Andrew I; Wu, Chunlei

    2016-05-06

    Efficient tools for data management and integration are essential for many aspects of high-throughput biology. In particular, annotations of genes and human genetic variants are commonly used but highly fragmented across many resources. Here, we describe MyGene.info and MyVariant.info, high-performance web services for querying gene and variant annotation information. These web services are currently accessed more than three million times permonth. They also demonstrate a generalizable cloud-based model for organizing and querying biological annotation information. MyGene.info and MyVariant.info are provided as high-performance web services, accessible at http://mygene.info and http://myvariant.info . Both are offered free of charge to the research community.

  11. OP17MICRORNA PROFILING USING SMALL RNA-SEQ IN PAEDIATRIC LOW GRADE GLIOMAS

    Science.gov (United States)

    Jeyapalan, Jennie N.; Jones, Tania A.; Tatevossian, Ruth G.; Qaddoumi, Ibrahim; Ellison, David W.; Sheer, Denise

    2014-01-01

    INTRODUCTION: MicroRNAs regulate gene expression by targeting mRNAs for translational repression or degradation at the post-transcriptional level. In paediatric low-grade gliomas a few key genetic mutations have been identified, including BRAF fusions, FGFR1 duplications and MYB rearrangements. Our aim in the current study is to profile aberrant microRNA expression in paediatric low-grade gliomas and determine the role of epigenetic changes in the aetiology and behaviour of these tumours. METHOD: MicroRNA profiling of tumour samples (6 pilocytic, 2 diffuse, 2 pilomyxoid astrocytomas) and normal brain controls (4 adult normal brain samples and a primary glial progenitor cell-line) was performed using small RNA sequencing. Bioinformatic analysis included sequence alignment, analysis of the number of reads (CPM, counts per million) and differential expression. RESULTS: Sequence alignment identified 695 microRNAs, whose expression was compared in tumours v. normal brain. PCA and hierarchical clustering showed separate groups for tumours and normal brain. Computational analysis identified approximately 400 differentially expressed microRNAs in the tumours compared to matched location controls. Our findings will then be validated and integrated with extensive genetic and epigenetic information we have previously obtained for the full tumour cohort. CONCLUSION: We have identified microRNAs that are differentially expressed in paediatric low-grade gliomas. As microRNAs are known to target genes involved in the initiation and progression of cancer, they provide critical information on tumour pathogenesis and are an important class of biomarkers.

  12. MicroRNA expression profile in head and neck cancer: HOX-cluster embedded microRNA-196a and microRNA-10b dysregulation implicated in cell proliferation

    International Nuclear Information System (INIS)

    Severino, Patricia; Mathor, Monica Beatriz; Nunes, Fabio Daumas; Ragoussis, Jiannis; Tajara, Eloiza Helena; Brüggemann, Holger; Andreghetto, Flavia Maziero; Camps, Carme; Klingbeil, Maria de Fatima Garrido; Pereira, Welbert Oliveira de; Soares, Renata Machado; Moyses, Raquel; Wünsch-Filho, Victor

    2013-01-01

    Current evidence implicates aberrant microRNA expression patterns in human malignancies; measurement of microRNA expression may have diagnostic and prognostic applications. Roles for microRNAs in head and neck squamous cell carcinomas (HNSCC) are largely unknown. HNSCC, a smoking-related cancer, is one of the most common malignancies worldwide but reliable diagnostic and prognostic markers have not been discovered so far. Some studies have evaluated the potential use of microRNA as biomarkers with clinical application in HNSCC. MicroRNA expression profile of oral squamous cell carcinoma samples was determined by means of DNA microarrays. We also performed gain-of-function assays for two differentially expressed microRNA using two squamous cell carcinoma cell lines and normal oral keratinocytes. The effect of the over-expression of these molecules was evaluated by means of global gene expression profiling and cell proliferation assessment. Altered microRNA expression was detected for a total of 72 microRNAs. Among these we found well studied molecules, such as the miR-17-92 cluster, comprising potent oncogenic microRNA, and miR-34, recently found to interact with p53. HOX-cluster embedded miR-196a/b and miR-10b were up- and down-regulated, respectively, in tumor samples. Since validated HOX gene targets for these microRNAs are not consistently deregulated in HNSCC, we performed gain-of-function experiments, in an attempt to outline their possible role. Our results suggest that both molecules interfere in cell proliferation through distinct processes, possibly targeting a small set of genes involved in cell cycle progression. Functional data on miRNAs in HNSCC is still scarce. Our data corroborate current literature and brings new insights into the role of microRNAs in HNSCC. We also show that miR-196a and miR-10b, not previously associated with HNSCC, may play an oncogenic role in this disease through the deregulation of cell proliferation. The study of microRNA

  13. MicroRNAs as regulatory elements in psoriasis

    Directory of Open Access Journals (Sweden)

    Liu Yuan

    2016-01-01

    Full Text Available Psoriasis is a chronic, autoimmune, and complex genetic disorder that affects 23% of the European population. The symptoms of Psoriatic skin are inflammation, raised and scaly lesions. microRNA, which is short, nonprotein-coding, regulatory RNAs, plays critical roles in psoriasis. microRNA participates in nearly all biological processes, such as cell differentiation, development and metabolism. Recent researches reveal that multitudinous novel microRNAs have been identified in skin. Some of these substantial novel microRNAs play as a class of posttranscriptional gene regulator in skin disease, such as psoriasis. In order to insight into microRNAs biological functions and verify microRNAs biomarker, we review diverse references about characterization, profiling and subtype of microRNAs. Here we will share our opinions about how and which microRNAs are as regulatory in psoriasis.

  14. The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4).

    Science.gov (United States)

    Huntemann, Marcel; Ivanova, Natalia N; Mavromatis, Konstantinos; Tripp, H James; Paez-Espino, David; Palaniappan, Krishnaveni; Szeto, Ernest; Pillay, Manoj; Chen, I-Min A; Pati, Amrita; Nielsen, Torben; Markowitz, Victor M; Kyrpides, Nikos C

    2015-01-01

    The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. Structural annotation is followed by assignment of protein product names and functions.

  15. Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop.

    Science.gov (United States)

    Brister, James Rodney; Bao, Yiming; Kuiken, Carla; Lefkowitz, Elliot J; Le Mercier, Philippe; Leplae, Raphael; Madupu, Ramana; Scheuermann, Richard H; Schobel, Seth; Seto, Donald; Shrivastava, Susmita; Sterk, Peter; Zeng, Qiandong; Klimke, William; Tatusova, Tatiana

    2010-10-01

    Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world's biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.

  16. Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop

    Directory of Open Access Journals (Sweden)

    Qiandong Zeng

    2010-10-01

    Full Text Available Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world’s biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.

  17. Desiderata for ontologies to be used in semantic annotation of biomedical documents.

    Science.gov (United States)

    Bada, Michael; Hunter, Lawrence

    2011-02-01

    A wealth of knowledge valuable to the translational research scientist is contained within the vast biomedical literature, but this knowledge is typically in the form of natural language. Sophisticated natural-language-processing systems are needed to translate text into unambiguous formal representations grounded in high-quality consensus ontologies, and these systems in turn rely on gold-standard corpora of annotated documents for training and testing. To this end, we are constructing the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-text biomedical journal articles that are being manually annotated with the entire sets of terms from select vocabularies, predominantly from the Open Biomedical Ontologies (OBO) library. Our efforts in building this corpus has illuminated infelicities of these ontologies with respect to the semantic annotation of biomedical documents, and we propose desiderata whose implementation could substantially improve their utility in this task; these include the integration of overlapping terms across OBOs, the resolution of OBO-specific ambiguities, the integration of the BFO with the OBOs and the use of mid-level ontologies, the inclusion of noncanonical instances, and the expansion of relations and realizable entities. Copyright © 2010 Elsevier Inc. All rights reserved.

  18. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation

    DEFF Research Database (Denmark)

    Zhang, Han; Yohe, Tanner; Huang, Le

    2018-01-01

    of plant and plant-associated microbial genomes and metagenomes being sequenced, there is an urgent need of automatic tools for genomic data mining of CAZymes. We developed the dbCAN web server in 2012 to provide a public service for automated CAZyme annotation for newly sequenced genomes. Here, dbCAN2...... (http://cys.bios.niu.edu/dbCAN2) is presented as an updated meta server, which integrates three state-of-the-art tools for CAZome (all CAZymes of a genome) annotation: (i) HMMER search against the dbCAN HMM (hidden Markov model) database; (ii) DIAMOND search against the CAZy pre-annotated CAZyme...

  19. MirZ: an integrated microRNA expression atlas and target prediction resource.

    Science.gov (United States)

    Hausser, Jean; Berninger, Philipp; Rodak, Christoph; Jantscher, Yvonne; Wirth, Stefan; Zavolan, Mihaela

    2009-07-01

    MicroRNAs (miRNAs) are short RNAs that act as guides for the degradation and translational repression of protein-coding mRNAs. A large body of work showed that miRNAs are involved in the regulation of a broad range of biological functions, from development to cardiac and immune system function, to metabolism, to cancer. For most of the over 500 miRNAs that are encoded in the human genome the functions still remain to be uncovered. Identifying miRNAs whose expression changes between cell types or between normal and pathological conditions is an important step towards characterizing their function as is the prediction of mRNAs that could be targeted by these miRNAs. To provide the community the possibility of exploring interactively miRNA expression patterns and the candidate targets of miRNAs in an integrated environment, we developed the MirZ web server, which is accessible at www.mirz.unibas.ch. The server provides experimental and computational biologists with statistical analysis and data mining tools operating on up-to-date databases of sequencing-based miRNA expression profiles and of predicted miRNA target sites in species ranging from Caenorhabditis elegans to Homo sapiens.

  20. MicroRNA signature of the human developing pancreas

    Directory of Open Access Journals (Sweden)

    Correa-Medina Mayrin

    2010-09-01

    Full Text Available Abstract Background MicroRNAs are non-coding RNAs that regulate gene expression including differentiation and development by either inhibiting translation or inducing target degradation. The aim of this study is to determine the microRNA expression signature during human pancreatic development and to identify potential microRNA gene targets calculating correlations between the signature microRNAs and their corresponding mRNA targets, predicted by bioinformatics, in genome-wide RNA microarray study. Results The microRNA signature of human fetal pancreatic samples 10-22 weeks of gestational age (wga, was obtained by PCR-based high throughput screening with Taqman Low Density Arrays. This method led to identification of 212 microRNAs. The microRNAs were classified in 3 groups: Group number I contains 4 microRNAs with the increasing profile; II, 35 microRNAs with decreasing profile and III with 173 microRNAs, which remain unchanged. We calculated Pearson correlations between the expression profile of microRNAs and target mRNAs, predicted by TargetScan 5.1 and miRBase altgorithms, using genome-wide mRNA expression data. Group I correlated with the decreasing expression of 142 target mRNAs and Group II with the increasing expression of 876 target mRNAs. Most microRNAs correlate with multiple targets, just as mRNAs are targeted by multiple microRNAs. Among the identified targets are the genes and transcription factors known to play an essential role in pancreatic development. Conclusions We have determined specific groups of microRNAs in human fetal pancreas that change the degree of their expression throughout the development. A negative correlative analysis suggests an intertwined network of microRNAs and mRNAs collaborating with each other. This study provides information leading to potential two-way level of combinatorial control regulating gene expression through microRNAs targeting multiple mRNAs and, conversely, target mRNAs regulated in

  1. Semi-Semantic Annotation: A guideline for the URDU.KON-TB treebank POS annotation

    Directory of Open Access Journals (Sweden)

    Qaiser ABBAS

    2016-12-01

    Full Text Available This work elaborates the semi-semantic part of speech annotation guidelines for the URDU.KON-TB treebank: an annotated corpus. A hierarchical annotation scheme was designed to label the part of speech and then applied on the corpus. This raw corpus was collected from the Urdu Wikipedia and the Jang newspaper and then annotated with the proposed semi-semantic part of speech labels. The corpus contains text of local & international news, social stories, sports, culture, finance, religion, traveling, etc. This exercise finally contributed a part of speech annotation to the URDU.KON-TB treebank. Twenty-two main part of speech categories are divided into subcategories, which conclude the morphological, and semantical information encoded in it. This article reports the annotation guidelines in major; however, it also briefs the development of the URDU.KON-TB treebank, which includes the raw corpus collection, designing & employment of annotation scheme and finally, its statistical evaluation and results. The guidelines presented as follows, will be useful for linguistic community to annotate the sentences not only for the national language Urdu but for the other indigenous languages like Punjab, Sindhi, Pashto, etc., as well.

  2. Expression of MicroRNA-146a and MicroRNA-155 in Placental Villi in Early- and Late-Onset Preeclampsia.

    Science.gov (United States)

    Nizyaeva, N V; Kulikova, G V; Nagovitsyna, M N; Kan, N E; Prozorovskaya, K N; Shchegolev, A I; Sukhikh, G T

    2017-07-01

    We studied the expression of microRNA-146a and microRNA-155 in placental villi from 18 women (26-39 weeks of gestation) of reproductive age with early- or late-onset preeclampsia. The reference group consisted of women with physiological pregnancy and full-term gestation and with preterm birth after caesarian section on gestation week 26-31. MicroRNA-146a and microRNA-155 were detected by in situ hybridization with digoxigenin on paraffin sections. It was found that the expression of microRNA-146a in both syncytiotrophoblast of the intermediate villi and syncytial knots was lower at late-onset preeclampsia than at physiologic pregnancy of full-term period (p=0.037 and p=0.001 respectively). The expression of microRNA-155 in syncytiotrophoblast of intermediate placental villi in early-onset preeclampsia was higher than in group with preterm delivery (p=0.003). However, in syncytiotrophoblast of intermediate villi and in syncytial knots, the expression of microRNA-155 was lower at late-onset preeclampsia in comparison with full-term physiological pregnancy (p=0.005). In addition, the expression of microRNA-146a and microRNA-155 did not increase in the later terms in preeclampsia, while in the reference groups demonstrating gradual increase in the expression of these markers with increasing gestational age. Expression microRNA-146a and microRNA-155 little differed in early- and late-onset preeclampsia. These findings suggest that different variants of preeclampsia are probably characterized by common pathogenetic pathways. Damaged trophoblast cannot maintain of microRNAs synthesis at the required level, which determines the formation of a vicious circle in preeclampsia and further progression of the disease.

  3. Functions of microRNA in response to cocaine stimulation.

    Science.gov (United States)

    Xu, L-F; Wang, J; Lv, F B; Song, Q

    2013-12-04

    MicroRNAs (miRNAs) are a type of non-protein-coding single-stranded RNA, which are typically 20-25 nt in length. miRNAs play important roles in various biological processes, including development, cell proliferation, differentiation, and apoptosis. We aimed to detect the miRNA response to cocaine stimulations and their target genes. Using the miRNA expression data GSE21901 downloaded from the Gene Expression Omnibus database, we screened out the differentially expressed miRNA after short-term (1 h) and longer-term (6 h) cocaine stimulations based on the fold change >1.2. Target genes of differentially expressed miRNAs were retrieved from TargetScan database with the context score -0.3. Functional annotation enrichment analysis was performed for all the target genes with DAVID. A total of 121 differentially expressed miRNAs between the 1-h treatment and the control samples, 58 between the 6-h treatment and the control samples, and 69 between the 1-h and the 6-h treatment samples. Among them, miR-212 results of particular interest, since its expression level was constantly elevated responding to cocaine treatment. After functional and pathway annotations of target genes, we proved that miR-212 was a critical element in cocaine-addiction, because of its involvement in regulating several important cell cycle events. The results may pave the way for further understanding the regulatory mechanisms of cocaine-response in human bodies.

  4. Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator

    Science.gov (United States)

    Seyed, P.; Chastain, K.; McGuinness, D. L.

    2013-12-01

    library of vocabularies to assist the user in locating terms to describe observed entities, their properties, and relationships. The Annotator leverages vocabulary definitions of these concepts to guide the user in describing data in a logically consistent manner. The vocabularies made available through the Annotator are open, as is the Annotator itself. We have taken a step towards making semantic annotation/translation of data more accessible. Our vision for the Annotator is as a tool that can be integrated into a semantic data 'workbench' environment, which would allow semantic annotation of a variety of data formats, using standard vocabularies. These vocabularies involved enable search for similar datasets, and integration with any semantically-enabled applications for analysis and visualization.

  5. BLAST-based structural annotation of protein residues using Protein Data Bank.

    Science.gov (United States)

    Singh, Harinder; Raghava, Gajendra P S

    2016-01-25

    In the era of next-generation sequencing where thousands of genomes have been already sequenced; size of protein databases is growing with exponential rate. Structural annotation of these proteins is one of the biggest challenges for the computational biologist. Although, it is easy to perform BLAST search against Protein Data Bank (PDB) but it is difficult for a biologist to annotate protein residues from BLAST search. A web-server StarPDB has been developed for structural annotation of a protein based on its similarity with known protein structures. It uses standard BLAST software for performing similarity search of a query protein against protein structures in PDB. This server integrates wide range modules for assigning different types of annotation that includes, Secondary-structure, Accessible surface area, Tight-turns, DNA-RNA and Ligand modules. Secondary structure module allows users to predict regular secondary structure states to each residue in a protein. Accessible surface area predict the exposed or buried residues in a protein. Tight-turns module is designed to predict tight turns like beta-turns in a protein. DNA-RNA module developed for predicting DNA and RNA interacting residues in a protein. Similarly, Ligand module of server allows one to predicted ligands, metal and nucleotides ligand interacting residues in a protein. In summary, this manuscript presents a web server for comprehensive annotation of a protein based on similarity search. It integrates number of visualization tools that facilitate users to understand structure and function of protein residues. This web server is available freely for scientific community from URL http://crdd.osdd.net/raghava/starpdb .

  6. BioAnnote: a software platform for annotating biomedical documents with application in medical learning environments.

    Science.gov (United States)

    López-Fernández, H; Reboiro-Jato, M; Glez-Peña, D; Aparicio, F; Gachet, D; Buenaga, M; Fdez-Riverola, F

    2013-07-01

    Automatic term annotation from biomedical documents and external information linking are becoming a necessary prerequisite in modern computer-aided medical learning systems. In this context, this paper presents BioAnnote, a flexible and extensible open-source platform for automatically annotating biomedical resources. Apart from other valuable features, the software platform includes (i) a rich client enabling users to annotate multiple documents in a user friendly environment, (ii) an extensible and embeddable annotation meta-server allowing for the annotation of documents with local or remote vocabularies and (iii) a simple client/server protocol which facilitates the use of our meta-server from any other third-party application. In addition, BioAnnote implements a powerful scripting engine able to perform advanced batch annotations. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  7. Chado controller: advanced annotation management with a community annotation system.

    Science.gov (United States)

    Guignon, Valentin; Droc, Gaëtan; Alaux, Michael; Baurens, Franc-Christophe; Garsmeur, Olivier; Poiron, Claire; Carver, Tim; Rouard, Mathieu; Bocs, Stéphanie

    2012-04-01

    We developed a controller that is compliant with the Chado database schema, GBrowse and genome annotation-editing tools such as Artemis and Apollo. It enables the management of public and private data, monitors manual annotation (with controlled vocabularies, structural and functional annotation controls) and stores versions of annotation for all modified features. The Chado controller uses PostgreSQL and Perl. The Chado Controller package is available for download at http://www.gnpannot.org/content/chado-controller and runs on any Unix-like operating system, and documentation is available at http://www.gnpannot.org/content/chado-controller-doc The system can be tested using the GNPAnnot Sandbox at http://www.gnpannot.org/content/gnpannot-sandbox-form valentin.guignon@cirad.fr; stephanie.sidibe-bocs@cirad.fr Supplementary data are available at Bioinformatics online.

  8. Micro-RNAs

    DEFF Research Database (Denmark)

    Taipaleenmäki, H.; Hokland, L. B.; Chen, Li

    2012-01-01

    Osteoblast differentiation and bone formation (osteogenesis) are regulated by transcriptional and post-transcriptional mechanisms. Recently, a novel class of regulatory factors termed microRNAs has been identified as playing an important role in the regulation of many aspects of osteoblast biology...... including proliferation, differentiation, metabolism and apoptosis. Also, preliminary data from animal disease models suggest that targeting miRNAs in bone can be a novel approach to increase bone mass. This review highlights the current knowledge of microRNA biology and their role in bone formation...

  9. MicroRNA expression characterizes oligometastasis(es).

    Science.gov (United States)

    Lussier, Yves A; Xing, H Rosie; Salama, Joseph K; Khodarev, Nikolai N; Huang, Yong; Zhang, Qingbei; Khan, Sajid A; Yang, Xinan; Hasselle, Michael D; Darga, Thomas E; Malik, Renuka; Fan, Hanli; Perakis, Samantha; Filippo, Matthew; Corbin, Kimberly; Lee, Younghee; Posner, Mitchell C; Chmura, Steven J; Hellman, Samuel; Weichselbaum, Ralph R

    2011-01-01

    Cancer staging and treatment presumes a division into localized or metastatic disease. We proposed an intermediate state defined by ≤ 5 cumulative metastasis(es), termed oligometastases. In contrast to widespread polymetastases, oligometastatic patients may benefit from metastasis-directed local treatments. However, many patients who initially present with oligometastases progress to polymetastases. Predictors of progression could improve patient selection for metastasis-directed therapy. Here, we identified patterns of microRNA expression of tumor samples from oligometastatic patients treated with high-dose radiotherapy. Patients who failed to develop polymetastases are characterized by unique prioritized features of a microRNA classifier that includes the microRNA-200 family. We created an oligometastatic-polymetastatic xenograft model in which the patient-derived microRNAs discriminated between the two metastatic outcomes. MicroRNA-200c enhancement in an oligometastatic cell line resulted in polymetastatic progression. These results demonstrate a biological basis for oligometastases and a potential for using microRNA expression to identify patients most likely to remain oligometastatic after metastasis-directed treatment.

  10. Differentially Expressed microRNAs and Target Genes Associated with Plastic Internode Elongation in Alternanthera philoxeroides in Contrasting Hydrological Habitats

    Directory of Open Access Journals (Sweden)

    Gengyun Li

    2017-12-01

    Full Text Available Phenotypic plasticity is crucial for plants to survive in changing environments. Discovering microRNAs, identifying their targets and further inferring microRNA functions in mediating plastic developmental responses to environmental changes have been a critical strategy for understanding the underlying molecular mechanisms of phenotypic plasticity. In this study, the dynamic expression patterns of microRNAs under contrasting hydrological habitats in the amphibious species Alternanthera philoxeroides were identified by time course expression profiling using high-throughput sequencing technology. A total of 128 known and 18 novel microRNAs were found to be differentially expressed under contrasting hydrological habitats. The microRNA:mRNA pairs potentially associated with plastic internode elongation were identified by integrative analysis of microRNA and mRNA expression profiles, and were validated by qRT-PCR and 5′ RLM-RACE. The results showed that both the universal microRNAs conserved across different plants and the unique microRNAs novelly identified in A. philoxeroides were involved in the responses to varied water regimes. The results also showed that most of the differentially expressed microRNAs were transiently up-/down-regulated at certain time points during the treatments. The fine-scale temporal changes in microRNA expression highlighted the importance of time-series sampling in identifying stress-responsive microRNAs and analyzing their role in stress response/tolerance.

  11. miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases.

    Science.gov (United States)

    Gupta, Samir; Ross, Karen E; Tudor, Catalina O; Wu, Cathy H; Schmidt, Carl J; Vijay-Shanker, K

    2016-04-29

    include sentences with a wide range of microRNA-disease information that may be of interest to biomedical researchers, miRiaD also performed very well with a F-score of 89.4. The informativeness ranking of sentences was evaluated in terms of nDCG (0.977) and correlation metrics (0.678-0.727) when compared to an annotator's ranked list. miRiaD, a high performance system that can capture a wide variety of microRNA-disease related information, extends beyond the scope of existing microRNA-disease resources. It can be incorporated into manual curation pipelines and serve as a resource for biomedical researchers interested in the role of microRNAs in disease. In our ongoing work we are developing an improved miRiaD web interface that will facilitate complex queries about microRNA-disease relationships, such as "In what diseases does microRNA regulation of apoptosis play a role?" or "Is there overlap in the sets of genes targeted by microRNAs in different types of dementia?"."

  12. MicroRNA involvement in glioblastoma pathogenesis

    International Nuclear Information System (INIS)

    Novakova, Jana; Slaby, Ondrej; Vyzula, Rostislav; Michalek, Jaroslav

    2009-01-01

    MicroRNAs are endogenously expressed regulatory noncoding RNAs. Altered expression levels of several microRNAs have been observed in glioblastomas. Functions and direct mRNA targets for these microRNAs have been relatively well studied over the last years. According to these data, it is now evident, that impairment of microRNA regulatory network is one of the key mechanisms in glioblastoma pathogenesis. MicroRNA deregulation is involved in processes such as cell proliferation, apoptosis, cell cycle regulation, invasion, glioma stem cell behavior, and angiogenesis. In this review, we summarize the current knowledge of miRNA functions in glioblastoma with an emphasis on its significance in glioblastoma oncogenic signaling and its potential to serve as a disease biomarker and a novel therapeutic target in oncology.

  13. FALDO: a semantic standard for describing the location of nucleotide and protein feature annotation.

    Science.gov (United States)

    Bolleman, Jerven T; Mungall, Christopher J; Strozzi, Francesco; Baran, Joachim; Dumontier, Michel; Bonnal, Raoul J P; Buels, Robert; Hoehndorf, Robert; Fujisawa, Takatomo; Katayama, Toshiaki; Cock, Peter J A

    2016-06-13

    Nucleotide and protein sequence feature annotations are essential to understand biology on the genomic, transcriptomic, and proteomic level. Using Semantic Web technologies to query biological annotations, there was no standard that described this potentially complex location information as subject-predicate-object triples. We have developed an ontology, the Feature Annotation Location Description Ontology (FALDO), to describe the positions of annotated features on linear and circular sequences. FALDO can be used to describe nucleotide features in sequence records, protein annotations, and glycan binding sites, among other features in coordinate systems of the aforementioned "omics" areas. Using the same data format to represent sequence positions that are independent of file formats allows us to integrate sequence data from multiple sources and data types. The genome browser JBrowse is used to demonstrate accessing multiple SPARQL endpoints to display genomic feature annotations, as well as protein annotations from UniProt mapped to genomic locations. Our ontology allows users to uniformly describe - and potentially merge - sequence annotations from multiple sources. Data sources using FALDO can prospectively be retrieved using federalised SPARQL queries against public SPARQL endpoints and/or local private triple stores.

  14. ChIPBase: a database for decoding the transcriptional regulation of long non-coding RNA and microRNA genes from ChIP-Seq data.

    Science.gov (United States)

    Yang, Jian-Hua; Li, Jun-Hao; Jiang, Shan; Zhou, Hui; Qu, Liang-Hu

    2013-01-01

    Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) represent two classes of important non-coding RNAs in eukaryotes. Although these non-coding RNAs have been implicated in organismal development and in various human diseases, surprisingly little is known about their transcriptional regulation. Recent advances in chromatin immunoprecipitation with next-generation DNA sequencing (ChIP-Seq) have provided methods of detecting transcription factor binding sites (TFBSs) with unprecedented sensitivity. In this study, we describe ChIPBase (http://deepbase.sysu.edu.cn/chipbase/), a novel database that we have developed to facilitate the comprehensive annotation and discovery of transcription factor binding maps and transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. The current release of ChIPBase includes high-throughput sequencing data that were generated by 543 ChIP-Seq experiments in diverse tissues and cell lines from six organisms. By analysing millions of TFBSs, we identified tens of thousands of TF-lncRNA and TF-miRNA regulatory relationships. Furthermore, two web-based servers were developed to annotate and discover transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. In addition, we developed two genome browsers, deepView and genomeView, to provide integrated views of multidimensional data. Moreover, our web implementation supports diverse query types and the exploration of TFs, lncRNAs, miRNAs, gene ontologies and pathways.

  15. MicroRNA expression characterizes oligometastasis(es.

    Directory of Open Access Journals (Sweden)

    Yves A Lussier

    Full Text Available Cancer staging and treatment presumes a division into localized or metastatic disease. We proposed an intermediate state defined by ≤ 5 cumulative metastasis(es, termed oligometastases. In contrast to widespread polymetastases, oligometastatic patients may benefit from metastasis-directed local treatments. However, many patients who initially present with oligometastases progress to polymetastases. Predictors of progression could improve patient selection for metastasis-directed therapy.Here, we identified patterns of microRNA expression of tumor samples from oligometastatic patients treated with high-dose radiotherapy.Patients who failed to develop polymetastases are characterized by unique prioritized features of a microRNA classifier that includes the microRNA-200 family. We created an oligometastatic-polymetastatic xenograft model in which the patient-derived microRNAs discriminated between the two metastatic outcomes. MicroRNA-200c enhancement in an oligometastatic cell line resulted in polymetastatic progression.These results demonstrate a biological basis for oligometastases and a potential for using microRNA expression to identify patients most likely to remain oligometastatic after metastasis-directed treatment.

  16. MicroScope: a platform for microbial genome annotation and comparative genomics.

    Science.gov (United States)

    Vallenet, D; Engelen, S; Mornico, D; Cruveiller, S; Fleury, L; Lajus, A; Rouy, Z; Roche, D; Salvignol, G; Scarpelli, C; Médigue, C

    2009-01-01

    The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http://www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope's rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of

  17. EST-PAC a web package for EST annotation and protein sequence prediction

    Directory of Open Access Journals (Sweden)

    Strahm Yvan

    2006-10-01

    Full Text Available Abstract With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1 searching local or remote biological databases for sequence similarities using Blast services, 2 predicting protein coding sequence from EST data and, 3 annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics.

  18. MicroRNAs in Prostate Cancer

    Science.gov (United States)

    2008-11-01

    lymphoma. Genes Chromosom. Cancer 39:167–69 131. O’Connell RM, Taganov KD, Boldin MP, Cheng G, Baltimore D. 2007. MicroRNA-155 is induced during the...carcinoma. J. Virol. 81:1033–36 155. Xi Y, Nakajima G, Gavin E, Morris CG, Kudo K, et al. 2007. Systematic analysis of microRNA expression of RNA extracted ...diversity. miRNAs were extracted from the unique sequences by searching against miRNA database (miRbase release 10.0; http://microrna.sanger.ac.uk

  19. An integrated one-chip-sensor system for microRNA quantitative analysis based on digital droplet polymerase chain reaction

    Science.gov (United States)

    Tsukuda, Masahiko; Wiederkehr, Rodrigo Sergio; Cai, Qing; Majeed, Bivragh; Fiorini, Paolo; Stakenborg, Tim; Matsuno, Toshinobu

    2016-04-01

    A silicon microfluidic chip was developed for microRNA (miRNA) quantitative analysis. It performs sequentially reverse transcription and polymerase chain reaction in a digital droplet format. Individual processes take place on different cavities, and reagent and sample mixing is carried out on a chip, prior to entering each compartment. The droplets are generated on a T-junction channel before the polymerase chain reaction step. Also, a miniaturized fluorescence detector was developed, based on an optical pick-up head of digital versatile disc (DVD) and a micro-photomultiplier tube. The chip integrated in the detection system was tested using synthetic miRNA with known concentrations, ranging from 300 to 3,000 templates/µL. Results proved the functionality of the system.

  20. MicroRNAs in right ventricular remodelling.

    Science.gov (United States)

    Batkai, Sandor; Bär, Christian; Thum, Thomas

    2017-10-01

    Right ventricular (RV) remodelling is a lesser understood process of the chronic, progressive transformation of the RV structure leading to reduced functional capacity and subsequent failure. Besides conditions concerning whole hearts, some pathology selectively affects the RV, leading to a distinct RV-specific clinical phenotype. MicroRNAs have been identified as key regulators of biological processes that drive the progression of chronic diseases. The role of microRNAs in diseases affecting the left ventricle has been studied for many years, however there is still limited information on microRNAs specific to diseases in the right ventricle. Here, we review recently described details on the expression, regulation, and function of microRNAs in the pathological remodelling of the right heart. Recently identified strategies using microRNAs as pharmacological targets or biomarkers will be highlighted. Increasing knowledge of pathogenic microRNAs will finally help improve our understanding of underlying distinct mechanisms and help utilize novel targets or biomarkers to develop treatments for patients suffering from right heart diseases. Published on behalf of the European Society of Cardiology. All rights reserved. © The Author 2017. For permissions, please email: journals.permissions@oup.com.

  1. Chronological changes in microRNA expression in the developing human brain.

    Directory of Open Access Journals (Sweden)

    Michael P Moreau

    Full Text Available MicroRNAs (miRNAs are endogenously expressed noncoding RNA molecules that are believed to regulate multiple neurobiological processes. Expression studies have revealed distinct temporal expression patterns in the developing rodent and porcine brain, but comprehensive profiling in the developing human brain has not been previously reported.We performed microarray and TaqMan-based expression analysis of all annotated mature miRNAs (miRBase 10.0 as well as 373 novel, predicted miRNAs. Expression levels were measured in 48 post-mortem brain tissue samples, representing gestational ages 14-24 weeks, as well as early postnatal and adult time points.Expression levels of 312 miRNAs changed significantly between at least two of the broad age categories, defined as fetal, young, and adult.We have constructed a miRNA expression atlas of the developing human brain, and we propose a classification scheme to guide future studies of neurobiological function.

  2. MicroRNA function in Drosophila melanogaster.

    Science.gov (United States)

    Carthew, Richard W; Agbu, Pamela; Giri, Ritika

    2017-05-01

    Over the last decade, microRNAs have emerged as critical regulators in the expression and function of animal genomes. This review article discusses the relationship between microRNA-mediated regulation and the biology of the fruit fly Drosophila melanogaster. We focus on the roles that microRNAs play in tissue growth, germ cell development, hormone action, and the development and activity of the central nervous system. We also discuss the ways in which microRNAs affect robustness. Many gene regulatory networks are robust; they are relatively insensitive to the precise values of reaction constants and concentrations of molecules acting within the networks. MicroRNAs involved in robustness appear to be nonessential under uniform conditions used in conventional laboratory experiments. However, the robust functions of microRNAs can be revealed when environmental or genetic variation otherwise has an impact on developmental outcomes. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. xGDBvm: A Web GUI-Driven Workflow for Annotating Eukaryotic Genomes in the Cloud.

    Science.gov (United States)

    Duvick, Jon; Standage, Daniel S; Merchant, Nirav; Brendel, Volker P

    2016-04-01

    Genome-wide annotation of gene structure requires the integration of numerous computational steps. Currently, annotation is arguably best accomplished through collaboration of bioinformatics and domain experts, with broad community involvement. However, such a collaborative approach is not scalable at today's pace of sequence generation. To address this problem, we developed the xGDBvm software, which uses an intuitive graphical user interface to access a number of common genome analysis and gene structure tools, preconfigured in a self-contained virtual machine image. Once their virtual machine instance is deployed through iPlant's Atmosphere cloud services, users access the xGDBvm workflow via a unified Web interface to manage inputs, set program parameters, configure links to high-performance computing (HPC) resources, view and manage output, apply analysis and editing tools, or access contextual help. The xGDBvm workflow will mask the genome, compute spliced alignments from transcript and/or protein inputs (locally or on a remote HPC cluster), predict gene structures and gene structure quality, and display output in a public or private genome browser complete with accessory tools. Problematic gene predictions are flagged and can be reannotated using the integrated yrGATE annotation tool. xGDBvm can also be configured to append or replace existing data or load precomputed data. Multiple genomes can be annotated and displayed, and outputs can be archived for sharing or backup. xGDBvm can be adapted to a variety of use cases including de novo genome annotation, reannotation, comparison of different annotations, and training or teaching. © 2016 American Society of Plant Biologists. All rights reserved.

  4. MixtureTree annotator: a program for automatic colorization and visual annotation of MixtureTree.

    Directory of Open Access Journals (Sweden)

    Shu-Chuan Chen

    Full Text Available The MixtureTree Annotator, written in JAVA, allows the user to automatically color any phylogenetic tree in Newick format generated from any phylogeny reconstruction program and output the Nexus file. By providing the ability to automatically color the tree by sequence name, the MixtureTree Annotator provides a unique advantage over any other programs which perform a similar function. In addition, the MixtureTree Annotator is the only package that can efficiently annotate the output produced by MixtureTree with mutation information and coalescent time information. In order to visualize the resulting output file, a modified version of FigTree is used. Certain popular methods, which lack good built-in visualization tools, for example, MEGA, Mesquite, PHY-FI, TreeView, treeGraph and Geneious, may give results with human errors due to either manually adding colors to each node or with other limitations, for example only using color based on a number, such as branch length, or by taxonomy. In addition to allowing the user to automatically color any given Newick tree by sequence name, the MixtureTree Annotator is the only method that allows the user to automatically annotate the resulting tree created by the MixtureTree program. The MixtureTree Annotator is fast and easy-to-use, while still allowing the user full control over the coloring and annotating process.

  5. microRNAs in mycobacterial disease: friend or foe?

    Directory of Open Access Journals (Sweden)

    Manali D Mehta

    2014-07-01

    Full Text Available As the role of microRNA in all aspects of biology continues to be unraveled, the interplay between microRNAs and human disease is becoming clearer. It should come of no surprise that microRNAs play a major part in the outcome of infectious diseases, since early work has implicated microRNAs as regulators of the immune response. Here, we provide a review on how microRNAs influence the course of mycobacterial infections, which cause two of humanity’s most ancient infectious diseases: tuberculosis and leprosy. Evidence derived from profiling and functional experiments suggests that regulation of specific microRNAs during infection can either enhance the immune response or facilitate pathogen immune evasion. Now, it remains to be seen if the manipulation of host cell microRNA profiles can be an opportunity for therapeutic intervention for these difficult-to-treat diseases.

  6. VISPA2: a scalable pipeline for high-throughput identification and annotation of vector integration sites.

    Science.gov (United States)

    Spinozzi, Giulio; Calabria, Andrea; Brasca, Stefano; Beretta, Stefano; Merelli, Ivan; Milanesi, Luciano; Montini, Eugenio

    2017-11-25

    Bioinformatics tools designed to identify lentiviral or retroviral vector insertion sites in the genome of host cells are used to address the safety and long-term efficacy of hematopoietic stem cell gene therapy applications and to study the clonal dynamics of hematopoietic reconstitution. The increasing number of gene therapy clinical trials combined with the increasing amount of Next Generation Sequencing data, aimed at identifying integration sites, require both highly accurate and efficient computational software able to correctly process "big data" in a reasonable computational time. Here we present VISPA2 (Vector Integration Site Parallel Analysis, version 2), the latest optimized computational pipeline for integration site identification and analysis with the following features: (1) the sequence analysis for the integration site processing is fully compliant with paired-end reads and includes a sequence quality filter before and after the alignment on the target genome; (2) an heuristic algorithm to reduce false positive integration sites at nucleotide level to reduce the impact of Polymerase Chain Reaction or trimming/alignment artifacts; (3) a classification and annotation module for integration sites; (4) a user friendly web interface as researcher front-end to perform integration site analyses without computational skills; (5) the time speedup of all steps through parallelization (Hadoop free). We tested VISPA2 performances using simulated and real datasets of lentiviral vector integration sites, previously obtained from patients enrolled in a hematopoietic stem cell gene therapy clinical trial and compared the results with other preexisting tools for integration site analysis. On the computational side, VISPA2 showed a > 6-fold speedup and improved precision and recall metrics (1 and 0.97 respectively) compared to previously developed computational pipelines. These performances indicate that VISPA2 is a fast, reliable and user-friendly tool for

  7. Cross disease analysis of co-functional microRNA pairs on a reconstructed network of disease-gene-microRNA tripartite.

    Science.gov (United States)

    Peng, Hui; Lan, Chaowang; Zheng, Yi; Hutvagner, Gyorgy; Tao, Dacheng; Li, Jinyan

    2017-03-24

    MicroRNAs always function cooperatively in their regulation of gene expression. Dysfunctions of these co-functional microRNAs can play significant roles in disease development. We are interested in those multi-disease associated co-functional microRNAs that regulate their common dysfunctional target genes cooperatively in the development of multiple diseases. The research is potentially useful for human disease studies at the transcriptional level and for the study of multi-purpose microRNA therapeutics. We designed a computational method to detect multi-disease associated co-functional microRNA pairs and conducted cross disease analysis on a reconstructed disease-gene-microRNA (DGR) tripartite network. The construction of the DGR tripartite network is by the integration of newly predicted disease-microRNA associations with those relationships of diseases, microRNAs and genes maintained by existing databases. The prediction method uses a set of reliable negative samples of disease-microRNA association and a pre-computed kernel matrix instead of kernel functions. From this reconstructed DGR tripartite network, multi-disease associated co-functional microRNA pairs are detected together with their common dysfunctional target genes and ranked by a novel scoring method. We also conducted proof-of-concept case studies on cancer-related co-functional microRNA pairs as well as on non-cancer disease-related microRNA pairs. With the prioritization of the co-functional microRNAs that relate to a series of diseases, we found that the co-function phenomenon is not unusual. We also confirmed that the regulation of the microRNAs for the development of cancers is more complex and have more unique properties than those of non-cancer diseases.

  8. Mason: a JavaScript web site widget for visualizing and comparing annotated features in nucleotide or protein sequences.

    Science.gov (United States)

    Jaschob, Daniel; Davis, Trisha N; Riffle, Michael

    2015-03-07

    Sequence feature annotations (e.g., protein domain boundaries, binding sites, and secondary structure predictions) are an essential part of biological research. Annotations are widely used by scientists during research and experimental design, and are frequently the result of biological studies. A generalized and simple means of disseminating and visualizing these data via the web would be of value to the research community. Mason is a web site widget designed to visualize and compare annotated features of one or more nucleotide or protein sequence. Annotated features may be of virtually any type, ranging from annotating transcription binding sites or exons and introns in DNA to secondary structure or domain boundaries in proteins. Mason is simple to use and easy to integrate into web sites. Mason has a highly dynamic and configurable interface supporting multiple sets of annotations per sequence, overlapping regions, customization of interface and user-driven events (e.g., clicks and text to appear for tooltips). It is written purely in JavaScript and SVG, requiring no 3(rd) party plugins or browser customization. Mason is a solution for dissemination of sequence annotation data on the web. It is highly flexible, customizable, simple to use, and is designed to be easily integrated into web sites. Mason is open source and freely available at https://github.com/yeastrc/mason.

  9. Evaluating Hierarchical Structure in Music Annotations.

    Science.gov (United States)

    McFee, Brian; Nieto, Oriol; Farbood, Morwaread M; Bello, Juan Pablo

    2017-01-01

    Music exhibits structure at multiple scales, ranging from motifs to large-scale functional components. When inferring the structure of a piece, different listeners may attend to different temporal scales, which can result in disagreements when they describe the same piece. In the field of music informatics research (MIR), it is common to use corpora annotated with structural boundaries at different levels. By quantifying disagreements between multiple annotators, previous research has yielded several insights relevant to the study of music cognition. First, annotators tend to agree when structural boundaries are ambiguous. Second, this ambiguity seems to depend on musical features, time scale, and genre. Furthermore, it is possible to tune current annotation evaluation metrics to better align with these perceptual differences. However, previous work has not directly analyzed the effects of hierarchical structure because the existing methods for comparing structural annotations are designed for "flat" descriptions, and do not readily generalize to hierarchical annotations. In this paper, we extend and generalize previous work on the evaluation of hierarchical descriptions of musical structure. We derive an evaluation metric which can compare hierarchical annotations holistically across multiple levels. sing this metric, we investigate inter-annotator agreement on the multilevel annotations of two different music corpora, investigate the influence of acoustic properties on hierarchical annotations, and evaluate existing hierarchical segmentation algorithms against the distribution of inter-annotator agreement.

  10. Evaluating Hierarchical Structure in Music Annotations

    Directory of Open Access Journals (Sweden)

    Brian McFee

    2017-08-01

    Full Text Available Music exhibits structure at multiple scales, ranging from motifs to large-scale functional components. When inferring the structure of a piece, different listeners may attend to different temporal scales, which can result in disagreements when they describe the same piece. In the field of music informatics research (MIR, it is common to use corpora annotated with structural boundaries at different levels. By quantifying disagreements between multiple annotators, previous research has yielded several insights relevant to the study of music cognition. First, annotators tend to agree when structural boundaries are ambiguous. Second, this ambiguity seems to depend on musical features, time scale, and genre. Furthermore, it is possible to tune current annotation evaluation metrics to better align with these perceptual differences. However, previous work has not directly analyzed the effects of hierarchical structure because the existing methods for comparing structural annotations are designed for “flat” descriptions, and do not readily generalize to hierarchical annotations. In this paper, we extend and generalize previous work on the evaluation of hierarchical descriptions of musical structure. We derive an evaluation metric which can compare hierarchical annotations holistically across multiple levels. sing this metric, we investigate inter-annotator agreement on the multilevel annotations of two different music corpora, investigate the influence of acoustic properties on hierarchical annotations, and evaluate existing hierarchical segmentation algorithms against the distribution of inter-annotator agreement.

  11. Community annotation and bioinformatics workforce development in concert--Little Skate Genome Annotation Workshops and Jamborees.

    Science.gov (United States)

    Wang, Qinghua; Arighi, Cecilia N; King, Benjamin L; Polson, Shawn W; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F; Page, Shallee T; Rendino, Marc Farnum; Thomas, William Kelley; Udwary, Daniel W; Wu, Cathy H

    2012-01-01

    Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome.

  12. Community annotation and bioinformatics workforce development in concert—Little Skate Genome Annotation Workshops and Jamborees

    Science.gov (United States)

    Wang, Qinghua; Arighi, Cecilia N.; King, Benjamin L.; Polson, Shawn W.; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F.; Page, Shallee T.; Farnum Rendino, Marc; Thomas, William Kelley; Udwary, Daniel W.; Wu, Cathy H.

    2012-01-01

    Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome. PMID:22434832

  13. Reasoning with Annotations of Texts

    OpenAIRE

    Ma , Yue; Lévy , François; Ghimire , Sudeep

    2011-01-01

    International audience; Linguistic and semantic annotations are important features for text-based applications. However, achieving and maintaining a good quality of a set of annotations is known to be a complex task. Many ad hoc approaches have been developed to produce various types of annotations, while comparing those annotations to improve their quality is still rare. In this paper, we propose a framework in which both linguistic and domain information can cooperate to reason with annotat...

  14. Enhanced annotations and features for comparing thousands of Pseudomonas genomes in the Pseudomonas genome database.

    Science.gov (United States)

    Winsor, Geoffrey L; Griffiths, Emma J; Lo, Raymond; Dhillon, Bhavjinder K; Shay, Julie A; Brinkman, Fiona S L

    2016-01-04

    The Pseudomonas Genome Database (http://www.pseudomonas.com) is well known for the application of community-based annotation approaches for producing a high-quality Pseudomonas aeruginosa PAO1 genome annotation, and facilitating whole-genome comparative analyses with other Pseudomonas strains. To aid analysis of potentially thousands of complete and draft genome assemblies, this database and analysis platform was upgraded to integrate curated genome annotations and isolate metadata with enhanced tools for larger scale comparative analysis and visualization. Manually curated gene annotations are supplemented with improved computational analyses that help identify putative drug targets and vaccine candidates or assist with evolutionary studies by identifying orthologs, pathogen-associated genes and genomic islands. The database schema has been updated to integrate isolate metadata that will facilitate more powerful analysis of genomes across datasets in the future. We continue to place an emphasis on providing high-quality updates to gene annotations through regular review of the scientific literature and using community-based approaches including a major new Pseudomonas community initiative for the assignment of high-quality gene ontology terms to genes. As we further expand from thousands of genomes, we plan to provide enhancements that will aid data visualization and analysis arising from whole-genome comparative studies including more pan-genome and population-based approaches. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. MicroRNAs associated with exercise and diet: a systematic review.

    Science.gov (United States)

    Flowers, Elena; Won, Gloria Y; Fukuoka, Yoshimi

    2015-01-01

    MicroRNAs are posttranscriptional regulators of gene expression. MicroRNAs reflect individual biologic adaptation to exposures in the environment. As such, measurement of circulating microRNAs presents an opportunity to evaluate biologic changes associated with behavioral interventions (i.e., exercise, diet) for weight loss. The aim of this study was to perform a systematic review of the literature to summarize what is known about circulating microRNAs associated with exercise, diet, and weight loss. We performed a systematic review of three scientific databases. We included studies reporting on circulating microRNAs associated with exercise, diet, and weight loss in humans. Of 1,219 studies identified in our comprehensive database search, 14 were selected for inclusion. Twelve reported on microRNAs associated with exercise, and two reported on microRNAs associated with diet and weight loss. The majority of studies used a quasiexperimental, cross-sectional design. There were numerous differences in the type and intensity of exercise and dietary interventions, the biologic source of microRNAs, and the methodological approaches used quantitate microRNAs. Data from several studies support an association between circulating microRNAs and exercise. The evidence for an association between circulating microRNAs and diet is weaker because of a small number of studies. Additional research is needed to validate previous observations using methodologically rigorous approaches to microRNA quantitation to determine the specific circulating microRNA signatures associated with behavioral approaches to weight loss. Future directions include longitudinal studies to determine if circulating microRNAs are predictive of response to behavioral interventions. Copyright © 2015 the American Physiological Society.

  16. xGDBvm: A Web GUI-Driven Workflow for Annotating Eukaryotic Genomes in the Cloud[OPEN

    Science.gov (United States)

    Merchant, Nirav

    2016-01-01

    Genome-wide annotation of gene structure requires the integration of numerous computational steps. Currently, annotation is arguably best accomplished through collaboration of bioinformatics and domain experts, with broad community involvement. However, such a collaborative approach is not scalable at today’s pace of sequence generation. To address this problem, we developed the xGDBvm software, which uses an intuitive graphical user interface to access a number of common genome analysis and gene structure tools, preconfigured in a self-contained virtual machine image. Once their virtual machine instance is deployed through iPlant’s Atmosphere cloud services, users access the xGDBvm workflow via a unified Web interface to manage inputs, set program parameters, configure links to high-performance computing (HPC) resources, view and manage output, apply analysis and editing tools, or access contextual help. The xGDBvm workflow will mask the genome, compute spliced alignments from transcript and/or protein inputs (locally or on a remote HPC cluster), predict gene structures and gene structure quality, and display output in a public or private genome browser complete with accessory tools. Problematic gene predictions are flagged and can be reannotated using the integrated yrGATE annotation tool. xGDBvm can also be configured to append or replace existing data or load precomputed data. Multiple genomes can be annotated and displayed, and outputs can be archived for sharing or backup. xGDBvm can be adapted to a variety of use cases including de novo genome annotation, reannotation, comparison of different annotations, and training or teaching. PMID:27020957

  17. TAM 2.0: tool for MicroRNA set analysis.

    Science.gov (United States)

    Li, Jianwei; Han, Xiaofen; Wan, Yanping; Zhang, Shan; Zhao, Yingshu; Fan, Rui; Cui, Qinghua; Zhou, Yuan

    2018-06-06

    With the rapid accumulation of high-throughput microRNA (miRNA) expression profile, the up-to-date resource for analyzing the functional and disease associations of miRNAs is increasingly demanded. We here describe the updated server TAM 2.0 for miRNA set enrichment analysis. Through manual curation of over 9000 papers, a more than two-fold growth of reference miRNA sets has been achieved in comparison with previous TAM, which covers 9945 and 1584 newly collected miRNA-disease and miRNA-function associations, respectively. Moreover, TAM 2.0 allows users not only to test the functional and disease annotations of miRNAs by overrepresentation analysis, but also to compare the input de-regulated miRNAs with those de-regulated in other disease conditions via correlation analysis. Finally, the functions for miRNA set query and result visualization are also enabled in the TAM 2.0 server to facilitate the community. The TAM 2.0 web server is freely accessible at http://www.scse.hebut.edu.cn/tam/ or http://www.lirmed.com/tam2/.

  18. Regulation of Corticosteroidogenic Genes by MicroRNAs

    Directory of Open Access Journals (Sweden)

    Stacy Robertson

    2017-01-01

    Full Text Available The loss of normal regulation of corticosteroid secretion is important in the development of cardiovascular disease. We previously showed that microRNAs regulate the terminal stages of corticosteroid biosynthesis. Here, we assess microRNA regulation across the whole corticosteroid pathway. Knockdown of microRNA using Dicer1 siRNA in H295R adrenocortical cells increased levels of CYP11A1, CYP21A1, and CYP17A1 mRNA and the secretion of cortisol, corticosterone, 11-deoxycorticosterone, 18-hydroxycorticosterone, and aldosterone. Bioinformatic analysis of genes involved in corticosteroid biosynthesis or metabolism identified many putative microRNA-binding sites, and some were selected for further study. Manipulation of individual microRNA levels demonstrated a direct effect of miR-125a-5p and miR-125b-5p on CYP11B2 and of miR-320a-3p levels on CYP11A1 and CYP17A1 mRNA. Finally, comparison of microRNA expression profiles from human aldosterone-producing adenoma and normal adrenal tissue showed levels of various microRNAs, including miR-125a-5p to be significantly different. This study demonstrates that corticosteroidogenesis is regulated at multiple points by several microRNAs and that certain of these microRNAs are differentially expressed in tumorous adrenal tissue, which may contribute to dysregulation of corticosteroid secretion. These findings provide new insights into the regulation of corticosteroid production and have implications for understanding the pathology of disease states where abnormal hormone secretion is a feature.

  19. MicroRNAs in sensorineural diseases of the ear

    Directory of Open Access Journals (Sweden)

    Kathy eUshakov

    2013-12-01

    Full Text Available Non-coding microRNAs have a fundamental role in gene regulation and expression in almost every multicellular organism. Only discovered in the last decade, microRNAs are already known to play a leading role in many aspects of disease. In the vertebrate inner ear, microRNAs are essential for controlling development and survival of hair cells. Moreover, dysregulation of microRNAs has been implicated in sensorineural hearing impairment, as well as in other ear diseases such as cholesteatomas, vestibular schwannomas and otitis media. Due to the inaccessibility of the ear in humans, animal models have provided the optimal tools to study microRNA expression and function, in particular mice and zebrafish. A major focus of current research has been to discover the targets of the microRNAs expressed in the inner ear, in order to determine the regulatory pathways of the auditory and vestibular systems. The potential for microRNA manipulation in development of therapeutic tools for hearing impairment is as yet unexplored, paving the way for future work in the field.

  20. A Resource of Quantitative Functional Annotation for Homo sapiens Genes.

    Science.gov (United States)

    Taşan, Murat; Drabkin, Harold J; Beaver, John E; Chua, Hon Nian; Dunham, Julie; Tian, Weidong; Blake, Judith A; Roth, Frederick P

    2012-02-01

    The body of human genomic and proteomic evidence continues to grow at ever-increasing rates, while annotation efforts struggle to keep pace. A surprisingly small fraction of human genes have clear, documented associations with specific functions, and new functions continue to be found for characterized genes. Here we assembled an integrated collection of diverse genomic and proteomic data for 21,341 human genes and make quantitative associations of each to 4333 Gene Ontology terms. We combined guilt-by-profiling and guilt-by-association approaches to exploit features unique to the data types. Performance was evaluated by cross-validation, prospective validation, and by manual evaluation with the biological literature. Functional-linkage networks were also constructed, and their utility was demonstrated by identifying candidate genes related to a glioma FLN using a seed network from genome-wide association studies. Our annotations are presented-alongside existing validated annotations-in a publicly accessible and searchable web interface.

  1. MicroRNAs, epigenetics and disease

    DEFF Research Database (Denmark)

    Silahtaroglu, Asli; Stenvang, Jan

    2010-01-01

    Epigenetics is defined as the heritable chances that affect gene expression without changing the DNA sequence. Epigenetic regulation of gene expression can be through different mechanisms such as DNA methylation, histone modifications and nucleosome positioning. MicroRNAs are short RNA molecules...... which do not code for a protein but have a role in post-transcriptional silencing of multiple target genes by binding to their 3' UTRs (untranslated regions). Both epigenetic mechanisms, such as DNA methylation and histone modifications, and the microRNAs are crucial for normal differentiation...... diseases. In the present chapter we will mainly focus on microRNAs and methylation and their implications in human disease, mainly in cancer....

  2. Semantic annotation of consumer health questions.

    Science.gov (United States)

    Kilicoglu, Halil; Ben Abacha, Asma; Mrabet, Yassine; Shooshan, Sonya E; Rodriguez, Laritza; Masterton, Kate; Demner-Fushman, Dina

    2018-02-06

    Consumers increasingly use online resources for their health information needs. While current search engines can address these needs to some extent, they generally do not take into account that most health information needs are complex and can only fully be expressed in natural language. Consumer health question answering (QA) systems aim to fill this gap. A major challenge in developing consumer health QA systems is extracting relevant semantic content from the natural language questions (question understanding). To develop effective question understanding tools, question corpora semantically annotated for relevant question elements are needed. In this paper, we present a two-part consumer health question corpus annotated with several semantic categories: named entities, question triggers/types, question frames, and question topic. The first part (CHQA-email) consists of relatively long email requests received by the U.S. National Library of Medicine (NLM) customer service, while the second part (CHQA-web) consists of shorter questions posed to MedlinePlus search engine as queries. Each question has been annotated by two annotators. The annotation methodology is largely the same between the two parts of the corpus; however, we also explain and justify the differences between them. Additionally, we provide information about corpus characteristics, inter-annotator agreement, and our attempts to measure annotation confidence in the absence of adjudication of annotations. The resulting corpus consists of 2614 questions (CHQA-email: 1740, CHQA-web: 874). Problems are the most frequent named entities, while treatment and general information questions are the most common question types. Inter-annotator agreement was generally modest: question types and topics yielded highest agreement, while the agreement for more complex frame annotations was lower. Agreement in CHQA-web was consistently higher than that in CHQA-email. Pairwise inter-annotator agreement proved most

  3. Predicting word sense annotation agreement

    DEFF Research Database (Denmark)

    Martinez Alonso, Hector; Johannsen, Anders Trærup; Lopez de Lacalle, Oier

    2015-01-01

    High agreement is a common objective when annotating data for word senses. However, a number of factors make perfect agreement impossible, e.g. the limitations of the sense inventories, the difficulty of the examples or the interpretation preferences of the annotations. Estimating potential...... agreement is thus a relevant task to supplement the evaluation of sense annotations. In this article we propose two methods to predict agreement on word-annotation instances. We experiment with a continuous representation and a three-way discretization of observed agreement. In spite of the difficulty...

  4. Alignment-Annotator web server: rendering and annotating sequence alignments.

    Science.gov (United States)

    Gille, Christoph; Fähling, Michael; Weyand, Birgit; Wieland, Thomas; Gille, Andreas

    2014-07-01

    Alignment-Annotator is a novel web service designed to generate interactive views of annotated nucleotide and amino acid sequence alignments (i) de novo and (ii) embedded in other software. All computations are performed at server side. Interactivity is implemented in HTML5, a language native to web browsers. The alignment is initially displayed using default settings and can be modified with the graphical user interfaces. For example, individual sequences can be reordered or deleted using drag and drop, amino acid color code schemes can be applied and annotations can be added. Annotations can be made manually or imported (BioDAS servers, the UniProt, the Catalytic Site Atlas and the PDB). Some edits take immediate effect while others require server interaction and may take a few seconds to execute. The final alignment document can be downloaded as a zip-archive containing the HTML files. Because of the use of HTML the resulting interactive alignment can be viewed on any platform including Windows, Mac OS X, Linux, Android and iOS in any standard web browser. Importantly, no plugins nor Java are required and therefore Alignment-Anotator represents the first interactive browser-based alignment visualization. http://www.bioinformatics.org/strap/aa/ and http://strap.charite.de/aa/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Annotating gene sets by mining large literature collections with protein networks.

    Science.gov (United States)

    Wang, Sheng; Ma, Jianzhu; Yu, Michael Ku; Zheng, Fan; Huang, Edward W; Han, Jiawei; Peng, Jian; Ideker, Trey

    2018-01-01

    Analysis of patient genomes and transcriptomes routinely recognizes new gene sets associated with human disease. Here we present an integrative natural language processing system which infers common functions for a gene set through automatic mining of the scientific literature with biological networks. This system links genes with associated literature phrases and combines these links with protein interactions in a single heterogeneous network. Multiscale functional annotations are inferred based on network distances between phrases and genes and then visualized as an ontology of biological concepts. To evaluate this system, we predict functions for gene sets representing known pathways and find that our approach achieves substantial improvement over the conventional text-mining baseline method. Moreover, our system discovers novel annotations for gene sets or pathways without previously known functions. Two case studies demonstrate how the system is used in discovery of new cancer-related pathways with ontological annotations.

  6. Profile of cerebrospinal microRNAs in fibromyalgia.

    Directory of Open Access Journals (Sweden)

    Jan L Bjersing

    Full Text Available Fibromyalgia (FM is characterized by chronic pain and reduced pain threshold. The pathophysiology involves disturbed neuroendocrine function, including impaired function of the growth hormone/insulin-like growth factor-1 axis. Recently, microRNAs have been shown to be important regulatory factors in a number of diseases. The aim of this study was to try to identify cerebrospinal microRNAs with expression specific for FM and to determine their correlation to pain and fatigue.The genome-wide profile of microRNAs in cerebrospinal fluid was assessed in ten women with FM and eight healthy controls using real-time quantitative PCR. Pain thresholds were examined by algometry. Levels of pain (FIQ pain were rated on a 0-100 mm scale (fibromyalgia impact questionnaire, FIQ. Levels of fatigue (FIQ fatigue were rated on a 0-100 mm scale using FIQ and by multidimensional fatigue inventory (MFI-20 general fatigue (MFIGF.Expression levels of nine microRNAs were significantly lower in patients with FM patients compared to healthy controls. The microRNAs identified were miR-21-5p, miR-145-5p, miR-29a-3p, miR-99b-5p, miR-125b-5p, miR-23a-3p, 23b-3p, miR-195-5p, miR-223-3p. The identified microRNAs with significantly lower expression in FM were assessed with regard to pain and fatigue. miR-145-5p correlated positively with FIQ pain (r=0.709, p=0.022, n=10 and with FIQ fatigue (r=0.687, p=0.028, n=10.To our knowledge, this is the first study to show a disease-specific pattern of cerebrospinal microRNAs in FM. We have identified nine microRNAs in cerebrospinal fluid that differed between FM patients and healthy controls. One of the identified microRNAs, miR-145 was associated with the cardinal symptoms of FM, pain and fatigue.

  7. Profile of cerebrospinal microRNAs in fibromyalgia.

    Science.gov (United States)

    Bjersing, Jan L; Lundborg, Christopher; Bokarewa, Maria I; Mannerkorpi, Kaisa

    2013-01-01

    Fibromyalgia (FM) is characterized by chronic pain and reduced pain threshold. The pathophysiology involves disturbed neuroendocrine function, including impaired function of the growth hormone/insulin-like growth factor-1 axis. Recently, microRNAs have been shown to be important regulatory factors in a number of diseases. The aim of this study was to try to identify cerebrospinal microRNAs with expression specific for FM and to determine their correlation to pain and fatigue. The genome-wide profile of microRNAs in cerebrospinal fluid was assessed in ten women with FM and eight healthy controls using real-time quantitative PCR. Pain thresholds were examined by algometry. Levels of pain (FIQ pain) were rated on a 0-100 mm scale (fibromyalgia impact questionnaire, FIQ). Levels of fatigue (FIQ fatigue) were rated on a 0-100 mm scale using FIQ and by multidimensional fatigue inventory (MFI-20) general fatigue (MFIGF). Expression levels of nine microRNAs were significantly lower in patients with FM patients compared to healthy controls. The microRNAs identified were miR-21-5p, miR-145-5p, miR-29a-3p, miR-99b-5p, miR-125b-5p, miR-23a-3p, 23b-3p, miR-195-5p, miR-223-3p. The identified microRNAs with significantly lower expression in FM were assessed with regard to pain and fatigue. miR-145-5p correlated positively with FIQ pain (r=0.709, p=0.022, n=10) and with FIQ fatigue (r=0.687, p=0.028, n=10). To our knowledge, this is the first study to show a disease-specific pattern of cerebrospinal microRNAs in FM. We have identified nine microRNAs in cerebrospinal fluid that differed between FM patients and healthy controls. One of the identified microRNAs, miR-145 was associated with the cardinal symptoms of FM, pain and fatigue.

  8. Objective-guided image annotation.

    Science.gov (United States)

    Mao, Qi; Tsang, Ivor Wai-Hung; Gao, Shenghua

    2013-04-01

    Automatic image annotation, which is usually formulated as a multi-label classification problem, is one of the major tools used to enhance the semantic understanding of web images. Many multimedia applications (e.g., tag-based image retrieval) can greatly benefit from image annotation. However, the insufficient performance of image annotation methods prevents these applications from being practical. On the other hand, specific measures are usually designed to evaluate how well one annotation method performs for a specific objective or application, but most image annotation methods do not consider optimization of these measures, so that they are inevitably trapped into suboptimal performance of these objective-specific measures. To address this issue, we first summarize a variety of objective-guided performance measures under a unified representation. Our analysis reveals that macro-averaging measures are very sensitive to infrequent keywords, and hamming measure is easily affected by skewed distributions. We then propose a unified multi-label learning framework, which directly optimizes a variety of objective-specific measures of multi-label learning tasks. Specifically, we first present a multilayer hierarchical structure of learning hypotheses for multi-label problems based on which a variety of loss functions with respect to objective-guided measures are defined. And then, we formulate these loss functions as relaxed surrogate functions and optimize them by structural SVMs. According to the analysis of various measures and the high time complexity of optimizing micro-averaging measures, in this paper, we focus on example-based measures that are tailor-made for image annotation tasks but are seldom explored in the literature. Experiments show consistency with the formal analysis on two widely used multi-label datasets, and demonstrate the superior performance of our proposed method over state-of-the-art baseline methods in terms of example-based measures on four

  9. Systematic tissue-specific functional annotation of the human genome highlights immune-related DNA elements for late-onset Alzheimer's disease.

    Directory of Open Access Journals (Sweden)

    Qiongshi Lu

    2017-07-01

    Full Text Available Continuing efforts from large international consortia have made genome-wide epigenomic and transcriptomic annotation data publicly available for a variety of cell and tissue types. However, synthesis of these datasets into effective summary metrics to characterize the functional non-coding genome remains a challenge. Here, we present GenoSkyline-Plus, an extension of our previous work through integration of an expanded set of epigenomic and transcriptomic annotations to produce high-resolution, single tissue annotations. After validating our annotations with a catalog of tissue-specific non-coding elements previously identified in the literature, we apply our method using data from 127 different cell and tissue types to present an atlas of heritability enrichment across 45 different GWAS traits. We show that broader organ system categories (e.g. immune system increase statistical power in identifying biologically relevant tissue types for complex diseases while annotations of individual cell types (e.g. monocytes or B-cells provide deeper insights into disease etiology. Additionally, we use our GenoSkyline-Plus annotations in an in-depth case study of late-onset Alzheimer's disease (LOAD. Our analyses suggest a strong connection between LOAD heritability and genetic variants contained in regions of the genome functional in monocytes. Furthermore, we show that LOAD shares a similar localization of SNPs to monocyte-functional regions with Parkinson's disease. Overall, we demonstrate that integrated genome annotations at the single tissue level provide a valuable tool for understanding the etiology of complex human diseases. Our GenoSkyline-Plus annotations are freely available at http://genocanyon.med.yale.edu/GenoSkyline.

  10. Systematic tissue-specific functional annotation of the human genome highlights immune-related DNA elements for late-onset Alzheimer's disease.

    Science.gov (United States)

    Lu, Qiongshi; Powles, Ryan L; Abdallah, Sarah; Ou, Derek; Wang, Qian; Hu, Yiming; Lu, Yisi; Liu, Wei; Li, Boyang; Mukherjee, Shubhabrata; Crane, Paul K; Zhao, Hongyu

    2017-07-01

    Continuing efforts from large international consortia have made genome-wide epigenomic and transcriptomic annotation data publicly available for a variety of cell and tissue types. However, synthesis of these datasets into effective summary metrics to characterize the functional non-coding genome remains a challenge. Here, we present GenoSkyline-Plus, an extension of our previous work through integration of an expanded set of epigenomic and transcriptomic annotations to produce high-resolution, single tissue annotations. After validating our annotations with a catalog of tissue-specific non-coding elements previously identified in the literature, we apply our method using data from 127 different cell and tissue types to present an atlas of heritability enrichment across 45 different GWAS traits. We show that broader organ system categories (e.g. immune system) increase statistical power in identifying biologically relevant tissue types for complex diseases while annotations of individual cell types (e.g. monocytes or B-cells) provide deeper insights into disease etiology. Additionally, we use our GenoSkyline-Plus annotations in an in-depth case study of late-onset Alzheimer's disease (LOAD). Our analyses suggest a strong connection between LOAD heritability and genetic variants contained in regions of the genome functional in monocytes. Furthermore, we show that LOAD shares a similar localization of SNPs to monocyte-functional regions with Parkinson's disease. Overall, we demonstrate that integrated genome annotations at the single tissue level provide a valuable tool for understanding the etiology of complex human diseases. Our GenoSkyline-Plus annotations are freely available at http://genocanyon.med.yale.edu/GenoSkyline.

  11. The Emerging Role of MicroRNA-155 in Cardiovascular Diseases

    Directory of Open Access Journals (Sweden)

    Richard Y. Cao

    2016-01-01

    Full Text Available MicroRNAs have been demonstrated to be involved in human diseases, including cardiovascular diseases. Growing evidences suggest that microRNA-155, a typical multifunctional microRNA, plays a crucial role in hematopoietic lineage differentiation, immunity, inflammation, viral infections, and vascular remodeling, which is linked to cardiovascular diseases such as coronary artery disease, abdominal aortic aneurysm, heart failure, and diabetic heart disease. The effects of microRNA-155 in different cell types through different target genes result in different mechanisms in diseases. MicroRNA-155 has been intensively studied in atherosclerosis and coronary artery disease. Contradictory results of microRNA-155 either promoting or preventing the pathophysiological process of atherosclerosis illustrate the complexity of this pleiotropic molecule. Therefore, more comprehensive studies of the underlying mechanisms of microRNA-155 involvement in cardiovascular diseases are required. Furthermore, a recent clinical trial of Miravirsen targeting microRNA-122 sheds light on exploiting microRNA-155 as a novel target to develop effective therapeutic strategies for cardiovascular diseases in the near future.

  12. HBVRegDB: Annotation, comparison, detection and visualization of regulatory elements in hepatitis B virus sequences

    Directory of Open Access Journals (Sweden)

    Firth Andrew E

    2007-12-01

    Full Text Available Abstract Background The many Hepadnaviridae sequences available have widely varied functional annotation. The genomes are very compact (~3.2 kb but contain multiple layers of functional regulatory elements in addition to coding regions. Key regions are subject to purifying selection, as mutations in these regions will produce non-functional viruses. Results These genomic sequences have been organized into a structured database to facilitate research at the molecular level. HBVRegDB is a comparative genomic analysis tool with an integrated underlying sequence database. The database contains genomic sequence data from representative viruses. In addition to INSDC and RefSeq annotation, HBVRegDB also contains expert and systematically calculated annotations (e.g. promoters and comparative genome analysis results (e.g. blastn, tblastx. It also contains analyses based on curated HBV alignments. Information about conserved regions – including primary conservation (e.g. CDS-Plotcon and RNA secondary structure predictions (e.g. Alidot – is integrated into the database. A large amount of data is graphically presented using the GBrowse (Generic Genome Browser adapted for analysis of viral genomes. Flexible query access is provided based on any annotated genomic feature. Novel regulatory motifs can be found by analysing the annotated sequences. Conclusion HBVRegDB serves as a knowledge database and as a comparative genomic analysis tool for molecular biologists investigating HBV. It is publicly available and complementary to other viral and HBV focused datasets and tools http://hbvregdb.otago.ac.nz. The availability of multiple and highly annotated sequences of viral genomes in one database combined with comparative analysis tools facilitates detection of novel genomic elements.

  13. MicroRNA from tuberculosis RNA: A bioinformatics study

    OpenAIRE

    Wiwanitkit, Somsri; Wiwanitkit, Viroj

    2012-01-01

    The role of microRNA in the pathogenesis of pulmonary tuberculosis is the interesting topic in chest medicine at present. Recently, it was proposed that the microRNA can be a useful biomarker for monitoring of pulmonary tuberculosis and might be the important part in pathogenesis of disease. Here, the authors perform a bioinformatics study to assess the microRNA within known tuberculosis RNA. The microRNA part can be detected and this can be important key information in further study of the p...

  14. Altered spinal microRNA-146a and the microRNA-183 cluster contribute to osteoarthritic pain in knee joints.

    Science.gov (United States)

    Li, Xin; Kroin, Jeffrey S; Kc, Ranjan; Gibson, Gary; Chen, Di; Corbett, Grant T; Pahan, Kalipada; Fayyaz, Sana; Kim, Jae-Sung; van Wijnen, Andre J; Suh, Joon; Kim, Su-Gwan; Im, Hee-Jeong

    2013-12-01

    The objective of this study was to examine whether altered expression of microRNAs in central nervous system components is pathologically linked to chronic knee joint pain in osteoarthritis. A surgical animal model for knee joint OA was generated by medial meniscus transection in rats followed by behavioral pain tests. Relationships between pathological changes in knee joint and development of chronic joint pain were examined by histology and imaging analyses. Alterations in microRNAs associated with OA-evoked pain sensation were determined in bilateral lumbar dorsal root ganglia (DRG) and the spinal dorsal horn by microRNA array followed by individual microRNA analyses. Gain- and loss-of-function studies of selected microRNAs (miR-146a and miR-183 cluster) were conducted to identify target pain mediators regulated by these selective microRNAs in glial cells. The ipsilateral hind leg displayed significantly increased hyperalgesia after 4 weeks of surgery, and sensitivity was sustained for the remainder of the 8-week experimental period (F = 341, p pain was correlated with pathological changes in the knee joints as assessed by histological and imaging analyses. MicroRNA analyses showed that miR-146a and the miR-183 cluster were markedly reduced in the sensory neurons in DRG (L4/L5) and spinal cord from animals experiencing knee joint OA pain. The downregulation of miR-146a and/or the miR-183 cluster in the central compartments (DRG and spinal cord) are closely associated with the upregulation of inflammatory pain mediators. The corroboration between decreases in these signature microRNAs and their specific target pain mediators were further confirmed by gain- and loss-of-function analyses in glia, the major cellular component of the central nervous system (CNS). MicroRNA therapy using miR-146a and the miR-183 cluster could be powerful therapeutic intervention for OA in alleviating joint pain and concomitantly regenerating peripheral knee joint cartilage. © 2013

  15. ChemiRs: a web application for microRNAs and chemicals.

    Science.gov (United States)

    Su, Emily Chia-Yu; Chen, Yu-Sing; Tien, Yun-Cheng; Liu, Jeff; Ho, Bing-Ching; Yu, Sung-Liang; Singh, Sher

    2016-04-18

    MicroRNAs (miRNAs) are about 22 nucleotides, non-coding RNAs that affect various cellular functions, and play a regulatory role in different organisms including human. Until now, more than 2500 mature miRNAs in human have been discovered and registered, but still lack of information or algorithms to reveal the relations among miRNAs, environmental chemicals and human health. Chemicals in environment affect our health and daily life, and some of them can lead to diseases by inferring biological pathways. We develop a creditable online web server, ChemiRs, for predicting interactions and relations among miRNAs, chemicals and pathways. The database not only compares gene lists affected by chemicals and miRNAs, but also incorporates curated pathways to identify possible interactions. Here, we manually retrieved associations of miRNAs and chemicals from biomedical literature. We developed an online system, ChemiRs, which contains miRNAs, diseases, Medical Subject Heading (MeSH) terms, chemicals, genes, pathways and PubMed IDs. We connected each miRNA to miRBase, and every current gene symbol to HUGO Gene Nomenclature Committee (HGNC) for genome annotation. Human pathway information is also provided from KEGG and REACTOME databases. Information about Gene Ontology (GO) is queried from GO Online SQL Environment (GOOSE). With a user-friendly interface, the web application is easy to use. Multiple query results can be easily integrated and exported as report documents in PDF format. Association analysis of miRNAs and chemicals can help us understand the pathogenesis of chemical components. ChemiRs is freely available for public use at http://omics.biol.ntnu.edu.tw/ChemiRs .

  16. Accurate microRNA target prediction correlates with protein repression levels

    Directory of Open Access Journals (Sweden)

    Simossis Victor A

    2009-09-01

    Full Text Available Abstract Background MicroRNAs are small endogenously expressed non-coding RNA molecules that regulate target gene expression through translation repression or messenger RNA degradation. MicroRNA regulation is performed through pairing of the microRNA to sites in the messenger RNA of protein coding genes. Since experimental identification of miRNA target genes poses difficulties, computational microRNA target prediction is one of the key means in deciphering the role of microRNAs in development and disease. Results DIANA-microT 3.0 is an algorithm for microRNA target prediction which is based on several parameters calculated individually for each microRNA and combines conserved and non-conserved microRNA recognition elements into a final prediction score, which correlates with protein production fold change. Specifically, for each predicted interaction the program reports a signal to noise ratio and a precision score which can be used as an indication of the false positive rate of the prediction. Conclusion Recently, several computational target prediction programs were benchmarked based on a set of microRNA target genes identified by the pSILAC method. In this assessment DIANA-microT 3.0 was found to achieve the highest precision among the most widely used microRNA target prediction programs reaching approximately 66%. The DIANA-microT 3.0 prediction results are available online in a user friendly web server at http://www.microrna.gr/microT

  17. Sensitive and long-term monitoring of intracellular microRNAs using a non-integrating cytoplasmic RNA vector.

    Science.gov (United States)

    Sano, Masayuki; Ohtaka, Manami; Iijima, Minoru; Nakasu, Asako; Kato, Yoshio; Nakanishi, Mahito

    2017-10-04

    MicroRNAs (miRNAs) are small noncoding RNAs that modulate gene expression at the post-transcriptional level. Different types of cells express unique sets of miRNAs that can be exploited as potential molecular markers to identify specific cell types. Among the variety of miRNA detection methods, a fluorescence-based imaging system that utilises a fluorescent-reporter gene regulated by a target miRNA offers a major advantage for long-term tracking of the miRNA in living cells. In this study, we developed a novel fluorescence-based miRNA-monitoring system using a non-integrating cytoplasmic RNA vector based on a replication-defective and persistent Sendai virus (SeVdp). Because SeVdp vectors robustly and stably express transgenes, this system enabled sensitive monitoring of miRNAs by fluorescence microscopy. By applying this system for cellular reprogramming, we found that miR-124, but not miR-9, was significantly upregulated during direct neuronal conversion. Additionally, we were able to isolate integration-free human induced pluripotent stem cells by long-term tracking of let-7 expression. Notably, this system was easily expandable to allow detection of multiple miRNAs separately and simultaneously. Our findings provide insight into a powerful tool for evaluating miRNA expression during the cellular reprogramming process and for isolating reprogrammed cells potentially useful for medical applications.

  18. Role of microRNAs in sepsis.

    Science.gov (United States)

    Kingsley, S Manoj Kumar; Bhat, B Vishnu

    2017-07-01

    MicroRNAs have been found to be of high significance in the regulation of various genes and processes in the body. Sepsis is a serious clinical problem which arises due to the excessive host inflammatory response to infection. The non-specific clinical features and delayed diagnosis of sepsis has been a matter of concern for long time. MicroRNAs could enable better diagnosis of sepsis and help in the identification of the various stages of sepsis. Improved diagnosis may enable quicker and more effective treatment measures. The initial acute and transient phase of sepsis involves excessive secretion of pro-inflammatory cytokines which causes severe damage. MicroRNAs negatively regulate the toll-like receptor signaling pathway and regulate the production of inflammatory cytokines during sepsis. Likewise, microRNAs have shown to regulate the vascular barrier and endothelial function in sepsis. They are also involved in the regulation of the apoptosis, immunosuppression, and organ dysfunction in later stages of sepsis. Their importance at various levels of the pathophysiology of sepsis has been discussed along with the challenges and future perspectives. MicroRNAs could be key players in the diagnosis and staging of sepsis. Their regulation at various stages of sepsis suggests that they may have an important role in altering the outcome associated with sepsis.

  19. Using Microbial Genome Annotation as a Foundation for Collaborative Student Research

    Science.gov (United States)

    Reed, Kelynne E.; Richardson, John M.

    2013-01-01

    We used the Integrated Microbial Genomes Annotation Collaboration Toolkit as a framework to incorporate microbial genomics research into a microbiology and biochemistry course in a way that promoted student learning of bioinformatics and research skills and emphasized teamwork and collaboration as evidenced through multiple assessment mechanisms.…

  20. Concept annotation in the CRAFT corpus.

    Science.gov (United States)

    Bada, Michael; Eckert, Miriam; Evans, Donald; Garcia, Kristin; Shipley, Krista; Sitnikov, Dmitry; Baumgartner, William A; Cohen, K Bretonnel; Verspoor, Karin; Blake, Judith A; Hunter, Lawrence E

    2012-07-09

    Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml.

  1. Blood cell mRNAs and microRNAs: optimized protocols for extraction and preservation.

    Science.gov (United States)

    Eikmans, Michael; Rekers, Niels V; Anholts, Jacqueline D H; Heidt, Sebastiaan; Claas, Frans H J

    2013-03-14

    Assessing messenger RNA (mRNA) and microRNA levels in peripheral blood cells may complement conventional parameters in clinical practice. Working with small, precious samples requires optimal RNA yields and minimal RNA degradation. Several procedures for RNA extraction and complementary DNA (cDNA) synthesis were compared for their efficiency. The effect on RNA quality of freeze-thawing peripheral blood cells and storage in preserving reagents was investigated. In terms of RNA yield and convenience, quality quantitative polymerase chain reaction signals per nanogram of total RNA and using NucleoSpin and mirVana columns is preferable. The SuperScript III protocol results in the highest cDNA yields. During conventional procedures of storing peripheral blood cells at -180°C and thawing them thereafter, RNA integrity is maintained. TRIzol preserves RNA in cells stored at -20°C. Detection of mRNA levels significantly decreases in degraded RNA samples, whereas microRNA molecules remain relatively stable. When standardized to reference targets, mRNA transcripts and microRNAs can be reliably quantified in moderately degraded (quality index 4-7) and severely degraded (quality index <4) RNA samples, respectively. We describe a strategy for obtaining high-quality and quantity RNA from fresh and stored cells from blood. The results serve as a guideline for sensitive mRNA and microRNA expression assessment in clinical material.

  2. Systematic tissue-specific functional annotation of the human genome highlights immune-related DNA elements for late-onset Alzheimer’s disease

    Science.gov (United States)

    Abdallah, Sarah; Ou, Derek; Wang, Qian; Hu, Yiming; Lu, Yisi; Liu, Wei; Li, Boyang; Mukherjee, Shubhabrata; Crane, Paul K.; Zhao, Hongyu

    2017-01-01

    Continuing efforts from large international consortia have made genome-wide epigenomic and transcriptomic annotation data publicly available for a variety of cell and tissue types. However, synthesis of these datasets into effective summary metrics to characterize the functional non-coding genome remains a challenge. Here, we present GenoSkyline-Plus, an extension of our previous work through integration of an expanded set of epigenomic and transcriptomic annotations to produce high-resolution, single tissue annotations. After validating our annotations with a catalog of tissue-specific non-coding elements previously identified in the literature, we apply our method using data from 127 different cell and tissue types to present an atlas of heritability enrichment across 45 different GWAS traits. We show that broader organ system categories (e.g. immune system) increase statistical power in identifying biologically relevant tissue types for complex diseases while annotations of individual cell types (e.g. monocytes or B-cells) provide deeper insights into disease etiology. Additionally, we use our GenoSkyline-Plus annotations in an in-depth case study of late-onset Alzheimer’s disease (LOAD). Our analyses suggest a strong connection between LOAD heritability and genetic variants contained in regions of the genome functional in monocytes. Furthermore, we show that LOAD shares a similar localization of SNPs to monocyte-functional regions with Parkinson’s disease. Overall, we demonstrate that integrated genome annotations at the single tissue level provide a valuable tool for understanding the etiology of complex human diseases. Our GenoSkyline-Plus annotations are freely available at http://genocanyon.med.yale.edu/GenoSkyline. PMID:28742084

  3. Small Molecule, Big Prospects: MicroRNA in Pregnancy and Its Complications

    Directory of Open Access Journals (Sweden)

    Meng Cai

    2017-01-01

    Full Text Available MicroRNAs are small, noncoding RNA molecules that regulate target gene expression in the posttranscriptional level. Unlike siRNA, microRNAs are “fine-tuners” rather than “switches” in the regulation of gene expression; thus they play key roles in maintaining tissue homeostasis. The aberrant microRNA expression is implicated in the disease process. To date, numerous studies have demonstrated the regulatory roles of microRNAs in various pathophysiological conditions. In contrast, the study of microRNA in pregnancy and its associated complications, such as preeclampsia (PE, fetal growth restriction (FGR, and preterm labor, is a young field. Over the last decade, the knowledge of pregnancy-related microRNAs has increased and the molecular mechanisms by which microRNAs regulate pregnancy or its associated complications are emerging. In this review, we focus on the recent advances in the research of pregnancy-related microRNAs, especially their function in pregnancy-associated complications and the potential clinical applications. Here microRNAs that associate with pregnancy are classified as placenta-specific, placenta-associated, placenta-derived circulating, and uterine microRNA according to their localization and origin. MicroRNAs offer a great potential for developing diagnostic and therapeutic targets in pregnancy-related disorders.

  4. Essential Requirements for Digital Annotation Systems

    Directory of Open Access Journals (Sweden)

    ADRIANO, C. M.

    2012-06-01

    Full Text Available Digital annotation systems are usually based on partial scenarios and arbitrary requirements. Accidental and essential characteristics are usually mixed in non explicit models. Documents and annotations are linked together accidentally according to the current technology, allowing for the development of disposable prototypes, but not to the support of non-functional requirements such as extensibility, robustness and interactivity. In this paper we perform a careful analysis on the concept of annotation, studying the scenarios supported by digital annotation tools. We also derived essential requirements based on a classification of annotation systems applied to existing tools. The analysis performed and the proposed classification can be applied and extended to other type of collaborative systems.

  5. Making web annotations persistent over time

    Energy Technology Data Exchange (ETDEWEB)

    Sanderson, Robert [Los Alamos National Laboratory; Van De Sompel, Herbert [Los Alamos National Laboratory

    2010-01-01

    As Digital Libraries (DL) become more aligned with the web architecture, their functional components need to be fundamentally rethought in terms of URIs and HTTP. Annotation, a core scholarly activity enabled by many DL solutions, exhibits a clearly unacceptable characteristic when existing models are applied to the web: due to the representations of web resources changing over time, an annotation made about a web resource today may no longer be relevant to the representation that is served from that same resource tomorrow. We assume the existence of archived versions of resources, and combine the temporal features of the emerging Open Annotation data model with the capability offered by the Memento framework that allows seamless navigation from the URI of a resource to archived versions of that resource, and arrive at a solution that provides guarantees regarding the persistence of web annotations over time. More specifically, we provide theoretical solutions and proof-of-concept experimental evaluations for two problems: reconstructing an existing annotation so that the correct archived version is displayed for all resources involved in the annotation, and retrieving all annotations that involve a given archived version of a web resource.

  6. Semantic annotation in biomedicine: the current landscape.

    Science.gov (United States)

    Jovanović, Jelena; Bagheri, Ebrahim

    2017-09-22

    The abundance and unstructured nature of biomedical texts, be it clinical or research content, impose significant challenges for the effective and efficient use of information and knowledge stored in such texts. Annotation of biomedical documents with machine intelligible semantics facilitates advanced, semantics-based text management, curation, indexing, and search. This paper focuses on annotation of biomedical entity mentions with concepts from relevant biomedical knowledge bases such as UMLS. As a result, the meaning of those mentions is unambiguously and explicitly defined, and thus made readily available for automated processing. This process is widely known as semantic annotation, and the tools that perform it are known as semantic annotators.Over the last dozen years, the biomedical research community has invested significant efforts in the development of biomedical semantic annotation technology. Aiming to establish grounds for further developments in this area, we review a selected set of state of the art biomedical semantic annotators, focusing particularly on general purpose annotators, that is, semantic annotation tools that can be customized to work with texts from any area of biomedicine. We also examine potential directions for further improvements of today's annotators which could make them even more capable of meeting the needs of real-world applications. To motivate and encourage further developments in this area, along the suggested and/or related directions, we review existing and potential practical applications and benefits of semantic annotators.

  7. miRQuest: integration of tools on a Web server for microRNA research.

    Science.gov (United States)

    Aguiar, R R; Ambrosio, L A; Sepúlveda-Hermosilla, G; Maracaja-Coutinho, V; Paschoal, A R

    2016-03-28

    This report describes the miRQuest - a novel middleware available in a Web server that allows the end user to do the miRNA research in a user-friendly way. It is known that there are many prediction tools for microRNA (miRNA) identification that use different programming languages and methods to realize this task. It is difficult to understand each tool and apply it to diverse datasets and organisms available for miRNA analysis. miRQuest can easily be used by biologists and researchers with limited experience with bioinformatics. We built it using the middleware architecture on a Web platform for miRNA research that performs two main functions: i) integration of different miRNA prediction tools for miRNA identification in a user-friendly environment; and ii) comparison of these prediction tools. In both cases, the user provides sequences (in FASTA format) as an input set for the analysis and comparisons. All the tools were selected on the basis of a survey of the literature on the available tools for miRNA prediction. As results, three different cases of use of the tools are also described, where one is the miRNA identification analysis in 30 different species. Finally, miRQuest seems to be a novel and useful tool; and it is freely available for both benchmarking and miRNA identification at http://mirquest.integrativebioinformatics.me/.

  8. Differential expression analysis of balding and nonbalding dermal papilla microRNAs in male pattern baldness with a microRNA amplification profiling method.

    Science.gov (United States)

    Goodarzi, H R; Abbasi, A; Saffari, M; Fazelzadeh Haghighi, M; Tabei, M B; Noori Daloii, M R

    2012-05-01

      Male pattern baldness or androgenetic alopecia is a common disorder affecting almost 50% of men throughout their lifetime, with androgens and genetics having significant contributing aetiologies. In contrast to the positive regulatory effect of androgens on body hair growth, they are thought to alter scalp hair follicle behaviour pathophysiologically, leading to male pattern baldness. However, the exact mechanisms of this paradoxical action have not yet been elucidated. The role of microRNAs, a novel group of noncoding RNAs impacting almost every aspect of biology, health and human diseases, has been documented in hair follicle formation. In addition, their deregulation in cancer of the prostate, a target organ of androgens, has also been well established. To investigate the possible contribution of microRNAs in the pathophysiology of male pattern baldness. We initially screened microRNA expression profiles of balding and nonbalding hair follicle papillae with a sensitive microRNA cloning method, microRNA amplification profiling, and statistically analysed significant differentially expressed microRNAs in balding relative to nonbalding dermal papillae, with real-time polymerase chain reaction as a confirmatory method to quantify expression in eight individuals affected with the disorder.   We detected the significant upregulation of miR-221, miR-125b, miR-106a and miR-410 in balding papilla cells.   We found four microRNAs that could participate in the pathogenesis of male pattern baldness. Regarding the strong therapeutic potential of microRNAs and the easy accessibility of hair follicles for gene therapy, microRNAs are possible candidates for a new generation of revolutionary treatments. © 2011 The Authors. BJD © 2011 British Association of Dermatologists.

  9. Tamarix microRNA Profiling Reveals New Insight into Salt Tolerance

    Directory of Open Access Journals (Sweden)

    Jianwen Wang

    2018-04-01

    Full Text Available The halophyte tamarisk (Tamarix is extremely salt tolerant, making it an ideal material for salt tolerance-related studies. Although many salt-responsive genes of Tamarix were identified in previous studies, there are no reports on the role of post-transcriptional regulation in its salt tolerance. We constructed six small RNA libraries of Tamarix chinensis roots with NaCl treatments. High-throughput sequencing of the six libraries was performed and microRNA expression profiles were constructed. We investigated salt-responsive microRNAs to uncover the microRNA-mediated genes regulation. From these analyses, 251 conserved and 18 novel microRNA were identified from all small RNAs. From 191 differentially expressed microRNAs, 74 co-expressed microRNAs were identified as salt-responsive candidate microRNAs. The most enriched GO (gene ontology terms for the 157 genes targeted by differentially expressed microRNAs suggested that transcriptions factors were highly active. Two hub microRNAs (miR414, miR5658, which connected by several target genes into an organic microRNA regulatory network, appeared to be the key regulators of post-transcriptional salt-stress responses. As the first survey on the tamarisk small RNAome, this study improves the understanding of tamarisk salt-tolerance mechanisms and will contribute to the molecular-assisted resistance breeding.

  10. Regulation of cardiac microRNAs by serum response factor

    Directory of Open Access Journals (Sweden)

    Wei Jeanne Y

    2011-02-01

    Full Text Available Abstract Serum response factor (SRF regulates certain microRNAs that play a role in cardiac and skeletal muscle development. However, the role of SRF in the regulation of microRNA expression and microRNA biogenesis in cardiac hypertrophy has not been well established. In this report, we employed two distinct transgenic mouse models to study the impact of SRF on cardiac microRNA expression and microRNA biogenesis. Cardiac-specific overexpression of SRF (SRF-Tg led to altered expression of a number of microRNAs. Interestingly, downregulation of miR-1, miR-133a and upregulation of miR-21 occurred by 7 days of age in these mice, long before the onset of cardiac hypertrophy, suggesting that SRF overexpression impacted the expression of microRNAs which contribute to cardiac hypertrophy. Reducing cardiac SRF level using the antisense-SRF transgenic approach (Anti-SRF-Tg resulted in the expression of miR-1, miR-133a and miR-21 in the opposite direction. Furthermore, we observed that SRF regulates microRNA biogenesis, specifically the transcription of pri-microRNA, thereby affecting the mature microRNA level. The mir-21 promoter sequence is conserved among mouse, rat and human; one SRF binding site was found to be in the mir-21 proximal promoter region of all three species. The mir-21 gene is regulated by SRF and its cofactors, including myocardin and p49/Strap. Our study demonstrates that the downregulation of miR-1, miR-133a, and upregulation of miR-21 can be reversed by one single upstream regulator, SRF. These results may help to develop novel therapeutic interventions targeting microRNA biogenesis.

  11. OntoVIP: an ontology for the annotation of object models used for medical image simulation.

    Science.gov (United States)

    Gibaud, Bernard; Forestier, Germain; Benoit-Cattin, Hugues; Cervenansky, Frédéric; Clarysse, Patrick; Friboulet, Denis; Gaignard, Alban; Hugonnard, Patrick; Lartizien, Carole; Liebgott, Hervé; Montagnat, Johan; Tabary, Joachim; Glatard, Tristan

    2014-12-01

    This paper describes the creation of a comprehensive conceptualization of object models used in medical image simulation, suitable for major imaging modalities and simulators. The goal is to create an application ontology that can be used to annotate the models in a repository integrated in the Virtual Imaging Platform (VIP), to facilitate their sharing and reuse. Annotations make the anatomical, physiological and pathophysiological content of the object models explicit. In such an interdisciplinary context we chose to rely on a common integration framework provided by a foundational ontology, that facilitates the consistent integration of the various modules extracted from several existing ontologies, i.e. FMA, PATO, MPATH, RadLex and ChEBI. Emphasis is put on methodology for achieving this extraction and integration. The most salient aspects of the ontology are presented, especially the organization in model layers, as well as its use to browse and query the model repository. Copyright © 2014 Elsevier Inc. All rights reserved.

  12. PedAM: a database for Pediatric Disease Annotation and Medicine.

    Science.gov (United States)

    Jia, Jinmeng; An, Zhongxin; Ming, Yue; Guo, Yongli; Li, Wei; Li, Xin; Liang, Yunxiang; Guo, Dongming; Tai, Jun; Chen, Geng; Jin, Yaqiong; Liu, Zhimei; Ni, Xin; Shi, Tieliu

    2018-01-04

    There is a significant number of children around the world suffering from the consequence of the misdiagnosis and ineffective treatment for various diseases. To facilitate the precision medicine in pediatrics, a database namely the Pediatric Disease Annotations & Medicines (PedAM) has been built to standardize and classify pediatric diseases. The PedAM integrates both biomedical resources and clinical data from Electronic Medical Records to support the development of computational tools, by which enables robust data analysis and integration. It also uses disease-manifestation (D-M) integrated from existing biomedical ontologies as prior knowledge to automatically recognize text-mined, D-M-specific syntactic patterns from 774 514 full-text articles and 8 848 796 abstracts in MEDLINE. Additionally, disease connections based on phenotypes or genes can be visualized on the web page of PedAM. Currently, the PedAM contains standardized 8528 pediatric disease terms (4542 unique disease concepts and 3986 synonyms) with eight annotation fields for each disease, including definition synonyms, gene, symptom, cross-reference (Xref), human phenotypes and its corresponding phenotypes in the mouse. The database PedAM is freely accessible at http://www.unimd.org/pedam/. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. A computational platform to maintain and migrate manual functional annotations for BioCyc databases.

    Science.gov (United States)

    Walsh, Jesse R; Sen, Taner Z; Dickerson, Julie A

    2014-10-12

    BioCyc databases are an important resource for information on biological pathways and genomic data. Such databases represent the accumulation of biological data, some of which has been manually curated from literature. An essential feature of these databases is the continuing data integration as new knowledge is discovered. As functional annotations are improved, scalable methods are needed for curators to manage annotations without detailed knowledge of the specific design of the BioCyc database. We have developed CycTools, a software tool which allows curators to maintain functional annotations in a model organism database. This tool builds on existing software to improve and simplify annotation data imports of user provided data into BioCyc databases. Additionally, CycTools automatically resolves synonyms and alternate identifiers contained within the database into the appropriate internal identifiers. Automating steps in the manual data entry process can improve curation efforts for major biological databases. The functionality of CycTools is demonstrated by transferring GO term annotations from MaizeCyc to matching proteins in CornCyc, both maize metabolic pathway databases available at MaizeGDB, and by creating strain specific databases for metabolic engineering.

  14. Physician evaluation and acceptance of remote transmission of CT, digital subtraction angiography, and US annotated images

    International Nuclear Information System (INIS)

    Haskin, M.E.; Robbins, C.; Kohn, M.; Laffey, P.A.; Haskin, P.H.; Teplick, J.G.; Teplick, S.K.; Peyster, R.G.

    1986-01-01

    The authors have found annotated images an effective way of communicating the results of imaging studies to referring physicians. Of particular value is the collation of representative images from several modalities. Previously, hard copy of this collation was sent to the referring physician as an integrated imaging report. Recently they developed a computer-based station that transmits annotated images to remote personal computer (PC) terminals via a telephone modem which requires 30 seconds to send each image. This annotated image report can be quickly accessed by the referring physician at the remote PC terminal The prototype system, utility, diagnostic fidelity, and potential of this remote system are described

  15. Isothermal circular-strand-displacement polymerization of DNA and microRNA in digital microfluidic devices.

    Science.gov (United States)

    Giuffrida, Maria Chiara; Zanoli, Laura Maria; D'Agata, Roberta; Finotti, Alessia; Gambari, Roberto; Spoto, Giuseppe

    2015-02-01

    Nucleic-acid amplification is a crucial step in nucleic-acid-sequence-detection assays. The use of digital microfluidic devices to miniaturize amplification techniques reduces the required sample volume and the analysis time and offers new possibilities for process automation and integration in a single device. The recently introduced droplet polymerase-chain-reaction (PCR) amplification methods require repeated cycles of two or three temperature-dependent steps during the amplification of the nucleic-acid target sequence. In contrast, low-temperature isothermal-amplification methods have no need for thermal cycling, thus requiring simplified microfluidic-device features. Here, the combined use of digital microfluidics and molecular-beacon (MB)-assisted isothermal circular-strand-displacement polymerization (ICSDP) to detect microRNA-210 sequences is described. MicroRNA-210 has been described as the most consistently and predominantly upregulated hypoxia-inducible factor. The nmol L(-1)-pmol L(-1) detection capabilities of the method were first tested by targeting single-stranded DNA sequences from the genetically modified Roundup Ready soybean. The ability of the droplet-ICSDP method to discriminate between full-matched, single-mismatched, and unrelated sequences was also investigated. The detection of a range of nmol L(-1)-pmol L(-1) microRNA-210 solutions compartmentalized in nanoliter-sized droplets was performed, establishing the ability of the method to detect as little as 10(-18) mol of microRNA target sequences compartmentalized in 20 nL droplets. The suitability of the method for biological samples was tested by detecting microRNA-210 from transfected K562 cells.

  16. Common features of microRNA target prediction tools

    Directory of Open Access Journals (Sweden)

    Sarah M. Peterson

    2014-02-01

    Full Text Available The human genome encodes for over 1800 microRNAs, which are short noncoding RNA molecules that function to regulate gene expression post-transcriptionally. Due to the potential for one microRNA to target multiple gene transcripts, microRNAs are recognized as a major mechanism to regulate gene expression and mRNA translation. Computational prediction of microRNA targets is a critical initial step in identifying microRNA:mRNA target interactions for experimental validation. The available tools for microRNA target prediction encompass a range of different computational approaches, from the modeling of physical interactions to the incorporation of machine learning. This review provides an overview of the major computational approaches to microRNA target prediction. Our discussion highlights three tools for their ease of use, reliance on relatively updated versions of miRBase, and range of capabilities, and these are DIANA-microT-CDS, miRanda-mirSVR, and TargetScan. In comparison across all microRNA target prediction tools, four main aspects of the microRNA:mRNA target interaction emerge as common features on which most target prediction is based: seed match, conservation, free energy, and site accessibility. This review explains these features and identifies how they are incorporated into currently available target prediction tools. MicroRNA target prediction is a dynamic field with increasing attention on development of new analysis tools. This review attempts to provide a comprehensive assessment of these tools in a manner that is accessible across disciplines. Understanding the basis of these prediction methodologies will aid in user selection of the appropriate tools and interpretation of the tool output.

  17. MicroRNA expression profiling of the porcine developing brain

    DEFF Research Database (Denmark)

    Podolska, Agnieszka; Kaczkowski, Bogumil; Busk, Peter Kamp

    2011-01-01

    MicroRNAs are small, non-coding RNA molecules that regulate gene expression at the post-transcriptional level and play an important role in the control of developmental and physiological processes. In particular, the developing brain contains an impressive diversity of microRNAs. Most micro...... and the growth curve when compared to humans. Considering these similarities, studies examining microRNA expression during porcine brain development could potentially be used to predict the expression profile and role of microRNAs in the human brain....

  18. Contributions to In Silico Genome Annotation

    KAUST Repository

    Kalkatawi, Manal M.

    2017-11-30

    Genome annotation is an important topic since it provides information for the foundation of downstream genomic and biological research. It is considered as a way of summarizing part of existing knowledge about the genomic characteristics of an organism. Annotating different regions of a genome sequence is known as structural annotation, while identifying functions of these regions is considered as a functional annotation. In silico approaches can facilitate both tasks that otherwise would be difficult and timeconsuming. This study contributes to genome annotation by introducing several novel bioinformatics methods, some based on machine learning (ML) approaches. First, we present Dragon PolyA Spotter (DPS), a method for accurate identification of the polyadenylation signals (PAS) within human genomic DNA sequences. For this, we derived a novel feature-set able to characterize properties of the genomic region surrounding the PAS, enabling development of high accuracy optimized ML predictive models. DPS considerably outperformed the state-of-the-art results. The second contribution concerns developing generic models for structural annotation, i.e., the recognition of different genomic signals and regions (GSR) within eukaryotic DNA. We developed DeepGSR, a systematic framework that facilitates generating ML models to predict GSR with high accuracy. To the best of our knowledge, no available generic and automated method exists for such task that could facilitate the studies of newly sequenced organisms. The prediction module of DeepGSR uses deep learning algorithms to derive highly abstract features that depend mainly on proper data representation and hyperparameters calibration. DeepGSR, which was evaluated on recognition of PAS and translation initiation sites (TIS) in different organisms, yields a simpler and more precise representation of the problem under study, compared to some other hand-tailored models, while producing high accuracy prediction results. Finally

  19. Active learning reduces annotation time for clinical concept extraction.

    Science.gov (United States)

    Kholghi, Mahnoosh; Sitbon, Laurianne; Zuccon, Guido; Nguyen, Anthony

    2017-10-01

    To investigate: (1) the annotation time savings by various active learning query strategies compared to supervised learning and a random sampling baseline, and (2) the benefits of active learning-assisted pre-annotations in accelerating the manual annotation process compared to de novo annotation. There are 73 and 120 discharge summary reports provided by Beth Israel institute in the train and test sets of the concept extraction task in the i2b2/VA 2010 challenge, respectively. The 73 reports were used in user study experiments for manual annotation. First, all sequences within the 73 reports were manually annotated from scratch. Next, active learning models were built to generate pre-annotations for the sequences selected by a query strategy. The annotation/reviewing time per sequence was recorded. The 120 test reports were used to measure the effectiveness of the active learning models. When annotating from scratch, active learning reduced the annotation time up to 35% and 28% compared to a fully supervised approach and a random sampling baseline, respectively. Reviewing active learning-assisted pre-annotations resulted in 20% further reduction of the annotation time when compared to de novo annotation. The number of concepts that require manual annotation is a good indicator of the annotation time for various active learning approaches as demonstrated by high correlation between time rate and concept annotation rate. Active learning has a key role in reducing the time required to manually annotate domain concepts from clinical free text, either when annotating from scratch or reviewing active learning-assisted pre-annotations. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. MicroRNA and cancer

    DEFF Research Database (Denmark)

    Jansson, Martin D; Lund, Anders H

    2012-01-01

    biological phenomena and pathologies. The best characterized non-coding RNA family consists in humans of about 1400 microRNAs for which abundant evidence have demonstrated fundamental importance in normal development, differentiation, growth control and in human diseases such as cancer. In this review, we...... summarize the current knowledge and concepts concerning the involvement of microRNAs in cancer, which have emerged from the study of cell culture and animal model systems, including the regulation of key cancer-related pathways, such as cell cycle control and the DNA damage response. Importantly, micro...

  1. Dehydration triggers differential microRNA expression in Xenopus laevis brain.

    Science.gov (United States)

    Luu, Bryan E; Storey, Kenneth B

    2015-11-15

    African clawed frogs, Xenopus laevis, although primarily aquatic, have a high tolerance for dehydration, being capable of withstanding the loss of up to 32-35% of total water body water. Recent studies have shown that microRNAs play a role in the response to dehydration by the liver, kidney and ventral skin of X. laevis. MicroRNAs act by modulating the expression of mRNA transcripts, thereby affecting diverse biochemical pathways. In this study, 43 microRNAs were assessed in frog brains comparing control and dehydrated (31.2±0.83% of total body water lost) conditions. MicroRNAs of interest were measured using a modified protocol which employs polyadenylation of microRNAs prior to reverse transcription and qPCR. Twelve microRNAs that showed a significant decrease in expression (to 41-77% of control levels) in brains from dehydrated frogs (xla-miR-15a, -150, -181a, -191, -211, -218, -219b, -30c, -30e, -31, -34a, and -34b) were identified. Genomic analysis showed that the sequences of these dehydration-responsive microRNAs were highly conserved as compared with the comparable microRNAs of mice (91-100%). Suppression of these microRNAs implies that translation of the mRNA transcripts under their control could be enhanced in response to dehydration. Bioinformatic analysis using the DIANA miRPath program (v.2.0) predicted the top two KEGG pathways that these microRNAs collectively regulate: 1. Axon guidance, and 2. Long-term potentiation. Previous studies indicated that suppression of these microRNAs promotes neuroprotective pathways by increasing the expression of brain-derived neurotrophic factor and activating anti-apoptotic pathways. This suggests that similar actions may be triggered in X. laevis brains as a protective response to dehydration. Crown Copyright © 2015. Published by Elsevier B.V. All rights reserved.

  2. Regulation of neutrophil senescence by microRNAs.

    Directory of Open Access Journals (Sweden)

    Jon R Ward

    2011-01-01

    Full Text Available Neutrophils are rapidly recruited to sites of tissue injury or infection, where they protect against invading pathogens. Neutrophil functions are limited by a process of neutrophil senescence, which renders the cells unable to respond to chemoattractants, carry out respiratory burst, or degranulate. In parallel, aged neutrophils also undergo spontaneous apoptosis, which can be delayed by factors such as GMCSF. This is then followed by their subsequent removal by phagocytic cells such as macrophages, thereby preventing unwanted inflammation and tissue damage. Neutrophils translate mRNA to make new proteins that are important in maintaining functional longevity. We therefore hypothesised that neutrophil functions and lifespan might be regulated by microRNAs expressed within human neutrophils. Total RNA from highly purified neutrophils was prepared and subjected to microarray analysis using the Agilent human miRNA microarray V3. We found human neutrophils expressed a selected repertoire of 148 microRNAs and that 6 of these were significantly upregulated after a period of 4 hours in culture, at a time when the contribution of apoptosis is negligible. A list of predicted targets for these 6 microRNAs was generated from http://mirecords.biolead.org and compared to mRNA species downregulated over time, revealing 83 genes targeted by at least 2 out of the 6 regulated microRNAs. Pathway analysis of genes containing binding sites for these microRNAs identified the following pathways: chemokine and cytokine signalling, Ras pathway, and regulation of the actin cytoskeleton. Our data suggest that microRNAs may play a role in the regulation of neutrophil senescence and further suggest that manipulation of microRNAs might represent an area of future therapeutic interest for the treatment of inflammatory disease.

  3. MicroRNAs: role and therapeutic targets in viral hepatitis

    NARCIS (Netherlands)

    van der Ree, Meike H.; de Bruijne, Joep; Kootstra, Neeltje A.; Jansen, Peter Lm; Reesink, Hendrik W.

    2014-01-01

    MicroRNAs regulate gene expression by binding to the 3'-untranslated region (UTR) of target messenger RNAs (mRNAs). The importance of microRNAs has been shown for several liver diseases, for example, viral hepatitis. MicroRNA-122 is highly abundant in the liver and is involved in the regulation of

  4. MicroRNA-939 governs vascular integrity and angiogenesis through targeting γ-catenin in endothelial cells

    International Nuclear Information System (INIS)

    Hou, Shiqiang; Fang, Ming; Zhu, Qian; Liu, Ying; Liu, Liang; Li, Xinming

    2017-01-01

    Coronary collateral circulation (CCC) functions as a natural bypass in the event of coronary obstruction, which markedly improves prognosis in patients with coronary artery disease (CAD). MicroRNAs (miRNAs) have been implicated in multiple physiological and pathological processes, including angiogenesis involved in CCC growth. The roles that miRNA-939 (miR-939) plays in angiogenesis remain largely unknown. We conducted this study to explore the expression of miR-939 in CAD patients and its role in angiogenesis. For the first time, our results indicated that the expression of circulating miR-939 was down-regulated in patients with sufficient CCC compared with patients with poor CCC. Overexpression of miR-939 in primary human umbilical vein endothelial cells (HUVECs) significantly inhibited the proliferation, adhesion and tube formation, but promoted the migration of cells. In contrast, miR-939 knockdown exerted reverse effects. We further identified that γ-catenin was a novel target of miR-939 by translational repression, which could rescue the effects of miR-939 in HUVECs. In summary, this study revealed that the expression of circulating miR-939 was down-regulated in CAD patients with sufficient CCC. MiR-939 abolished vascular integrity and repressed angiogenesis through directly targeting γ-catenin. It provided a potential biomarker and a therapeutic target for CAD. - Highlights: • Circulating miR-939 is decreased in sufficient coronary collateral circulation. • MiR-939 abolishes vascular integrity in endothelial cells. • MiR-939 represses angiogenesis. • γ-catenin is a novel target of miR-939.

  5. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

    Energy Technology Data Exchange (ETDEWEB)

    Brettin, Thomas; Davis, James J.; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Olsen, Gary J.; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D.; Shukla, Maulik; Thomason, James A.; Stevens, Rick; Vonstein, Veronika; Wattam, Alice R.; Xia, Fangfang

    2015-02-10

    The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.

  6. RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes.

    Science.gov (United States)

    Brettin, Thomas; Davis, James J; Disz, Terry; Edwards, Robert A; Gerdes, Svetlana; Olsen, Gary J; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D; Shukla, Maulik; Thomason, James A; Stevens, Rick; Vonstein, Veronika; Wattam, Alice R; Xia, Fangfang

    2015-02-10

    The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.

  7. A Flexible Object-of-Interest Annotation Framework for Online Video Portals

    Directory of Open Access Journals (Sweden)

    Robert Sorschag

    2012-02-01

    Full Text Available In this work, we address the use of object recognition techniques to annotate what is shown where in online video collections. These annotations are suitable to retrieve specific video scenes for object related text queries which is not possible with the manually generated metadata that is used by current portals. We are not the first to present object annotations that are generated with content-based analysis methods. However, the proposed framework possesses some outstanding features that offer good prospects for its application in real video portals. Firstly, it can be easily used as background module in any video environment. Secondly, it is not based on a fixed analysis chain but on an extensive recognition infrastructure that can be used with all kinds of visual features, matching and machine learning techniques. New recognition approaches can be integrated into this infrastructure with low development costs and a configuration of the used recognition approaches can be performed even on a running system. Thus, this framework might also benefit from future advances in computer vision. Thirdly, we present an automatic selection approach to support the use of different recognition strategies for different objects. Last but not least, visual analysis can be performed efficiently on distributed, multi-processor environments and a database schema is presented to store the resulting video annotations as well as the off-line generated low-level features in a compact form. We achieve promising results in an annotation case study and the instance search task of the TRECVID 2011 challenge.

  8. Systemic Planning: An Annotated Bibliography and Literature Guide. Exchange Bibliography No. 91.

    Science.gov (United States)

    Catanese, Anthony James

    Systemic planning is an operational approach to using scientific rigor and qualitative judgment in a complementary manner. It integrates rigorous techniques and methods from systems analysis, cybernetics, decision theory, and work programing. The annotated reference sources in this bibliography include those works that have been most influential…

  9. Supporting the annotation of chronic obstructive pulmonary disease (COPD) phenotypes with text mining workflows.

    Science.gov (United States)

    Fu, Xiao; Batista-Navarro, Riza; Rak, Rafal; Ananiadou, Sophia

    2015-01-01

    Chronic obstructive pulmonary disease (COPD) is a life-threatening lung disorder whose recent prevalence has led to an increasing burden on public healthcare. Phenotypic information in electronic clinical records is essential in providing suitable personalised treatment to patients with COPD. However, as phenotypes are often "hidden" within free text in clinical records, clinicians could benefit from text mining systems that facilitate their prompt recognition. This paper reports on a semi-automatic methodology for producing a corpus that can ultimately support the development of text mining tools that, in turn, will expedite the process of identifying groups of COPD patients. A corpus of 30 full-text papers was formed based on selection criteria informed by the expertise of COPD specialists. We developed an annotation scheme that is aimed at producing fine-grained, expressive and computable COPD annotations without burdening our curators with a highly complicated task. This was implemented in the Argo platform by means of a semi-automatic annotation workflow that integrates several text mining tools, including a graphical user interface for marking up documents. When evaluated using gold standard (i.e., manually validated) annotations, the semi-automatic workflow was shown to obtain a micro-averaged F-score of 45.70% (with relaxed matching). Utilising the gold standard data to train new concept recognisers, we demonstrated that our corpus, although still a work in progress, can foster the development of significantly better performing COPD phenotype extractors. We describe in this work the means by which we aim to eventually support the process of COPD phenotype curation, i.e., by the application of various text mining tools integrated into an annotation workflow. Although the corpus being described is still under development, our results thus far are encouraging and show great potential in stimulating the development of further automatic COPD phenotype extractors.

  10. Computer systems for annotation of single molecule fragments

    Science.gov (United States)

    Schwartz, David Charles; Severin, Jessica

    2016-07-19

    There are provided computer systems for visualizing and annotating single molecule images. Annotation systems in accordance with this disclosure allow a user to mark and annotate single molecules of interest and their restriction enzyme cut sites thereby determining the restriction fragments of single nucleic acid molecules. The markings and annotations may be automatically generated by the system in certain embodiments and they may be overlaid translucently onto the single molecule images. An image caching system may be implemented in the computer annotation systems to reduce image processing time. The annotation systems include one or more connectors connecting to one or more databases capable of storing single molecule data as well as other biomedical data. Such diverse array of data can be retrieved and used to validate the markings and annotations. The annotation systems may be implemented and deployed over a computer network. They may be ergonomically optimized to facilitate user interactions.

  11. Image annotation under X Windows

    Science.gov (United States)

    Pothier, Steven

    1991-08-01

    A mechanism for attaching graphic and overlay annotation to multiple bits/pixel imagery while providing levels of performance approaching that of native mode graphics systems is presented. This mechanism isolates programming complexity from the application programmer through software encapsulation under the X Window System. It ensures display accuracy throughout operations on the imagery and annotation including zooms, pans, and modifications of the annotation. Trade-offs that affect speed of display, consumption of memory, and system functionality are explored. The use of resource files to tune the display system is discussed. The mechanism makes use of an abstraction consisting of four parts; a graphics overlay, a dithered overlay, an image overly, and a physical display window. Data structures are maintained that retain the distinction between the four parts so that they can be modified independently, providing system flexibility. A unique technique for associating user color preferences with annotation is introduced. An interface that allows interactive modification of the mapping between image value and color is discussed. A procedure that provides for the colorization of imagery on 8-bit display systems using pixel dithering is explained. Finally, the application of annotation mechanisms to various applications is discussed.

  12. Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data.

    Science.gov (United States)

    He, Zihuai; Xu, Bin; Lee, Seunggeun; Ionita-Laza, Iuliana

    2017-09-07

    Substantial progress has been made in the functional annotation of genetic variation in the human genome. Integrative analysis that incorporates such functional annotations into sequencing studies can aid the discovery of disease-associated genetic variants, especially those with unknown function and located outside protein-coding regions. Direct incorporation of one functional annotation as weight in existing dispersion and burden tests can suffer substantial loss of power when the functional annotation is not predictive of the risk status of a variant. Here, we have developed unified tests that can utilize multiple functional annotations simultaneously for integrative association analysis with efficient computational techniques. We show that the proposed tests significantly improve power when variant risk status can be predicted by functional annotations. Importantly, when functional annotations are not predictive of risk status, the proposed tests incur only minimal loss of power in relation to existing dispersion and burden tests, and under certain circumstances they can even have improved power by learning a weight that better approximates the underlying disease model in a data-adaptive manner. The tests can be constructed with summary statistics of existing dispersion and burden tests for sequencing data, therefore allowing meta-analysis of multiple studies without sharing individual-level data. We applied the proposed tests to a meta-analysis of noncoding rare variants in Metabochip data on 12,281 individuals from eight studies for lipid traits. By incorporating the Eigen functional score, we detected significant associations between noncoding rare variants in SLC22A3 and low-density lipoprotein and total cholesterol, associations that are missed by standard dispersion and burden tests. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  13. New genes expressed in human brains: implications for annotating evolving genomes.

    Science.gov (United States)

    Zhang, Yong E; Landback, Patrick; Vibranovski, Maria; Long, Manyuan

    2012-11-01

    New genes have frequently formed and spread to fixation in a wide variety of organisms, constituting abundant sets of lineage-specific genes. It was recently reported that an excess of primate-specific and human-specific genes were upregulated in the brains of fetuses and infants, and especially in the prefrontal cortex, which is involved in cognition. These findings reveal the prevalent addition of new genetic components to the transcriptome of the human brain. More generally, these findings suggest that genomes are continually evolving in both sequence and content, eroding the conservation endowed by common ancestry. Despite increasing recognition of the importance of new genes, we highlight here that these genes are still seriously under-characterized in functional studies and that new gene annotation is inconsistent in current practice. We propose an integrative approach to annotate new genes, taking advantage of functional and evolutionary genomic methods. We finally discuss how the refinement of new gene annotation will be important for the detection of evolutionary forces governing new gene origination. Copyright © 2012 WILEY Periodicals, Inc.

  14. Two microRNA signatures for malignancy and immune infiltration predict overall survival in advanced epithelial ovarian cancer.

    Science.gov (United States)

    Korsunsky, Ilya; Parameswaran, Janaki; Shapira, Iuliana; Lovecchio, John; Menzin, Andrew; Whyte, Jill; Dos Santos, Lisa; Liang, Sharon; Bhuiya, Tawfiqul; Keogh, Mary; Khalili, Houman; Pond, Cassandra; Liew, Anthony; Shih, Andrew; Gregersen, Peter K; Lee, Annette T

    2017-10-01

    MicroRNAs have been established as key regulators of tumor gene expression and as prime biomarker candidates for clinical phenotypes in epithelial ovarian cancer (EOC). We analyzed the coexpression and regulatory structure of microRNAs and their co-localized gene targets in primary tumor tissue of 20 patients with advanced EOC in order to construct a regulatory signature for clinical prognosis. We performed an integrative analysis to identify two prognostic microRNA/mRNA coexpression modules, each enriched for consistent biological functions. One module, enriched for malignancy-related functions, was found to be upregulated in malignant versus benign samples. The second module, enriched for immune-related functions, was strongly correlated with imputed intratumoral immune infiltrates of T cells, natural killer cells, cytotoxic lymphocytes, and macrophages. We validated the prognostic relevance of the immunological module microRNAs in the publicly available The Cancer Genome Atlas data set. These findings provide novel functional roles for microRNAs in the progression of advanced EOC and possible prognostic signatures for survival. © American Federation for Medical Research (unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  15. microRNA expression in the neural retina: Focus on Müller glia.

    Science.gov (United States)

    Quintero, Heberto; Lamas, Mónica

    2018-03-01

    The neural retina hosts a unique specialized type of macroglial cell that not only preserves retinal homeostasis, function, and integrity but also may serve as a source of new neurons during regenerative processes: the Müller cell. Precise microRNA-driven mechanisms of gene regulation impel and direct the processes of Müller glia lineage acquisition from retinal progenitors during development, the triggering of their response to retinal degeneration and, in some cases, Müller cell reprogramming and regenerative events. In this review we survey the recent reports describing, through functional assays, the regulatory role of microRNAs in Müller cell physiology, differentiation potential, and retinal pathology. We discuss also the evidence based on expression analysis that points out the relevance of a Müller glia-specific microRNA signature that would orchestrate these processes. © 2017 Wiley Periodicals, Inc.

  16. Visual Interpretation with Three-Dimensional Annotations (VITA): three-dimensional image interpretation tool for radiological reporting.

    Science.gov (United States)

    Roy, Sharmili; Brown, Michael S; Shih, George L

    2014-02-01

    This paper introduces a software framework called Visual Interpretation with Three-Dimensional Annotations (VITA) that is able to automatically generate three-dimensional (3D) visual summaries based on radiological annotations made during routine exam reporting. VITA summaries are in the form of rotating 3D volumes where radiological annotations are highlighted to place important clinical observations into a 3D context. The rendered volume is produced as a Digital Imaging and Communications in Medicine (DICOM) object and is automatically added to the study for archival in Picture Archiving and Communication System (PACS). In addition, a video summary (e.g., MPEG4) can be generated for sharing with patients and for situations where DICOM viewers are not readily available to referring physicians. The current version of VITA is compatible with ClearCanvas; however, VITA can work with any PACS workstation that has a structured annotation implementation (e.g., Extendible Markup Language, Health Level 7, Annotation and Image Markup) and is able to seamlessly integrate into the existing reporting workflow. In a survey with referring physicians, the vast majority strongly agreed that 3D visual summaries improve the communication of the radiologists' reports and aid communication with patients.

  17. Motion lecture annotation system to learn Naginata performances

    Science.gov (United States)

    Kobayashi, Daisuke; Sakamoto, Ryota; Nomura, Yoshihiko

    2013-12-01

    This paper describes a learning assistant system using motion capture data and annotation to teach "Naginata-jutsu" (a skill to practice Japanese halberd) performance. There are some video annotation tools such as YouTube. However these video based tools have only single angle of view. Our approach that uses motion-captured data allows us to view any angle. A lecturer can write annotations related to parts of body. We have made a comparison of effectiveness between the annotation tool of YouTube and the proposed system. The experimental result showed that our system triggered more annotations than the annotation tool of YouTube.

  18. MicroRNA-21 Increases Proliferation and Cisplatin Sensitivity of Osteosarcoma-Derived Cells.

    Directory of Open Access Journals (Sweden)

    Vanita Vanas

    Full Text Available Osteosarcoma is the most common primary bone tumor and poor prognosis for osteosarcoma patients is mainly due to chemotherapy resistance. MicroRNAs are important to maintain pathophysiological mechanisms of cancer and influence cell sensitivity to chemotherapy. In this study, we tested the functions of microRNA-21 for malignant features as well as for drug resistance of osteosarcoma. We used Northern blot to measure microRNA-21 levels in osteosarcoma-derived cell lines. MicroRNA-21 activity was modulated by either expressing a sponge to decrease its activity in an osteosarcoma-derived cell line expressing high levels of microRNA-21 or by introducing pri-microRNA-21 in a cell line with low endogenous levels. Cell migration was determined in a scratch assay and cell proliferation was measured by performing growth curve analysis. Sensitivity of the cells towards chemotherapeutics was investigated by performing cell viability assays and calculating the IC50 values. While cell migration was unaffected by modulated microRNA-21 levels, microRNA-21 inhibition slowed proliferation and exogenously expressed microRNA-21 promoted this process. Modulated microRNA-21 activity failed to effect sensitivity of osteosarcoma-derived cell lines to doxorubicin or methotrexate. Contrarily, reduction of microRNA-21 activity resulted in enhanced resistance towards cisplatin while ectopic expression of microRNA-21 showed the opposite effect. Increased microRNA-21 levels repressed the expression of Sprouty2 and ectopic expression of Sprouty2 was able to largely rescue the observed effects of microRNA-21 in osteosarcoma. In summary, our data indicate that in osteosarcoma microRNA-21 expression is an important component for regulation of cell proliferation and for determining sensitivity to cisplatin.

  19. BEACON: automated tool for Bacterial GEnome Annotation ComparisON

    KAUST Repository

    Kalkatawi, Manal M.

    2015-08-18

    Background Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). Results The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON’s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced. Conclusions We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/

  20. BEACON: automated tool for Bacterial GEnome Annotation ComparisON.

    Science.gov (United States)

    Kalkatawi, Manal; Alam, Intikhab; Bajic, Vladimir B

    2015-08-18

    Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON's utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27%, while the number of genes without any function assignment is reduced. We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .

  1. Friend or Foe: MicroRNAs in the p53 network.

    Science.gov (United States)

    Luo, Zhenghua; Cui, Ri; Tili, Esmerina; Croce, Carlo

    2018-04-10

    The critical tumor suppressor gene TP53 is either lost or mutated in more than half of human cancers. As an important transcriptional regulator, p53 modulates the expression of many microRNAs. While wild-type p53 uses microRNAs to suppress cancer development, microRNAs that are activated by gain-of-function mutant p53 confer oncogenic properties. On the other hand, the expression of p53 is tightly controlled by a fine-tune machinery including microRNAs. MicroRNAs can target the TP53 gene directly or other factors in the p53 network so that expression and function of either the wild-type or the mutant forms of p53 is downregulated. Therefore, depending on the wild-type or mutant p53 context, microRNAs contribute substantially to suppress or exacerbate tumor development. Copyright © 2018. Published by Elsevier B.V.

  2. The BioC-BioGRID corpus: full text articles annotated for curation of protein–protein and genetic interactions

    Science.gov (United States)

    Kim, Sun; Chatr-aryamontri, Andrew; Chang, Christie S.; Oughtred, Rose; Rust, Jennifer; Wilbur, W. John; Comeau, Donald C.; Dolinski, Kara; Tyers, Mike

    2017-01-01

    A great deal of information on the molecular genetics and biochemistry of model organisms has been reported in the scientific literature. However, this data is typically described in free text form and is not readily amenable to computational analyses. To this end, the BioGRID database systematically curates the biomedical literature for genetic and protein interaction data. This data is provided in a standardized computationally tractable format and includes structured annotation of experimental evidence. BioGRID curation necessarily involves substantial human effort by expert curators who must read each publication to extract the relevant information. Computational text-mining methods offer the potential to augment and accelerate manual curation. To facilitate the development of practical text-mining strategies, a new challenge was organized in BioCreative V for the BioC task, the collaborative Biocurator Assistant Task. This was a non-competitive, cooperative task in which the participants worked together to build BioC-compatible modules into an integrated pipeline to assist BioGRID curators. As an integral part of this task, a test collection of full text articles was developed that contained both biological entity annotations (gene/protein and organism/species) and molecular interaction annotations (protein–protein and genetic interactions (PPIs and GIs)). This collection, which we call the BioC-BioGRID corpus, was annotated by four BioGRID curators over three rounds of annotation and contains 120 full text articles curated in a dataset representing two major model organisms, namely budding yeast and human. The BioC-BioGRID corpus contains annotations for 6409 mentions of genes and their Entrez Gene IDs, 186 mentions of organism names and their NCBI Taxonomy IDs, 1867 mentions of PPIs and 701 annotations of PPI experimental evidence statements, 856 mentions of GIs and 399 annotations of GI evidence statements. The purpose, characteristics and possible future

  3. Computer simulation models as a tool to investigate the role of microRNAs in osteoarthritis.

    Directory of Open Access Journals (Sweden)

    Carole J Proctor

    Full Text Available The aim of this study was to show how computational models can be used to increase our understanding of the role of microRNAs in osteoarthritis (OA using miR-140 as an example. Bioinformatics analysis and experimental results from the literature were used to create and calibrate models of gene regulatory networks in OA involving miR-140 along with key regulators such as NF-κB, SMAD3, and RUNX2. The individual models were created with the modelling standard, Systems Biology Markup Language, and integrated to examine the overall effect of miR-140 on cartilage homeostasis. Down-regulation of miR-140 may have either detrimental or protective effects for cartilage, indicating that the role of miR-140 is complex. Studies of individual networks in isolation may therefore lead to different conclusions. This indicated the need to combine the five chosen individual networks involving miR-140 into an integrated model. This model suggests that the overall effect of miR-140 is to change the response to an IL-1 stimulus from a prolonged increase in matrix degrading enzymes to a pulse-like response so that cartilage degradation is temporary. Our current model can easily be modified and extended as more experimental data become available about the role of miR-140 in OA. In addition, networks of other microRNAs that are important in OA could be incorporated. A fully integrated model could not only aid our understanding of the mechanisms of microRNAs in ageing cartilage but could also provide a useful tool to investigate the effect of potential interventions to prevent cartilage loss.

  4. FragKB: structural and literature annotation resource of conserved peptide fragments and residues.

    Directory of Open Access Journals (Sweden)

    Ashish V Tendulkar

    Full Text Available BACKGROUND: FragKB (Fragment Knowledgebase is a repository of clusters of structurally similar fragments from proteins. Fragments are annotated with information at the level of sequence, structure and function, integrating biological descriptions derived from multiple existing resources and text mining. METHODOLOGY: FragKB contains approximately 400,000 conserved fragments from 4,800 representative proteins from PDB. Literature annotations are extracted from more than 1,700 articles and are available for over 12,000 fragments. The underlying systematic annotation workflow of FragKB ensures efficient update and maintenance of this database. The information in FragKB can be accessed through a web interface that facilitates sequence and structural visualization of fragments together with known literature information on the consequences of specific residue mutations and functional annotations of proteins and fragment clusters. FragKB is accessible online at http://ubio.bioinfo.cnio.es/biotools/fragkb/. SIGNIFICANCE: The information presented in FragKB can be used for modeling protein structures, for designing novel proteins and for functional characterization of related fragments. The current release is focused on functional characterization of proteins through inspection of conservation of the fragments.

  5. MicroRNA expression profiles in human cancer cells after ionizing radiation

    International Nuclear Information System (INIS)

    Niemoeller, Olivier M; Niyazi, Maximilian; Corradini, Stefanie; Zehentmayr, Franz; Li, Minglun; Lauber, Kirsten; Belka, Claus

    2011-01-01

    MicroRNAs are regulators of central cellular processes and are implicated in the pathogenesis and prognosis of human cancers. MicroRNAs also modulate responses to anti-cancer therapy. In the context of radiation oncology microRNAs were found to modulate cell death and proliferation after irradiation. However, changes in microRNA expression profiles in response to irradiation have not been comprehensively analyzed so far. The present study's intend is to present a broad screen of changes in microRNA expression following irradiation of different malignant cell lines. 1100 microRNAs (Sanger miRBase release version 14.0) were analyzed in six malignant cell lines following irradiation with clinically relevant doses of 2.0 Gy. MicroRNA levels 6 hours after irradiation were compared to microRNA levels in non-irradiated cells using the 'Geniom Biochip MPEA homo sapiens'. Hierarchical clustering analysis revealed a pattern, which significantly (p = 0.014) discerned irradiated from non-irradiated cells. The expression levels of a number of microRNAs known to be involved in the regulation of cellular processes like apoptosis, proliferation, invasion, local immune response and radioresistance (e. g. miR-1285, miR-24-1, miR-151-5p, let-7i) displayed 2 - 3-fold changes after irradiation. Moreover, several microRNAs previously not known to be radiation-responsive were discovered. Ionizing radiation induced significant changes in microRNA expression profiles in 3 glioma and 3 squamous cell carcinoma cell lines. The functional relevance of these changes is not addressed but should by analyzed by future work especially focusing on clinically relevant endpoints like radiation induced cell death, proliferation, migration and metastasis

  6. The effects of environmental chemical carcinogens on the microRNA machinery.

    Science.gov (United States)

    Izzotti, A; Pulliero, A

    2014-07-01

    The first evidence that microRNA expression is early altered by exposure to environmental chemical carcinogens in still healthy organisms was obtained for cigarette smoke. To date, the cumulative experimental data indicate that similar effects are caused by a variety of environmental carcinogens, including polycyclic aromatic hydrocarbons, nitropyrenes, endocrine disruptors, airborne mixtures, carcinogens in food and water, and carcinogenic drugs. Accordingly, the alteration of miRNA expression is a general mechanism that plays an important pathogenic role in linking exposure to environmental toxic agents with their pathological consequences, mainly including cancer development. This review summarizes the existing experimental evidence concerning the effects of chemical carcinogens on the microRNA machinery. For each carcinogen, the specific microRNA alteration signature, as detected in experimental studies, is reported. These data are useful for applying microRNA alterations as early biomarkers of biological effects in healthy organisms exposed to environmental carcinogens. However, microRNA alteration results in carcinogenesis only if accompanied by other molecular damages. As an example, microRNAs altered by chemical carcinogens often inhibits the expression of mutated oncogenes. The long-term exposure to chemical carcinogens causes irreversible suppression of microRNA expression thus allowing the transduction into proteins of mutated oncogenes. This review also analyzes the existing knowledge regarding the mechanisms by which environmental carcinogens alter microRNA expression. The underlying molecular mechanism involves p53-microRNA interconnection, microRNA adduct formation, and alterations of Dicer function. On the whole, reported findings provide evidence that microRNA analysis is a molecular toxicology tool that can elucidate the pathogenic mechanisms activated by environmental carcinogens. Copyright © 2014 Elsevier GmbH. All rights reserved.

  7. dictyBase 2015: Expanding data and annotations in a new software environment.

    Science.gov (United States)

    Basu, Siddhartha; Fey, Petra; Jimenez-Morales, David; Dodson, Robert J; Chisholm, Rex L

    2015-08-01

    dictyBase is the model organism database for the social amoeba Dictyostelium discoideum and related species. The primary mission of dictyBase is to provide the biomedical research community with well-integrated high quality data, and tools that enable original research. Data presented at dictyBase is obtained from sequencing centers, groups performing high throughput experiments such as large-scale mutagenesis studies, and RNAseq data, as well as a growing number of manually added functional gene annotations from the published literature, including Gene Ontology, strain, and phenotype annotations. Through the Dicty Stock Center we provide the community with an impressive amount of annotated strains and plasmids. Recently, dictyBase accomplished a major overhaul to adapt an outdated infrastructure to the current technological advances, thus facilitating the implementation of innovative tools and comparative genomics. It also provides new strategies for high quality annotations that enable bench researchers to benefit from the rapidly increasing volume of available data. dictyBase is highly responsive to its users needs, building a successful relationship that capitalizes on the vast efforts of the Dictyostelium research community. dictyBase has become the trusted data resource for Dictyostelium investigators, other investigators or organizations seeking information about Dictyostelium, as well as educators who use this model system. © 2015 Wiley Periodicals, Inc.

  8. Facilitating functional annotation of chicken microarray data

    Directory of Open Access Journals (Sweden)

    Gresham Cathy R

    2009-10-01

    Full Text Available Abstract Background Modeling results from chicken microarray studies is challenging for researchers due to little functional annotation associated with these arrays. The Affymetrix GenChip chicken genome array, one of the biggest arrays that serve as a key research tool for the study of chicken functional genomics, is among the few arrays that link gene products to Gene Ontology (GO. However the GO annotation data presented by Affymetrix is incomplete, for example, they do not show references linked to manually annotated functions. In addition, there is no tool that facilitates microarray researchers to directly retrieve functional annotations for their datasets from the annotated arrays. This costs researchers amount of time in searching multiple GO databases for functional information. Results We have improved the breadth of functional annotations of the gene products associated with probesets on the Affymetrix chicken genome array by 45% and the quality of annotation by 14%. We have also identified the most significant diseases and disorders, different types of genes, and known drug targets represented on Affymetrix chicken genome array. To facilitate functional annotation of other arrays and microarray experimental datasets we developed an Array GO Mapper (AGOM tool to help researchers to quickly retrieve corresponding functional information for their dataset. Conclusion Results from this study will directly facilitate annotation of other chicken arrays and microarray experimental datasets. Researchers will be able to quickly model their microarray dataset into more reliable biological functional information by using AGOM tool. The disease, disorders, gene types and drug targets revealed in the study will allow researchers to learn more about how genes function in complex biological systems and may lead to new drug discovery and development of therapies. The GO annotation data generated will be available for public use via AgBase website and

  9. Genomic Organization of Zebrafish microRNAs

    Directory of Open Access Journals (Sweden)

    Paydar Ima

    2008-05-01

    Full Text Available Abstract Background microRNAs (miRNAs are small (~22 nt non-coding RNAs that regulate cell movement, specification, and development. Expression of miRNAs is highly regulated, both spatially and temporally. Based on direct cloning, sequence conservation, and predicted secondary structures, a large number of miRNAs have been identified in higher eukaryotic genomes but whether these RNAs are simply a subset of a much larger number of noncoding RNA families is unknown. This is especially true in zebrafish where genome sequencing and annotation is not yet complete. Results We analyzed the zebrafish genome to identify the number and location of proven and predicted miRNAs resulting in the identification of 35 new miRNAs. We then grouped all 415 zebrafish miRNAs into families based on seed sequence identity as a means to identify possible functional redundancy. Based on genomic location and expression analysis, we also identified those miRNAs that are likely to be encoded as part of polycistronic transcripts. Lastly, as a resource, we compiled existing zebrafish miRNA expression data and, where possible, listed all experimentally proven mRNA targets. Conclusion Current analysis indicates the zebrafish genome encodes 415 miRNAs which can be grouped into 44 families. The largest of these families (the miR-430 family contains 72 members largely clustered in two main locations along chromosome 4. Thus far, most zebrafish miRNAs exhibit tissue specific patterns of expression.

  10. Systematic Prediction of the Impacts of Mutations in MicroRNA Seed Sequences

    Directory of Open Access Journals (Sweden)

    Bhattacharya Anindya

    2017-05-01

    Full Text Available MicroRNAs are a class of small non-coding RNAs that are involved in many important biological processes and the dysfunction of microRNA has been associated with many diseases. The seed region of a microRNA is of crucial importance to its target recognition. Mutations in microRNA seed regions may disrupt the binding of microRNAs to their original target genes and make them bind to new target genes. Here we use a knowledge-based computational method to systematically predict the functional effects of all the possible single nucleotide mutations in human microRNA seed regions. The result provides a comprehensive reference for the functional assessment of the impacts of possible natural and artificial single nucleotide mutations in microRNA seed regions.

  11. Dictionary-driven protein annotation.

    Science.gov (United States)

    Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

    2002-09-01

    Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/ bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were

  12. Identification and validation of human papillomavirus encoded microRNAs.

    Directory of Open Access Journals (Sweden)

    Kui Qian

    Full Text Available We report here identification and validation of the first papillomavirus encoded microRNAs expressed in human cervical lesions and cell lines. We established small RNA libraries from ten human papillomavirus associated cervical lesions including cancer and two human papillomavirus harboring cell lines. These libraries were sequenced using SOLiD 4 technology. We used the sequencing data to predict putative viral microRNAs and discovered nine putative papillomavirus encoded microRNAs. Validation was performed for five candidates, four of which were successfully validated by qPCR from cervical tissue samples and cell lines: two were encoded by HPV 16, one by HPV 38 and one by HPV 68. The expression of HPV 16 microRNAs was further confirmed by in situ hybridization, and colocalization with p16INK4A was established. Prediction of cellular target genes of HPV 16 encoded microRNAs suggests that they may play a role in cell cycle, immune functions, cell adhesion and migration, development, and cancer. Two putative viral target sites for the two validated HPV 16 miRNAs were mapped to the E5 gene, one in the E1 gene, two in the L1 gene and one in the LCR region. This is the first report to show that papillomaviruses encode their own microRNA species. Importantly, microRNAs were found in libraries established from human cervical disease and carcinoma cell lines, and their expression was confirmed in additional tissue samples. To our knowledge, this is also the first paper to use in situ hybridization to show the expression of a viral microRNA in human tissue.

  13. Characterization and identification of microRNA core promoters in four model species.

    Directory of Open Access Journals (Sweden)

    Xuefeng Zhou

    2007-03-01

    Full Text Available MicroRNAs are short, noncoding RNAs that play important roles in post-transcriptional gene regulation. Although many functions of microRNAs in plants and animals have been revealed in recent years, the transcriptional mechanism of microRNA genes is not well-understood. To elucidate the transcriptional regulation of microRNA genes, we study and characterize, in a genome scale, the promoters of intergenic microRNA genes in Caenorhabditis elegans, Homo sapiens, Arabidopsis thaliana, and Oryza sativa. We show that most known microRNA genes in these four species have the same type of promoters as protein-coding genes have. To further characterize the promoters of microRNA genes, we developed a novel promoter prediction method, called common query voting (CoVote, which is more effective than available promoter prediction methods. Using this new method, we identify putative core promoters of most known microRNA genes in the four model species. Moreover, we characterize the promoters of microRNA genes in these four species. We discover many significant, characteristic sequence motifs in these core promoters, several of which match or resemble the known cis-acting elements for transcription initiation. Among these motifs, some are conserved across different species while some are specific to microRNA genes of individual species.

  14. Tissue-specific regulation of mouse MicroRNA genes in endoderm-derived tissues

    OpenAIRE

    Gao, Yan; Schug, Jonathan; McKenna, Lindsay B.; Le Lay, John; Kaestner, Klaus H.; Greenbaum, Linda E.

    2010-01-01

    MicroRNAs fine-tune the activity of hundreds of protein-coding genes. The identification of tissue-specific microRNAs and their promoters has been constrained by the limited sensitivity of prior microRNA quantification methods. Here, we determine the entire microRNAome of three endoderm-derived tissues, liver, jejunum and pancreas, using ultra-high throughput sequencing. Although many microRNA genes are expressed at comparable levels, 162 microRNAs exhibited striking tissue-specificity. After...

  15. The effectiveness of annotated (vs. non-annotated) digital pathology slides as a teaching tool during dermatology and pathology residencies.

    Science.gov (United States)

    Marsch, Amanda F; Espiritu, Baltazar; Groth, John; Hutchens, Kelli A

    2014-06-01

    With today's technology, paraffin-embedded, hematoxylin & eosin-stained pathology slides can be scanned to generate high quality virtual slides. Using proprietary software, digital images can also be annotated with arrows, circles and boxes to highlight certain diagnostic features. Previous studies assessing digital microscopy as a teaching tool did not involve the annotation of digital images. The objective of this study was to compare the effectiveness of annotated digital pathology slides versus non-annotated digital pathology slides as a teaching tool during dermatology and pathology residencies. A study group composed of 31 dermatology and pathology residents was asked to complete an online pre-quiz consisting of 20 multiple choice style questions, each associated with a static digital pathology image. After completion, participants were given access to an online tutorial composed of digitally annotated pathology slides and subsequently asked to complete a post-quiz. A control group of 12 residents completed a non-annotated version of the tutorial. Nearly all participants in the study group improved their quiz score, with an average improvement of 17%, versus only 3% (P = 0.005) in the control group. These results support the notion that annotated digital pathology slides are superior to non-annotated slides for the purpose of resident education. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  16. Roles of microRNA-15 family in normal and pathological late lung development

    OpenAIRE

    Sakkas, Elpidoforos

    2016-01-01

    MicroRNAs are key regulators of organogenesis and during the last years many studies focused on microRNA expression during embryonic development. To date, there is no study to report possible roles of microRNAs in late lung development and especially during the alveolarization process. The objective of this study was to identify microRNAs that are deregulated under hyperoxic conditions and to assess whether microRNA expression can be modulated in vivo. Lung microRNA expression screening wa...

  17. Identification and differential expression of microRNAs in ovaries of laying and Broody geese (Anser cygnoides by Solexa sequencing.

    Directory of Open Access Journals (Sweden)

    Qi Xu

    Full Text Available BACKGROUND: Recent functional studies have demonstrated that the microRNAs (miRNAs play critical roles in ovarian gonadal development, steroidogenesis, apoptosis, and ovulation in mammals. However, little is known about the involvement of miRNAs in the ovarian function of fowl. The goose (Anas cygnoides is a commercially important food that is cultivated widely in China but the goose industry has been hampered by high broodiness and poor egg laying performance, which are influenced by ovarian function. METHODOLOGY/PRINCIPAL FINDINGS: In this study, the miRNA transcriptomes of ovaries from laying and broody geese were profiled using Solexa deep sequencing and bioinformatics was used to determine differential expression of the miRNAs. As a result, 11,350,396 and 9,890,887 clean reads were obtained in laying and broodiness goose, respectively, and 1,328 conserved known miRNAs and 22 novel potential miRNA candidates were identified. A total of 353 conserved microRNAs were significantly differentially expressed between laying and broody ovaries. Compared with miRNA expression in the laying ovary, 127 miRNAs were up-regulated and 126 miRNAs were down-regulated in the ovary of broody birds. A subset of the differentially expressed miRNAs (G-miR-320, G-miR-202, G-miR-146, and G-miR-143* were validated using real-time quantitative PCR. In addition, 130,458 annotated mRNA transcripts were identified as putative target genes. Gene ontology annotation and KEGG (Kyoto Encyclopedia of Genes and Genomes pathway analysis suggested that the differentially expressed miRNAs are involved in ovarian function, including hormone secretion, reproduction processes and so on. CONCLUSIONS: The present study provides the first global miRNA transcriptome data in A. cygnoides and identifies novel and known miRNAs that are differentially expressed between the ovaries of laying and broody geese. These findings contribute to our understanding of the functional involvement of mi

  18. Automatic annotation of head velocity and acceleration in Anvil

    DEFF Research Database (Denmark)

    Jongejan, Bart

    2012-01-01

    We describe an automatic face tracker plugin for the ANVIL annotation tool. The face tracker produces data for velocity and for acceleration in two dimensions. We compare the annotations generated by the face tracking algorithm with independently made manual annotations for head movements....... The annotations are a useful supplement to manual annotations and may help human annotators to quickly and reliably determine onset of head movements and to suggest which kind of head movement is taking place....

  19. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences

    OpenAIRE

    Huerta-Cepas, J.; Szklarczyk, D.; Forslund, K.; Cook, H.; Heller, D.; Walter, M.C.; Rattei, T.; Mende, D.R.; Sunagawa, S.; Kuhn, M.; Jensen, L.J.; von Mering, C.; Bork, P.

    2016-01-01

    eggNOG is a public resource that provides Orthologous Groups (OGs) of proteins at different taxonomic levels, each with integrated and summarized functional annotations. Developments since the latest public release include changes to the algorithm for creating OGs across taxonomic levels, making nested groups hierarchically consistent. This allows for a better propagation of functional terms across nested OGs and led to the novel annotation of 95 890 previously uncharacterized OGs, increasing...

  20. Mesotext. Framing and exploring annotations

    NARCIS (Netherlands)

    Boot, P.; Boot, P.; Stronks, E.

    2007-01-01

    From the introduction: Annotation is an important item on the wish list for digital scholarly tools. It is one of John Unsworth’s primitives of scholarship (Unsworth 2000). Especially in linguistics,a number of tools have been developed that facilitate the creation of annotations to source material

  1. Annotating images by mining image search results

    NARCIS (Netherlands)

    Wang, X.J.; Zhang, L.; Li, X.; Ma, W.Y.

    2008-01-01

    Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search

  2. MicroRNAs in the host response to viral infections of veterinary importance

    Directory of Open Access Journals (Sweden)

    Mohamed Samir Ahmed

    2016-10-01

    Full Text Available The discovery of small regulatory non-coding RNAs has been an exciting advance in the field of genomics. MicroRNAs (miRNAs are endogenous RNA molecules, approximately 22 nucleotides in length that regulate gene expression, mostly at the post-transcriptional level. MiRNA profiling technologies have made it possible to identify and quantify novel miRNAs and to study their regulation and potential roles in disease pathogenesis. Although miRNAs have been extensively investigated in viral infections of humans, their implications in viral diseases affecting animals of veterinary importance are much less understood. The number of annotated miRNAs in different animal species is growing continuously, and novel roles in regulating host-pathogen interactions are being discovered, for instance miRNA-mediated augmentation of viral transcription and replication. In this review, we present an overview of synthesis and function of miRNAs and an update on the current state of research on host-encoded miRNAs in the genesis of viral infectious diseases in their natural animal host as well as in selected in vivo and in vitro laboratory models.

  3. New research progress of microRNAs in retinoblastoma

    Directory of Open Access Journals (Sweden)

    Jing Zeng

    2014-11-01

    Full Text Available Retinoblastoma(RBis the most common intraocular malignancy of children with extremely poor prognosis. MicroRNAs are small non-coding single-stranded RNAs in eukaryotic cells, which regulate the expression of gene by mRNA degradation or translation inhibition. MicroRNAs, acting as oncogenes or tumor suppressor genes, are associated with the occurrence and development of RB directly, which is vital for the early diagnosis and clinical targeted therapy of RB. This review summarized the expression of microRNAs in RB and the related mechanism.

  4. MicroRNA profiling of primary cutaneous large B-cell lymphomas.

    Directory of Open Access Journals (Sweden)

    Lianne Koens

    Full Text Available Aberrant expression of microRNAs is widely accepted to be pathogenetically involved in nodal diffuse large B-cell lymphomas (DLBCLs. However, the microRNAs profiles of primary cutaneous large B-cell lymphomas (PCLBCLs are not yet described. Its two main subtypes, i.e., primary cutaneous diffuse large B-cell lymphoma, leg type (PCLBCL-LT and primary cutaneous follicle center lymphoma (PCFCL are characterized by an activated B-cell (ABC-genotype and a germinal center B-cell (GCB-genotype, respectively. We performed high-throughput sequencing analysis on frozen tumor biopsies from 19 cases of PCFCL and PCLBCL-LT to establish microRNA profiles. Cluster analysis of the complete microRNome could not distinguish between the two subtypes, but 16 single microRNAs were found to be differentially expressed. Single microRNA RT-qPCR was conducted on formalin-fixed paraffin-embedded tumor biopsies of 20 additional cases, confirming higher expression of miR-9-5p, miR-31-5p, miR-129-2-3p and miR-214-3p in PCFCL as compared to PCLBCL-LT. MicroRNAs previously described to be higher expressed in ABC-type as compared to GCB-type nodal DLBCL were not differentially expressed between PCFCL and PCLBCL-LT. In conclusion, PCFCL and PCLBCL-LT differ in their microRNA profiles. In contrast to their gene expression profile, they only show slight resemblance with the microRNA profiles found in GCB- and ABC-type nodal DLBCL.

  5. Teaching and Learning Communities through Online Annotation

    Science.gov (United States)

    van der Pluijm, B.

    2016-12-01

    What do colleagues do with your assigned textbook? What they say or think about the material? Want students to be more engaged in their learning experience? If so, online materials that complement standard lecture format provide new opportunity through managed, online group annotation that leverages the ubiquity of internet access, while personalizing learning. The concept is illustrated with the new online textbook "Processes in Structural Geology and Tectonics", by Ben van der Pluijm and Stephen Marshak, which offers a platform for sharing of experiences, supplementary materials and approaches, including readings, mathematical applications, exercises, challenge questions, quizzes, alternative explanations, and more. The annotation framework used is Hypothes.is, which offers a free, open platform markup environment for annotation of websites and PDF postings. The annotations can be public, grouped or individualized, as desired, including export access and download of annotations. A teacher group, hosted by a moderator/owner, limits access to members of a user group of teachers, so that its members can use, copy or transcribe annotations for their own lesson material. Likewise, an instructor can host a student group that encourages sharing of observations, questions and answers among students and instructor. Also, the instructor can create one or more closed groups that offers study help and hints to students. Options galore, all of which aim to engage students and to promote greater responsibility for their learning experience. Beyond new capacity, the ability to analyze student annotation supports individual learners and their needs. For example, student notes can be analyzed for key phrases and concepts, and identify misunderstandings, omissions and problems. Also, example annotations can be shared to enhance notetaking skills and to help with studying. Lastly, online annotation allows active application to lecture posted slides, supporting real-time notetaking

  6. Modulation of microRNA activity by semi-microRNAs (smiRNAs

    Directory of Open Access Journals (Sweden)

    Isabelle ePlante

    2012-06-01

    Full Text Available The ribonuclease Dicer plays a central role in the microRNA pathway by catalyzing the formation of 19 to 24-nucleotide (nt long microRNAs. Subsequently incorporated into Ago2 effector complexes, microRNAs are known to regulate messenger RNA (mRNA translation. Whether shorter RNA species derived from microRNAs exist and play a role in mRNA regulation remains unknown. Here, we report the serendipitous discovery of a 12-nt long RNA species corresponding to the 5’ region of the microRNA let-7, and tentatively termed semi-microRNA, or smiRNA. Using a smiRNA derived from the precursor of miR-223 as a model, we show that 12-nt long smiRNA species are devoid of any direct mRNA regulatory activity, as assessed in a reporter gene activity assay in transfected cultured human cells. However, smiR-223 was found to modulate the ability of the microRNA from which it derives to mediate translational repression or cleavage of reporter mRNAs. Our findings suggest that smiRNAs may be generated along the microRNA pathway and participate to the control of gene expression by regulating the activity of the related full-length mature microRNA in vivo.

  7. Displaying Annotations for Digitised Globes

    Science.gov (United States)

    Gede, Mátyás; Farbinger, Anna

    2018-05-01

    Thanks to the efforts of the various globe digitising projects, nowadays there are plenty of old globes that can be examined as 3D models on the computer screen. These globes usually contain a lot of interesting details that an average observer would not entirely discover for the first time. The authors developed a website that can display annotations for such digitised globes. These annotations help observers of the globe to discover all the important, interesting details. Annotations consist of a plain text title, a HTML formatted descriptive text and a corresponding polygon and are stored in KML format. The website is powered by the Cesium virtual globe engine.

  8. THE DIMENSIONS OF COMPOSITION ANNOTATION.

    Science.gov (United States)

    MCCOLLY, WILLIAM

    ENGLISH TEACHER ANNOTATIONS WERE STUDIED TO DETERMINE THE DIMENSIONS AND PROPERTIES OF THE ENTIRE SYSTEM FOR WRITING CORRECTIONS AND CRITICISMS ON COMPOSITIONS. FOUR SETS OF COMPOSITIONS WERE WRITTEN BY STUDENTS IN GRADES 9 THROUGH 13. TYPESCRIPTS OF THE COMPOSITIONS WERE ANNOTATED BY CLASSROOM ENGLISH TEACHERS. THEN, 32 ENGLISH TEACHERS JUDGED…

  9. Evaluation of three automated genome annotations for Halorhabdus utahensis.

    Directory of Open Access Journals (Sweden)

    Peter Bakke

    2009-07-01

    Full Text Available Genome annotations are accumulating rapidly and depend heavily on automated annotation systems. Many genome centers offer annotation systems but no one has compared their output in a systematic way to determine accuracy and inherent errors. Errors in the annotations are routinely deposited in databases such as NCBI and used to validate subsequent annotation errors. We submitted the genome sequence of halophilic archaeon Halorhabdus utahensis to be analyzed by three genome annotation services. We have examined the output from each service in a variety of ways in order to compare the methodology and effectiveness of the annotations, as well as to explore the genes, pathways, and physiology of the previously unannotated genome. The annotation services differ considerably in gene calls, features, and ease of use. We had to manually identify the origin of replication and the species-specific consensus ribosome-binding site. Additionally, we conducted laboratory experiments to test H. utahensis growth and enzyme activity. Current annotation practices need to improve in order to more accurately reflect a genome's biological potential. We make specific recommendations that could improve the quality of microbial annotation projects.

  10. microRNA Response to Listeria monocytogenes Infection in Epithelial Cells

    Science.gov (United States)

    Izar, Benjamin; Mannala, Gopala Krishna; Mraheil, Mobarak Abu; Chakraborty, Trinad; Hain, Torsten

    2012-01-01

    microRNAs represent a family of very small non-coding RNAs that control several physiologic and pathologic processes, including host immune response and cancer by antagonizing a number of target mRNAs. There is limited knowledge about cell expression and the regulatory role of microRNAs following bacterial infections. We investigated whether infection with a Gram-positive bacterium leads to altered expression of microRNAs involved in the host cell response in epithelial cells. Caco-2 cells were infected with Listeria monocytogenes EGD-e, a mutant strain (ΔinlAB or Δhly) or incubated with purified listeriolysin (LLO). Total RNA was isolated and microRNA and target gene expression was compared to the expression in non-infected cells using microRNA microarrays and qRT-PCR. We identified and validated five microRNAs (miR- 146b, miR-16, let-7a1, miR-145 and miR-155) that were significantly deregulated following listerial infection. We show that expression patterns of particular microRNAs strongly depend on pathogen localization and the presence of bacterial effector proteins. Strikingly, miR-155 which was shown to have an important role in inflammatory responses during infection was induced by wild-type bacteria, by LLO-deficient bacteria and following incubation with purified LLO. It was downregulated following ΔinlAB infection indicating a new potent role for internalins in listerial pathogenicity and miRNA regulation. Concurrently, we observed differences in target transcript expression of the investigated miRNAs. We provide first evidence that L. monocytogenes infection leads to deregulation of a set of microRNAs with important roles in host response. Distinct microRNA expression depends on both LLO and pathogen localization. PMID:22312311

  11. PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.

    Science.gov (United States)

    Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay

    2015-12-01

    A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.

  12. BG7: A New Approach for Bacterial Genome Annotation Designed for Next Generation Sequencing Data

    Science.gov (United States)

    Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Pareja, Eduardo; Tobes, Raquel

    2012-01-01

    BG7 is a new system for de novo bacterial, archaeal and viral genome annotation based on a new approach specifically designed for annotating genomes sequenced with next generation sequencing technologies. The system is versatile and able to annotate genes even in the step of preliminary assembly of the genome. It is especially efficient detecting unexpected genes horizontally acquired from bacterial or archaeal distant genomes, phages, plasmids, and mobile elements. From the initial phases of the gene annotation process, BG7 exploits the massive availability of annotated protein sequences in databases. BG7 predicts ORFs and infers their function based on protein similarity with a wide set of reference proteins, integrating ORF prediction and functional annotation phases in just one step. BG7 is especially tolerant to sequencing errors in start and stop codons, to frameshifts, and to assembly or scaffolding errors. The system is also tolerant to the high level of gene fragmentation which is frequently found in not fully assembled genomes. BG7 current version – which is developed in Java, takes advantage of Amazon Web Services (AWS) cloud computing features, but it can also be run locally in any operating system. BG7 is a fast, automated and scalable system that can cope with the challenge of analyzing the huge amount of genomes that are being sequenced with NGS technologies. Its capabilities and efficiency were demonstrated in the 2011 EHEC Germany outbreak in which BG7 was used to get the first annotations right the next day after the first entero-hemorrhagic E. coli genome sequences were made publicly available. The suitability of BG7 for genome annotation has been proved for Illumina, 454, Ion Torrent, and PacBio sequencing technologies. Besides, thanks to its plasticity, our system could be very easily adapted to work with new technologies in the future. PMID:23185310

  13. BG7: a new approach for bacterial genome annotation designed for next generation sequencing data.

    Directory of Open Access Journals (Sweden)

    Pablo Pareja-Tobes

    Full Text Available BG7 is a new system for de novo bacterial, archaeal and viral genome annotation based on a new approach specifically designed for annotating genomes sequenced with next generation sequencing technologies. The system is versatile and able to annotate genes even in the step of preliminary assembly of the genome. It is especially efficient detecting unexpected genes horizontally acquired from bacterial or archaeal distant genomes, phages, plasmids, and mobile elements. From the initial phases of the gene annotation process, BG7 exploits the massive availability of annotated protein sequences in databases. BG7 predicts ORFs and infers their function based on protein similarity with a wide set of reference proteins, integrating ORF prediction and functional annotation phases in just one step. BG7 is especially tolerant to sequencing errors in start and stop codons, to frameshifts, and to assembly or scaffolding errors. The system is also tolerant to the high level of gene fragmentation which is frequently found in not fully assembled genomes. BG7 current version - which is developed in Java, takes advantage of Amazon Web Services (AWS cloud computing features, but it can also be run locally in any operating system. BG7 is a fast, automated and scalable system that can cope with the challenge of analyzing the huge amount of genomes that are being sequenced with NGS technologies. Its capabilities and efficiency were demonstrated in the 2011 EHEC Germany outbreak in which BG7 was used to get the first annotations right the next day after the first entero-hemorrhagic E. coli genome sequences were made publicly available. The suitability of BG7 for genome annotation has been proved for Illumina, 454, Ion Torrent, and PacBio sequencing technologies. Besides, thanks to its plasticity, our system could be very easily adapted to work with new technologies in the future.

  14. Annotate-it: a Swiss-knife approach to annotation, analysis and interpretation of single nucleotide variation in human disease.

    Science.gov (United States)

    Sifrim, Alejandro; Van Houdt, Jeroen Kj; Tranchevent, Leon-Charles; Nowakowska, Beata; Sakai, Ryo; Pavlopoulos, Georgios A; Devriendt, Koen; Vermeesch, Joris R; Moreau, Yves; Aerts, Jan

    2012-01-01

    The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org.

  15. Annotated chemical patent corpus: a gold standard for text mining.

    Directory of Open Access Journals (Sweden)

    Saber A Akhondi

    Full Text Available Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org.

  16. Diet-responsive microRNAs are likely exogenous

    Science.gov (United States)

    In a recent report Title "et al". fostered miRNA-375 and miR-200c knock-out pups to wild-type dams and arrived at the conclusion that milk microRNAs are bioavailable in trace amounts at best and that postprandial concentrations of microRNAs are too low to elicit biological effects. Their take home m...

  17. Identification of differentially expressed microRNAs in human male breast cancer

    Directory of Open Access Journals (Sweden)

    Schipper Elisa

    2010-03-01

    Full Text Available Abstract Background The discovery of small non-coding RNAs and the subsequent analysis of microRNA expression patterns in human cancer specimens have provided completely new insights into cancer biology. Genetic and epigenetic data indicate oncogenic or tumor suppressor function of these pleiotropic regulators. Therefore, many studies analyzed the expression and function of microRNA in human breast cancer, the most frequent malignancy in females. However, nothing is known so far about microRNA expression in male breast cancer, accounting for approximately 1% of all breast cancer cases. Methods The expression of 319 microRNAs was analyzed in 9 primary human male breast tumors and in epithelial cells from 15 male gynecomastia specimens using fluorescence-labeled bead technology. For identification of differentially expressed microRNAs data were analyzed by cluster analysis and selected statistical methods. Expression levels were validated for the most up- or down-regulated microRNAs in this training cohort using real-time PCR methodology as well as in an independent test cohort comprising 12 cases of human male breast cancer. Results Unsupervised cluster analysis separated very well male breast cancer samples and control specimens according to their microRNA expression pattern indicating cancer-specific alterations of microRNA expression in human male breast cancer. miR-21, miR519d, miR-183, miR-197, and miR-493-5p were identified as most prominently up-regulated, miR-145 and miR-497 as most prominently down-regulated in male breast cancer. Conclusions Male breast cancer displays several differentially expressed microRNAs. Not all of them are shared with breast cancer biopsies from female patients indicating male breast cancer specific alterations of microRNA expression.

  18. Identification of serum microRNA biomarkers for tuberculosis using RNA-seq.

    Directory of Open Access Journals (Sweden)

    Hongtai Zhang

    Full Text Available Tuberculosis (TB remains a significant human health issue. More effective biomarkers for use in tuberculosis prevention, diagnosis, and treatment, including markers that can discriminate between healthy individuals and those with latent infection, are urgently needed. To identify a set of such markers, we used Solexa sequencing to examine microRNA expression in the serum of patients with active disease, healthy individuals with latent TB, and those with or without prior BCG inoculation. We identified 24 microRNAs that are up-regulated (2.85-1285.93 fold and 6 microRNAs that are down-regulated (0.003-0.11 fold (P<0.05 in patients with active TB relative to the three groups of healthy controls. In addition, 75 microRNAs were up-regulated (2.05-2454.58 fold and 11 were down-regulated (0.001-0.42 fold (P<0.05 in latent-TB infected individuals relative to BCG- inoculated individuals. Of interest, 134 microRNAs were differentially-expressed in BCG-inoculated relative to un-inoculated individuals (18 up-regulated 2.9-499.29 fold, 116 down-regulated 0.0002-0.5 fold, providing insights into the effects of BCG inoculation at the microRNA level. Target prediction of differentially-expressed microRNAs by microRNA-Gene Network analysis and analysis of pathways affected suggest that regulation of the host immune system by microRNAs is likely to be one of the main factors in the pathogenesis of tuberculosis. qRT-PCR validation indicated that hsa-miR-196b and hsa-miR-376c have potential as markers for active TB disease. The microRNA differential-expression profiles generated in this study provide a good foundation for the development of markers for TB diagnosis, and for investigations on the role of microRNAs in BCG-inoculated and latent-infected individuals.

  19. Diverse Image Annotation

    KAUST Repository

    Wu, Baoyuan

    2017-11-09

    In this work we study the task of image annotation, of which the goal is to describe an image using a few tags. Instead of predicting the full list of tags, here we target for providing a short list of tags under a limited number (e.g., 3), to cover as much information as possible of the image. The tags in such a short list should be representative and diverse. It means they are required to be not only corresponding to the contents of the image, but also be different to each other. To this end, we treat the image annotation as a subset selection problem based on the conditional determinantal point process (DPP) model, which formulates the representation and diversity jointly. We further explore the semantic hierarchy and synonyms among the candidate tags, and require that two tags in a semantic hierarchy or in a pair of synonyms should not be selected simultaneously. This requirement is then embedded into the sampling algorithm according to the learned conditional DPP model. Besides, we find that traditional metrics for image annotation (e.g., precision, recall and F1 score) only consider the representation, but ignore the diversity. Thus we propose new metrics to evaluate the quality of the selected subset (i.e., the tag list), based on the semantic hierarchy and synonyms. Human study through Amazon Mechanical Turk verifies that the proposed metrics are more close to the humans judgment than traditional metrics. Experiments on two benchmark datasets show that the proposed method can produce more representative and diverse tags, compared with existing image annotation methods.

  20. Diverse Image Annotation

    KAUST Repository

    Wu, Baoyuan; Jia, Fan; Liu, Wei; Ghanem, Bernard

    2017-01-01

    In this work we study the task of image annotation, of which the goal is to describe an image using a few tags. Instead of predicting the full list of tags, here we target for providing a short list of tags under a limited number (e.g., 3), to cover as much information as possible of the image. The tags in such a short list should be representative and diverse. It means they are required to be not only corresponding to the contents of the image, but also be different to each other. To this end, we treat the image annotation as a subset selection problem based on the conditional determinantal point process (DPP) model, which formulates the representation and diversity jointly. We further explore the semantic hierarchy and synonyms among the candidate tags, and require that two tags in a semantic hierarchy or in a pair of synonyms should not be selected simultaneously. This requirement is then embedded into the sampling algorithm according to the learned conditional DPP model. Besides, we find that traditional metrics for image annotation (e.g., precision, recall and F1 score) only consider the representation, but ignore the diversity. Thus we propose new metrics to evaluate the quality of the selected subset (i.e., the tag list), based on the semantic hierarchy and synonyms. Human study through Amazon Mechanical Turk verifies that the proposed metrics are more close to the humans judgment than traditional metrics. Experiments on two benchmark datasets show that the proposed method can produce more representative and diverse tags, compared with existing image annotation methods.

  1. Annotating individual human genomes.

    Science.gov (United States)

    Torkamani, Ali; Scott-Van Zeeland, Ashley A; Topol, Eric J; Schork, Nicholas J

    2011-10-01

    Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. Copyright © 2011 Elsevier Inc. All rights reserved.

  2. ANNOTATING INDIVIDUAL HUMAN GENOMES*

    Science.gov (United States)

    Torkamani, Ali; Scott-Van Zeeland, Ashley A.; Topol, Eric J.; Schork, Nicholas J.

    2014-01-01

    Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely to amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. PMID:21839162

  3. The Role of microRNAs in the Pathogenesis of Herpesvirus Infection.

    Science.gov (United States)

    Piedade, Diogo; Azevedo-Pereira, José Miguel

    2016-06-02

    MicroRNAs (miRNAs) are small non-coding RNAs important in gene regulation. They are able to regulate mRNA translation through base-pair complementarity. Cellular miRNAs have been involved in the regulation of nearly all cellular pathways, and their deregulation has been associated with several diseases such as cancer. Given the importance of microRNAs to cell homeostasis, it is no surprise that viruses have evolved to take advantage of this cellular pathway. Viruses have been reported to be able to encode and express functional viral microRNAs that target both viral and cellular transcripts. Moreover, viral inhibition of key proteins from the microRNA pathway and important changes in cellular microRNA pool have been reported upon viral infection. In addition, viruses have developed multiple mechanisms to avoid being targeted by cellular microRNAs. This complex interaction between host and viruses to control the microRNA pathway usually favors viral infection and persistence by either reducing immune detection, avoiding apoptosis, promoting cell growth, or promoting lytic or latent infection. One of the best examples of this virus-host-microRNA interplay emanates from members of the Herperviridae family, namely the herpes simplex virus type 1 and type 2 (HSV-1 and HSV-2), human cytomegalovirus (HCMV), human herpesvirus 8 (HHV-8), and the Epstein-Barr virus (EBV). In this review, we will focus on the general functions of microRNAs and the interactions between herpesviruses, human hosts, and microRNAs and will delve into the related mechanisms that contribute to infection and pathogenesis.

  4. The GATO gene annotation tool for research laboratories

    Directory of Open Access Journals (Sweden)

    A. Fujita

    2005-11-01

    Full Text Available Large-scale genome projects have generated a rapidly increasing number of DNA sequences. Therefore, development of computational methods to rapidly analyze these sequences is essential for progress in genomic research. Here we present an automatic annotation system for preliminary analysis of DNA sequences. The gene annotation tool (GATO is a Bioinformatics pipeline designed to facilitate routine functional annotation and easy access to annotated genes. It was designed in view of the frequent need of genomic researchers to access data pertaining to a common set of genes. In the GATO system, annotation is generated by querying some of the Web-accessible resources and the information is stored in a local database, which keeps a record of all previous annotation results. GATO may be accessed from everywhere through the internet or may be run locally if a large number of sequences are going to be annotated. It is implemented in PHP and Perl and may be run on any suitable Web server. Usually, installation and application of annotation systems require experience and are time consuming, but GATO is simple and practical, allowing anyone with basic skills in informatics to access it without any special training. GATO can be downloaded at [http://mariwork.iq.usp.br/gato/]. Minimum computer free space required is 2 MB.

  5. Bridging the Gap: Enriching YouTube Videos with Jazz Music Annotations

    Directory of Open Access Journals (Sweden)

    Stefan Balke

    2018-02-01

    Full Text Available Web services allow permanent access to music from all over the world. Especially in the case of web services with user-supplied content, e.g., YouTube™, the available metadata is often incomplete or erroneous. On the other hand, a vast amount of high-quality and musically relevant metadata has been annotated in research areas such as Music Information Retrieval (MIR. Although they have great potential, these musical annotations are often inaccessible to users outside the academic world. With our contribution, we want to bridge this gap by enriching publicly available multimedia content with musical annotations available in research corpora, while maintaining easy access to the underlying data. Our web-based tools offer researchers and music lovers novel possibilities to interact with and navigate through the content. In this paper, we consider a research corpus called the Weimar Jazz Database (WJD as an illustrating example scenario. The WJD contains various annotations related to famous jazz solos. First, we establish a link between the WJD annotations and corresponding YouTube videos employing existing retrieval techniques. With these techniques, we were able to identify 988 corresponding YouTube videos for 329 solos out of 456 solos contained in the WJD. We then embed the retrieved videos in a recently developed web-based platform and enrich the videos with solo transcriptions that are part of the WJD. Furthermore, we integrate publicly available data resources from the Semantic Web in order to extend the presented information, for example, with a detailed discography or artists-related information. Our contribution illustrates the potential of modern web-based technologies for the digital humanities, and novel ways for improving access and interaction with digitized multimedia content.

  6. Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58

    International Nuclear Information System (INIS)

    Yu Jia-Feng; Sui Tian-Xiang; Wang Ji-Hua; Wang Hong-Mei; Wang Chun-Ling; Jing Li

    2015-01-01

    Agrobacterium tumefaciens strain C58 is a type of pathogen that can cause tumors in some dicotyledonous plants. Ever since the genome of A. tumefaciens strain C58 was sequenced, the quality of annotation of its protein-coding genes has been queried continually, because the annotation varies greatly among different databases. In this paper, the questionable hypothetical genes were re-predicted by integrating the TN curve and Z curve methods. As a result, 30 genes originally annotated as “hypothetical” were discriminated as being non-coding sequences. By testing the re-prediction program 10 times on data sets composed of the function-known genes, the mean accuracy of 99.99% and mean Matthews correlation coefficient value of 0.9999 were obtained. Further sequence analysis and COG analysis showed that the re-annotation results were very reliable. This work can provide an efficient tool and data resources for future studies of A. tumefaciens strain C58. (special topic)

  7. GSV Annotated Bibliography

    Energy Technology Data Exchange (ETDEWEB)

    Roberts, Randy S. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Pope, Paul A. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Jiang, Ming [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Trucano, Timothy G. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Aragon, Cecilia R. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Ni, Kevin [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Wei, Thomas [Argonne National Lab. (ANL), Argonne, IL (United States); Chilton, Lawrence K. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Bakel, Alan [Argonne National Lab. (ANL), Argonne, IL (United States)

    2010-09-14

    The following annotated bibliography was developed as part of the geospatial algorithm verification and validation (GSV) project for the Simulation, Algorithms and Modeling program of NA-22. Verification and Validation of geospatial image analysis algorithms covers a wide range of technologies. Papers in the bibliography are thus organized into the following five topic areas: Image processing and analysis, usability and validation of geospatial image analysis algorithms, image distance measures, scene modeling and image rendering, and transportation simulation models. Many other papers were studied during the course of the investigation including. The annotations for these articles can be found in the paper "On the verification and validation of geospatial image analysis algorithms".

  8. The BioC-BioGRID corpus: full text articles annotated for curation of protein-protein and genetic interactions.

    Science.gov (United States)

    Islamaj Dogan, Rezarta; Kim, Sun; Chatr-Aryamontri, Andrew; Chang, Christie S; Oughtred, Rose; Rust, Jennifer; Wilbur, W John; Comeau, Donald C; Dolinski, Kara; Tyers, Mike

    2017-01-01

    A great deal of information on the molecular genetics and biochemistry of model organisms has been reported in the scientific literature. However, this data is typically described in free text form and is not readily amenable to computational analyses. To this end, the BioGRID database systematically curates the biomedical literature for genetic and protein interaction data. This data is provided in a standardized computationally tractable format and includes structured annotation of experimental evidence. BioGRID curation necessarily involves substantial human effort by expert curators who must read each publication to extract the relevant information. Computational text-mining methods offer the potential to augment and accelerate manual curation. To facilitate the development of practical text-mining strategies, a new challenge was organized in BioCreative V for the BioC task, the collaborative Biocurator Assistant Task. This was a non-competitive, cooperative task in which the participants worked together to build BioC-compatible modules into an integrated pipeline to assist BioGRID curators. As an integral part of this task, a test collection of full text articles was developed that contained both biological entity annotations (gene/protein and organism/species) and molecular interaction annotations (protein-protein and genetic interactions (PPIs and GIs)). This collection, which we call the BioC-BioGRID corpus, was annotated by four BioGRID curators over three rounds of annotation and contains 120 full text articles curated in a dataset representing two major model organisms, namely budding yeast and human. The BioC-BioGRID corpus contains annotations for 6409 mentions of genes and their Entrez Gene IDs, 186 mentions of organism names and their NCBI Taxonomy IDs, 1867 mentions of PPIs and 701 annotations of PPI experimental evidence statements, 856 mentions of GIs and 399 annotations of GI evidence statements. The purpose, characteristics and possible future

  9. Differentially expressed microRNA in multiple sclerosis: A window into pathogenesis?

    DEFF Research Database (Denmark)

    Martin, Nellie Anne; Illés, Zsolt

    2014-01-01

    MicroRNA are small non-coding RNA that mediate mRNA translation repression or mRNA degradation, and thereby refine protein expression levels. More than 30–60% of all genes are regulated by microRNA. Exploring disease-related microRNA signatures is an emerging tool in biomarker discovery, and sile......MicroRNA are small non-coding RNA that mediate mRNA translation repression or mRNA degradation, and thereby refine protein expression levels. More than 30–60% of all genes are regulated by microRNA. Exploring disease-related microRNA signatures is an emerging tool in biomarker discovery......RNA related to multiple sclerosis has increased significantly in recent years. Differentially expressed microRNA have been identified in the whole blood, serum, plasma, cerebrospinal fluid, peripheral blood mononuclear cells, blood-derived cell subsets and brain lesions of patients with multiple sclerosis....... Most studies applied a non-candidate approach of screening by microarray and validation by quantitative polymerase chain reaction or next generation sequencing; others used a candidate-driven approach. Despite a relatively high number of multiple sclerosis-associated microRNA, just a few could...

  10. Solar Tutorial and Annotation Resource (STAR)

    Science.gov (United States)

    Showalter, C.; Rex, R.; Hurlburt, N. E.; Zita, E. J.

    2009-12-01

    We have written a software suite designed to facilitate solar data analysis by scientists, students, and the public, anticipating enormous datasets from future instruments. Our “STAR" suite includes an interactive learning section explaining 15 classes of solar events. Users learn software tools that exploit humans’ superior ability (over computers) to identify many events. Annotation tools include time slice generation to quantify loop oscillations, the interpolation of event shapes using natural cubic splines (for loops, sigmoids, and filaments) and closed cubic splines (for coronal holes). Learning these tools in an environment where examples are provided prepares new users to comfortably utilize annotation software with new data. Upon completion of our tutorial, users are presented with media of various solar events and asked to identify and annotate the images, to test their mastery of the system. Goals of the project include public input into the data analysis of very large datasets from future solar satellites, and increased public interest and knowledge about the Sun. In 2010, the Solar Dynamics Observatory (SDO) will be launched into orbit. SDO’s advancements in solar telescope technology will generate a terabyte per day of high-quality data, requiring innovation in data management. While major projects develop automated feature recognition software, so that computers can complete much of the initial event tagging and analysis, still, that software cannot annotate features such as sigmoids, coronal magnetic loops, coronal dimming, etc., due to large amounts of data concentrated in relatively small areas. Previously, solar physicists manually annotated these features, but with the imminent influx of data it is unrealistic to expect specialized researchers to examine every image that computers cannot fully process. A new approach is needed to efficiently process these data. Providing analysis tools and data access to students and the public have proven

  11. Discovering gene annotations in biomedical text databases

    Directory of Open Access Journals (Sweden)

    Ozsoyoglu Gultekin

    2008-03-01

    Full Text Available Abstract Background Genes and gene products are frequently annotated with Gene Ontology concepts based on the evidence provided in genomics articles. Manually locating and curating information about a genomic entity from the biomedical literature requires vast amounts of human effort. Hence, there is clearly a need forautomated computational tools to annotate the genes and gene products with Gene Ontology concepts by computationally capturing the related knowledge embedded in textual data. Results In this article, we present an automated genomic entity annotation system, GEANN, which extracts information about the characteristics of genes and gene products in article abstracts from PubMed, and translates the discoveredknowledge into Gene Ontology (GO concepts, a widely-used standardized vocabulary of genomic traits. GEANN utilizes textual "extraction patterns", and a semantic matching framework to locate phrases matching to a pattern and produce Gene Ontology annotations for genes and gene products. In our experiments, GEANN has reached to the precision level of 78% at therecall level of 61%. On a select set of Gene Ontology concepts, GEANN either outperforms or is comparable to two other automated annotation studies. Use of WordNet for semantic pattern matching improves the precision and recall by 24% and 15%, respectively, and the improvement due to semantic pattern matching becomes more apparent as the Gene Ontology terms become more general. Conclusion GEANN is useful for two distinct purposes: (i automating the annotation of genomic entities with Gene Ontology concepts, and (ii providing existing annotations with additional "evidence articles" from the literature. The use of textual extraction patterns that are constructed based on the existing annotations achieve high precision. The semantic pattern matching framework provides a more flexible pattern matching scheme with respect to "exactmatching" with the advantage of locating approximate

  12. Annotating the human genome with Disease Ontology

    Science.gov (United States)

    Osborne, John D; Flatow, Jared; Holko, Michelle; Lin, Simon M; Kibbe, Warren A; Zhu, Lihua (Julie); Danila, Maria I; Feng, Gang; Chisholm, Rex L

    2009-01-01

    Background The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. Results We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. Conclusion The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome. PMID:19594883

  13. MIPS bacterial genomes functional annotation benchmark dataset.

    Science.gov (United States)

    Tetko, Igor V; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Fobo, Gisela; Ruepp, Andreas; Antonov, Alexey V; Surmeli, Dimitrij; Mewes, Hans-Wernen

    2005-05-15

    Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. BFAB is available at http://mips.gsf.de/proj/bfab

  14. Annotating non-coding regions of the genome.

    Science.gov (United States)

    Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

    2010-08-01

    Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.

  15. MicroRNAs expression profile in solid and unicystic ameloblastomas

    Science.gov (United States)

    Setién-Olarra, A.; Bediaga, N. G.; Aguirre-Echebarria, P.; Aguirre-Urizar, J. M.; Mosqueda-Taylor, A.

    2017-01-01

    Objectives Odontogenic tumors (OT) represent a specific pathological category that includes some lesions with unpredictable biological behavior. Although most of these lesions are benign, some, such as the ameloblastoma, exhibit local aggressiveness and high recurrence rates. The most common types of ameloblastoma are the solid/multicystic (SA) and the unicystic ameloblastoma (UA); the latter considered a much less aggressive entity as compared to the SA. The microRNA system regulates the expression of many human genes while its deregulation has been associated with neoplastic development. The aim of the current study was to determine the expression profiles of microRNAs present in the two most common types of ameloblastomas. Material & methods MicroRNA expression profiles were assessed using TaqMan® Low Density Arrays (TLDAs) in 24 samples (8 SA, 8 UA and 8 control samples). The findings were validated using quantitative RTqPCR in an independent cohort of 19 SA, 8 UA and 19 dentigerous cysts as controls. Results We identified 40 microRNAs differentially regulated in ameloblastomas, which are related to neoplastic development and differentiation, and with the osteogenic process. Further validation of the top ranked microRNAs revealed significant differences in the expression of 6 of them in relation to UA, 7 in relation to SA and 1 (miR-489) that was related to both types. Conclusion We identified a new microRNA signature for the ameloblastoma and for its main types, which may be useful to better understand the etiopathogenesis of this neoplasm. In addition, we identified a microRNA (miR-489) that is suggestive of differentiating among solid from unicystic ameloblastoma. PMID:29053755

  16. Identification and conformational analysis of putative microRNAs in Maruca vitrata (Lepidoptera: Pyralidae

    Directory of Open Access Journals (Sweden)

    C. Shruthi Sureshan

    2015-12-01

    Full Text Available MicroRNAs (miRNAs are a class of small RNAs, evolutionarily conserved endogenous non-coding RNAs that regulate their target mRNA expression by either inactivating or degrading mRNA genes; thus playing an important role in the growth and development of an organism. Maruca vitrata is an insect pest of leguminous plants like pigeon pea, cowpea and mung bean and is pantropical. In this study, we perform BLAST on all known miRNAs against the transcriptome data of M. vitrata and thirteen miRNAs were identified. These miRNAs were characterised and their target genes were identified using TargetScan and were functionally annotated using FlyBase. The importance of the structure of pre-miRNA in the Drosha activity led to study the backbone torsion angles of predicted pre-miRNAs (mvi-miR-9751, mvi-miR-649-3p, mvi-miR-4057 and mvi-miR-1271 to identify various nucleotide triplets that contribute to the variation of torsion angle values at various structural motifs of a pre-miRNA.

  17. A microRNA activity map of human mesenchymal tumors: connections to oncogenic pathways; an integrative transcriptomic study

    Directory of Open Access Journals (Sweden)

    Fountzilas Elena

    2012-07-01

    Full Text Available Abstract Background MicroRNAs (miRNAs are nucleic acid regulators of many human mRNAs, and are associated with many tumorigenic processes. miRNA expression levels have been used in profiling studies, but some evidence suggests that expression levels do not fully capture miRNA regulatory activity. In this study we integrate multiple gene expression datasets to determine miRNA activity patterns associated with cancer phenotypes and oncogenic pathways in mesenchymal tumors – a very heterogeneous class of malignancies. Results Using a computational method, we identified differentially activated miRNAs between 77 normal tissue specimens and 135 sarcomas and we validated many of these findings with microarray interrogation of an independent, paraffin-based cohort of 18 tumors. We also showed that miRNA activity is imperfectly correlated with miRNA expression levels. Using next-generation miRNA sequencing we identified potential base sequence alterations which may explain differential activity. We then analyzed miRNA activity changes related to the RAS-pathway and found 21 miRNAs that switch from silenced to activated status in parallel with RAS activation. Importantly, nearly half of these 21 miRNAs were predicted to regulate integral parts of the miRNA processing machinery, and our gene expression analysis revealed significant reductions of these transcripts in RAS-active tumors. These results suggest an association between RAS signaling and miRNA processing in which miRNAs may attenuate their own biogenesis. Conclusions Our study represents the first gene expression-based investigation of miRNA regulatory activity in human sarcomas, and our findings indicate that miRNA activity patterns derived from integrated transcriptomic data are reproducible and biologically informative in cancer. We identified an association between RAS signaling and miRNA processing, and demonstrated sequence alterations as plausible causes for differential miRNA activity

  18. Large-scale inference of gene function through phylogenetic annotation of Gene Ontology terms: case study of the apoptosis and autophagy cellular processes.

    Science.gov (United States)

    Feuermann, Marc; Gaudet, Pascale; Mi, Huaiyu; Lewis, Suzanna E; Thomas, Paul D

    2016-01-01

    We previously reported a paradigm for large-scale phylogenomic analysis of gene families that takes advantage of the large corpus of experimentally supported Gene Ontology (GO) annotations. This 'GO Phylogenetic Annotation' approach integrates GO annotations from evolutionarily related genes across ∼100 different organisms in the context of a gene family tree, in which curators build an explicit model of the evolution of gene functions. GO Phylogenetic Annotation models the gain and loss of functions in a gene family tree, which is used to infer the functions of uncharacterized (or incompletely characterized) gene products, even for human proteins that are relatively well studied. Here, we report our results from applying this paradigm to two well-characterized cellular processes, apoptosis and autophagy. This revealed several important observations with respect to GO annotations and how they can be used for function inference. Notably, we applied only a small fraction of the experimentally supported GO annotations to infer function in other family members. The majority of other annotations describe indirect effects, phenotypes or results from high throughput experiments. In addition, we show here how feedback from phylogenetic annotation leads to significant improvements in the PANTHER trees, the GO annotations and GO itself. Thus GO phylogenetic annotation both increases the quantity and improves the accuracy of the GO annotations provided to the research community. We expect these phylogenetically based annotations to be of broad use in gene enrichment analysis as well as other applications of GO annotations.Database URL: http://amigo.geneontology.org/amigo. © The Author(s) 2016. Published by Oxford University Press.

  19. The role of microRNA in diseases of the biliary system

    Directory of Open Access Journals (Sweden)

    A.E. Abaturov

    2017-10-01

    Full Text Available This literature review provides current information about role of microRNA in diseases of the biliary system. For writing the article, we used such databases, as Scopus, Web of Science, MedLine, PubMed, Google Scholar, CyberLeninka, RSCI. The mechanisms of formation and action of microRNA are demonstrated. The data of scientific researches on the association of various microRNAs in the development and progression of diseases of the biliary system are presented. The influence of ursodeoxycholic acid on the expression of microRNA is considered. Attention is focused on the therapeutic efficacy and benefits of using ursodeoxycholic acid in diseases of the biliary system due to the effect on the activity of the generation of some microRNAs.

  20. Integrated annotation and analysis of in situ hybridization images using the ImAnno system: application to the ear and sensory organs of the fetal mouse.

    Science.gov (United States)

    Romand, Raymond; Ripp, Raymond; Poidevin, Laetitia; Boeglin, Marcel; Geffers, Lars; Dollé, Pascal; Poch, Olivier

    2015-01-01

    An in situ hybridization (ISH) study was performed on 2000 murine genes representing around 10% of the protein-coding genes present in the mouse genome using data generated by the EURExpress consortium. This study was carried out in 25 tissues of late gestation embryos (E14.5), with a special emphasis on the developing ear and on five distinct developing sensory organs, including the cochlea, the vestibular receptors, the sensory retina, the olfactory organ, and the vibrissae follicles. The results obtained from an analysis of more than 11,000 micrographs have been integrated in a newly developed knowledgebase, called ImAnno. In addition to managing the multilevel micrograph annotations performed by human experts, ImAnno provides public access to various integrated databases and tools. Thus, it facilitates the analysis of complex ISH gene expression patterns, as well as functional annotation and interaction of gene sets. It also provides direct links to human pathways and diseases. Hierarchical clustering of expression patterns in the 25 tissues revealed three main branches corresponding to tissues with common functions and/or embryonic origins. To illustrate the integrative power of ImAnno, we explored the expression, function and disease traits of the sensory epithelia of the five presumptive sensory organs. The study identified 623 genes (out of 2000) concomitantly expressed in the five embryonic epithelia, among which many (∼12%) were involved in human disorders. Finally, various multilevel interaction networks were characterized, highlighting differential functional enrichments of directly or indirectly interacting genes. These analyses exemplify an under-represention of "sensory" functions in the sensory gene set suggests that E14.5 is a pivotal stage between the developmental stage and the functional phase that will be fully reached only after birth.

  1. Circulating microRNA-200 Family as Diagnostic Marker in Hepatocellular Carcinoma.

    Directory of Open Access Journals (Sweden)

    Sameer A Dhayat

    Full Text Available In this clinical study, we aimed to evaluate the role of circulating microRNA-200 family as a non-invasive tool to identify patients with cirrhosis-associated hepatocellular carcinoma (HCC.Prognosis of HCC remains poor with increasing incidence worldwide, mainly related to liver cirrhosis. So far, no reliable molecular targets exist for early detection of HCC at surgically manageable stages. Recently, we identified members of the microRNA-200 family as potential diagnostic markers of cirrhosis-associated HCC in patient tissue samples. Their value as circulating biomarkers for HCC remained undefined.Blood samples and clinicopathological data of consecutive patients with liver diseases were collected prospectively. Expression of the microRNA-200 family was investigated by qRT-PCR in blood serum samples of 22 HCC patients with and without cirrhosis. Serum samples of patients with non-cancerous chronic liver cirrhosis (n = 22 and of healthy volunteers (n = 15 served as controls.MicroRNA-141 and microRNA-200a were significantly downregulated in blood serum of patients with HCC compared to liver cirrhosis (p<0.007 and healthy controls (p<0.002. MicroRNA-141 and microRNA-200a could well discriminate patients with cirrhosis-associated HCC from healthy volunteers with area under the receiver-operating characteristic curve (AUC values of 0.85 and 0.82, respectively. Additionally, both microRNAs could differentiate between HCC and non-cancerous liver cirrhosis with a fair accuracy.Circulating microRNA-200 family members are significantly deregulated in patients with HCC and liver cirrhosis. Further studies are necessary to confirm the diagnostic value of the microRNA-200 family as accurate serum marker for cirrhosis-associated HCC.

  2. Systems Theory and Communication. Annotated Bibliography.

    Science.gov (United States)

    Covington, William G., Jr.

    This annotated bibliography presents annotations of 31 books and journal articles dealing with systems theory and its relation to organizational communication, marketing, information theory, and cybernetics. Materials were published between 1963 and 1992 and are listed alphabetically by author. (RS)

  3. The surplus value of semantic annotations

    NARCIS (Netherlands)

    Marx, M.

    2010-01-01

    We compare the costs of semantic annotation of textual documents to its benefits for information processing tasks. Semantic annotation can improve the performance of retrieval tasks and facilitates an improved search experience through faceted search, focused retrieval, better document summaries,

  4. Systematic validation of predicted microRNAs for cyclin D1

    International Nuclear Information System (INIS)

    Jiang, Qiong; Feng, Ming-Guang; Mo, Yin-Yuan

    2009-01-01

    MicroRNAs are the endogenous small non-coding RNA molecules capable of silencing protein coding genes at the posttranscriptional level. Based on computer-aided predictions, a single microRNA could have over a hundred of targets. On the other hand, a single protein-coding gene could be targeted by many potential microRNAs. However, only a relatively small number of these predicted microRNA/mRNA interactions are experimentally validated, and no systematic validation has been carried out using a reporter system. In this study, we used luciferease reporter assays to validate microRNAs that can silence cyclin D1 (CCND1) because CCND1 is a well known proto-oncogene implicated in a variety of types of cancers. We chose miRanda (http://www.microRNA.org) as a primary prediction method. We then cloned 51 of 58 predicted microRNA precursors into pCDH-CMV-MCS-EF1-copGFP and tested for their effect on the luciferase reporter carrying the 3'-untranslated region (UTR) of CCND1 gene. Real-time PCR revealed the 45 of 51 cloned microRNA precursors expressed a relatively high level of the exogenous microRNAs which were used in our validation experiments. By an arbitrary cutoff of 35% reduction, we identified 7 microRNAs that were able to suppress Luc-CCND1-UTR activity. Among them, 4 of them were previously validated targets and the rest 3 microRNAs were validated to be positive in this study. Of interest, we found that miR-503 not only suppressed the luciferase activity, but also suppressed the endogenous CCND1 both at protein and mRNA levels. Furthermore, we showed that miR-503 was able to reduce S phase cell populations and caused cell growth inhibition, suggesting that miR-503 may be a putative tumor suppressor. This study provides a more comprehensive picture of microRNA/CCND1 interactions and it further demonstrates the importance of experimental target validation

  5. MicroRNAs meet calcium: joint venture in ER proteostasis.

    Science.gov (United States)

    Finger, Fabian; Hoppe, Thorsten

    2014-11-04

    The endoplasmic reticulum (ER) is a cellular compartment that has a key function in protein translation and folding. Maintaining its integrity is of fundamental importance for organism's physiology and viability. The dynamic regulation of intraluminal ER Ca(2+) concentration directly influences the activity of ER-resident chaperones and stress response pathways that balance protein load and folding capacity. We review the emerging evidence that microRNAs play important roles in adjusting these processes to frequently changing intracellular and environmental conditions to modify ER Ca(2+) handling and storage and maintain ER homeostasis. Copyright © 2014, American Association for the Advancement of Science.

  6. MicroRNA pharmacogenomics

    DEFF Research Database (Denmark)

    Rukov, Jakob Lewin; Shomron, Noam

    2011-01-01

    polymorphisms, copy number variations or differences in gene expression levels of drug metabolizing or transporting genes and drug targets. In this review paper, we focus instead on microRNAs (miRNAs): small noncoding RNAs, prevalent in metazoans, that negatively regulate gene expression in many cellular...

  7. Protannotator: a semiautomated pipeline for chromosome-wise functional annotation of the "missing" human proteome.

    Science.gov (United States)

    Islam, Mohammad T; Garg, Gagan; Hancock, William S; Risk, Brian A; Baker, Mark S; Ranganathan, Shoba

    2014-01-03

    The chromosome-centric human proteome project (C-HPP) aims to define the complete set of proteins encoded in each human chromosome. The neXtProt database (September 2013) lists 20,128 proteins for the human proteome, of which 3831 human proteins (∼19%) are considered "missing" according to the standard metrics table (released September 27, 2013). In support of the C-HPP initiative, we have extended the annotation strategy developed for human chromosome 7 "missing" proteins into a semiautomated pipeline to functionally annotate the "missing" human proteome. This pipeline integrates a suite of bioinformatics analysis and annotation software tools to identify homologues and map putative functional signatures, gene ontology, and biochemical pathways. From sequential BLAST searches, we have primarily identified homologues from reviewed nonhuman mammalian proteins with protein evidence for 1271 (33.2%) "missing" proteins, followed by 703 (18.4%) homologues from reviewed nonhuman mammalian proteins and subsequently 564 (14.7%) homologues from reviewed human proteins. Functional annotations for 1945 (50.8%) "missing" proteins were also determined. To accelerate the identification of "missing" proteins from proteomics studies, we generated proteotypic peptides in silico. Matching these proteotypic peptides to ENCODE proteogenomic data resulted in proteomic evidence for 107 (2.8%) of the 3831 "missing proteins, while evidence from a recent membrane proteomic study supported the existence for another 15 "missing" proteins. The chromosome-wise functional annotation of all "missing" proteins is freely available to the scientific community through our web server (http://biolinfo.org/protannotator).

  8. PANNZER2: a rapid functional annotation web server.

    Science.gov (United States)

    Törönen, Petri; Medlar, Alan; Holm, Liisa

    2018-05-08

    The unprecedented growth of high-throughput sequencing has led to an ever-widening annotation gap in protein databases. While computational prediction methods are available to make up the shortfall, a majority of public web servers are hindered by practical limitations and poor performance. Here, we introduce PANNZER2 (Protein ANNotation with Z-scoRE), a fast functional annotation web server that provides both Gene Ontology (GO) annotations and free text description predictions. PANNZER2 uses SANSparallel to perform high-performance homology searches, making bulk annotation based on sequence similarity practical. PANNZER2 can output GO annotations from multiple scoring functions, enabling users to see which predictions are robust across predictors. Finally, PANNZER2 predictions scored within the top 10 methods for molecular function and biological process in the CAFA2 NK-full benchmark. The PANNZER2 web server is updated on a monthly schedule and is accessible at http://ekhidna2.biocenter.helsinki.fi/sanspanz/. The source code is available under the GNU Public Licence v3.

  9. Integration of miRNA and protein profiling reveals coordinated neuroadaptations in the alcohol-dependent mouse brain.

    Directory of Open Access Journals (Sweden)

    Giorgio Gorini

    Full Text Available The molecular mechanisms underlying alcohol dependence involve different neurochemical systems and are brain region-dependent. Chronic Intermittent Ethanol (CIE procedure, combined with a Two-Bottle Choice voluntary drinking paradigm, represents one of the best available animal models for alcohol dependence and relapse drinking. MicroRNAs, master regulators of the cellular transcriptome and proteome, can regulate their targets in a cooperative, combinatorial fashion, ensuring fine tuning and control over a large number of cellular functions. We analyzed cortex and midbrain microRNA expression levels using an integrative approach to combine and relate data to previous protein profiling from the same CIE-subjected samples, and examined the significance of the data in terms of relative contribution to alcohol consumption and dependence. MicroRNA levels were significantly altered in CIE-exposed dependent mice compared with their non-dependent controls. More importantly, our integrative analysis identified modules of coexpressed microRNAs that were highly correlated with CIE effects and predicted target genes encoding differentially expressed proteins. Coexpressed CIE-relevant proteins, in turn, were often negatively correlated with specific microRNA modules. Our results provide evidence that microRNA-orchestrated translational imbalances are driving the behavioral transition from alcohol consumption to dependence. This study represents the first attempt to combine ex vivo microRNA and protein expression on a global scale from the same mammalian brain samples. The integrative systems approach used here will improve our understanding of brain adaptive changes in response to drug abuse and suggests the potential therapeutic use of microRNAs as tools to prevent or compensate multiple neuroadaptations underlying addictive behavior.

  10. microRNA expression profiling on individual breast cancer patients identifies novel panel of circulating microRNA for early detection

    DEFF Research Database (Denmark)

    Hamam, Rimi; Ali, Arwa M.; Alsaleh, Khalid A.

    2016-01-01

    Breast cancer (BC) is the most common cancer type and the second cause of cancer-related death among women. Therefore, better understanding of breast cancer tumor biology and the identification of novel biomarkers is essential for the early diagnosis and for better disease stratification and mana......Breast cancer (BC) is the most common cancer type and the second cause of cancer-related death among women. Therefore, better understanding of breast cancer tumor biology and the identification of novel biomarkers is essential for the early diagnosis and for better disease stratification...... and management choices. Herein we developed a novel approach which relies on the isolation of circulating microRNAs through an enrichment step using speed-vacuum concentration which resulted in 5-fold increase in microRNA abundance. Global miRNA microarray expression profiling performed on individual samples...... of 46 BC and 14 controls. The expression of those microRNAs was overall higher in patients with stage I, II, and III, compared to stage IV, with potential utilization for early detection. The expression of this microRNA panel was slightly higher in the HER2 and TN compared to patients with luminal...

  11. PanCoreGen – profiling, detecting, annotating protein-coding genes in microbial genomes

    Science.gov (United States)

    Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V.

    2015-01-01

    A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen – a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars – Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. PMID:26456591

  12. GI-POP: a combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects.

    Science.gov (United States)

    Lee, Chi-Ching; Chen, Yi-Ping Phoebe; Yao, Tzu-Jung; Ma, Cheng-Yu; Lo, Wei-Cheng; Lyu, Ping-Chiang; Tang, Chuan Yi

    2013-04-10

    Sequencing of microbial genomes is important because of microbial-carrying antibiotic and pathogenetic activities. However, even with the help of new assembling software, finishing a whole genome is a time-consuming task. In most bacteria, pathogenetic or antibiotic genes are carried in genomic islands. Therefore, a quick genomic island (GI) prediction method is useful for ongoing sequencing genomes. In this work, we built a Web server called GI-POP (http://gipop.life.nthu.edu.tw) which integrates a sequence assembling tool, a functional annotation pipeline, and a high-performance GI predicting module, in a support vector machine (SVM)-based method called genomic island genomic profile scanning (GI-GPS). The draft genomes of the ongoing genome projects in contigs or scaffolds can be submitted to our Web server, and it provides the functional annotation and highly probable GI-predicting results. GI-POP is a comprehensive annotation Web server designed for ongoing genome project analysis. Researchers can perform annotation and obtain pre-analytic information include possible GIs, coding/non-coding sequences and functional analysis from their draft genomes. This pre-analytic system can provide useful information for finishing a genome sequencing project. Copyright © 2012 Elsevier B.V. All rights reserved.

  13. Vital analysis: field validation of a framework for annotating biological signals of first responders in action.

    Science.gov (United States)

    Gomes, P; Lopes, B; Coimbra, M

    2012-01-01

    First responders are professionals that are exposed to extreme stress and fatigue during extended periods of time. That is why it is necessary to research and develop technological solutions based on wearable sensors that can continuously monitor the health of these professionals in action, namely their stress and fatigue levels. In this paper we present the Vital Analysis smartphone-based framework, integrated into the broader Vital Responder project, that allows the annotation and contextualization of the signals collected during real action. After a contextual study we have implemented and deployed this framework in a firefighter team with 5 elements, from where we have collected over 3300 hours of annotations during 174 days, covering 382 different events. Results are analysed and discussed, validating the framework as a useful and usable tool for annotating biological signals of first responders in action.

  14. Intra-tumor heterogeneity of microRNA-92a, microRNA-375 and microRNA-424 in colorectal cancer

    DEFF Research Database (Denmark)

    Jepsen, Rikke Karlin; Novotny, Guy Wayne; Klarskov, Louise Laurberg

    2016-01-01

    Various microRNAs (miRNAs) have been investigated in order to improve diagnostics and risk assessment in colorectal cancer (CRC). To clarify the potential of miRNA profiling in CRC, knowledge of intra-tumor heterogeneity in expression levels is crucial. The study aim was to estimate the intra...

  15. Pulmonary microRNA profiling: implications in upper lobe predominant lung disease.

    Science.gov (United States)

    Armstrong, David A; Nymon, Amanda B; Ringelberg, Carol S; Lesseur, Corina; Hazlett, Haley F; Howard, Louisa; Marsit, Carmen J; Ashare, Alix

    2017-01-01

    Numerous pulmonary diseases manifest with upper lobe predominance including cystic fibrosis, smoking-related chronic obstructive pulmonary disease, and tuberculosis. Zonal hypoxia, characteristic of these pulmonary maladies, and oxygen stress in general is known to exert profound effects on various important aspects of cell biology. Lung macrophages are major participants in the pulmonary innate immune response and regional differences in macrophage responsiveness to hypoxia may contribute in the development of lung disease. MicroRNAs are ubiquitous regulators of human biology and emerging evidence indicates altered microRNA expression modulates respiratory disease processes. The objective of this study is to gain insight into the epigenetic and cellular mechanisms influencing regional differences in lung disease by investigating effect of hypoxia on regional microRNA expression in the lung. All studies were performed using primary alveolar macrophages ( n  = 10) or bronchoalveolar lavage fluid ( n  = 16) isolated from human subjects. MicroRNA was assayed via the NanoString nCounter microRNA assay. Divergent molecular patterns of microRNA expression were observed in alternate lung lobes, specifically noted was disparate expression of miR-93 and miR-4454 in alveolar macrophages along with altered expression of miR-451a and miR-663a in bronchoalveolar lavage fluid. Gene ontology was used to identify potential downstream targets of divergent microRNAs. Targets include cytokines and matrix metalloproteinases, molecules that could have a significant impact on pulmonary inflammation and fibrosis. Our findings show variant regional microRNA expression associated with hypoxia in alveolar macrophages and BAL fluid in the lung-upper vs lower lobe. Future studies should address whether these specific microRNAs may act intracellularly, in a paracrine/endocrine manner to direct the innate immune response or may ultimately be involved in pulmonary host-to-pathogen trans

  16. Adenoid cystic carcinomas of the salivary gland, lacrimal gland, and breast are morphologically and genetically similar but have distinct microRNA expression profiles.

    Science.gov (United States)

    Andreasen, Simon; Tan, Qihua; Agander, Tina Klitmøller; Steiner, Petr; Bjørndal, Kristine; Høgdall, Estrid; Larsen, Stine Rosenkilde; Erentaite, Daiva; Olsen, Caroline Holkmann; Ulhøi, Benedicte Parm; von Holstein, Sarah Linéa; Wessel, Irene; Heegaard, Steffen; Homøe, Preben

    2018-02-21

    Adenoid cystic carcinoma is among the most frequent malignancies in the salivary and lacrimal glands and has a grave prognosis characterized by frequent local recurrences, distant metastases, and tumor-related mortality. Conversely, adenoid cystic carcinoma of the breast is a rare type of triple-negative (estrogen and progesterone receptor, HER2) and basal-like carcinoma, which in contrast to other triple-negative and basal-like breast carcinomas has a very favorable prognosis. Irrespective of site, adenoid cystic carcinoma is characterized by gene fusions involving MYB, MYBL1, and NFIB, and the reason for the different clinical outcomes is unknown. In order to identify the molecular mechanisms underlying the discrepancy in clinical outcome, we characterized the phenotypic profiles, pattern of gene rearrangements, and global microRNA expression profiles of 64 salivary gland, 9 lacrimal gland, and 11 breast adenoid cystic carcinomas. All breast and lacrimal gland adenoid cystic carcinomas had triple-negative and basal-like phenotypes, while salivary gland tumors were indeterminate in 13% of cases. Aberrations in MYB and/or NFIB were found in the majority of cases in all three locations, whereas MYBL1 involvement was restricted to tumors in the salivary gland. Global microRNA expression profiling separated salivary and lacrimal gland adenoid cystic carcinoma from their respective normal glands but could not distinguish normal breast adenoid cystic carcinoma from normal breast tissue. Hierarchical clustering separated adenoid cystic carcinomas of salivary gland origin from those of the breast and placed lacrimal gland carcinomas in between these. Functional annotation of the microRNAs differentially expressed between salivary gland and breast adenoid cystic carcinoma showed these as regulating genes involved in metabolism, signal transduction, and genes involved in other cancers. In conclusion, microRNA dysregulation is the first class of molecules separating adenoid

  17. Regulation of MicroRNAs, and the Correlations of MicroRNAs and Their Targeted Genes by Zinc Oxide Nanoparticles in Ovarian Granulosa Cells.

    Directory of Open Access Journals (Sweden)

    Yong Zhao

    Full Text Available Zinc oxide (ZnO nanoparticles (NPs have been applied in numerous industrial products and personal care products like sunscreens and cosmetics. The released ZnO NPs from consumer and household products into the environment might pose potential health issues for animals and humans. In this study the expression of microRNAs and the correlations of microRNAs and their targeted genes in ZnO NPs treated chicken ovarian granulosa cells were investigated. ZnSO4 was used as the sole Zn2+ provider to differentiate the effects of NPs from Zn2+. It was found that ZnO-NP-5 μg/ml specifically regulated the expression of microRNAs involved in embryonic development although ZnO-NP-5 μg/ml and ZnSO4-10 μg/ml treatments produced the same intracellular Zn concentrations and resulted in similar cell growth inhibition. And ZnO-NP-5 μg/ml also specifically regulated the correlations of microRNAs and their targeted genes. This is the first investigation that intact NPs in ZnO-NP-5 μg/ml treatment specifically regulated the expression of microRNAs, and the correlations of microRNAs and their targeted genes compared to that by Zn2+. This expands our knowledge for biological effects of ZnO NPs and at the same time it raises the health concerns that ZnO NPs might adversely affect our biological systems, even the reproductive systems through regulation of specific signaling pathways.

  18. Genome Wide Re-Annotation of Caldicellulosiruptor saccharolyticus with New Insights into Genes Involved in Biomass Degradation and Hydrogen Production.

    Science.gov (United States)

    Chowdhary, Nupoor; Selvaraj, Ashok; KrishnaKumaar, Lakshmi; Kumar, Gopal Ramesh

    2015-01-01

    Caldicellulosiruptor saccharolyticus has proven itself to be an excellent candidate for biological hydrogen (H2) production, but still it has major drawbacks like sensitivity to high osmotic pressure and low volumetric H2 productivity, which should be considered before it can be used industrially. A whole genome re-annotation work has been carried out as an attempt to update the incomplete genome information that causes gap in the knowledge especially in the area of metabolic engineering, to improve the H2 producing capabilities of C. saccharolyticus. Whole genome re-annotation was performed through manual means for 2,682 Coding Sequences (CDSs). Bioinformatics tools based on sequence similarity, motif search, phylogenetic analysis and fold recognition were employed for re-annotation. Our methodology could successfully add functions for 409 hypothetical proteins (HPs), 46 proteins previously annotated as putative and assigned more accurate functions for the known protein sequences. Homology based gene annotation has been used as a standard method for assigning function to novel proteins, but over the past few years many non-homology based methods such as genomic context approaches for protein function prediction have been developed. Using non-homology based functional prediction methods, we were able to assign cellular processes or physical complexes for 249 hypothetical sequences. Our re-annotation pipeline highlights the addition of 231 new CDSs generated from MicroScope Platform, to the original genome with functional prediction for 49 of them. The re-annotation of HPs and new CDSs is stored in the relational database that is available on the MicroScope web-based platform. In parallel, a comparative genome analyses were performed among the members of genus Caldicellulosiruptor to understand the function and evolutionary processes. Further, with results from integrated re-annotation studies (homology and genomic context approach), we strongly suggest that Csac

  19. Genome Wide Re-Annotation of Caldicellulosiruptor saccharolyticus with New Insights into Genes Involved in Biomass Degradation and Hydrogen Production.

    Directory of Open Access Journals (Sweden)

    Nupoor Chowdhary

    Full Text Available Caldicellulosiruptor saccharolyticus has proven itself to be an excellent candidate for biological hydrogen (H2 production, but still it has major drawbacks like sensitivity to high osmotic pressure and low volumetric H2 productivity, which should be considered before it can be used industrially. A whole genome re-annotation work has been carried out as an attempt to update the incomplete genome information that causes gap in the knowledge especially in the area of metabolic engineering, to improve the H2 producing capabilities of C. saccharolyticus. Whole genome re-annotation was performed through manual means for 2,682 Coding Sequences (CDSs. Bioinformatics tools based on sequence similarity, motif search, phylogenetic analysis and fold recognition were employed for re-annotation. Our methodology could successfully add functions for 409 hypothetical proteins (HPs, 46 proteins previously annotated as putative and assigned more accurate functions for the known protein sequences. Homology based gene annotation has been used as a standard method for assigning function to novel proteins, but over the past few years many non-homology based methods such as genomic context approaches for protein function prediction have been developed. Using non-homology based functional prediction methods, we were able to assign cellular processes or physical complexes for 249 hypothetical sequences. Our re-annotation pipeline highlights the addition of 231 new CDSs generated from MicroScope Platform, to the original genome with functional prediction for 49 of them. The re-annotation of HPs and new CDSs is stored in the relational database that is available on the MicroScope web-based platform. In parallel, a comparative genome analyses were performed among the members of genus Caldicellulosiruptor to understand the function and evolutionary processes. Further, with results from integrated re-annotation studies (homology and genomic context approach, we strongly

  20. Identifying and annotating human bifunctional RNAs reveals their versatile functions.

    Science.gov (United States)

    Chen, Geng; Yang, Juan; Chen, Jiwei; Song, Yunjie; Cao, Ruifang; Shi, Tieliu; Shi, Leming

    2016-10-01

    Bifunctional RNAs that possess both protein-coding and noncoding functional properties were less explored and poorly understood. Here we systematically explored the characteristics and functions of such human bifunctional RNAs by integrating tandem mass spectrometry and RNA-seq data. We first constructed a pipeline to identify and annotate bifunctional RNAs, leading to the characterization of 132 high-confidence bifunctional RNAs. Our analyses indicate that bifunctional RNAs may be involved in human embryonic development and can be functional in diverse tissues. Moreover, bifunctional RNAs could interact with multiple miRNAs and RNA-binding proteins to exert their corresponding roles. Bifunctional RNAs may also function as competing endogenous RNAs to regulate the expression of many genes by competing for common targeting miRNAs. Finally, somatic mutations of diverse carcinomas may generate harmful effect on corresponding bifunctional RNAs. Collectively, our study not only provides the pipeline for identifying and annotating bifunctional RNAs but also reveals their important gene-regulatory functions.

  1. Rapid Generation of MicroRNA Sponges for MicroRNA Inhibition

    NARCIS (Netherlands)

    Kluiver, Joost; Gibcus, Johan H.; Hettinga, Chris; Adema, Annelies; Richter, Mareike K. S.; Halsema, Nancy; Slezak-Prochazka, Izabella; Ding, Ye; Kroesen, Bart-Jan; van den Berg, Anke

    2012-01-01

    MicroRNA (miRNA) sponges are transcripts with repeated miRNA antisense sequences that can sequester miRNAs from endogenous targets. MiRNA sponges are valuable tools for miRNA loss-of-function studies both in vitro and in vivo. We developed a fast and flexible method to generate miRNA sponges and

  2. Genetic control of functional traits related to photosynthesis and water use efficiency in Pinus pinaster Ait. drought response: integration of genome annotation, allele association and QTL detection for candidate gene identification.

    Science.gov (United States)

    de Miguel, Marina; Cabezas, José-Antonio; de María, Nuria; Sánchez-Gómez, David; Guevara, María-Ángeles; Vélez, María-Dolores; Sáez-Laguna, Enrique; Díaz, Luis-Manuel; Mancha, Jose-Antonio; Barbero, María-Carmen; Collada, Carmen; Díaz-Sala, Carmen; Aranda, Ismael; Cervera, María-Teresa

    2014-06-12

    Understanding molecular mechanisms that control photosynthesis and water use efficiency in response to drought is crucial for plant species from dry areas. This study aimed to identify QTL for these traits in a Mediterranean conifer and tested their stability under drought. High density linkage maps for Pinus pinaster were used in the detection of QTL for photosynthesis and water use efficiency at three water irrigation regimes. A total of 28 significant and 27 suggestive QTL were found. QTL detected for photochemical traits accounted for the higher percentage of phenotypic variance. Functional annotation of genes within the QTL suggested 58 candidate genes for the analyzed traits. Allele association analysis in selected candidate genes showed three SNPs located in a MYB transcription factor that were significantly associated with efficiency of energy capture by open PSII reaction centers and specific leaf area. The integration of QTL mapping of functional traits, genome annotation and allele association yielded several candidate genes involved with molecular control of photosynthesis and water use efficiency in response to drought in a conifer species. The results obtained highlight the importance of maintaining the integrity of the photochemical machinery in P. pinaster drought response.

  3. MitoBamAnnotator: A web-based tool for detecting and annotating heteroplasmy in human mitochondrial DNA sequences.

    Science.gov (United States)

    Zhidkov, Ilia; Nagar, Tal; Mishmar, Dan; Rubin, Eitan

    2011-11-01

    The use of Next-Generation Sequencing of mitochondrial DNA is becoming widespread in biological and clinical research. This, in turn, creates a need for a convenient tool that detects and analyzes heteroplasmy. Here we present MitoBamAnnotator, a user friendly web-based tool that allows maximum flexibility and control in heteroplasmy research. MitoBamAnnotator provides the user with a comprehensively annotated overview of mitochondrial genetic variation, allowing for an in-depth analysis with no prior knowledge in programming. Copyright © 2011 Elsevier B.V. and Mitochondria Research Society. All rights reserved. All rights reserved.

  4. Correction of the Caulobacter crescentus NA1000 genome annotation.

    Directory of Open Access Journals (Sweden)

    Bert Ely

    Full Text Available Bacterial genome annotations are accumulating rapidly in the GenBank database and the use of automated annotation technologies to create these annotations has become the norm. However, these automated methods commonly result in a small, but significant percentage of genome annotation errors. To improve accuracy and reliability, we analyzed the Caulobacter crescentus NA1000 genome utilizing computer programs Artemis and MICheck to manually examine the third codon position GC content, alignment to a third codon position GC frame plot peak, and matches in the GenBank database. We identified 11 new genes, modified the start site of 113 genes, and changed the reading frame of 38 genes that had been incorrectly annotated. Furthermore, our manual method of identifying protein-coding genes allowed us to remove 112 non-coding regions that had been designated as coding regions. The improved NA1000 genome annotation resulted in a reduction in the use of rare codons since noncoding regions with atypical codon usage were removed from the annotation and 49 new coding regions were added to the annotation. Thus, a more accurate codon usage table was generated as well. These results demonstrate that a comparison of the location of peaks third codon position GC content to the location of protein coding regions could be used to verify the annotation of any genome that has a GC content that is greater than 60%.

  5. Characterizing and annotating the genome using RNA-seq data.

    Science.gov (United States)

    Chen, Geng; Shi, Tieliu; Shi, Leming

    2017-02-01

    Bioinformatics methods for various RNA-seq data analyses are in fast evolution with the improvement of sequencing technologies. However, many challenges still exist in how to efficiently process the RNA-seq data to obtain accurate and comprehensive results. Here we reviewed the strategies for improving diverse transcriptomic studies and the annotation of genetic variants based on RNA-seq data. Mapping RNA-seq reads to the genome and transcriptome represent two distinct methods for quantifying the expression of genes/transcripts. Besides the known genes annotated in current databases, many novel genes/transcripts (especially those long noncoding RNAs) still can be identified on the reference genome using RNA-seq. Moreover, owing to the incompleteness of current reference genomes, some novel genes are missing from them. Genome- guided and de novo transcriptome reconstruction are two effective and complementary strategies for identifying those novel genes/transcripts on or beyond the reference genome. In addition, integrating the genes of distinct databases to conduct transcriptomics and genetics studies can improve the results of corresponding analyses.

  6. IW-Scoring: an Integrative Weighted Scoring framework for annotating and prioritizing genetic variations in the noncoding genome.

    Science.gov (United States)

    Wang, Jun; Dayem Ullah, Abu Z; Chelala, Claude

    2018-01-30

    The vast majority of germline and somatic variations occur in the noncoding part of the genome, only a small fraction of which are believed to be functional. From the tens of thousands of noncoding variations detectable in each genome, identifying and prioritizing driver candidates with putative functional significance is challenging. To address this, we implemented IW-Scoring, a new Integrative Weighted Scoring model to annotate and prioritise functionally relevant noncoding variations. We evaluate 11 scoring methods, and apply an unsupervised spectral approach for subsequent selective integration into two linear weighted functional scoring schemas for known and novel variations. IW-Scoring produces stable high-quality performance as the best predictors for three independent data sets. We demonstrate the robustness of IW-Scoring in identifying recurrent functional mutations in the TERT promoter, as well as disease SNPs in proximity to consensus motifs and with gene regulatory effects. Using follicular lymphoma as a paradigmatic cancer model, we apply IW-Scoring to locate 11 recurrently mutated noncoding regions in 14 follicular lymphoma genomes, and validate 9 of these regions in an extension cohort, including the promoter and enhancer regions of PAX5. Overall, IW-Scoring demonstrates greater versatility in identifying trait- and disease-associated noncoding variants. Scores from IW-Scoring as well as other methods are freely available from http://www.snp-nexus.org/IW-Scoring/. © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. The MicroRNA Repertoire of Symbiodinium, the Dinoflagellate Symbiont of Reef-Building Corals

    KAUST Repository

    Baumgarten, Sebastian

    2013-07-01

    Animal and plant genomes produce numerous small RNAs (smRNAs) that regulate gene expression post-transcriptionally affecting metabolism, development, and epigenetic inheritance. In order to characterize the repertoire of endogenous microRNAs and potential gene targets, we conducted smRNA and mRNA expression profiling over nine experimental treatments of cultures from the dinoflagellate Symbiodinium sp. A1, a photosynthetic symbiont of scleractinian corals. We identified a total of 75 novel smRNAs in Symbiodinum sp. A1 that share stringent key features with functional microRNAs from other model organisms. A subset of 38 smRNAs was predicted independently over all nine treatments and their putative gene targets were identified. We found 3,187 animal-like target sites in the 3’UTRs of 12,858 mRNAs and 53 plantlike target sites in 51,917 genes. Furthermore, we identified the core RNAi protein machinery in Symbiodinium. Integration of smRNA and mRNA expression profiling identified a variety of processes that could be under microRNA control, e.g. regulation of translation, DNA modification, and chromatin silencing. Given that Symbiodinium seems to have a paucity of transcription factors and differentially expressed genes, identification and characterization of its smRNA repertoire establishes the possibility of a range of gene regulatory mechanisms in dinoflagellates acting post-transcriptionally.

  8. Annotation of regular polysemy and underspecification

    DEFF Research Database (Denmark)

    Martínez Alonso, Héctor; Pedersen, Bolette Sandford; Bel, Núria

    2013-01-01

    We present the result of an annotation task on regular polysemy for a series of seman- tic classes or dot types in English, Dan- ish and Spanish. This article describes the annotation process, the results in terms of inter-encoder agreement, and the sense distributions obtained with two methods...

  9. MicroRNAs in skin tissue engineering.

    Science.gov (United States)

    Miller, Kyle J; Brown, David A; Ibrahim, Mohamed M; Ramchal, Talisha D; Levinson, Howard

    2015-07-01

    35.2 million annual cases in the U.S. require clinical intervention for major skin loss. To meet this demand, the field of skin tissue engineering has grown rapidly over the past 40 years. Traditionally, skin tissue engineering relies on the "cell-scaffold-signal" approach, whereby isolated cells are formulated into a three-dimensional substrate matrix, or scaffold, and exposed to the proper molecular, physical, and/or electrical signals to encourage growth and differentiation. However, clinically available bioengineered skin equivalents (BSEs) suffer from a number of drawbacks, including time required to generate autologous BSEs, poor allogeneic BSE survival, and physical limitations such as mass transfer issues. Additionally, different types of skin wounds require different BSE designs. MicroRNA has recently emerged as a new and exciting field of RNA interference that can overcome the barriers of BSE design. MicroRNA can regulate cellular behavior, change the bioactive milieu of the skin, and be delivered to skin tissue in a number of ways. While it is still in its infancy, the use of microRNAs in skin tissue engineering offers the opportunity to both enhance and expand a field for which there is still a vast unmet clinical need. Here we give a review of skin tissue engineering, focusing on the important cellular processes, bioactive mediators, and scaffolds. We further discuss potential microRNA targets for each individual component, and we conclude with possible future applications. Copyright © 2015 Elsevier B.V. All rights reserved.

  10. Integrative analysis of functional genomic annotations and sequencing data to identify rare causal variants via hierarchical modeling

    Directory of Open Access Journals (Sweden)

    Marinela eCapanu

    2015-05-01

    Full Text Available Identifying the small number of rare causal variants contributing to disease has beena major focus of investigation in recent years, but represents a formidable statisticalchallenge due to the rare frequencies with which these variants are observed. In thiscommentary we draw attention to a formal statistical framework, namely hierarchicalmodeling, to combine functional genomic annotations with sequencing data with theobjective of enhancing our ability to identify rare causal variants. Using simulations weshow that in all configurations studied, the hierarchical modeling approach has superiordiscriminatory ability compared to a recently proposed aggregate measure of deleteriousness,the Combined Annotation-Dependent Depletion (CADD score, supportingour premise that aggregate functional genomic measures can more accurately identifycausal variants when used in conjunction with sequencing data through a hierarchicalmodeling approach

  11. Experiments with crowdsourced re-annotation of a POS tagging data set

    DEFF Research Database (Denmark)

    Hovy, Dirk; Plank, Barbara; Søgaard, Anders

    2014-01-01

    Crowdsourcing lets us collect multiple annotations for an item from several annotators. Typically, these are annotations for non-sequential classification tasks. While there has been some work on crowdsourcing named entity annotations, researchers have assumed that syntactic tasks such as part......-of-speech (POS) tagging cannot be crowdsourced. This paper shows that workers can actually annotate sequential data almost as well as experts. Further, we show that the models learned from crowdsourced annotations fare as well as the models learned from expert annotations in downstream tasks....

  12. MPEG-7 based video annotation and browsing

    Science.gov (United States)

    Hoeynck, Michael; Auweiler, Thorsten; Wellhausen, Jens

    2003-11-01

    The huge amount of multimedia data produced worldwide requires annotation in order to enable universal content access and to provide content-based search-and-retrieval functionalities. Since manual video annotation can be time consuming, automatic annotation systems are required. We review recent approaches to content-based indexing and annotation of videos for different kind of sports and describe our approach to automatic annotation of equestrian sports videos. We especially concentrate on MPEG-7 based feature extraction and content description, where we apply different visual descriptors for cut detection. Further, we extract the temporal positions of single obstacles on the course by analyzing MPEG-7 edge information. Having determined single shot positions as well as the visual highlights, the information is jointly stored with meta-textual information in an MPEG-7 description scheme. Based on this information, we generate content summaries which can be utilized in a user-interface in order to provide content-based access to the video stream, but further for media browsing on a streaming server.

  13. TNF-α-Induced microRNAs Control Dystrophin Expression in Becker Muscular Dystrophy.

    Science.gov (United States)

    Fiorillo, Alyson A; Heier, Christopher R; Novak, James S; Tully, Christopher B; Brown, Kristy J; Uaesoontrachoon, Kitipong; Vila, Maria C; Ngheim, Peter P; Bello, Luca; Kornegay, Joe N; Angelini, Corrado; Partridge, Terence A; Nagaraju, Kanneboyina; Hoffman, Eric P

    2015-09-08

    The amount and distribution of dystrophin protein in myofibers and muscle is highly variable in Becker muscular dystrophy and in exon-skipping trials for Duchenne muscular dystrophy. Here, we investigate a molecular basis for this variability. In muscle from Becker patients sharing the same exon 45-47 in-frame deletion, dystrophin levels negatively correlate with microRNAs predicted to target dystrophin. Seven microRNAs inhibit dystrophin expression in vitro, and three are validated in vivo (miR-146b/miR-374a/miR-31). microRNAs are expressed in dystrophic myofibers and increase with age and disease severity. In exon-skipping-treated mdx mice, microRNAs are significantly higher in muscles with low dystrophin rescue. TNF-α increases microRNA levels in vitro whereas NFκB inhibition blocks this in vitro and in vivo. Collectively, these data show that microRNAs contribute to variable dystrophin levels in muscular dystrophy. Our findings suggest a model where chronic inflammation in distinct microenvironments induces pathological microRNAs, initiating a self-sustaining feedback loop that exacerbates disease progression. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  14. TNF-α-Induced microRNAs Control Dystrophin Expression in Becker Muscular Dystrophy

    Directory of Open Access Journals (Sweden)

    Alyson A. Fiorillo

    2015-09-01

    Full Text Available The amount and distribution of dystrophin protein in myofibers and muscle is highly variable in Becker muscular dystrophy and in exon-skipping trials for Duchenne muscular dystrophy. Here, we investigate a molecular basis for this variability. In muscle from Becker patients sharing the same exon 45–47 in-frame deletion, dystrophin levels negatively correlate with microRNAs predicted to target dystrophin. Seven microRNAs inhibit dystrophin expression in vitro, and three are validated in vivo (miR-146b/miR-374a/miR-31. microRNAs are expressed in dystrophic myofibers and increase with age and disease severity. In exon-skipping-treated mdx mice, microRNAs are significantly higher in muscles with low dystrophin rescue. TNF-α increases microRNA levels in vitro whereas NFκB inhibition blocks this in vitro and in vivo. Collectively, these data show that microRNAs contribute to variable dystrophin levels in muscular dystrophy. Our findings suggest a model where chronic inflammation in distinct microenvironments induces pathological microRNAs, initiating a self-sustaining feedback loop that exacerbates disease progression.

  15. A conformation-induced fluorescence method for microRNA detection

    DEFF Research Database (Denmark)

    Aw, Sherry S; Tang, Melissa Xm; Teo, Yin Nah

    2016-01-01

    and quantify microRNAs may aid research into novel aspects of microRNA biology and contribute to the development of diagnostics. By introducing an additional stem loop into the fluorescent RNA Spinach and altering its 3' and 5' ends, we have generated a new RNA, Pandan, that functions as the basis for a micro......MicroRNAs play important roles in a large variety of biological systems and processes through their regulation of target mRNA expression, and show promise as clinical biomarkers. However, their small size presents challenges for tagging or direct detection. Innovation in techniques to sense......RNA sensor. Pandan contains two sequence-variable stem loops that encode complementary sequence for a target microRNA of interest. In its sensor form, it requires the binding of a target microRNA in order to reconstitute the RNA scaffold for fluorophore binding and fluorescence. Binding of the target micro...

  16. mirPub: a database for searching microRNA publications.

    Science.gov (United States)

    Vergoulis, Thanasis; Kanellos, Ilias; Kostoulas, Nikos; Georgakilas, Georgios; Sellis, Timos; Hatzigeorgiou, Artemis; Dalamagas, Theodore

    2015-05-01

    Identifying, amongst millions of publications available in MEDLINE, those that are relevant to specific microRNAs (miRNAs) of interest based on keyword search faces major obstacles. References to miRNA names in the literature often deviate from standard nomenclature for various reasons, since even the official nomenclature evolves. For instance, a single miRNA name may identify two completely different molecules or two different names may refer to the same molecule. mirPub is a database with a powerful and intuitive interface, which facilitates searching for miRNA literature, addressing the aforementioned issues. To provide effective search services, mirPub applies text mining techniques on MEDLINE, integrates data from several curated databases and exploits data from its user community following a crowdsourcing approach. Other key features include an interactive visualization service that illustrates intuitively the evolution of miRNA data, tag clouds summarizing the relevance of publications to particular diseases, cell types or tissues and access to TarBase 6.0 data to oversee genes related to miRNA publications. mirPub is freely available at http://www.microrna.gr/mirpub/. vergoulis@imis.athena-innovation.gr or dalamag@imis.athena-innovation.gr Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.

  17. Ground Truth Annotation in T Analyst

    DEFF Research Database (Denmark)

    2015-01-01

    This video shows how to annotate the ground truth tracks in the thermal videos. The ground truth tracks are produced to be able to compare them to tracks obtained from a Computer Vision tracking approach. The program used for annotation is T-Analyst, which is developed by Aliaksei Laureshyn, Ph...

  18. Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58

    Science.gov (United States)

    Yu, Jia-Feng; Sui, Tian-Xiang; Wang, Hong-Mei; Wang, Chun-Ling; Jing, Li; Wang, Ji-Hua

    2015-12-01

    Agrobacterium tumefaciens strain C58 is a type of pathogen that can cause tumors in some dicotyledonous plants. Ever since the genome of A. tumefaciens strain C58 was sequenced, the quality of annotation of its protein-coding genes has been queried continually, because the annotation varies greatly among different databases. In this paper, the questionable hypothetical genes were re-predicted by integrating the TN curve and Z curve methods. As a result, 30 genes originally annotated as “hypothetical” were discriminated as being non-coding sequences. By testing the re-prediction program 10 times on data sets composed of the function-known genes, the mean accuracy of 99.99% and mean Matthews correlation coefficient value of 0.9999 were obtained. Further sequence analysis and COG analysis showed that the re-annotation results were very reliable. This work can provide an efficient tool and data resources for future studies of A. tumefaciens strain C58. Project supported by the National Natural Science Foundation of China (Grant Nos. 61302186 and 61271378) and the Funding from the State Key Laboratory of Bioelectronics of Southeast University.

  19. Coordinated action of histone modification and microRNA regulations in human genome.

    Science.gov (United States)

    Wang, Xuan; Zheng, Guantao; Dong, Dong

    2015-10-10

    Both histone modifications and microRNAs (miRNAs) play pivotal role in gene expression regulation. Although numerous studies have been devoted to explore the gene regulation by miRNA and epigenetic regulations, their coordinated actions have not been comprehensively examined. In this work, we systematically investigated the combinatorial relationship between miRNA and epigenetic regulation by taking advantage of recently published whole genome-wide histone modification data and high quality miRNA targeting data. The results showed that miRNA targets have distinct histone modification patterns compared with non-targets in their promoter regions. Based on this finding, we proposed a machine learning approach to fit predictive models on the task to discern whether a gene is targeted by a specific miRNA. We found a considerable advantage in both sensitivity and specificity in diverse human cell lines. Finally, we found that our predicted miRNA targets are consistently annotated with Gene Ontology terms. Our work is the first genome-wide investigation of the coordinated action of miRNA and histone modification regulations, which provide a guide to deeply understand the complexity of transcriptional regulation. Copyright © 2015 Elsevier B.V. All rights reserved.

  20. The Role of MicroRNAs in Pancreatitis

    Science.gov (United States)

    2015-10-01

    AD______________ AWARD NUMBER: W81XWH-14-1-0469 TITLE: The Role of microRNAs in Pancreatitis PRINCIPAL INVESTIGATOR: Li, Yong RECIPIENT...The Role of MicroRNAs in Pancreatitis 5b. GRANT NUMBER W81XWH-14-1-0469 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S) 5d. PROJECT NUMBER Li, Yong 5e...AVAILABILITY STATEMENT Approved for Public Release; Distribution Unlimited 13. SUPPLEMENTARY NOTES 14. ABSTRACT Pancreatitis (inflammation of the

  1. Propagating annotations of molecular networks using in silico fragmentation.

    Science.gov (United States)

    da Silva, Ricardo R; Wang, Mingxun; Nothias, Louis-Félix; van der Hooft, Justin J J; Caraballo-Rodríguez, Andrés Mauricio; Fox, Evan; Balunas, Marcy J; Klassen, Jonathan L; Lopes, Norberto Peporine; Dorrestein, Pieter C

    2018-04-18

    The annotation of small molecules is one of the most challenging and important steps in untargeted mass spectrometry analysis, as most of our biological interpretations rely on structural annotations. Molecular networking has emerged as a structured way to organize and mine data from untargeted tandem mass spectrometry (MS/MS) experiments and has been widely applied to propagate annotations. However, propagation is done through manual inspection of MS/MS spectra connected in the spectral networks and is only possible when a reference library spectrum is available. One of the alternative approaches used to annotate an unknown fragmentation mass spectrum is through the use of in silico predictions. One of the challenges of in silico annotation is the uncertainty around the correct structure among the predicted candidate lists. Here we show how molecular networking can be used to improve the accuracy of in silico predictions through propagation of structural annotations, even when there is no match to a MS/MS spectrum in spectral libraries. This is accomplished through creating a network consensus of re-ranked structural candidates using the molecular network topology and structural similarity to improve in silico annotations. The Network Annotation Propagation (NAP) tool is accessible through the GNPS web-platform https://gnps.ucsd.edu/ProteoSAFe/static/gnps-theoretical.jsp.

  2. Gene calling and bacterial genome annotation with BG7.

    Science.gov (United States)

    Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo

    2015-01-01

    New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).

  3. Identification of serum microRNA biomarkers for tuberculosis using RNA-seq.

    Science.gov (United States)

    Zhang, Hongtai; Sun, Zhaogang; Wei, Wenjing; Liu, Zhonghui; Fleming, Joy; Zhang, Shuai; Lin, Nan; Wang, Ming; Chen, Maoshan; Xu, Yuhui; Zhou, Jie; Li, Chuanyou; Bi, Lijun; Zhou, Guangming

    2014-01-01

    Tuberculosis (TB) remains a significant human health issue. More effective biomarkers for use in tuberculosis prevention, diagnosis, and treatment, including markers that can discriminate between healthy individuals and those with latent infection, are urgently needed. To identify a set of such markers, we used Solexa sequencing to examine microRNA expression in the serum of patients with active disease, healthy individuals with latent TB, and those with or without prior BCG inoculation. We identified 24 microRNAs that are up-regulated (2.85-1285.93 fold) and 6 microRNAs that are down-regulated (0.003-0.11 fold) (PmicroRNAs were up-regulated (2.05-2454.58 fold) and 11 were down-regulated (0.001-0.42 fold) (PmicroRNAs were differentially-expressed in BCG-inoculated relative to un-inoculated individuals (18 up-regulated 2.9-499.29 fold, 116 down-regulated 0.0002-0.5 fold), providing insights into the effects of BCG inoculation at the microRNA level. Target prediction of differentially-expressed microRNAs by microRNA-Gene Network analysis and analysis of pathways affected suggest that regulation of the host immune system by microRNAs is likely to be one of the main factors in the pathogenesis of tuberculosis. qRT-PCR validation indicated that hsa-miR-196b and hsa-miR-376c have potential as markers for active TB disease. The microRNA differential-expression profiles generated in this study provide a good foundation for the development of markers for TB diagnosis, and for investigations on the role of microRNAs in BCG-inoculated and latent-infected individuals.

  4. Annotation of the Evaluative Language in a Dependency Treebank

    Directory of Open Access Journals (Sweden)

    Šindlerová Jana

    2017-12-01

    Full Text Available In the paper, we present our efforts to annotate evaluative language in the Prague Dependency Treebank 2.0. The project is a follow-up of the series of annotations of small plaintext corpora. It uses automatic identification of potentially evaluative nodes through mapping a Czech subjectivity lexicon to syntactically annotated data. These nodes are then manually checked by an annotator and either dismissed as standing in a non-evaluative context, or confirmed as evaluative. In the latter case, information about the polarity orientation, the source and target of evaluation is added by the annotator. The annotations unveiled several advantages and disadvantages of the chosen framework. The advantages involve more structured and easy-to-handle environment for the annotator, visibility of syntactic patterning of the evaluative state, effective solving of discontinuous structures or a new perspective on the influence of good/bad news. The disadvantages include little capability of treating cases with evaluation spread among more syntactically connected nodes at once, little capability of treating metaphorical expressions, or disregarding the effects of negation and intensification in the current scheme.

  5. A database of immunoglobulins with integrated tools: DIGIT.

    KAUST Repository

    Chailyan, Anna; Tramontano, Anna; Marcatili, Paolo

    2011-01-01

    The DIGIT (Database of ImmunoGlobulins with Integrated Tools) database (http://biocomputing.it/digit) is an integrated resource storing sequences of annotated immunoglobulin variable domains and enriched with tools for searching and analyzing them. The annotations in the database include information on the type of antigen, the respective germline sequences and on pairing information between light and heavy chains. Other annotations, such as the identification of the complementarity determining regions, assignment of their structural class and identification of mutations with respect to the germline, are computed on the fly and can also be obtained for user-submitted sequences. The system allows customized BLAST searches and automatic building of 3D models of the domains to be performed.

  6. A database of immunoglobulins with integrated tools: DIGIT.

    KAUST Repository

    Chailyan, Anna

    2011-11-10

    The DIGIT (Database of ImmunoGlobulins with Integrated Tools) database (http://biocomputing.it/digit) is an integrated resource storing sequences of annotated immunoglobulin variable domains and enriched with tools for searching and analyzing them. The annotations in the database include information on the type of antigen, the respective germline sequences and on pairing information between light and heavy chains. Other annotations, such as the identification of the complementarity determining regions, assignment of their structural class and identification of mutations with respect to the germline, are computed on the fly and can also be obtained for user-submitted sequences. The system allows customized BLAST searches and automatic building of 3D models of the domains to be performed.

  7. The caBIG annotation and image Markup project.

    Science.gov (United States)

    Channin, David S; Mongkolwat, Pattanasak; Kleper, Vladimir; Sepukar, Kastubh; Rubin, Daniel L

    2010-04-01

    Image annotation and markup are at the core of medical interpretation in both the clinical and the research setting. Digital medical images are managed with the DICOM standard format. While DICOM contains a large amount of meta-data about whom, where, and how the image was acquired, DICOM says little about the content or meaning of the pixel data. An image annotation is the explanatory or descriptive information about the pixel data of an image that is generated by a human or machine observer. An image markup is the graphical symbols placed over the image to depict an annotation. While DICOM is the standard for medical image acquisition, manipulation, transmission, storage, and display, there are no standards for image annotation and markup. Many systems expect annotation to be reported verbally, while markups are stored in graphical overlays or proprietary formats. This makes it difficult to extract and compute with both of them. The goal of the Annotation and Image Markup (AIM) project is to develop a mechanism, for modeling, capturing, and serializing image annotation and markup data that can be adopted as a standard by the medical imaging community. The AIM project produces both human- and machine-readable artifacts. This paper describes the AIM information model, schemas, software libraries, and tools so as to prepare researchers and developers for their use of AIM.

  8. Human Milk MicroRNA and Total RNA Differ Depending on Milk Fractionation.

    Science.gov (United States)

    Alsaweed, Mohammed; Hepworth, Anna R; Lefèvre, Christophe; Hartmann, Peter E; Geddes, Donna T; Hassiotou, Foteini

    2015-10-01

    MicroRNA have been recently discovered in human milk signifying potentially important functions for both the lactating breast and the infant. Whilst human milk microRNA have started to be explored, little data exist on the evaluation of sample processing, and analysis to ensure that a full spectrum of microRNA can be obtained. Human milk comprises three main fractions: cells, skim milk, and lipids. Typically, the skim milk fraction has been measured in isolation despite evidence that the lipid fraction may contain more microRNA. This study aimed to standardize isolation of microRNA and total RNA from all three fractions of human milk to determine the most appropriate sampling and analysis procedure for future studies. Three different methods from eight commercially available kits were tested for their efficacy in extracting total RNA and microRNA from the lipid, skim, and cell fractions of human milk. Each fraction yielded different concentrations of RNA and microRNA, with the highest quantities found in the cell and lipid fractions, and the lowest in skim milk. The column-based phenol-free method was the most efficient extraction method for all three milk fractions. Two microRNAs were expressed and validated in the three milk fractions by qPCR using the three recommended extraction kits for each fraction. High expression levels were identified in the skim and lipid milk factions for these microRNAs. These results suggest that careful consideration of both the human milk sample preparation and extraction protocols should be made prior to embarking upon research in this area. © 2015 The Authors. Journal of Cellular Biochemistry Published by Wiley Periodicals, Inc.

  9. Interoperable Multimedia Annotation and Retrieval for the Tourism Sector

    NARCIS (Netherlands)

    Chatzitoulousis, Antonios; Efraimidis, Pavlos S.; Athanasiadis, I.N.

    2015-01-01

    The Atlas Metadata System (AMS) employs semantic web annotation techniques in order to create an interoperable information annotation and retrieval platform for the tourism sector. AMS adopts state-of-the-art metadata vocabularies, annotation techniques and semantic web technologies.

  10. Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.

    Science.gov (United States)

    Cohen, K Bretonnel; Lanfranchi, Arrick; Choi, Miji Joo-Young; Bada, Michael; Baumgartner, William A; Panteleyeva, Natalya; Verspoor, Karin; Palmer, Martha; Hunter, Lawrence E

    2017-08-17

    Coreference resolution is the task of finding strings in text that have the same referent as other strings. Failures of coreference resolution are a common cause of false negatives in information extraction from the scientific literature. In order to better understand the nature of the phenomenon of coreference in biomedical publications and to increase performance on the task, we annotated the Colorado Richly Annotated Full Text (CRAFT) corpus with coreference relations. The corpus was manually annotated with coreference relations, including identity and appositives for all coreferring base noun phrases. The OntoNotes annotation guidelines, with minor adaptations, were used. Interannotator agreement ranges from 0.480 (entity-based CEAF) to 0.858 (Class-B3), depending on the metric that is used to assess it. The resulting corpus adds nearly 30,000 annotations to the previous release of the CRAFT corpus. Differences from related projects include a much broader definition of markables, connection to extensive annotation of several domain-relevant semantic classes, and connection to complete syntactic annotation. Tool performance was benchmarked on the data. A publicly available out-of-the-box, general-domain coreference resolution system achieved an F-measure of 0.14 (B3), while a simple domain-adapted rule-based system achieved an F-measure of 0.42. An ensemble of the two reached F of 0.46. Following the IDENTITY chains in the data would add 106,263 additional named entities in the full 97-paper corpus, for an increase of 76% percent in the semantic classes of the eight ontologies that have been annotated in earlier versions of the CRAFT corpus. The project produced a large data set for further investigation of coreference and coreference resolution in the scientific literature. The work raised issues in the phenomenon of reference in this domain and genre, and the paper proposes that many mentions that would be considered generic in the general domain are not

  11. MicroRNA-33 promotes the replicative senescence of mouse embryonic fibroblasts by suppressing CDK6

    Energy Technology Data Exchange (ETDEWEB)

    Xu, Shun; Huang, Haijiao; Li, Nanhong; Zhang, Bing; Jia, Yubin; Yang, Yukun; Yuan, Yuan; Xiong, Xing-dong; Wang, Dengchuan; Zheng, Hui-ling [Institute of Aging Research, Guangdong Medical University, Dongguan (China); Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, Dongguan (China); Institute of Biochemistry & Molecular Biology, Guangdong Medical University, Zhanjiang (China); Liu, Xinguang, E-mail: xgliu64@126.com [Institute of Aging Research, Guangdong Medical University, Dongguan (China); Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, Dongguan (China); Institute of Biochemistry & Molecular Biology, Guangdong Medical University, Zhanjiang (China)

    2016-05-13

    MicroRNAs are a large class of tiny noncoding RNAs, which have emerged as critical regulators of gene expression, and thus are involved in multiple cellular processes, including cellular senescence. MicroRNA-33 has previously been established to exert crucial effect on cell proliferation, lipid metabolism and cholesterol metabolism. Nonetheless, the association between microRNA-33 and cellular senescence and its underlying molecular mechanism are far to be elucidated. The present study has attempted to probe into the effect of microRNA-33 on MEFs senescence. Our data unveiled that microRNA-33 was dramatically down-regulated in senescent MEFs compared to the young MEFs, and ectopic expression of microRNA-33 promoted MEFs senescence, while knock-down of microRNA-33 exhibited a protective effect against senescence phenotype. Moreover, we verified CDK6 as a direct target of microRNA-33 in mouse. Silencing of CDK6 induced the premature senescence phenotype of MEFs similarly as microRNA-33, while enforced expression of CDK6 significantly reverse the senescence-induction effect of microRNA-33. Taken together, our results suggested that microRNA-33 enhanced the replicative senescence of MEFs potentially by suppressing CDK6 expression. -- Highlights: •MicroRNA-33 was dramatically down-regulated in senescent MEF cells. •Altered expression of microRNA-33 exerted a critical role in MEFs senescence. •MicroRNA-33 promoted the replicative senescence of MEFs via targeting of CDK6.

  12. MicroRNA-33 promotes the replicative senescence of mouse embryonic fibroblasts by suppressing CDK6

    International Nuclear Information System (INIS)

    Xu, Shun; Huang, Haijiao; Li, Nanhong; Zhang, Bing; Jia, Yubin; Yang, Yukun; Yuan, Yuan; Xiong, Xing-dong; Wang, Dengchuan; Zheng, Hui-ling; Liu, Xinguang

    2016-01-01

    MicroRNAs are a large class of tiny noncoding RNAs, which have emerged as critical regulators of gene expression, and thus are involved in multiple cellular processes, including cellular senescence. MicroRNA-33 has previously been established to exert crucial effect on cell proliferation, lipid metabolism and cholesterol metabolism. Nonetheless, the association between microRNA-33 and cellular senescence and its underlying molecular mechanism are far to be elucidated. The present study has attempted to probe into the effect of microRNA-33 on MEFs senescence. Our data unveiled that microRNA-33 was dramatically down-regulated in senescent MEFs compared to the young MEFs, and ectopic expression of microRNA-33 promoted MEFs senescence, while knock-down of microRNA-33 exhibited a protective effect against senescence phenotype. Moreover, we verified CDK6 as a direct target of microRNA-33 in mouse. Silencing of CDK6 induced the premature senescence phenotype of MEFs similarly as microRNA-33, while enforced expression of CDK6 significantly reverse the senescence-induction effect of microRNA-33. Taken together, our results suggested that microRNA-33 enhanced the replicative senescence of MEFs potentially by suppressing CDK6 expression. -- Highlights: •MicroRNA-33 was dramatically down-regulated in senescent MEF cells. •Altered expression of microRNA-33 exerted a critical role in MEFs senescence. •MicroRNA-33 promoted the replicative senescence of MEFs via targeting of CDK6.

  13. A Novel Approach to Semantic and Coreference Annotation at LLNL

    Energy Technology Data Exchange (ETDEWEB)

    Firpo, M

    2005-02-04

    A case is made for the importance of high quality semantic and coreference annotation. The challenges of providing such annotation are described. Asperger's Syndrome is introduced, and the connections are drawn between the needs of text annotation and the abilities of persons with Asperger's Syndrome to meet those needs. Finally, a pilot program is recommended wherein semantic annotation is performed by people with Asperger's Syndrome. The primary points embodied in this paper are as follows: (1) Document annotation is essential to the Natural Language Processing (NLP) projects at Lawrence Livermore National Laboratory (LLNL); (2) LLNL does not currently have a system in place to meet its need for text annotation; (3) Text annotation is challenging for a variety of reasons, many related to its very rote nature; (4) Persons with Asperger's Syndrome are particularly skilled at rote verbal tasks, and behavioral experts agree that they would excel at text annotation; and (6) A pilot study is recommend in which two to three people with Asperger's Syndrome annotate documents and then the quality and throughput of their work is evaluated relative to that of their neuro-typical peers.

  14. MicroRNA and gene signature of severe cutaneous drug ...

    African Journals Online (AJOL)

    Purpose: To build a microRNA and gene signature of severe cutaneous adverse drug reactions (SCAR), including Stevens-Johnson syndrome (SJS) and toxic epidermal necrolysis (TEN). Methods: MicroRNA expression profiles were downloaded from miRNA expression profile of patients' skin suffering from TEN using an ...

  15. INDIGO - INtegrated data warehouse of microbial genomes with examples from the red sea extremophiles.

    KAUST Repository

    Alam, Intikhab

    2013-12-06

    The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes.

  16. INDIGO - INtegrated data warehouse of microbial genomes with examples from the red sea extremophiles.

    KAUST Repository

    Alam, Intikhab; Antunes, André ; Kamau, Allan; Ba Alawi, Wail; Kalkatawi, Manal M.; Stingl, Ulrich; Bajic, Vladimir B.

    2013-01-01

    The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes.

  17. Plann: A command-line application for annotating plastome sequences.

    Science.gov (United States)

    Huang, Daisie I; Cronk, Quentin C B

    2015-08-01

    Plann automates the process of annotating a plastome sequence in GenBank format for either downstream processing or for GenBank submission by annotating a new plastome based on a similar, well-annotated plastome. Plann is a Perl script to be executed on the command line. Plann compares a new plastome sequence to the features annotated in a reference plastome and then shifts the intervals of any matching features to the locations in the new plastome. Plann's output can be used in the National Center for Biotechnology Information's tbl2asn to create a Sequin file for GenBank submission. Unlike Web-based annotation packages, Plann is a locally executable script that will accurately annotate a plastome sequence to a locally specified reference plastome. Because it executes from the command line, it is ready to use in other software pipelines and can be easily rerun as a draft plastome is improved.

  18. Transcriptome dynamics of the microRNA inhibition response

    DEFF Research Database (Denmark)

    Wen, Jiayu; Leucci, Elenora; Vendramin, Roberto

    2015-01-01

    We report a high-resolution time series study of transcriptome dynamics following antimiR-mediated inhibition of miR-9 in a Hodgkin lymphoma cell-line-the first such dynamic study of the microRNA inhibition response-revealing both general and specific aspects of the physiological response. We show...... validate the key observations with independent time series qPCR and we experimentally validate key predicted miR-9 targets. Methodologically, we developed sensitive functional data analytic predictive methods to analyse the weak response inherent in microRNA inhibition experiments. The methods...... of this study will be applicable to similar high-resolution time series transcriptome analyses and provides the context for more accurate experimental design and interpretation of future microRNA inhibition studies....

  19. Identification and Characterization of MicroRNAs in the Liver of Blunt Snout Bream (Megalobrama amblycephala Infected by Aeromonas hydrophila

    Directory of Open Access Journals (Sweden)

    Lei Cui

    2016-11-01

    Full Text Available MicroRNAs (miRNAs are small RNA molecules that play key roles in regulation of various biological processes. In order to better understand the biological significance of miRNAs in the context of Aeromonas hydrophila infection in Megalobrama amblycephala, small RNA libraries obtained from fish liver at 0 (non-infection, 4, and 24 h post infection (poi were sequenced using Illumina deep sequencing technology. A total of 11,244,207, 9,212,958, and 7,939,157 clean reads were obtained from these three RNA libraries, respectively. Bioinformatics analysis identified 171 conserved miRNAs and 62 putative novel miRNAs. The existence of ten randomly selected novel miRNAs was validated by RT-PCR. Pairwise comparison suggested that 61 and 44 miRNAs were differentially expressed at 4 and 24 h poi, respectively. Furthermore, the expression profiles of nine randomly selected miRNAs were validated by qRT-PCR. MicroRNA target prediction, gene ontology (GO annotation, and Kyoto Encylopedia of Genes and Genomes (KEGG analysis indicated that a variety of biological pathways could be affected by A. hydrophila infection. Additionally, transferrin (TF and transferrin receptor (TFR genes were confirmed to be direct targets of miR-375. These results will expand our knowledge of the role of miRNAs in the immune response of M. amblycephala to A. hydrophila infection, and facilitate the development of effective strategies against A. hydrophila infection in M. amblycephala.

  20. Semantator: annotating clinical narratives with semantic web ontologies.

    Science.gov (United States)

    Song, Dezhao; Chute, Christopher G; Tao, Cui

    2012-01-01

    To facilitate clinical research, clinical data needs to be stored in a machine processable and understandable way. Manual annotating clinical data is time consuming. Automatic approaches (e.g., Natural Language Processing systems) have been adopted to convert such data into structured formats; however, the quality of such automatically extracted data may not always be satisfying. In this paper, we propose Semantator, a semi-automatic tool for document annotation with Semantic Web ontologies. With a loaded free text document and an ontology, Semantator supports the creation/deletion of ontology instances for any document fragment, linking/disconnecting instances with the properties in the ontology, and also enables automatic annotation by connecting to the NCBO annotator and cTAKES. By representing annotations in Semantic Web standards, Semantator supports reasoning based upon the underlying semantics of the owl:disjointWith and owl:equivalentClass predicates. We present discussions based on user experiences of using Semantator.

  1. MicroRNA expression in multiple myeloma is associated with genetic subtype, isotype and survival

    Directory of Open Access Journals (Sweden)

    Pezzella Francesco

    2011-05-01

    Full Text Available Abstract Background MicroRNAs are small RNA species that regulate gene expression post-transcriptionally and are aberrantly expressed in many cancers including hematological malignancies. However, the role of microRNAs in the pathogenesis of multiple myeloma (MM is only poorly understood. We therefore used microarray analysis to elucidate the complete miRNome (miRBase version 13.0 of purified tumor (CD138+ cells from 33 patients with MM, 5 patients with monoclonal gammopathy of undetermined significance (MGUS and 9 controls. Results Unsupervised cluster analysis revealed that MM and MGUS samples have a distinct microRNA expression profile from control CD138+ cells. The majority of microRNAs aberrantly expressed in MM (109/129 were up-regulated. A comparison of these microRNAs with those aberrantly expressed in other B-cell and T-cell malignancies revealed a surprising degree of similarity (~40% suggesting the existence of a common lymphoma microRNA signature. We identified 39 microRNAs associated with the pre-malignant condition MGUS. Twenty-three (59% of these were also aberrantly expressed in MM suggesting common microRNA expression events in MM progression. MM is characterized by multiple chromosomal abnormalities of varying prognostic significance. We identified specific microRNA signatures associated with the most common IgH translocations (t(4;14 and t(11;14 and del(13q. Expression levels of these microRNAs were distinct between the genetic subtypes (by cluster analysis and correctly predicted these abnormalities in > 85% of cases using the support vector machine algorithm. Additionally, we identified microRNAs associated with light chain only myeloma, as well as IgG and IgA-type MM. Finally, we identified 32 microRNAs associated with event-free survival (EFS in MM, ten of which were significant by univariate (logrank survival analysis. Conclusions In summary, this work has identified aberrantly expressed microRNAs associated with the

  2. CGKB: an annotation knowledge base for cowpea (Vigna unguiculata L. methylation filtered genomic genespace sequences

    Directory of Open Access Journals (Sweden)

    Spraggins Thomas A

    2007-04-01

    potential domains on annotated GSS were analyzed using the HMMER package against the Pfam database. The annotated GSS were also assigned with Gene Ontology annotation terms and integrated with 228 curated plant metabolic pathways from the Arabidopsis Information Resource (TAIR knowledge base. The UniProtKB-Swiss-Prot ENZYME database was used to assign putative enzymatic function to each GSS. Each GSS was also analyzed with the Tandem Repeat Finder (TRF program in order to identify potential SSRs for molecular marker discovery. The raw sequence data, processed annotation, and SSR results were stored in relational tables designed in key-value pair fashion using a PostgreSQL relational database management system. The biological knowledge derived from the sequence data and processed results are represented as views or materialized views in the relational database management system. All materialized views are indexed for quick data access and retrieval. Data processing and analysis pipelines were implemented using the Perl programming language. The web interface was implemented in JavaScript and Perl CGI running on an Apache web server. The CPU intensive data processing and analysis pipelines were run on a computer cluster of more than 30 dual-processor Apple XServes. A job management system called Vela was created as a robust way to submit large numbers of jobs to the Portable Batch System (PBS. Conclusion CGKB is an integrated and annotated resource for cowpea GSS with features of homology-based and HMM-based annotations, enzyme and pathway annotations, GO term annotation, toolkits, and a large number of other facilities to perform complex queries. The cowpea GSS, chloroplast sequences, mitochondrial sequences, retroelements, and SSR sequences are available as FASTA formatted files and downloadable at CGKB. This database and web interface are publicly accessible at http://cowpeagenomics.med.virginia.edu/CGKB/.

  3. Identification of reference genes for relative quantification of circulating microRNAs in bovine serum.

    Directory of Open Access Journals (Sweden)

    In-Seon Bae

    Full Text Available Circulating microRNAs in body fluids have been implicated as promising biomarkers for physiopathology disorders. Currently, the expression levels of circulating microRNAs are estimated by reverse transcription quantitative real-time polymerase chain reaction. Use of appropriate reference microRNAs for normalization is critical for accurate microRNA expression analysis. However, no study has systematically investigated reference genes for evaluating circulating microRNA expression in cattle. In this study, we describe the identification and characterization of appropriate reference microRNAs for use in the normalization of circulating microRNA levels in bovine serum. We evaluated the expression stability of ten candidate reference genes in bovine serum by using reverse transcription quantitative real-time polymerase chain reaction. Data were analyzed using geNorm, NormFinder, and BestKeeper statistical algorithms. The results consistently showed that a combination of miR-93 and miR-127 provided the most stably expressed reference. The suitability of these microRNAs was validated, and even when compared among different genders or breeds, the combination of miR-93 and miR-127 was ranked as the most stable microRNA reference. Therefore, we conclude that this combination is the optimal endogenous reference for reverse transcription quantitative real-time polymerase chain reaction-based detection of microRNAs in bovine serum. The data presented in this study are crucial to successful biomarker discovery and validation for the diagnosis of physiopathological conditions in cattle.

  4. MEETING: Chlamydomonas Annotation Jamboree - October 2003

    Energy Technology Data Exchange (ETDEWEB)

    Grossman, Arthur R

    2007-04-13

    Shotgun sequencing of the nuclear genome of Chlamydomonas reinhardtii (Chlamydomonas throughout) was performed at an approximate 10X coverage by JGI. Roughly half of the genome is now contained on 26 scaffolds, all of which are at least 1.6 Mb, and the coverage of the genome is ~95%. There are now over 200,000 cDNA sequence reads that we have generated as part of the Chlamydomonas genome project (Grossman, 2003; Shrager et al., 2003; Grossman et al. 2007; Merchant et al., 2007); other sequences have also been generated by the Kasuza sequence group (Asamizu et al., 1999; Asamizu et al., 2000) or individual laboratories that have focused on specific genes. Shrager et al. (2003) placed the reads into distinct contigs (an assemblage of reads with overlapping nucleotide sequences), and contigs that group together as part of the same genes have been designated ACEs (assembly of contigs generated from EST information). All of the reads have also been mapped to the Chlamydomonas nuclear genome and the cDNAs and their corresponding genomic sequences have been reassembled, and the resulting assemblage is called an ACEG (an Assembly of contiguous EST sequences supported by genomic sequence) (Jain et al., 2007). Most of the unique genes or ACEGs are also represented by gene models that have been generated by the Joint Genome Institute (JGI, Walnut Creek, CA). These gene models have been placed onto the DNA scaffolds and are presented as a track on the Chlamydomonas genome browser associated with the genome portal (http://genome.jgi-psf.org/Chlre3/Chlre3.home.html). Ultimately, the meeting grant awarded by DOE has helped enormously in the development of an annotation pipeline (a set of guidelines used in the annotation of genes) and resulted in high quality annotation of over 4,000 genes; the annotators were from both Europe and the USA. Some of the people who led the annotation initiative were Arthur Grossman, Olivier Vallon, and Sabeeha Merchant (with many individual

  5. STINGRAY: system for integrated genomic resources and analysis.

    Science.gov (United States)

    Wagner, Glauber; Jardim, Rodrigo; Tschoeke, Diogo A; Loureiro, Daniel R; Ocaña, Kary A C S; Ribeiro, Antonio C B; Emmel, Vanessa E; Probst, Christian M; Pitaluga, André N; Grisard, Edmundo C; Cavalcanti, Maria C; Campos, Maria L M; Mattoso, Marta; Dávila, Alberto M R

    2014-03-07

    The STINGRAY system has been conceived to ease the tasks of integrating, analyzing, annotating and presenting genomic and expression data from Sanger and Next Generation Sequencing (NGS) platforms. STINGRAY includes: (a) a complete and integrated workflow (more than 20 bioinformatics tools) ranging from functional annotation to phylogeny; (b) a MySQL database schema, suitable for data integration and user access control; and (c) a user-friendly graphical web-based interface that makes the system intuitive, facilitating the tasks of data analysis and annotation. STINGRAY showed to be an easy to use and complete system for analyzing sequencing data. While both Sanger and NGS platforms are supported, the system could be faster using Sanger data, since the large NGS datasets could potentially slow down the MySQL database usage. STINGRAY is available at http://stingray.biowebdb.org and the open source code at http://sourceforge.net/projects/stingray-biowebdb/.

  6. Ontology modularization to improve semantic medical image annotation.

    Science.gov (United States)

    Wennerberg, Pinar; Schulz, Klaus; Buitelaar, Paul

    2011-02-01

    Searching for medical images and patient reports is a significant challenge in a clinical setting. The contents of such documents are often not described in sufficient detail thus making it difficult to utilize the inherent wealth of information contained within them. Semantic image annotation addresses this problem by describing the contents of images and reports using medical ontologies. Medical images and patient reports are then linked to each other through common annotations. Subsequently, search algorithms can more effectively find related sets of documents on the basis of these semantic descriptions. A prerequisite to realizing such a semantic search engine is that the data contained within should have been previously annotated with concepts from medical ontologies. One major challenge in this regard is the size and complexity of medical ontologies as annotation sources. Manual annotation is particularly time consuming labor intensive in a clinical environment. In this article we propose an approach to reducing the size of clinical ontologies for more efficient manual image and text annotation. More precisely, our goal is to identify smaller fragments of a large anatomy ontology that are relevant for annotating medical images from patients suffering from lymphoma. Our work is in the area of ontology modularization, which is a recent and active field of research. We describe our approach, methods and data set in detail and we discuss our results. Copyright © 2010 Elsevier Inc. All rights reserved.

  7. Orchid: a novel management, annotation and machine learning framework for analyzing cancer mutations.

    Science.gov (United States)

    Cario, Clinton L; Witte, John S

    2018-03-15

    As whole-genome tumor sequence and biological annotation datasets grow in size, number and content, there is an increasing basic science and clinical need for efficient and accurate data management and analysis software. With the emergence of increasingly sophisticated data stores, execution environments and machine learning algorithms, there is also a need for the integration of functionality across frameworks. We present orchid, a python based software package for the management, annotation and machine learning of cancer mutations. Building on technologies of parallel workflow execution, in-memory database storage and machine learning analytics, orchid efficiently handles millions of mutations and hundreds of features in an easy-to-use manner. We describe the implementation of orchid and demonstrate its ability to distinguish tissue of origin in 12 tumor types based on 339 features using a random forest classifier. Orchid and our annotated tumor mutation database are freely available at https://github.com/wittelab/orchid. Software is implemented in python 2.7, and makes use of MySQL or MemSQL databases. Groovy 2.4.5 is optionally required for parallel workflow execution. JWitte@ucsf.edu. Supplementary data are available at Bioinformatics online.

  8. [Prescription annotations in Welfare Pharmacy].

    Science.gov (United States)

    Han, Yi

    2018-03-01

    Welfare Pharmacy contains medical formulas documented by the government and official prescriptions used by the official pharmacy in the pharmaceutical process. In the last years of Southern Song Dynasty, anonyms gave a lot of prescription annotations, made textual researches for the name, source, composition and origin of the prescriptions, and supplemented important historical data of medical cases and researched historical facts. The annotations of Welfare Pharmacy gathered the essence of medical theory, and can be used as precious materials to correctly understand the syndrome differentiation, compatibility regularity and clinical application of prescriptions. This article deeply investigated the style and form of the prescription annotations in Welfare Pharmacy, the name of prescriptions and the evolution of terminology, the major functions of the prescriptions, processing methods, instructions for taking medicine and taboos of prescriptions, the medical cases and clinical efficacy of prescriptions, the backgrounds, sources, composition and cultural meanings of prescriptions, proposed that the prescription annotations played an active role in the textual dissemination, patent medicine production and clinical diagnosis and treatment of Welfare Pharmacy. This not only helps understand the changes in the names and terms of traditional Chinese medicines in Welfare Pharmacy, but also provides the basis for understanding the knowledge sources, compatibility regularity, important drug innovations and clinical medications of prescriptions in Welfare Pharmacy. Copyright© by the Chinese Pharmaceutical Association.

  9. A Cancer-Indicative microRNA Pattern in Normal Prostate Tissue

    Directory of Open Access Journals (Sweden)

    Thorsten Schlomm

    2013-03-01

    Full Text Available We analyzed the levels of selected micro-RNAs in normal prostate tissue to assess their potential to indicate tumor foci elsewhere in the prostate. Histologically normal prostate tissue samples from 31 prostate cancer patients and two cancer negative control groups with either unsuspicious or elevated prostate specific antigen (PSA levels (14 and 17 individuals, respectively were analyzed. Based on the expression analysis of 157 microRNAs in a pool of prostate tissue samples and information from data bases/literature, we selected eight microRNAs for quantification by real-time polymerase chain reactions (RT-PCRs. Selected miRNAs were analyzed in histologically tumor-free biopsy samples from patients and healthy controls. We identified seven microRNAs (miR-124a, miR-146a & b, miR-185, miR-16 and let-7a & b, which displayed significant differential expression in normal prostate tissue from men with prostate cancer compared to both cancer negative control groups. Four microRNAs (miR-185, miR-16 and let-7a and let-7b remained to significantly discriminate normal tissues from prostate cancer patients from those of the cancer negative control group with elevated PSA levels. The transcript levels of these microRNAs were highly indicative for the presence of cancer in the prostates, independently of the PSA level. Our results suggest a microRNA-pattern in histologically normal prostate tissue, indicating prostate cancer elsewhere in the organ.

  10. An imprinted non-coding genomic cluster at 14q32 defines clinically relevant molecular subtypes in osteosarcoma across multiple independent datasets

    OpenAIRE

    Hill, Katherine E.; Kelly, Andrew D.; Kuijjer, Marieke L.; Barry, William; Rattani, Ahmed; Garbutt, Cassandra C.; Kissick, Haydn; Janeway, Katherine; Perez-Atayde, Antonio; Goldsmith, Jeffrey; Gebhardt, Mark C.; Arredouani, Mohamed S.; Cote, Greg; Hornicek, Francis; Choy, Edwin

    2017-01-01

    Background: A microRNA (miRNA) collection on the imprinted 14q32 MEG3 region has been associated with outcome in osteosarcoma. We assessed the clinical utility of this miRNA set and their association with methylation status. Methods: We integrated coding and non-coding RNA data from three independent annotated clinical osteosarcoma cohorts (n = 65, n = 27, and n = 25) and miRNA and methylation data from one in vitro (19 cell lines) and one clinical (NCI Therapeutically Applicable Research to ...

  11. Annotating abstract pronominal anaphora in the DAD project

    DEFF Research Database (Denmark)

    Navarretta, Costanza; Olsen, Sussi Anni

    2008-01-01

    n this paper we present an extension of the MATE/GNOME annotation scheme for anaphora (Poesio 2004) which accounts for abstract anaphora in Danish and Italian. By abstract anaphora it is here meant pronouns whose linguistic antecedents are verbal phrases, clauses and discourse segments. The exten......n this paper we present an extension of the MATE/GNOME annotation scheme for anaphora (Poesio 2004) which accounts for abstract anaphora in Danish and Italian. By abstract anaphora it is here meant pronouns whose linguistic antecedents are verbal phrases, clauses and discourse segments....... The extended scheme, which we call the DAD annotation scheme, allows to annotate information about abstract anaphora which is important to investigate their use, see Webber (1988), Gundel et al. (2003), Navarretta (2004) and which can influence their automatic treatment. Intercoder agreement scores obtained...... by applying the DAD annotation scheme on texts and dialogues in the two languages are given and show that th information proposed in the scheme can be recognised in a reliable way....

  12. Role of microRNA-7 and selenoprotein P in hepatocellular carcinoma.

    Science.gov (United States)

    Tarek, Marwa; Louka, Manal Louis; Khairy, Eman; Ali-Labib, Randa; Zakaria Zaky, Doaa; Montasser, Iman F

    2017-05-01

    There is an obvious need to diagnose hepatocellular carcinoma using novel non-invasive and sensitive biomarkers. In this regard, the aim of this study was to evaluate and correlate both relative quantification of microRNA-7 using quantitative real time polymerase chain reaction and quantitative analysis of selenoprotein P using enzyme-linked immunosorbent assay in sera of hepatocellular carcinoma patients, chronic liver disease patients, as well as normal healthy subjects in order to establish a new diagnostic biomarker with a valid non-invasive technique. In addition, this study aimed to investigate whether changes in selenium supply affect microRNA-7 expression and selenoprotein P levels in human hepatocarcinoma cell line (HepG2). The results showed a highly significant decrease in serum microRNA-7 relative quantification values and selenoprotein P levels in malignant group in comparison with benign and control groups. The best cutoff for serum microRNA-7 and selenoprotein P to discriminate hepatocellular carcinoma group from benign and control groups was 0.06 and 4.30 mg/L, respectively. Furthermore, this study showed that changes in selenium supply to HepG2 cell line can alter the microRNA-7 profile and are paralleled by changes in the concentration of its target protein (selenoprotein P). Hence, serum microRNA-7 and selenoprotein P appear to be potential non-invasive diagnostic markers for hepatocellular carcinoma. Moreover, the results suggest that selenium could be used as an anticancer therapy for hepatocellular carcinoma by affecting both microRNA-7 and selenoprotein P.

  13. MicroRNA-122 is involved in oxidative stress in isoniazid-induced liver injury in mice.

    Science.gov (United States)

    Song, L; Zhang, Z R; Zhang, J L; Zhu, X B; He, L; Shi, Z; Gao, L; Li, Y; Hu, B; Feng, F M

    2015-10-27

    Many studies have shown that the pathogenesis of liver injury includes oxidative stress. MicroRNA-122 may be a marker for the early diagnosis of drug-induced liver injury. However, the relationship between microRNA-122 and oxidative stress in anti-tuberculosis drug-induced liver injury remains unknown. We measured changes in tissue microRNA-122 levels and indices of oxidative stress during liver injury in mice after administration of isoniazid, a first-line anti-tuberculosis drug. We quantified microRNA-122 expression and indices of oxidative stress at 7 time points, including 1, 3, and 5 days and 1, 2, 3, and 4 weeks. The tissue microRNA-122 levels and oxidative stress significantly changed at 3 and 5 days, suggesting that isoniazid-induced liver injury reduces oxidative stress and microRNA-122 expression compared to in the control group (P microRNA-122, began to change at 5 days (P microRNA-122 profile may affect oxidative stress by regulating mitochondrial ribosome protein S11 gene during isoniazid-induced liver injury, which may contribute to the response mechanisms of microRNA-122 and oxidative stress.

  14. MicroRNA and gene signature of severe cutaneous drug ...

    African Journals Online (AJOL)

    greater than 30 % of the same patients [5]. Nevertheless, the mechanisms of SJS and TEN are not fully elucidated. MicroRNAs or miRs are single stranded RNAs that are capable of posttranscriptional gene regulation via targeting their Mrna [6]. MicroRNAs are very important regulators in many human diseases, for instance,.

  15. Quick Pad Tagger : An Efficient Graphical User Interface for Building Annotated Corpora with Multiple Annotation Layers

    OpenAIRE

    Marc Schreiber; Kai Barkschat; Bodo Kraft; Albert Zundorf

    2015-01-01

    More and more domain specific applications in the internet make use of Natural Language Processing (NLP) tools (e. g. Information Extraction systems). The output quality of these applications relies on the output quality of the used NLP tools. Often, the quality can be increased by annotating a domain specific corpus. However, annotating a corpus is a time consuming and exhaustive task. To reduce the annota tion time we present...

  16. The role of microRNA-200 in progression of human colorectal and breast cancer.

    Directory of Open Access Journals (Sweden)

    Linda Bojmar

    Full Text Available The role of the epithelial-mesenchymal transition (EMT in cancer has been studied extensively in vitro, but involvement of the EMT in tumorigenesis in vivo is largely unknown. We investigated the potential of microRNAs as clinical markers and analyzed participation of the EMT-associated microRNA-200-ZEB-E-cadherin pathway in cancer progression. Expression of the microRNA-200 family was quantified by real-time RT-PCR analysis of fresh-frozen and microdissected formalin-fixed paraffin-embedded primary colorectal tumors, normal colon mucosa, and matched liver metastases. MicroRNA expression was validated by in situ hybridization and after in vitro culture of the malignant cells. To assess EMT as a predictive marker, factors considered relevant in colorectal cancer were investigated in 98 primary breast tumors from a treatment-randomized study. Associations between the studied EMT-markers were found in primary breast tumors and in colorectal liver metastases. MicroRNA-200 expression in epithelial cells was lower in malignant mucosa than in normal mucosa, and was also decreased in metastatic compared to non-metastatic colorectal cancer. Low microRNA-200 expression in colorectal liver metastases was associated with bad prognosis. In breast cancer, low levels of microRNA-200 were related to reduced survival and high expression of microRNA-200 was predictive of benefit from radiotheraphy. MicroRNA-200 was associated with ER positive status, and inversely correlated to HER2 and overactivation of the PI3K/AKT pathway, that was associated with high ZEB1 mRNA expression. Our findings suggest that the stability of microRNAs makes them suitable as clinical markers and that the EMT-related microRNA-200-ZEB-E-cadherin signaling pathway is connected to established clinical characteristics and can give useful prognostic and treatment-predictive information in progressive breast and colorectal cancers.

  17. Circulating microRNA expression profiles associated with systemic lupus erythematosus

    DEFF Research Database (Denmark)

    Carlsen, Anting Liu; Schetter, Aaron J; Nielsen, Christoffer

    2013-01-01

    OBJECTIVE: To evaluate the specificity of expression patterns of cell-free, circulating microRNAs in systemic lupus erythematosus (SLE). METHODS: Total RNA was purified from plasma and 45 different specific mature microRNAs were determined using quantitative reverse transcription polymerase chain...

  18. Supplementary Material for: BEACON: automated tool for Bacterial GEnome Annotation ComparisON

    KAUST Repository

    Kalkatawi, Manal M.; Alam, Intikhab; Bajic, Vladimir B.

    2015-01-01

    Abstract Background Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). Results The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACONâ s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced. Conclusions We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .

  19. Extending eScience Provenance with User-Submitted Semantic Annotations

    Science.gov (United States)

    Michaelis, J.; Zednik, S.; West, P.; Fox, P. A.; McGuinness, D. L.

    2010-12-01

    eScience based systems generate provenance of their data products, related to such things as: data processing, data collection conditions, expert evaluation, and data product quality. Recent advances in web-based technology offer users the possibility of making annotations to both data products and steps in accompanying provenance traces, thereby expanding the utility of such provenance for others. These contributing users may have varying backgrounds, ranging from system experts to outside domain experts to citizen scientists. Furthermore, such users may wish to make varying types of annotations - ranging from documenting the purpose of a provenance step to raising concerns about the quality of data dependencies. Semantic Web technologies allow for such kinds of rich annotations to be made to provenance through the use of ontology vocabularies for (i) organizing provenance, and (ii) organizing user/annotation classifications. Furthermore, through Linked Data practices, Semantic linkages may be made from provenance steps to external data of interest. A desire for Semantically-annotated provenance has been motivated by data management issues in the Mauna Loa Solar Observatory’s (MLSO) Advanced Coronal Observing System (ACOS). In ACOS, photomoeter-based readings are taken of solar activity and subsequently processed into final data products consumable by end users. At intermediate stages of ACOS processing, factors such as evaluations by human experts and weather conditions are logged, which could impact data product quality. If such factors are linked via user-submitted annotations to provenance, it could be significantly beneficial for other users. Likewise, the background of a user could impact the credibility of their annotations. For example, an annotation made by a citizen scientist describing the purpose of a provenance step may not be as reliable as a similar annotation made by an ACOS project member. For this work, we have developed a software package that

  20. Harnessing Collaborative Annotations on Online Formative Assessments

    Science.gov (United States)

    Lin, Jian-Wei; Lai, Yuan-Cheng

    2013-01-01

    This paper harnesses collaborative annotations by students as learning feedback on online formative assessments to improve the learning achievements of students. Through the developed Web platform, students can conduct formative assessments, collaboratively annotate, and review historical records in a convenient way, while teachers can generate…

  1. Research resources: comparative microRNA profiles in human corona radiata cells and cumulus oophorus cells detected by next-generation small RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Xian-Hong Tong

    Full Text Available During folliculogenesis, cumulus cells surrounding the oocyte differentiate into corona radiata cells (CRCs and cumulus oophorus cells (COCs, which are involved in gonadal steroidogenesis and the development of germ cells. Several studies suggested that microRNAs (miRNAs play an important regulatory role at the post-transcriptional level in cumulus cells. However, comparative miRNA profiles and associated processes in human CRCs and COCs have not been reported before. In this study, miRNA profiles were obtained from CRCs and COCs using next generation sequencing in women undergoing controlled ovarian stimulation for IVF. A total of 785 and 799 annotated miRNAs were identified in CRCs and COCs, while high expression levels of six novel miRNAs were detected both in CRCs and in COCs. In addition, different expression patterns in CRCs and COCs were detected in 72 annotated miRNAs. To confirm the miRNA profile in COCs and CRCs, quantitative real-time PCR was used to validate the expression of annotated miRNAs, differentially expressed miRNAs, and novel miRNAs. The miRNAs in the let-7 family were found to be involved in the regulation of a broad range of biological processes in both cumulus cell populations, which was accompanied by a large amount of miRNA editing. Bioinformatics analysis showed that amino acid and energy metabolism were targeted significantly by miRNAs that were differentially expressed between CRCs and COCs. Our work extends the current knowledge of the regulatory role of miRNAs and their targeted pathways in folliculogenesis, and provides novel candidates for molecular biomarkers in the research of female infertility.

  2. Crowdsourcing and annotating NER for Twitter #drift

    DEFF Research Database (Denmark)

    Fromreide, Hege; Hovy, Dirk; Søgaard, Anders

    2014-01-01

    We present two new NER datasets for Twitter; a manually annotated set of 1,467 tweets (kappa=0.942) and a set of 2,975 expert-corrected, crowdsourced NER annotated tweets from the dataset described in Finin et al. (2010). In our experiments with these datasets, we observe two important points: (a......) language drift on Twitter is significant, and while off-the-shelf systems have been reported to perform well on in-sample data, they often perform poorly on new samples of tweets, (b) state-of-the-art performance across various datasets can beobtained from crowdsourced annotations, making it more feasible...

  3. SNAD: sequence name annotation-based designer

    Directory of Open Access Journals (Sweden)

    Gorbalenya Alexander E

    2009-08-01

    Full Text Available Abstract Background A growing diversity of biological data is tagged with unique identifiers (UIDs associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a tedious exercise prone to mistakes and omissions. Results Here we introduce SNAD (Sequence Name Annotation-based Designer that mediates automatic conversion of sequence UIDs (associated with multiple alignment or phylogenetic tree, or supplied as plain text list into biologically meaningful names and acronyms. This conversion is directed by precompiled or user-defined templates that exploit wealth of annotation available in cognate entries of external databases. Using examples, we demonstrate how this tool can be used to generate names for practical purposes, particularly in virology. Conclusion A tool for controllable annotation-based conversion of sequence UIDs into biologically meaningful names and acronyms has been developed and placed into service, fostering links between quality of sequence annotation, and efficiency of communication and knowledge dissemination among researchers.

  4. Identifying and exploiting trait-relevant tissues with multiple functional annotations in genome-wide association studies

    Science.gov (United States)

    Zhang, Shujun

    2018-01-01

    Genome-wide association studies (GWASs) have identified many disease associated loci, the majority of which have unknown biological functions. Understanding the mechanism underlying trait associations requires identifying trait-relevant tissues and investigating associations in a trait-specific fashion. Here, we extend the widely used linear mixed model to incorporate multiple SNP functional annotations from omics studies with GWAS summary statistics to facilitate the identification of trait-relevant tissues, with which to further construct powerful association tests. Specifically, we rely on a generalized estimating equation based algorithm for parameter inference, a mixture modeling framework for trait-tissue relevance classification, and a weighted sequence kernel association test constructed based on the identified trait-relevant tissues for powerful association analysis. We refer to our analytic procedure as the Scalable Multiple Annotation integration for trait-Relevant Tissue identification and usage (SMART). With extensive simulations, we show how our method can make use of multiple complementary annotations to improve the accuracy for identifying trait-relevant tissues. In addition, our procedure allows us to make use of the inferred trait-relevant tissues, for the first time, to construct more powerful SNP set tests. We apply our method for an in-depth analysis of 43 traits from 28 GWASs using tissue-specific annotations in 105 tissues derived from ENCODE and Roadmap. Our results reveal new trait-tissue relevance, pinpoint important annotations that are informative of trait-tissue relationship, and illustrate how we can use the inferred trait-relevant tissues to construct more powerful association tests in the Wellcome trust case control consortium study. PMID:29377896

  5. Identifying and exploiting trait-relevant tissues with multiple functional annotations in genome-wide association studies.

    Directory of Open Access Journals (Sweden)

    Xingjie Hao

    2018-01-01

    Full Text Available Genome-wide association studies (GWASs have identified many disease associated loci, the majority of which have unknown biological functions. Understanding the mechanism underlying trait associations requires identifying trait-relevant tissues and investigating associations in a trait-specific fashion. Here, we extend the widely used linear mixed model to incorporate multiple SNP functional annotations from omics studies with GWAS summary statistics to facilitate the identification of trait-relevant tissues, with which to further construct powerful association tests. Specifically, we rely on a generalized estimating equation based algorithm for parameter inference, a mixture modeling framework for trait-tissue relevance classification, and a weighted sequence kernel association test constructed based on the identified trait-relevant tissues for powerful association analysis. We refer to our analytic procedure as the Scalable Multiple Annotation integration for trait-Relevant Tissue identification and usage (SMART. With extensive simulations, we show how our method can make use of multiple complementary annotations to improve the accuracy for identifying trait-relevant tissues. In addition, our procedure allows us to make use of the inferred trait-relevant tissues, for the first time, to construct more powerful SNP set tests. We apply our method for an in-depth analysis of 43 traits from 28 GWASs using tissue-specific annotations in 105 tissues derived from ENCODE and Roadmap. Our results reveal new trait-tissue relevance, pinpoint important annotations that are informative of trait-tissue relationship, and illustrate how we can use the inferred trait-relevant tissues to construct more powerful association tests in the Wellcome trust case control consortium study.

  6. Creating Gaze Annotations in Head Mounted Displays

    DEFF Research Database (Denmark)

    Mardanbeigi, Diako; Qvarfordt, Pernilla

    2015-01-01

    To facilitate distributed communication in mobile settings, we developed GazeNote for creating and sharing gaze annotations in head mounted displays (HMDs). With gaze annotations it possible to point out objects of interest within an image and add a verbal description. To create an annota- tion...

  7. Circulating MicroRNAs as Potential Molecular Biomarkers in Pathophysiological Evolution of Pregnancy

    Directory of Open Access Journals (Sweden)

    Dragos Cretoiu

    2016-01-01

    Full Text Available MicroRNAs represent nonprotein coding small RNA molecules that are very stable to degradation and responsible for gene silencing in most eukaryotic cells. Increased evidence has been accumulating over the years about their potential value as biomarkers for several diseases. MicroRNAs were predicted to be involved in nearly all biological processes from development to oncogenesis. In this review, we address the importance of circulating microRNAs in different conditions associated with pregnancy starting with the implantation period to preeclampsia and we shortly describe the correlation between placental circulating miRNAs and pregnancy status. We also discuss the importance of microRNAs in recurrent abortion and ectopic pregnancy.

  8. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

  9. MicroRNAs, the DNA damage response and cancer

    International Nuclear Information System (INIS)

    Wouters, Maikel D.; Gent, Dik C. van; Hoeijmakers, Jan H.J.; Pothof, Joris

    2011-01-01

    Many carcinogenic agents such as ultra-violet light from the sun and various natural and man-made chemicals act by damaging the DNA. To deal with these potentially detrimental effects of DNA damage, cells induce a complex DNA damage response (DDR) that includes DNA repair, cell cycle checkpoints, damage tolerance systems and apoptosis. This DDR is a potent barrier against carcinogenesis and defects within this response are observed in many, if not all, human tumors. DDR defects fuel the evolution of precancerous cells to malignant tumors, but can also induce sensitivity to DNA damaging agents in cancer cells, which can be therapeutically exploited by the use of DNA damaging treatment modalities. Regulation of and coordination between sub-pathways within the DDR is important for maintaining genome stability. Although regulation of the DDR has been extensively studied at the transcriptional and post-translational level, less is known about post-transcriptional gene regulation by microRNAs, the topic of this review. More specifically, we highlight current knowledge about DNA damage responsive microRNAs and microRNAs that regulate DNA damage response genes. We end by discussing the role of DNA damage response microRNAs in cancer etiology and sensitivity to ionizing radiation and other DNA damaging therapeutic agents.

  10. Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA.

    Directory of Open Access Journals (Sweden)

    Kumar Parijat Tripathi

    Full Text Available RNA-seq is a new tool to measure RNA transcript counts, using high-throughput sequencing at an extraordinary accuracy. It provides quantitative means to explore the transcriptome of an organism of interest. However, interpreting this extremely large data into biological knowledge is a problem, and biologist-friendly tools are lacking. In our lab, we developed Transcriptator, a web application based on a computational Python pipeline with a user-friendly Java interface. This pipeline uses the web services available for BLAST (Basis Local Search Alignment Tool, QuickGO and DAVID (Database for Annotation, Visualization and Integrated Discovery tools. It offers a report on statistical analysis of functional and Gene Ontology (GO annotation's enrichment. It helps users to identify enriched biological themes, particularly GO terms, pathways, domains, gene/proteins features and protein-protein interactions related informations. It clusters the transcripts based on functional annotations and generates a tabular report for functional and gene ontology annotations for each submitted transcript to the web server. The implementation of QuickGo web-services in our pipeline enable the users to carry out GO-Slim analysis, whereas the integration of PORTRAIT (Prediction of transcriptomic non coding RNA (ncRNA by ab initio methods helps to identify the non coding RNAs and their regulatory role in transcriptome. In summary, Transcriptator is a useful software for both NGS and array data. It helps the users to characterize the de-novo assembled reads, obtained from NGS experiments for non-referenced organisms, while it also performs the functional enrichment analysis of differentially expressed transcripts/genes for both RNA-seq and micro-array experiments. It generates easy to read tables and interactive charts for better understanding of the data. The pipeline is modular in nature, and provides an opportunity to add new plugins in the future. Web application is

  11. Understanding alcoholism through microRNA signatures in brains of human alcoholics

    Directory of Open Access Journals (Sweden)

    R. Dayne eMayfield

    2012-04-01

    Full Text Available Advances in the fields of genomics and genetics in the last decade have identified a large number of genes that can potentially influence alcohol-drinking behavior in humans as well as animal models. Consequently, the task of identifying efficient molecular targets that could be used to develop effective therapeutics against the disease has become increasingly daunting. One of the reasons for this is the fact that each of the many alcohol-responsive genes only contributes a small effect to the overall mechanism and disease phenotype, as is characteristic of complex traits. Current research trends are hence shifting towards the analysis of gene networks rather than emphasizing individual genes. The discovery of microRNAs and their mechanisms of action on regulation of transcript level and protein translation have made evident the utility of these small non-coding RNA molecules that act as central coordinators of multiple cross-communicating cellular pathways. Cells exploit the fact that a single microRNA can target hundreds of mRNA transcripts and that a single mRNA transcript can be simultaneously targeted by distinct microRNAs, to ensure fine-tuned and/or redundant control over a large number of cellular functions. By the same token, we can use these properties of microRNAs to develop novel, targeted strategies to combat complex disorders. In this review, we will focus on recent discoveries of microRNA signatures in brain of human alcoholics supporting the hypothesis that changes in gene expression and regulation by microRNAs are responsible for long-term neuroadaptations occurring during development of alcoholism. We also discuss insights into the potential modulation of epigenetic regulators by a subset of microRNAs. Taken together, microRNA activity may be controlling many of the cellular mechanisms already known to be involved in the development of alcoholism, and suggests potential targets for the development of novel therapeutic

  12. Cellular Response to Ionizing Radiation: A MicroRNA Story

    Science.gov (United States)

    Halimi, Mohammad; Asghari, S. Mohsen; Sariri, Reyhaneh; Moslemi, Dariush; Parsian, Hadi

    2012-01-01

    MicroRNAs (miRNAs) represent a class of small non-coding RNA molecules that regulate gene expression at the post-transcriptional level. They play a crucial role in diverse cellular pathways. Ionizing radiation (IR) is one of the most important treatment protocols for patients that suffer from cancer and affects directly or indirectly cellular integration. Recently it has been discovered that microRNA-mediated gene regulation interferes with radio-related pathways in ionizing radiation. Here, we review the recent discoveries about miRNAs in cellular response to IR. Thoroughly understanding the mechanism of miRNAs in radiation response, it will be possible to design new strategies for improving radiotherapy efficiency and ultimately cancer treatment. PMID:24551775

  13. Blood microRNAs in Low or No Risk Ischemic Stroke Patients

    Directory of Open Access Journals (Sweden)

    Jun Rong Tan

    2013-01-01

    Full Text Available Ischemic stroke is a multi-factorial disease where some patients present themselves with little or no risk factors. Blood microRNA expression profiles are becoming useful in the diagnosis and prognosis of human diseases. We therefore investigated the blood microRNA profiles in young stroke patients who presented with minimal or absence of risk factors for stroke such as type 2 diabetes, dyslipidemia and hypertension. Blood microRNA profiles from these patients varied with stroke subtypes as well as different functional outcomes (based on modified Rankin Score. These microRNAs have been shown to target genes that are involved in stroke pathogenesis. The findings from our study suggest that molecular mechanisms in stroke pathogenesis involving low or no risk ischemic stroke patients could differ substantially from those with pre-existing risk factors.

  14. Current status of research on microRNA associated with colorectal cancer liver metastasis

    Directory of Open Access Journals (Sweden)

    WANG Dongxu

    2016-12-01

    Full Text Available Tumor metastasis is a complicated process with multiple steps, and liver metastasis is the most common metastatic mode of colorectal cancer. Deep understanding and study of metastatic mechanism helps to find solutions for colorectal cancer liver metastasis. Recent studies have shown that microRNA are involved in tumor metastasis and recurrence, and studies on microRNA associated with colorectal cancer liver metastasis can provide new thoughts for the development and progression, diagnosis and treatment, and prognosis of the disease. This article summarizes the research advances in microRNA associated with colorectal cancer liver metastasis and reviews the biological function and molecular mechanism of microRNA, which suggests that microRNA have a vital significance in the field of tumor metastasis, especially colorectal cancer liver metastasis.

  15. Functional annotation of hierarchical modularity.

    Directory of Open Access Journals (Sweden)

    Kanchana Padmanabhan

    Full Text Available In biological networks of molecular interactions in a cell, network motifs that are biologically relevant are also functionally coherent, or form functional modules. These functionally coherent modules combine in a hierarchical manner into larger, less cohesive subsystems, thus revealing one of the essential design principles of system-level cellular organization and function-hierarchical modularity. Arguably, hierarchical modularity has not been explicitly taken into consideration by most, if not all, functional annotation systems. As a result, the existing methods would often fail to assign a statistically significant functional coherence score to biologically relevant molecular machines. We developed a methodology for hierarchical functional annotation. Given the hierarchical taxonomy of functional concepts (e.g., Gene Ontology and the association of individual genes or proteins with these concepts (e.g., GO terms, our method will assign a Hierarchical Modularity Score (HMS to each node in the hierarchy of functional modules; the HMS score and its p-value measure functional coherence of each module in the hierarchy. While existing methods annotate each module with a set of "enriched" functional terms in a bag of genes, our complementary method provides the hierarchical functional annotation of the modules and their hierarchically organized components. A hierarchical organization of functional modules often comes as a bi-product of cluster analysis of gene expression data or protein interaction data. Otherwise, our method will automatically build such a hierarchy by directly incorporating the functional taxonomy information into the hierarchy search process and by allowing multi-functional genes to be part of more than one component in the hierarchy. In addition, its underlying HMS scoring metric ensures that functional specificity of the terms across different levels of the hierarchical taxonomy is properly treated. We have evaluated our

  16. Fluid Annotations in a Open World

    DEFF Research Database (Denmark)

    Zellweger, Polle Trescott; Bouvin, Niels Olof; Jehøj, Henning

    2001-01-01

    Fluid Documents use animated typographical changes to provide a novel and appealing user experience for hypertext browsing and for viewing document annotations in context. This paper describes an effort to broaden the utility of Fluid Documents by using the open hypermedia Arakne Environment to l...... to layer fluid annotations and links on top of abitrary HTML pages on the World Wide Web. Changes to both Fluid Documents and Arakne are required....

  17. 14q32-encoded microRNAs mediate an oligometastatic phenotype.

    Science.gov (United States)

    Uppal, Abhineet; Wightman, Sean C; Mallon, Stephen; Oshima, Go; Pitroda, Sean P; Zhang, Qingbei; Huang, Xiaona; Darga, Thomas E; Huang, Lei; Andrade, Jorge; Liu, Huiping; Ferguson, Mark K; Greene, Geoffrey L; Posner, Mitchell C; Hellman, Samuel; Khodarev, Nikolai N; Weichselbaum, Ralph R

    2015-02-28

    Oligometastasis is a clinically distinct subset of metastasis characterized by a limited number of metastases potentially curable with localized therapies. We analyzed pathways targeted by microRNAs over-expressed in clinical oligometastasis samples and identified suppression of cellular adhesion, invasion, and motility pathways in association with the oligometastatic phenotype. We identified miR-127-5p, miR-544a, and miR-655-3p encoded in the 14q32 microRNA cluster as co-regulators of multiple metastatic pathways through repression of shared target genes. These microRNAs suppressed cellular adhesion and invasion and inhibited metastasis development in an animal model of breast cancer lung colonization. Target genes, including TGFBR2 and ROCK2, were key mediators of these effects. Understanding the role of microRNAs expressed in oligometastases may lead to improved identification of and interventions for patients with curable metastatic disease, as well as an improved understanding of the molecular basis of this unique clinical entity.

  18. No miR quirk: dysregulation of microRNAs in pancreatic ductal adenocarcinoma.

    Science.gov (United States)

    Cheung, Philip Y; Szafranska-Schwarzbach, Anna E; Schlageter, Annette M; Andruss, Bernard F; Weiss, Glen J

    2012-01-01

    MicroRNAs are post-transcriptional regulators of gene expression with tissue-specific expression profiles. Dysregulation of microRNAs has been shown to play a role in carcinogenesis. Although progress has been made in the diagnosis and treatment of many cancers, pancreatic cancer remains an intractable public health problem, causing 6.58% of cancer deaths despite making up less than 3% of cancer diagnoses in the United States. No screening, diagnostic or imaging techniques exist with the sensitivity to detect pancreatic cancer in its early, operable stages. Risk factors include numerous inherited syndromes, diabetes mellitus, and hepatitis C virus infection. Here we review the literature regarding dysregulation of microRNA expression in native pancreas, pancreatic ductal adenocarcinoma (the dominant form of pancreatic cancer), and its risk factors to illuminate the biology and progression of this disease. We explore promising evidence for the use of microRNAs as prognostic and diagnostic tools, and discuss emerging reports on microRNA therapeutics.

  19. Let-7 microRNAs are developmentally regulated in circulating human erythroid cells

    Directory of Open Access Journals (Sweden)

    Reed Christopher

    2009-11-01

    Full Text Available Abstract Background MicroRNAs are ~22nt-long small non-coding RNAs that negatively regulate protein expression through mRNA degradation or translational repression in eukaryotic cells. Based upon their importance in regulating development and terminal differentiation in model systems, erythrocyte microRNA profiles were examined at birth and in adults to determine if changes in their abundance coincide with the developmental phenomenon of hemoglobin switching. Methods Expression profiling of microRNA was performed using total RNA from four adult peripheral blood samples compared to four cord blood samples after depletion of plasma, platelets, and nucleated cells. Labeled RNAs were hybridized to custom spotted arrays containing 474 human microRNA species (miRBase release 9.1. Total RNA from Epstein-Barr virus (EBV-transformed lymphoblastoid cell lines provided a hybridization reference for all samples to generate microRNA abundance profile for each sample. Results Among 206 detected miRNAs, 79% of the microRNAs were present at equivalent levels in both cord and adult cells. By comparison, 37 microRNAs were up-regulated and 4 microRNAs were down-regulated in adult erythroid cells (fold change > 2; p let-7 miRNA family consistently demonstrated increased abundance in the adult samples by array-based analyses that were confirmed by quantitative PCR (4.5 to 18.4 fold increases in 6 of 8 let-7 miRNA. Profiling studies of messenger RNA (mRNA in these cells additionally demonstrated down-regulation of ten let-7 target genes in the adult cells. Conclusion These data suggest that a consistent pattern of up-regulation among let-7 miRNA in circulating erythroid cells occurs in association with hemoglobin switching during the fetal-to-adult developmental transition in humans.

  20. MicroRNA expression in benign breast tissue and risk of subsequent invasive breast cancer.

    Science.gov (United States)

    Rohan, Thomas; Ye, Kenny; Wang, Yihong; Glass, Andrew G; Ginsberg, Mindy; Loudig, Olivier

    2018-01-01

    MicroRNAs are endogenous, small non-coding RNAs that control gene expression by directing their target mRNAs for degradation and/or posttranscriptional repression. Abnormal expression of microRNAs is thought to contribute to the development and progression of cancer. A history of benign breast disease (BBD) is associated with increased risk of subsequent breast cancer. However, no large-scale study has examined the association between microRNA expression in BBD tissue and risk of subsequent invasive breast cancer (IBC). We conducted discovery and validation case-control studies nested in a cohort of 15,395 women diagnosed with BBD in a large health plan between 1971 and 2006 and followed to mid-2015. Cases were women with BBD who developed subsequent IBC; controls were matched 1:1 to cases on age, age at diagnosis of BBD, and duration of plan membership. The discovery stage (316 case-control pairs) entailed use of the Illumina MicroRNA Expression Profiling Assay (in duplicate) to identify breast cancer-associated microRNAs. MicroRNAs identified at this stage were ranked by the strength of the correlation between Illumina array and quantitative PCR results for 15 case-control pairs. The top ranked 14 microRNAs entered the validation stage (165 case-control pairs) which was conducted using quantitative PCR (in triplicate). In both stages, linear regression was used to evaluate the association between the mean expression level of each microRNA (response variable) and case-control status (independent variable); paired t-tests were also used in the validation stage. None of the 14 validation stage microRNAs was associated with breast cancer risk. The results of this study suggest that microRNA expression in benign breast tissue does not influence the risk of subsequent IBC.

  1. Black English Annotations for Elementary Reading Programs.

    Science.gov (United States)

    Prasad, Sandre

    This report describes a program that uses annotations in the teacher's editions of existing reading programs to indicate the characteristics of black English that may interfere with the reading process of black children. The first part of the report provides a rationale for the annotation approach, explaining that the discrepancy between written…

  2. Special Issue: Annotated Bibliography for Volumes XIX-XXXII.

    Science.gov (United States)

    Pullin, Richard A.

    1998-01-01

    This annotated bibliography lists 310 articles from the "Journal of Cooperative Education" from Volumes XIX-XXXII, 1983-1997. Annotations are presented in the order they appear in the journal; author and subject indexes are provided. (JOW)

  3. MetaStorm: A Public Resource for Customizable Metagenomics Annotation.

    Directory of Open Access Journals (Sweden)

    Gustavo Arango-Argoty

    Full Text Available Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/, which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution.

  4. MetaStorm: A Public Resource for Customizable Metagenomics Annotation.

    Science.gov (United States)

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution.

  5. MIPS: analysis and annotation of genome information in 2007.

    Science.gov (United States)

    Mewes, H W; Dietmann, S; Frishman, D; Gregory, R; Mannhaupt, G; Mayer, K F X; Münsterkötter, M; Ruepp, A; Spannagl, M; Stümpflen, V; Rattei, T

    2008-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).

  6. MetaStorm: A Public Resource for Customizable Metagenomics Annotation

    Science.gov (United States)

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S.; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution. PMID:27632579

  7. Pratiques d'annotations à l'ère des médias numériques : étude de cas de l'architexte Diigo.

    OpenAIRE

    Kerneis , Jacques; Thiault , Florence

    2014-01-01

    International audience; The integration of technology in higher education raises questions about the potential of digital university teaching. In this article, we approach the influence of environments (hardware or platforms) on the practice of digital annotation. Our project is to analyze the types of use produced by the architext of an online management tool of bookmarks Diigo. We rely on two case studies to identify annotation practices. A first study relates to the practices which student...

  8. Identification and Characterization of 293T Cell-Derived Exosomes by Profiling the Protein, mRNA and MicroRNA Components.

    Directory of Open Access Journals (Sweden)

    Jing Li

    Full Text Available Cell-derived exosomes are leading candidates for in vivo drug delivery carriers. In particular, exosomes derived from 293T cells are used most frequently, although exosome dosing has varied greatly among studies. Considering their biological origin, it is crucial to characterize the molecular composition of exosomes if large doses are to be administered in clinical settings. In this study, we present the first comprehensive analysis of the protein, messenger RNA and microRNA profiles of 293T cell-derived exosomes; then, we characterized these data using Gene Ontology annotation and Kyoto Encyclopedia for Genes and Genomes pathway analysis. Our study will provide the basis for the selection of 293T cell-derived exosome drug delivery systems. Profiling the exosomal signatures of 293T cells will lead to a better understanding of 293T exosome biology and will aid in the identification of any harmful factors in exosomes that could cause adverse clinical effects.

  9. Lost in translation. New unexplored avenues for neuropsychopharmacology: epigenetics and microRNAs.

    Science.gov (United States)

    Tardito, Daniela; Mallei, Alessandra; Popoli, Maurizio

    2013-02-01

    Mood and anxiety disorders are among the major causes of disability worldwide. Despite clear need for better therapies, efforts to develop novel drugs have been relatively unsuccessful. One major reason is lack of translation into neuropsychopharmacology of the impressive recent array of knowledge accrued by clinical and preclinical researches on the brain. Here focus is on epigenetics mechanisms, including microRNAs, which seem particularly promising for the identification of new targets for alternative pharmacological approaches. First, the current knowledge about epigenetic mechanisms, including DNA methylation, posttranslational modification of histone proteins, focusing on histone methylation and acetylation, and posttranscriptional modulation of gene expression by microRNAs is described. Then evidence showing involvement of epigenetics and microRNAs in the pathophysiology of mood and anxiety disorders as well as evidence showing that some of the currently employed antidepressants and mood stabilizers also affect epigenetic and microRNA mechanisms are reviewed. Finally current evidence and novel approaches in favor of drugs regulating epigenetic and microRNA mechanisms as potential therapeutics for these disorders are discussed. Although still in its infancy, research investigating the effects of pharmacological modulation of epigenetic and microRNA mechanisms in neuropsychiatric disorders continues to provide encouraging findings, suggesting new avenues for treatment of mood and anxiety disorders.

  10. The Bologna Annotation Resource (BAR 3.0): improving protein functional annotation.

    Science.gov (United States)

    Profiti, Giuseppe; Martelli, Pier Luigi; Casadio, Rita

    2017-07-03

    BAR 3.0 updates our server BAR (Bologna Annotation Resource) for predicting protein structural and functional features from sequence. We increase data volume, query capabilities and information conveyed to the user. The core of BAR 3.0 is a graph-based clustering procedure of UniProtKB sequences, following strict pairwise similarity criteria (sequence identity ≥40% with alignment coverage ≥90%). Each cluster contains the available annotation downloaded from UniProtKB, GO, PFAM and PDB. After statistical validation, GO terms and PFAM domains are cluster-specific and annotate new sequences entering the cluster after satisfying similarity constraints. BAR 3.0 includes 28 869 663 sequences in 1 361 773 clusters, of which 22.2% (22 241 661 sequences) and 47.4% (24 555 055 sequences) have at least one validated GO term and one PFAM domain, respectively. 1.4% of the clusters (36% of all sequences) include PDB structures and the cluster is associated to a hidden Markov model that allows building template-target alignment suitable for structural modeling. Some other 3 399 026 sequences are singletons. BAR 3.0 offers an improved search interface, allowing queries by UniProtKB-accession, Fasta sequence, GO-term, PFAM-domain, organism, PDB and ligand/s. When evaluated on the CAFA2 targets, BAR 3.0 largely outperforms our previous version and scores among state-of-the-art methods. BAR 3.0 is publicly available and accessible at http://bar.biocomp.unibo.it/bar3. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Role of MicroRNAs in Renin-Angiotensin-Aldosterone System-Mediated Cardiovascular Inflammation and Remodeling

    Directory of Open Access Journals (Sweden)

    Maricica Pacurari

    2015-01-01

    Full Text Available MicroRNAs are endogenous regulators of gene expression either by inhibiting translation or protein degradation. Recent studies indicate that microRNAs play a role in cardiovascular disease and renin-angiotensin-aldosterone system- (RAAS- mediated cardiovascular inflammation, either as mediators or being targeted by RAAS pharmacological inhibitors. The exact role(s of microRNAs in RAAS-mediated cardiovascular inflammation and remodeling is/are still in early stage of investigation. However, few microRNAs have been shown to play a role in RAAS signaling, particularly miR-155, miR-146a/b, miR-132/122, and miR-483-3p. Identification of specific microRNAs and their targets and elucidating microRNA-regulated mechanisms associated RAS-mediated cardiovascular inflammation and remodeling might lead to the development of novel pharmacological strategies to target RAAS-mediated vascular pathologies. This paper reviews microRNAs role in inflammatory factors mediating cardiovascular inflammation and RAAS genes and the effect of RAAS pharmacological inhibition on microRNAs and the resolution of RAAS-mediated cardiovascular inflammation and remodeling. Also, this paper discusses the advances on microRNAs-based therapeutic approaches that may be important in targeting RAAS signaling.

  12. Computational prediction and experimental validation of Ciona intestinalis microRNA genes

    Directory of Open Access Journals (Sweden)

    Pasquinelli Amy E

    2007-11-01

    Full Text Available Abstract Background This study reports the first collection of validated microRNA genes in the sea squirt, Ciona intestinalis. MicroRNAs are processed from hairpin precursors to ~22 nucleotide RNAs that base pair to target mRNAs and inhibit expression. As a member of the subphylum Urochordata (Tunicata whose larval form has a notochord, the sea squirt is situated at the emergence of vertebrates, and therefore may provide information about the evolution of molecular regulators of early development. Results In this study, computational methods were used to predict 14 microRNA gene families in Ciona intestinalis. The microRNA prediction algorithm utilizes configurable microRNA sequence conservation and stem-loop specificity parameters, grouping by miRNA family, and phylogenetic conservation to the related species, Ciona savignyi. The expression for 8, out of 9 attempted, of the putative microRNAs in the adult tissue of Ciona intestinalis was validated by Northern blot analyses. Additionally, a target prediction algorithm was implemented, which identified a high confidence list of 240 potential target genes. Over half of the predicted targets can be grouped into the gene ontology categories of metabolism, transport, regulation of transcription, and cell signaling. Conclusion The computational techniques implemented in this study can be applied to other organisms and serve to increase the understanding of the origins of non-coding RNAs, embryological and cellular developmental pathways, and the mechanisms for microRNA-controlled gene regulatory networks.

  13. The miR-10 microRNA precursor family

    DEFF Research Database (Denmark)

    Tehler, Disa; Høyland-Kroghsbo, Nina Molin; Lund, Anders H

    2011-01-01

    The miR-10 microRNA precursor family encodes a group of short non-coding RNAs involved in gene regulation. The miR-10 family is highly conserved and has sparked the interest of many research groups because of the genomic localization in the vicinity of, coexpression with and regulation of the Hox...... gene developmental regulators. Here, we review the current knowledge of the evolution, physiological function and involvement in cancer of this family of microRNAs....

  14. BAT: An open-source, web-based audio events annotation tool

    OpenAIRE

    Blai Meléndez-Catalan, Emilio Molina, Emilia Gómez

    2017-01-01

    In this paper we present BAT (BMAT Annotation Tool), an open-source, web-based tool for the manual annotation of events in audio recordings developed at BMAT (Barcelona Music and Audio Technologies). The main feature of the tool is that it provides an easy way to annotate the salience of simultaneous sound sources. Additionally, it allows to define multiple ontologies to adapt to multiple tasks and offers the possibility to cross-annotate audio data. Moreover, it is easy to install and deploy...

  15. Deregulated Cardiac Specific MicroRNAs in Postnatal Heart Growth

    Directory of Open Access Journals (Sweden)

    Pujiao Yu

    2016-01-01

    Full Text Available The heart is recognized as an organ that is terminally differentiated by adulthood. However, during the process of human development, the heart is the first organ with function in the embryo and grows rapidly during the postnatal period. MicroRNAs (miRNAs, miRs, as regulators of gene expression, play important roles during the development of multiple systems. However, the role of miRNAs in postnatal heart growth is still unclear. In this study, by using qRT-PCR, we compared the expression of seven cardiac- or muscle-specific miRNAs that may be related to heart development in heart tissue from mice at postnatal days 0, 3, 8, and 14. Four miRNAs—miR-1a-3p, miR-133b-3p, miR-208b-3p, and miR-206-3p—were significantly decreased while miR-208a-3p was upregulated during the postnatal heart growth period. Based on these results, GeneSpring GX was used to predict potential downstream targets by performing a 3-way comparison of predictions from the miRWalk, PITA, and microRNAorg databases. Gene Ontology (GO and Kyoto Encyclopedia of Genes and Genomes (KEGG analysis were used to identify potential functional annotations and signaling pathways related to postnatal heart growth. This study describes expression changes of cardiac- and muscle-specific miRNAs during postnatal heart growth and may provide new therapeutic targets for cardiovascular diseases.

  16. Identification of Conserved and Novel MicroRNAs in Blueberry

    Directory of Open Access Journals (Sweden)

    Junyang Yue

    2017-06-01

    Full Text Available MicroRNAs (miRNAs are a class of small endogenous RNAs that play important regulatory roles in cells by negatively affecting gene expression at both transcriptional and post-transcriptional levels. There have been extensive studies aiming to identify miRNAs and to elucidate their functions in various plant species. In the present study, we employed the high-throughput sequencing technology to profile miRNAs in blueberry fruits. A total of 9,992,446 small RNA tags with sizes ranged from 18 to 30 nt were obtained, indicating that blueberry fruits have a large and diverse small RNA population. Bioinformatic analysis identified 412 conserved miRNAs belonging to 29 families, and 35 predicted novel miRNAs that are likely to be unique to blueberries. Among them, expression profiles of five conserved miRNAs were validated by stem loop qRT-PCR. Furthermore, the potential target genes of conserved and novel miRNAs were predicted and subjected to Gene Ontology (GO annotation. Enrichment analysis of the GO-represented biological processes and molecular functions revealed that these target genes were potentially involved in a wide range of metabolic pathways and developmental processes. Particularly, anthocyanin biosynthesis has been predicted to be directly or indirectly regulated by diverse miRNA families. This study is the first report on genome-wide miRNA profile analysis in blueberry and it provides a useful resource for further elucidation of the functional roles of miRNAs during fruit development and ripening.

  17. Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome.

    Science.gov (United States)

    Wang, Jia; Chen, Dijun; Lei, Yang; Chang, Ji-Wei; Hao, Bao-Hai; Xing, Feng; Li, Sen; Xu, Qiang; Deng, Xiu-Xin; Chen, Ling-Ling

    2014-01-01

    Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET) evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs) and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.

  18. Annotating images by mining image search results.

    Science.gov (United States)

    Wang, Xin-Jing; Zhang, Lei; Li, Xirong; Ma, Wei-Ying

    2008-11-01

    Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

  19. Collaborative Paper-Based Annotation of Lecture Slides

    Science.gov (United States)

    Steimle, Jurgen; Brdiczka, Oliver; Muhlhauser, Max

    2009-01-01

    In a study of notetaking in university courses, we found that the large majority of students prefer paper to computer-based media like Tablet PCs for taking notes and making annotations. Based on this finding, we developed CoScribe, a concept and system which supports students in making collaborative handwritten annotations on printed lecture…

  20. Music journals in South Africa 1854-2010: an annotated bibliography

    African Journals Online (AJOL)

    Music journals in South Africa 1854-2010: an annotated bibliography. ... The article focuses on presenting an annotated bibliography of music journalism in South Africa from as early as 1854 until 2010. Most of ... Key words: annotated bibliography, electronic journals, music journals, periodicals, South African music history ...

  1. MicroRNAs take part in pathophysiology and pathogenesis of Male Pattern Baldness.

    Science.gov (United States)

    Goodarzi, Hamed R; Abbasi, Ali; Saffari, Mojtaba; Tabei, Mohammad B; Noori Daloii, Mohammad R

    2010-07-01

    Male Pattern Baldness (MPB) or androgenetic alopecia is a common form of hair loss with androgens and genetics having etiological significance. Androgens are thought to pathophysiologically power on cascades of chronically dramatic alterations in genetically susceptible scalp dermal papillas, specialized cells in hair follicles in which androgens react, and finally resulting in a patterned alopecia. However, the exact mechanisms through which androgens, positive regulators of growth and anabolism in most body sites, paradoxically exert their effects on balding hair follicles, are not yet known. The role of microRNAs, a recently discovered class of non-coding RNAs, with a wide range of regulatory functions, has been documented in hair follicle formation and their deregulation in cancer of prostate, a target organ of androgens has also been delineated. Yet, there is a lack of knowledge in agreement with microRNAs' contribution in pathophysiology of MPB. To investigate the role of microRNAs in pathogenesis of MPB, we selected seven microRNAs, predicted bioinformatically on a reverse engineering basis, from previously published microarray gene expression data and analyzed their expression in balding relative to non-balding dermal papillas. We found for the first time upregulation of four microRNAs (miR-221, miR-125b, miR-106b and miR-410) that could participate in pathogenesis of MPB. Regarding microRNAs' therapeutic potential and accessibility of hair follicles for gene therapy, these microRNAs can be considered as good candidates for a new revolutionized generation of treatments.

  2. DNA methylation, microRNAs, and their crosstalk as potential biomarkers in hepatocellular carcinoma

    Science.gov (United States)

    Anwar, Sumadi Lukman; Lehmann, Ulrich

    2014-01-01

    Epigenetic alterations have been identified as a major characteristic in human cancers. Advances in the field of epigenetics have contributed significantly in refining our knowledge of molecular mechanisms underlying malignant transformation. DNA methylation and microRNA expression are epigenetic mechanisms that are widely altered in human cancers including hepatocellular carcinoma (HCC), the third leading cause of cancer related mortality worldwide. Both DNA methylation and microRNA expression patterns are regulated in developmental stage specific-, cell type specific- and tissue-specific manner. The aberrations are inferred in the maintenance of cancer stem cells and in clonal cell evolution during carcinogenesis. The availability of genome-wide technologies for DNA methylation and microRNA profiling has revolutionized the field of epigenetics and led to the discovery of a number of epigenetically silenced microRNAs in cancerous cells and primary tissues. Dysregulation of these microRNAs affects several key signalling pathways in hepatocarcinogenesis suggesting that modulation of DNA methylation and/or microRNA expression can serve as new therapeutic targets for HCC. Accumulative evidence shows that aberrant DNA methylation of certain microRNA genes is an event specifically found in HCC which correlates with unfavorable outcomes. Therefore, it can potentially serve as a biomarker for detection as well as for prognosis, monitoring and predicting therapeutic responses in HCC. PMID:24976726

  3. ComPath: comparative enzyme analysis and annotation in pathway/subsystem contexts

    Directory of Open Access Journals (Sweden)

    Kim Sun

    2008-03-01

    Full Text Available Abstract Background Once a new genome is sequenced, one of the important questions is to determine the presence and absence of biological pathways. Analysis of biological pathways in a genome is a complicated task since a number of biological entities are involved in pathways and biological pathways in different organisms are not identical. Computational pathway identification and analysis thus involves a number of computational tools and databases and typically done in comparison with pathways in other organisms. This computational requirement is much beyond the capability of biologists, so information systems for reconstructing, annotating, and analyzing biological pathways are much needed. We introduce a new comparative pathway analysis workbench, ComPath, which integrates various resources and computational tools using an interactive spreadsheet-style web interface for reliable pathway analyses. Results ComPath allows users to compare biological pathways in multiple genomes using a spreadsheet style web interface where various sequence-based analysis can be performed either to compare enzymes (e.g. sequence clustering and pathways (e.g. pathway hole identification, to search a genome for de novo prediction of enzymes, or to annotate a genome in comparison with reference genomes of choice. To fill in pathway holes or make de novo enzyme predictions, multiple computational methods such as FASTA, Whole-HMM, CSR-HMM (a method of our own introduced in this paper, and PDB-domain search are integrated in ComPath. Our experiments show that FASTA and CSR-HMM search methods generally outperform Whole-HMM and PDB-domain search methods in terms of sensitivity, but FASTA search performs poorly in terms of specificity, detecting more false positive as E-value cutoff increases. Overall, CSR-HMM search method performs best in terms of both sensitivity and specificity. Gene neighborhood and pathway neighborhood (global network visualization tools can be used

  4. An annotated corpus with nanomedicine and pharmacokinetic parameters.

    Science.gov (United States)

    Lewinski, Nastassja A; Jimenez, Ivan; McInnes, Bridget T

    2017-01-01

    A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration's Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided.

  5. Plann: A command-line application for annotating plastome sequences1

    Science.gov (United States)

    Huang, Daisie I.; Cronk, Quentin C. B.

    2015-01-01

    Premise of the study: Plann automates the process of annotating a plastome sequence in GenBank format for either downstream processing or for GenBank submission by annotating a new plastome based on a similar, well-annotated plastome. Methods and Results: Plann is a Perl script to be executed on the command line. Plann compares a new plastome sequence to the features annotated in a reference plastome and then shifts the intervals of any matching features to the locations in the new plastome. Plann’s output can be used in the National Center for Biotechnology Information’s tbl2asn to create a Sequin file for GenBank submission. Conclusions: Unlike Web-based annotation packages, Plann is a locally executable script that will accurately annotate a plastome sequence to a locally specified reference plastome. Because it executes from the command line, it is ready to use in other software pipelines and can be easily rerun as a draft plastome is improved. PMID:26312193

  6. MicroRNAs in Heart Failure, Cardiac Transplantation, and Myocardial Recovery: Biomarkers with Therapeutic Potential.

    Science.gov (United States)

    Shah, Palak; Bristow, Michael R; Port, J David

    2017-12-01

    Heart failure is increasing in prevalence with a lack of recently developed therapies that produce major beneficial effects on its associated mortality. MicroRNAs are small non-coding RNA molecules that regulate gene expression, are differentially regulated in heart failure, and are found in the circulation serving as a biomarker of heart failure. Data suggests that microRNAs may be used to detect allograft rejection in cardiac transplantation and may predict the degree of myocardial recovery in patients with a left ventricular assist device or treated with beta-blocker therapy. Given their role in regulating cellular function, microRNAs are an intriguing target for oligonucleotide therapeutics, designed to mimic or antagonize (antagomir) their biological effects. We review the current state of microRNAs as biomarkers of heart failure and associated conditions, the mechanisms by which microRNAs control cellular function, and how specific microRNAs may be targeted with novel therapeutics designed to treat heart failure.

  7. INDIGO - INtegrated data warehouse of microbial genomes with examples from the red sea extremophiles.

    Science.gov (United States)

    Alam, Intikhab; Antunes, André; Kamau, Allan Anthony; Ba Alawi, Wail; Kalkatawi, Manal; Stingl, Ulrich; Bajic, Vladimir B

    2013-01-01

    The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes. We developed a data warehouse system (INDIGO) that enables the integration of annotations for exploration and analysis of newly sequenced microbial genomes. INDIGO offers an opportunity to construct complex queries and combine annotations from multiple sources starting from genomic sequence to protein domain, gene ontology and pathway levels. This data warehouse is aimed at being populated with information from genomes of pure cultures and uncultured single cells of Red Sea bacteria and Archaea. Currently, INDIGO contains information from Salinisphaera shabanensis, Haloplasma contractile, and Halorhabdus tiamatea - extremophiles isolated from deep-sea anoxic brine lakes of the Red Sea. We provide examples of utilizing the system to gain new insights into specific aspects on the unique lifestyle and adaptations of these organisms to extreme environments. We developed a data warehouse system, INDIGO, which enables comprehensive integration of information from various resources to be used for annotation, exploration and analysis of microbial genomes. It will be regularly updated and extended with new genomes. It is aimed to serve as a resource dedicated to the Red Sea microbes. In addition, through INDIGO, we provide our Automatic Annotation of Microbial Genomes (AAMG) pipeline. The INDIGO web server is freely available at http://www.cbrc.kaust.edu.sa/indigo.

  8. MicroRNAs as Biomarkers for Liver Disease and Hepatocellular Carcinoma

    Directory of Open Access Journals (Sweden)

    C. Nelson Hayes

    2016-02-01

    Full Text Available Serum levels of liver enzymes, such as alanine transaminase, aspartate transaminase, and α-fetoprotein, provide insight into liver function and are used during treatment of liver disease, but such information is limited. In the case of hepatocellular carcinoma (HCC, which is often not detected until an advanced stage, more sensitive biomarkers may help to achieve earlier detection. Serum also contains microRNAs, a class of small non-coding RNAs that play an important role in regulating gene expression. miR-122 is specific to the liver and correlates strongly with liver enzyme levels and necroinflammatory activity, and other microRNAs are correlated with the degree of fibrosis. miR-122 has also been found to be required for hepatitis C virus (HCV infection, whereas other microRNAs have been shown to play antiviral roles. miR-125a-5p and miR-1231 have been shown to directly target hepatitis B virus (HBV transcripts, and others are up- or down-regulated in infected individuals. MicroRNA profiles also differ in the case of HBV and HCV infection as well as between HBeAg-positive and negative patients, and in patients with occult versus active HBV infection. In such patients, monitoring of changes in microRNA profiles might provide earlier warning of neoplastic changes preceding HCC.

  9. Evaluation of web-based annotation of ophthalmic images for multicentric clinical trials.

    Science.gov (United States)

    Chalam, K V; Jain, P; Shah, V A; Shah, Gaurav Y

    2006-06-01

    An Internet browser-based annotation system can be used to identify and describe features in digitalized retinal images, in multicentric clinical trials, in real time. In this web-based annotation system, the user employs a mouse to draw and create annotations on a transparent layer, that encapsulates the observations and interpretations of a specific image. Multiple annotation layers may be overlaid on a single image. These layers may correspond to annotations by different users on the same image or annotations of a temporal sequence of images of a disease process, over a period of time. In addition, geometrical properties of annotated figures may be computed and measured. The annotations are stored in a central repository database on a server, which can be retrieved by multiple users in real time. This system facilitates objective evaluation of digital images and comparison of double-blind readings of digital photographs, with an identifiable audit trail. Annotation of ophthalmic images allowed clinically feasible and useful interpretation to track properties of an area of fundus pathology. This provided an objective method to monitor properties of pathologies over time, an essential component of multicentric clinical trials. The annotation system also allowed users to view stereoscopic images that are stereo pairs. This web-based annotation system is useful and valuable in monitoring patient care, in multicentric clinical trials, telemedicine, teaching and routine clinical settings.

  10. An annotated bibliography of completed and in-progress behavioral research for the Office of Buildings and Community Systems. [About 1000 items, usually with abstracts

    Energy Technology Data Exchange (ETDEWEB)

    Weijo, R.O.; Roberson, B.F.; Eckert, R.; Anderson, M.R.

    1988-05-01

    This report provides an annotated bibliography of completed and in-progress consumer decision research useful for technology transfer and commercialization planning by the US Department of Energy's (DOE) Office of Buildings and Community Systems (OBCS). This report attempts to integrate the consumer research studies conducted across several public and private organizations over the last four to five years. Some of the sources of studies included in this annotated bibliography are DOE National Laboratories, public and private utilities, trade associations, states, and nonprofit organizations. This study divides the articles identified in this annotated bibliography into sections that are consistent with or similar to the system of organization used by OBCS.

  11. Essential Annotation Schema for Ecology (EASE)-A framework supporting the efficient data annotation and faceted navigation in ecology.

    Science.gov (United States)

    Pfaff, Claas-Thido; Eichenberg, David; Liebergesell, Mario; König-Ries, Birgitta; Wirth, Christian

    2017-01-01

    Ecology has become a data intensive science over the last decades which often relies on the reuse of data in cross-experimental analyses. However, finding data which qualifies for the reuse in a specific context can be challenging. It requires good quality metadata and annotations as well as efficient search strategies. To date, full text search (often on the metadata only) is the most widely used search strategy although it is known to be inaccurate. Faceted navigation is providing a filter mechanism which is based on fine granular metadata, categorizing search objects along numeric and categorical parameters relevant for their discovery. Selecting from these parameters during a full text search creates a system of filters which allows to refine and improve the results towards more relevance. We developed a framework for the efficient annotation and faceted navigation in ecology. It consists of an XML schema for storing the annotation of search objects and is accompanied by a vocabulary focused on ecology to support the annotation process. The framework consolidates ideas which originate from widely accepted metadata standards, textbooks, scientific literature, and vocabularies as well as from expert knowledge contributed by researchers from ecology and adjacent disciplines.

  12. Essential Annotation Schema for Ecology (EASE-A framework supporting the efficient data annotation and faceted navigation in ecology.

    Directory of Open Access Journals (Sweden)

    Claas-Thido Pfaff

    Full Text Available Ecology has become a data intensive science over the last decades which often relies on the reuse of data in cross-experimental analyses. However, finding data which qualifies for the reuse in a specific context can be challenging. It requires good quality metadata and annotations as well as efficient search strategies. To date, full text search (often on the metadata only is the most widely used search strategy although it is known to be inaccurate. Faceted navigation is providing a filter mechanism which is based on fine granular metadata, categorizing search objects along numeric and categorical parameters relevant for their discovery. Selecting from these parameters during a full text search creates a system of filters which allows to refine and improve the results towards more relevance. We developed a framework for the efficient annotation and faceted navigation in ecology. It consists of an XML schema for storing the annotation of search objects and is accompanied by a vocabulary focused on ecology to support the annotation process. The framework consolidates ideas which originate from widely accepted metadata standards, textbooks, scientific literature, and vocabularies as well as from expert knowledge contributed by researchers from ecology and adjacent disciplines.

  13. The MicroRNA Interaction Network of Lipid Diseases

    Science.gov (United States)

    Kandhro, Abdul H.; Shoombuatong, Watshara; Nantasenamat, Chanin; Prachayasittikul, Virapong; Nuchnoi, Pornlada

    2017-01-01

    Background: Dyslipidemia is one of the major forms of lipid disorder, characterized by increased triglycerides (TGs), increased low-density lipoprotein-cholesterol (LDL-C), and decreased high-density lipoprotein-cholesterol (HDL-C) levels in blood. Recently, MicroRNAs (miRNAs) have been reported to involve in various biological processes; their potential usage being a biomarkers and in diagnosis of various diseases. Computational approaches including text mining have been used recently to analyze abstracts from the public databases to observe the relationships/associations between the biological molecules, miRNAs, and disease phenotypes. Materials and Methods: In the present study, significance of text mined extracted pair associations (miRNA-lipid disease) were estimated by one-sided Fisher's exact test. The top 20 significant miRNA-disease associations were visualized on Cytoscape. The CyTargetLinker plug-in tool on Cytoscape was used to extend the network and predicts new miRNA target genes. The Biological Networks Gene Ontology (BiNGO) plug-in tool on Cytoscape was used to retrieve gene ontology (GO) annotations for the targeted genes. Results: We retrieved 227 miRNA-lipid disease associations including 148 miRNAs. The top 20 significant miRNAs analysis on CyTargetLinker provides defined, predicted and validated gene targets, further targeted genes analyzed by BiNGO showed targeted genes were significantly associated with lipid, cholesterol, apolipoprotein, and fatty acids GO terms. Conclusion: We are the first to provide a reliable miRNA-lipid disease association network based on text mining. This could help future experimental studies that aim to validate predicted gene targets. PMID:29018475

  14. Diverse microRNAs with convergent functions regulate tumorigenesis.

    Science.gov (United States)

    Zhu, Min-Yan; Zhang, Wei; Yang, Tao

    2016-02-01

    MicroRNAs (miRNAs) regulate several biological processes, including tumorigenesis. In order to comprehend the roles of miRNAs in cancer, various screens were performed to investigate the changes in the expression levels of miRNAs that occur in different types of cancer. The present review focuses on the results of five recent screens, whereby a number of overlapping miRNAs were identified to be downregulated or differentially regulated, whereas no miRNAs were observed to be frequently upregulated. Furthermore, the majority of the miRNAs that were common to >1 screen were involved in signaling networks, including wingless-related integration site, receptor tyrosine kinase and transforming growth factor-β, or in cell cycle checkpoint control. The present review will discuss the aforementioned miRNAs implicated in cell cycle checkpoint control and signaling networks.

  15. MicroRNAs in Cardiometabolic Diseases

    Directory of Open Access Journals (Sweden)

    Anna Meiliana

    2013-08-01

    Full Text Available BACKGROUND: MicroRNAs (miRNAs are ~22-nucleotide noncoding RNAs with critical functions in multiple physiological and pathological processes. An explosion of reports on the discovery and characterization of different miRNA species and their involvement in almost every aspect of cardiac biology and diseases has established an exciting new dimension in gene regulation networks for cardiac development and pathogenesis. CONTENT: Alterations in the metabolic control of lipid and glucose homeostasis predispose an individual to develop cardiometabolic diseases, such as type 2 diabetes mellitus and atherosclerosis. Work over the last years has suggested that miRNAs play an important role in regulating these physiological processes. Besides a cell-specific transcription factor profile, cell-specific miRNA-regulated gene expression is integral to cell fate and activation decisions. Thus, the cell types involved in atherosclerosis, vascular disease, and its myocardial sequelae may be differentially regulated by distinct miRNAs, thereby controlling highly complex processes, for example, smooth muscle cell phenotype and inflammatory responses of endothelial cells or macrophages. The recent advancements in using miRNAs as circulating biomarkers or therapeutic modalities, will hopefully be able to provide a strong basis for future research to further expand our insights into miRNA function in cardiovascular biology. SUMMARY: MiRNAs are small, noncoding RNAs that function as post-transcriptional regulators of gene expression. They are potent modulators of diverse biological processes and pathologies. Recent findings demonstrated the importance of miRNAs in the vasculature and the orchestration of lipid metabolism and glucose homeostasis. MiRNA networks represent an additional layer of regulation for gene expression that absorbs perturbations and ensures the robustness of biological systems. A detailed understanding of the molecular and cellular mechanisms of mi

  16. The analysis of novel microRNA mimic sequences in cancer cells reveals lack of specificity in stem-loop RT-qPCR-based microRNA detection.

    Science.gov (United States)

    Winata, Patrick; Williams, Marissa; McGowan, Eileen; Nassif, Najah; van Zandwijk, Nico; Reid, Glen

    2017-11-17

    MicroRNAs are frequently downregulated in cancer, and restoring expression has tumour suppressive activity in tumour cells. Our recent phase I clinical trial investigated microRNA-based therapy in patients with malignant pleural mesothelioma. Treatment with TargomiRs, microRNA mimics with novel sequence packaged in EGFR antibody-targeted bacterial minicells, revealed clear signs of clinical activity. In order to detect delivery of microRNA mimics to tumour cells in future clinical trials, we tested hydrolysis probe-based assays specific for the sequence of the novel mimics in transfected mesothelioma cell lines using RT-qPCR. The custom assays efficiently and specifically amplified the consensus mimics. However, we found that these assays gave a signal when total RNA from untransfected and control mimic-transfected cells were used as templates. Further investigation revealed that the reverse transcription step using stem-loop primers appeared to introduce substantial non-specific amplification with either total RNA or synthetic RNA templates. This suggests that reverse transcription using stem-loop primers suffers from an intrinsic lack of specificity for the detection of highly similar microRNAs in the same family, especially when analysing total RNA. These results suggest that RT-qPCR is unlikely to be an effective means to detect delivery of microRNA mimic-based drugs to tumour cells in patients.

  17. Search for microRNAs expressed by intracellular bacterial pathogens in infected mammalian cells.

    Science.gov (United States)

    Furuse, Yuki; Finethy, Ryan; Saka, Hector A; Xet-Mull, Ana M; Sisk, Dana M; Smith, Kristen L Jurcic; Lee, Sunhee; Coers, Jörn; Valdivia, Raphael H; Tobin, David M; Cullen, Bryan R

    2014-01-01

    MicroRNAs are expressed by all multicellular organisms and play a critical role as post-transcriptional regulators of gene expression. Moreover, different microRNA species are known to influence the progression of a range of different diseases, including cancer and microbial infections. A number of different human viruses also encode microRNAs that can attenuate cellular innate immune responses and promote viral replication, and a fungal pathogen that infects plants has recently been shown to express microRNAs in infected cells that repress host cell immune responses and promote fungal pathogenesis. Here, we have used deep sequencing of total expressed small RNAs, as well as small RNAs associated with the cellular RNA-induced silencing complex RISC, to search for microRNAs that are potentially expressed by intracellular bacterial pathogens and translocated into infected animal cells. In the case of Legionella and Chlamydia and the two mycobacterial species M. smegmatis and M. tuberculosis, we failed to detect any bacterial small RNAs that had the characteristics expected for authentic microRNAs, although large numbers of small RNAs of bacterial origin could be recovered. However, a third mycobacterial species, M. marinum, did express an ∼ 23-nt small RNA that was bound by RISC and derived from an RNA stem-loop with the characteristics expected for a pre-microRNA. While intracellular expression of this candidate bacterial microRNA was too low to effectively repress target mRNA species in infected cultured cells in vitro, artificial overexpression of this potential bacterial pre-microRNA did result in the efficient repression of a target mRNA. This bacterial small RNA therefore represents the first candidate microRNA of bacterial origin.

  18. Search for microRNAs expressed by intracellular bacterial pathogens in infected mammalian cells.

    Directory of Open Access Journals (Sweden)

    Yuki Furuse

    Full Text Available MicroRNAs are expressed by all multicellular organisms and play a critical role as post-transcriptional regulators of gene expression. Moreover, different microRNA species are known to influence the progression of a range of different diseases, including cancer and microbial infections. A number of different human viruses also encode microRNAs that can attenuate cellular innate immune responses and promote viral replication, and a fungal pathogen that infects plants has recently been shown to express microRNAs in infected cells that repress host cell immune responses and promote fungal pathogenesis. Here, we have used deep sequencing of total expressed small RNAs, as well as small RNAs associated with the cellular RNA-induced silencing complex RISC, to search for microRNAs that are potentially expressed by intracellular bacterial pathogens and translocated into infected animal cells. In the case of Legionella and Chlamydia and the two mycobacterial species M. smegmatis and M. tuberculosis, we failed to detect any bacterial small RNAs that had the characteristics expected for authentic microRNAs, although large numbers of small RNAs of bacterial origin could be recovered. However, a third mycobacterial species, M. marinum, did express an ∼ 23-nt small RNA that was bound by RISC and derived from an RNA stem-loop with the characteristics expected for a pre-microRNA. While intracellular expression of this candidate bacterial microRNA was too low to effectively repress target mRNA species in infected cultured cells in vitro, artificial overexpression of this potential bacterial pre-microRNA did result in the efficient repression of a target mRNA. This bacterial small RNA therefore represents the first candidate microRNA of bacterial origin.

  19. Annotating Logical Forms for EHR Questions.

    Science.gov (United States)

    Roberts, Kirk; Demner-Fushman, Dina

    2016-05-01

    This paper discusses the creation of a semantically annotated corpus of questions about patient data in electronic health records (EHRs). The goal is to provide the training data necessary for semantic parsers to automatically convert EHR questions into a structured query. A layered annotation strategy is used which mirrors a typical natural language processing (NLP) pipeline. First, questions are syntactically analyzed to identify multi-part questions. Second, medical concepts are recognized and normalized to a clinical ontology. Finally, logical forms are created using a lambda calculus representation. We use a corpus of 446 questions asking for patient-specific information. From these, 468 specific questions are found containing 259 unique medical concepts and requiring 53 unique predicates to represent the logical forms. We further present detailed characteristics of the corpus, including inter-annotator agreement results, and describe the challenges automatic NLP systems will face on this task.

  20. Expressed Peptide Tags: An additional layer of data for genome annotation

    Energy Technology Data Exchange (ETDEWEB)

    Savidor, Alon [ORNL; Donahoo, Ryan S [ORNL; Hurtado-Gonzales, Oscar [University of Tennessee, Knoxville (UTK); Verberkmoes, Nathan C [ORNL; Shah, Manesh B [ORNL; Lamour, Kurt H [ORNL; McDonald, W Hayes [ORNL

    2006-01-01

    While genome sequencing is becoming ever more routine, genome annotation remains a challenging process. Identification of the coding sequences within the genomic milieu presents a tremendous challenge, especially for eukaryotes with their complex gene architectures. Here we present a method to assist the annotation process through the use of proteomic data and bioinformatics. Mass spectra of digested protein preparations of the organism of interest were acquired and searched against a protein database created by a six frame translation of the genome. The identified peptides were mapped back to the genome, compared to the current annotation, and then categorized as supporting or extending the current genome annotation. We named the classified peptides Expressed Peptide Tags (EPTs). The well annotated bacterium Rhodopseudomonas palustris was used as a control for the method and showed high degree of correlation between EPT mapping and the current annotation, with 86% of the EPTs confirming existing gene calls and less than 1% of the EPTs expanding on the current annotation. The eukaryotic plant pathogens Phytophthora ramorum and Phytophthora sojae, whose genomes have been recently sequenced and are much less well annotated, were also subjected to this method. A series of algorithmic steps were taken to increase the confidence of EPT identification for these organisms, including generation of smaller sub-databases to be searched against, and definition of EPT criteria that accommodates the more complex eukaryotic gene architecture. As expected, the analysis of the Phytophthora species showed less correlation between EPT mapping and their current annotation. While ~77% of Phytophthora EPTs supported the current annotation, a portion of them (7.2% and 12.6% for P. ramorum and P. sojae, respectively) suggested modification to current gene calls or identified novel genes that were missed by the current genome annotation of these organisms.

  1. Identification and pathway analysis of microRNAs with no previous involvement in breast cancer.

    Directory of Open Access Journals (Sweden)

    Sandra Romero-Cordoba

    Full Text Available microRNA expression signatures can differentiate normal and breast cancer tissues and can define specific clinico-pathological phenotypes in breast tumors. In order to further evaluate the microRNA expression profile in breast cancer, we analyzed the expression of 667 microRNAs in 29 tumors and 21 adjacent normal tissues using TaqMan Low-density arrays. 130 miRNAs showed significant differential expression (adjusted P value = 0.05, Fold Change = 2 in breast tumors compared to the normal adjacent tissue. Importantly, the role of 43 of these microRNAs has not been previously reported in breast cancer, including several evolutionary conserved microRNA*, showing similar expression rates to that of their corresponding leading strand. The expression of 14 microRNAs was replicated in an independent set of 55 tumors. Bioinformatic analysis of mRNA targets of the altered miRNAs, identified oncogenes like ERBB2, YY1, several MAP kinases, and known tumor-suppressors like FOXA1 and SMAD4. Pathway analysis identified that some biological process which are important in breast carcinogenesis are affected by the altered microRNA expression, including signaling through MAP kinases and TP53 pathways, as well as biological processes like cell death and communication, focal adhesion and ERBB2-ERBB3 signaling. Our data identified the altered expression of several microRNAs whose aberrant expression might have an important impact on cancer-related cellular pathways and whose role in breast cancer has not been previously described.

  2. MicroRNA-target binding structures mimic microRNA duplex structures in humans.

    Directory of Open Access Journals (Sweden)

    Xi Chen

    Full Text Available Traditionally, researchers match a microRNA guide strand to mRNA sequences using sequence comparisons to predict its potential target genes. However, many of the predictions can be false positives due to limitations in sequence comparison alone. In this work, we consider the association of two related RNA structures that share a common guide strand: the microRNA duplex and the microRNA-target binding structure. We have analyzed thousands of such structure pairs and found many of them share high structural similarity. Therefore, we conclude that when predicting microRNA target genes, considering just the microRNA guide strand matches to gene sequences may not be sufficient--the microRNA duplex structure formed by the guide strand and its companion passenger strand must also be considered. We have developed software to translate RNA binding structure into encoded representations, and we have also created novel automatic comparison methods utilizing such encoded representations to determine RNA structure similarity. Our software and methods can be utilized in the other RNA secondary structure comparisons as well.

  3. MicroRNA expression profiles associated with pancreatic adenocarcinoma and ampullary adenocarcinoma

    DEFF Research Database (Denmark)

    Schultz, Nicolai A; Werner, Jens; Willenbrock, Hanni

    2012-01-01

    MicroRNAs have potential as diagnostic cancer biomarkers. The aim of this study was (1) to define microRNA expression patterns in formalin-fixed parafin-embedded tissue from pancreatic ductal adenocarcinoma, ampullary adenocarcinoma, normal pancreas and chronic pancreatitis without using micro-di...

  4. Multivendor Spectral-Domain Optical Coherence Tomography Dataset, Observer Annotation Performance Evaluation, and Standardized Evaluation Framework for Intraretinal Cystoid Fluid Segmentation

    Directory of Open Access Journals (Sweden)

    Jing Wu

    2016-01-01

    Full Text Available Development of image analysis and machine learning methods for segmentation of clinically significant pathology in retinal spectral-domain optical coherence tomography (SD-OCT, used in disease detection and prediction, is limited due to the availability of expertly annotated reference data. Retinal segmentation methods use datasets that either are not publicly available, come from only one device, or use different evaluation methodologies making them difficult to compare. Thus we present and evaluate a multiple expert annotated reference dataset for the problem of intraretinal cystoid fluid (IRF segmentation, a key indicator in exudative macular disease. In addition, a standardized framework for segmentation accuracy evaluation, applicable to other pathological structures, is presented. Integral to this work is the dataset used which must be fit for purpose for IRF segmentation algorithm training and testing. We describe here a multivendor dataset comprised of 30 scans. Each OCT scan for system training has been annotated by multiple graders using a proprietary system. Evaluation of the intergrader annotations shows a good correlation, thus making the reproducibly annotated scans suitable for the training and validation of image processing and machine learning based segmentation methods. The dataset will be made publicly available in the form of a segmentation Grand Challenge.

  5. Fuzzy Emotional Semantic Analysis and Automated Annotation of Scene Images

    Directory of Open Access Journals (Sweden)

    Jianfang Cao

    2015-01-01

    Full Text Available With the advances in electronic and imaging techniques, the production of digital images has rapidly increased, and the extraction and automated annotation of emotional semantics implied by images have become issues that must be urgently addressed. To better simulate human subjectivity and ambiguity for understanding scene images, the current study proposes an emotional semantic annotation method for scene images based on fuzzy set theory. A fuzzy membership degree was calculated to describe the emotional degree of a scene image and was implemented using the Adaboost algorithm and a back-propagation (BP neural network. The automated annotation method was trained and tested using scene images from the SUN Database. The annotation results were then compared with those based on artificial annotation. Our method showed an annotation accuracy rate of 91.2% for basic emotional values and 82.4% after extended emotional values were added, which correspond to increases of 5.5% and 8.9%, respectively, compared with the results from using a single BP neural network algorithm. Furthermore, the retrieval accuracy rate based on our method reached approximately 89%. This study attempts to lay a solid foundation for the automated emotional semantic annotation of more types of images and therefore is of practical significance.

  6. Multiview Hessian regularization for image annotation.

    Science.gov (United States)

    Liu, Weifeng; Tao, Dacheng

    2013-07-01

    The rapid development of computer hardware and Internet technology makes large scale data dependent models computationally tractable, and opens a bright avenue for annotating images through innovative machine learning algorithms. Semisupervised learning (SSL) therefore received intensive attention in recent years and was successfully deployed in image annotation. One representative work in SSL is Laplacian regularization (LR), which smoothes the conditional distribution for classification along the manifold encoded in the graph Laplacian, however, it is observed that LR biases the classification function toward a constant function that possibly results in poor generalization. In addition, LR is developed to handle uniformly distributed data (or single-view data), although instances or objects, such as images and videos, are usually represented by multiview features, such as color, shape, and texture. In this paper, we present multiview Hessian regularization (mHR) to address the above two problems in LR-based image annotation. In particular, mHR optimally combines multiple HR, each of which is obtained from a particular view of instances, and steers the classification function that varies linearly along the data manifold. We apply mHR to kernel least squares and support vector machines as two examples for image annotation. Extensive experiments on the PASCAL VOC'07 dataset validate the effectiveness of mHR by comparing it with baseline algorithms, including LR and HR.

  7. Ten steps to get started in Genome Assembly and Annotation

    Science.gov (United States)

    Dominguez Del Angel, Victoria; Hjerde, Erik; Sterck, Lieven; Capella-Gutierrez, Salvadors; Notredame, Cederic; Vinnere Pettersson, Olga; Amselem, Joelle; Bouri, Laurent; Bocs, Stephanie; Klopp, Christophe; Gibrat, Jean-Francois; Vlasova, Anna; Leskosek, Brane L.; Soler, Lucile; Binzer-Panchal, Mahesh; Lantz, Henrik

    2018-01-01

    As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR). PMID:29568489

  8. Sharing Map Annotations in Small Groups: X Marks the Spot

    Science.gov (United States)

    Congleton, Ben; Cerretani, Jacqueline; Newman, Mark W.; Ackerman, Mark S.

    Advances in location-sensing technology, coupled with an increasingly pervasive wireless Internet, have made it possible (and increasingly easy) to access and share information with context of one’s geospatial location. We conducted a four-phase study, with 27 students, to explore the practices surrounding the creation, interpretation and sharing of map annotations in specific social contexts. We found that annotation authors consider multiple factors when deciding how to annotate maps, including the perceived utility to the audience and how their contributions will reflect on the image they project to others. Consumers of annotations value the novelty of information, but must be convinced of the author’s credibility. In this paper we describe our study, present the results, and discuss implications for the design of software for sharing map annotations.

  9. Annotation-based feature extraction from sets of SBML models.

    Science.gov (United States)

    Alm, Rebekka; Waltemath, Dagmar; Wolfien, Markus; Wolkenhauer, Olaf; Henkel, Ron

    2015-01-01

    Model repositories such as BioModels Database provide computational models of biological systems for the scientific community. These models contain rich semantic annotations that link model entities to concepts in well-established bio-ontologies such as Gene Ontology. Consequently, thematically similar models are likely to share similar annotations. Based on this assumption, we argue that semantic annotations are a suitable tool to characterize sets of models. These characteristics improve model classification, allow to identify additional features for model retrieval tasks, and enable the comparison of sets of models. In this paper we discuss four methods for annotation-based feature extraction from model sets. We tested all methods on sets of models in SBML format which were composed from BioModels Database. To characterize each of these sets, we analyzed and extracted concepts from three frequently used ontologies, namely Gene Ontology, ChEBI and SBO. We find that three out of the methods are suitable to determine characteristic features for arbitrary sets of models: The selected features vary depending on the underlying model set, and they are also specific to the chosen model set. We show that the identified features map on concepts that are higher up in the hierarchy of the ontologies than the concepts used for model annotations. Our analysis also reveals that the information content of concepts in ontologies and their usage for model annotation do not correlate. Annotation-based feature extraction enables the comparison of model sets, as opposed to existing methods for model-to-keyword comparison, or model-to-model comparison.

  10. In Situ Detection of MicroRNA Expression with RNAscope Probes.

    Science.gov (United States)

    Yin, Viravuth P

    2018-01-01

    Elucidating the spatial resolution of gene transcripts provides important insight into potential gene function. MicroRNAs are short, singled-stranded noncoding RNAs that control gene expression through base-pair complementarity with target mRNAs in the 3' untranslated region (UTR) and inhibiting protein expression. However, given their small size of ~22- to 24-nt and low expression levels, standard in situ hybridization detection methods are not amendable for microRNA spatial resolution. Here, I describe a technique that employs RNAscope probe design and propriety amplification technology that provides simultaneous single molecule detection of individual microRNA and its target gene. This method allows for rapid and sensitive detection of noncoding RNA transcripts in frozen tissue sections.

  11. The Plant Genome Integrative Explorer Resource: PlantGenIE.org.

    Science.gov (United States)

    Sundell, David; Mannapperuma, Chanaka; Netotea, Sergiu; Delhomme, Nicolas; Lin, Yao-Cheng; Sjödin, Andreas; Van de Peer, Yves; Jansson, Stefan; Hvidsten, Torgeir R; Street, Nathaniel R

    2015-12-01

    Accessing and exploring large-scale genomics data sets remains a significant challenge to researchers without specialist bioinformatics training. We present the integrated PlantGenIE.org platform for exploration of Populus, conifer and Arabidopsis genomics data, which includes expression networks and associated visualization tools. Standard features of a model organism database are provided, including genome browsers, gene list annotation, Blast homology searches and gene information pages. Community annotation updating is supported via integration of WebApollo. We have produced an RNA-sequencing (RNA-Seq) expression atlas for Populus tremula and have integrated these data within the expression tools. An updated version of the ComPlEx resource for performing comparative plant expression analyses of gene coexpression network conservation between species has also been integrated. The PlantGenIE.org platform provides intuitive access to large-scale and genome-wide genomics data from model forest tree species, facilitating both community contributions to annotation improvement and tools supporting use of the included data resources to inform biological insight. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  12. Further Development of an Exhaled microRNA Biomarker of Lung Cancer Risk

    Science.gov (United States)

    2017-08-01

    AWARD NUMBER: W81XWH-16-1-0328 TITLE: Further Development of an Exhaled microRNA Biomarker of Lung Cancer Risk PRINCIPAL INVESTIGATOR: Dr...4. TITLE AND SUBTITLE Further Development of an Exhaled microRNA Biomarker of Lung Cancer Risk 5b. GRANT NUMBER W81XWH 16-1-0328 5c. PROGRAM...devise a non-invasive airway based exhaled microRNA metric for lung cancer risk, initial work to be tested in a case control study. We expanded the

  13. A plasma microRNA signature as a biomarker for acquired aplastic anemia.

    Science.gov (United States)

    Hosokawa, Kohei; Kajigaya, Sachiko; Feng, Xingmin; Desierto, Marie J; Fernandez Ibanez, Maria Del Pilar; Rios, Olga; Weinstein, Barbara; Scheinberg, Phillip; Townsley, Danielle M; Young, Neal S

    2017-01-01

    Aplastic anemia is an acquired bone marrow failure characterized by marrow hypoplasia, a paucity of hematopoietic stem and progenitor cells, and pancytopenia of the peripheral blood, due to immune attack on the bone marrow. In aplastic anemia, a major challenge is to develop immune biomarkers to monitor the disease. We measured circulating microRNAs in plasma samples of aplastic anemia patients in order to identify disease-specific microRNAs. A total of 179 microRNAs were analyzed in 35 plasma samples from 13 aplastic anemia patients, 11 myelodysplastic syndrome patients, and 11 healthy controls using the Serum/Plasma Focus microRNA Polymerase Chain Reaction Panel. Subsequently, 19 microRNAs from the discovery set were investigated in the 108 plasma samples from 41 aplastic anemia patients, 24 myelodysplastic syndrome patients, and 43 healthy controls for validation, confirming that 3 microRNAs could be validated as dysregulated (>1.5-fold change) in aplastic anemia, compared to healthy controls. MiR-150-5p (induction of T-cell differentiation) and miR-146b-5p (involvement in the feedback regulation of innate immune response) were elevated in aplastic anemia plasma, whereas miR-1 was decreased in aplastic anemia. By receiver operating characteristic curve analysis, we developed a logistic model with these 3 microRNAs that enabled us to predict the probability of a diagnosis of aplastic anemia with an area under the curve of 0.86. Dysregulated expression levels of the microRNAs became normal after immunosuppressive therapy at 6 months. Specifically, miR-150-5p expression was significantly reduced after successful immunosuppressive therapy, but did not change in non-responders. We propose 3 novel plasma biomarkers in aplastic anemia, in which miR-150-5p, miR-146b-5p, and miR-1 can serve for diagnosis and miR-150-5p for disease monitoring. Clinicaltrials.gov identifiers:00260689, 00217594, 00961064. Copyright© Ferrata Storti Foundation.

  14. Role of microRNAs in the immune system, inflammation and cancer.

    Science.gov (United States)

    Raisch, Jennifer; Darfeuille-Michaud, Arlette; Nguyen, Hang Thi Thu

    2013-05-28

    MicroRNAs, a key class of gene expression regulators, have emerged as crucial players in various biological processes such as cellular proliferation and differentiation, development and apoptosis. In addition, microRNAs are coming to light as crucial regulators of innate and adaptive immune responses, and their abnormal expression and/or function in the immune system have been linked to multiple human diseases including inflammatory disorders, such as inflammatory bowel disease, and cancers. In this review, we discuss our current understanding of microRNAs with a focus on their role and mode of action in regulating the immune system during inflammation and carcinogenesis.

  15. Roadmap for annotating transposable elements in eukaryote genomes.

    Science.gov (United States)

    Permal, Emmanuelle; Flutre, Timothée; Quesneville, Hadi

    2012-01-01

    Current high-throughput techniques have made it feasible to sequence even the genomes of non-model organisms. However, the annotation process now represents a bottleneck to genome analysis, especially when dealing with transposable elements (TE). Combined approaches, using both de novo and knowledge-based methods to detect TEs, are likely to produce reasonably comprehensive and sensitive results. This chapter provides a roadmap for researchers involved in genome projects to address this issue. At each step of the TE annotation process, from the identification of TE families to the annotation of TE copies, we outline the tools and good practices to be used.

  16. Clinical applications of microRNAs [v3; ref status: indexed, http://f1000r.es/218

    Directory of Open Access Journals (Sweden)

    Per Hydbring

    2013-10-01

    Full Text Available MicroRNAs represent a class of small RNAs derived from polymerase II controlled transcriptional regions. The primary transcript forms one or several bulging double stranded hairpins which are processed by Drosha and Dicer into hetero-duplexes. The targeting microRNA strand of the duplex is incorporated into the RNA Induced Silencing Complex from where it silences up to hundreds of mRNA transcript by inducing mRNA degradation or blocking protein translation. Apart from involvement in a variety of biological processes, microRNAs were early recognized for their potential in disease diagnostics and therapeutics. Due to their stability, microRNAs could be used as biomarkers. Currently, there are microRNA panels helping physicians determining the origins of cancer in disseminated tumors. The development of microRNA therapeutics has proved more challenging mainly due to delivery issues. However, one drug is already in clinical trials and several more await entering clinical phases. This review summarizes what has been recognized pre-clinically and clinically on diagnostic microRNAs. In addition, it highlights individual microRNA drugs in running platforms driven by four leading microRNA-therapeutic companies.

  17. Centrifugation: an important pre-analytic procedure that influences plasma microRNA quantification during blood processing.

    Science.gov (United States)

    Zheng, Xiao-Hui; Cui, Cui; Zhou, Xin-Xi; Zeng, Yi-Xin; Jia, Wei-Hua

    2013-12-01

    Circulating microRNAs are robustly present in plasma or serum and have become a research focus as biomarkers for tumor diagnosis and prognosis. Centrifugation is a necessary procedure for obtaining high-quality blood supernatant. Herein, we investigated one-step and two-step centrifugations, two centrifugal methods routinely used in microRNA study, to explore their effects on plasma microRNA quantification. The microRNAs obtained from one-step and two-step centrifugations were quantified by microarray and TaqMan-based real-time quantitative polymerase chain reaction (Q-PCR). Dynamic light scattering was performed to explore the difference underlying the two centrifugal methods. The results from the microarray containing 1,347 microRNAs showed that the signal detection rate was greatly decreased in the plasma sample prepared by two-step centrifugation. More importantly, the microRNAs missing in this plasma sample could be recovered and detected in the precipitate generated from the second centrifugation. Consistent with the results from microarray, a marked decrease of three representative microRNAs in two-step centrifugal plasma was validated by Q-PCR. According to the size distribution of all nanoparticles in plasma, there were fewer nanoparticles with size >1,000 nm in two-step centrifugal plasma. Our experiments directly demonstrated that different centrifugation methods produced distinct quantities of plasma microRNAs. Thus, exosomes or protein complexes containing microRNAs may be involved in large nanoparticle formation and may be precipitated after two-step centrifugation. Our results remind us that sample processing methods should be first considered in conducting research.

  18. The Effects of Multimedia Annotations on Iranian EFL Learners’ L2 Vocabulary Learning

    Directory of Open Access Journals (Sweden)

    Saeideh Ahangari

    2010-05-01

    Full Text Available In our modern technological world, Computer-Assisted Language learning (CALL is a new realm towards learning a language in general, and learning L2 vocabulary in particular. It is assumed that the use of multimedia annotations promotes language learners’ vocabulary acquisition. Therefore, this study set out to investigate the effects of different multimedia annotations (still picture annotations, dynamic picture annotations, and written annotations on L2 vocabulary learning. To fulfill this objective, the researchers selected sixty four EFL learners as the participants of this study. The participants were randomly assigned to one of the four groups: a control group that received no annotations and three experimental groups that received:  still picture annotations, dynamic picture annotations, and written annotations. Each participant was required to take a pre-test. A vocabulary post- test was also designed and administered to the participants in order to assess the efficacy of each annotation. First for each group a paired t-test was conducted between their pre and post test scores in order to observe their improvement; then through an ANCOVA test the performance of four groups was compared. The results showed that using multimedia annotations resulted in a significant difference in the participants’ vocabulary learning. Based on the results of the present study, multimedia annotations are suggested as a vocabulary teaching strategy.

  19. Evidence for X-chromosomal schizophrenia associated with microRNA alterations.

    Directory of Open Access Journals (Sweden)

    Jinong Feng

    2009-07-01

    Full Text Available Schizophrenia is a severe disabling brain disease affecting about 1% of the population. Individual microRNAs (miRNAs affect moderate downregulation of gene expression. In addition, components required for miRNA processing and/or function have also been implicated in X-linked mental retardation, neurological and neoplastic diseases, pointing to the wide ranging involvement of miRNAs in disease.To explore the role of miRNAs in schizophrenia, 59 microRNA genes on the X-chromosome were amplified and sequenced in males with (193 and without (191 schizophrenia spectrum disorders to test the hypothesis that ultra-rare mutations in microRNA collectively contribute to the risk of schizophrenia. Here we provide the first association of microRNA gene dysfunction with schizophrenia. Eight ultra-rare variants in the precursor or mature miRNA were identified in eight distinct miRNA genes in 4% of analyzed males with schizophrenia. One ultra-rare variant was identified in a control sample (with a history of depression (8/193 versus 1/191, p = 0.02 by one-sided Fisher's exact test, odds ratio = 8.2. These variants were not found in an additional 7,197 control X-chromosomes.Functional analyses of ectopically expressed copies of the variant miRNA precursors demonstrate loss of function, gain of function or altered expression levels. While confirmation is required, this study suggests that microRNA mutations can contribute to schizophrenia.

  20. INDIGO - INtegrated data warehouse of microbial genomes with examples from the red sea extremophiles.

    Directory of Open Access Journals (Sweden)

    Intikhab Alam

    Full Text Available The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes.We developed a data warehouse system (INDIGO that enables the integration of annotations for exploration and analysis of newly sequenced microbial genomes. INDIGO offers an opportunity to construct complex queries and combine annotations from multiple sources starting from genomic sequence to protein domain, gene ontology and pathway levels. This data warehouse is aimed at being populated with information from genomes of pure cultures and uncultured single cells of Red Sea bacteria and Archaea. Currently, INDIGO contains information from Salinisphaera shabanensis, Haloplasma contractile, and Halorhabdus tiamatea - extremophiles isolated from deep-sea anoxic brine lakes of the Red Sea. We provide examples of utilizing the system to gain new insights into specific aspects on the unique lifestyle and adaptations of these organisms to extreme environments.We developed a data warehouse system, INDIGO, which enables comprehensive integration of information from various resources to be used for annotation, exploration and analysis of microbial genomes. It will be regularly updated and extended with new genomes. It is aimed to serve as a resource dedicated to the Red Sea microbes. In addition, through INDIGO, we provide our Automatic Annotation of Microbial Genomes (AAMG pipeline. The INDIGO web server is freely available at http://www.cbrc.kaust.edu.sa/indigo.

  1. Evaluating Functional Annotations of Enzymes Using the Gene Ontology.

    Science.gov (United States)

    Holliday, Gemma L; Davidson, Rebecca; Akiva, Eyal; Babbitt, Patricia C

    2017-01-01

    The Gene Ontology (GO) (Ashburner et al., Nat Genet 25(1):25-29, 2000) is a powerful tool in the informatics arsenal of methods for evaluating annotations in a protein dataset. From identifying the nearest well annotated homologue of a protein of interest to predicting where misannotation has occurred to knowing how confident you can be in the annotations assigned to those proteins is critical. In this chapter we explore what makes an enzyme unique and how we can use GO to infer aspects of protein function based on sequence similarity. These can range from identification of misannotation or other errors in a predicted function to accurate function prediction for an enzyme of entirely unknown function. Although GO annotation applies to any gene products, we focus here a describing our approach for hierarchical classification of enzymes in the Structure-Function Linkage Database (SFLD) (Akiva et al., Nucleic Acids Res 42(Database issue):D521-530, 2014) as a guide for informed utilisation of annotation transfer based on GO terms.

  2. miRDis: a Web tool for endogenous and exogenous microRNA discovery based on deep-sequencing data analysis.

    Science.gov (United States)

    Zhang, Hanyuan; Vieira Resende E Silva, Bruno; Cui, Juan

    2018-05-01

    Small RNA sequencing is the most widely used tool for microRNA (miRNA) discovery, and shows great potential for the efficient study of miRNA cross-species transport, i.e., by detecting the presence of exogenous miRNA sequences in the host species. Because of the increased appreciation of dietary miRNAs and their far-reaching implication in human health, research interests are currently growing with regard to exogenous miRNAs bioavailability, mechanisms of cross-species transport and miRNA function in cellular biological processes. In this article, we present microRNA Discovery (miRDis), a new small RNA sequencing data analysis pipeline for both endogenous and exogenous miRNA detection. Specifically, we developed and deployed a Web service that supports the annotation and expression profiling data of known host miRNAs and the detection of novel miRNAs, other noncoding RNAs, and the exogenous miRNAs from dietary species. As a proof-of-concept, we analyzed a set of human plasma sequencing data from a milk-feeding study where 225 human miRNAs were detected in the plasma samples and 44 show elevated expression after milk intake. By examining the bovine-specific sequences, data indicate that three bovine miRNAs (bta-miR-378, -181* and -150) are present in human plasma possibly because of the dietary uptake. Further evaluation based on different sets of public data demonstrates that miRDis outperforms other state-of-the-art tools in both detection and quantification of miRNA from either animal or plant sources. The miRDis Web server is available at: http://sbbi.unl.edu/miRDis/index.php.

  3. The Regulatory Roles of MicroRNAs in Bone Remodeling and Perspectives as Biomarkers in Osteoporosis

    Directory of Open Access Journals (Sweden)

    Mengge Sun

    2016-01-01

    Full Text Available MicroRNAs are involved in many cellular and molecular activities and played important roles in many biological and pathological processes, such as tissue formation, cancer development, diabetes, neurodegenerative diseases, and cardiovascular diseases. Recently, it has been reported that microRNAs can modulate the differentiation and activities of osteoblasts and osteoclasts, the key cells that are involved in bone remodeling process. Meanwhile, the results from our and other research groups showed that the expression profiles of microRNAs in the serum and bone tissues are significantly different in postmenopausal women with or without fractures compared to the control. Therefore, it can be postulated that microRNAs might play important roles in bone remodeling and that they are very likely to be involved in the pathological process of postmenopausal osteoporosis. In this review, we will present the updated research on the regulatory roles of microRNAs in osteoblasts and osteoclasts and the expression profiles of microRNAs in osteoporosis and osteoporotic fracture patients. The perspective of serum microRNAs as novel biomarkers in bone loss disorders such as osteoporosis has also been discussed.

  4. MicroRNA-449a deficiency promotes colon carcinogenesis.

    Science.gov (United States)

    Niki, Masanori; Nakajima, Kohei; Ishikawa, Daichi; Nishida, Jun; Ishifune, Chieko; Tsukumo, Shin-Ichi; Shimada, Mitsuo; Nagahiro, Shinji; Mitamura, Yoshinori; Yasutomo, Koji

    2017-09-06

    MicroRNAs have broad roles in tumorigenesis and cell differentiation through regulation of target genes. Notch signaling also controls cell differentiation and tumorigenesis. However, the mechanisms through which Notch mediates microRNA expression are still unclear. In this study, we aimed to identify microRNAs regulated by Notch signaling. Our analysis found that microRNA-449a (miR-449a) was indirectly regulated by Notch signaling. Although miR-449a-deficient mice did not show any Notch-dependent defects in immune cell development, treatment of miR-449a-deficient mice with azoxymethane (AOM) or dextran sodium sulfate (DSS) increased the numbers and sizes of colon tumors. These effects were associated with an increase in intestinal epithelial cell proliferation following AOM/DSS treatment. In patients with colon cancer, miR-449a expression was inversely correlated with disease-free survival and histological scores and was positively correlated with the expression of MLH1 for which loss-of function mutations have been shown to be involved in colon cancer. Colon tissues of miR-449a-deficient mice showed reduced Mlh1 expression compared with those of wild-type mice. Thus, these data suggested that miR-449a acted as a key regulator of colon tumorigenesis by controlling the proliferation of intestinal epithelial cells. Additionally, activation of miR-449a may represent an effective therapeutic strategy and prognostic marker in colon cancer.

  5. AutoFACT: An Automatic Functional Annotation and Classification Tool

    Directory of Open Access Journals (Sweden)

    Lang B Franz

    2005-06-01

    Full Text Available Abstract Background Assignment of function to new molecular sequence data is an essential step in genomics projects. The usual process involves similarity searches of a given sequence against one or more databases, an arduous process for large datasets. Results We present AutoFACT, a fully automated and customizable annotation tool that assigns biologically informative functions to a sequence. Key features of this tool are that it (1 analyzes nucleotide and protein sequence data; (2 determines the most informative functional description by combining multiple BLAST reports from several user-selected databases; (3 assigns putative metabolic pathways, functional classes, enzyme classes, GeneOntology terms and locus names; and (4 generates output in HTML, text and GFF formats for the user's convenience. We have compared AutoFACT to four well-established annotation pipelines. The error rate of functional annotation is estimated to be only between 1–2%. Comparison of AutoFACT to the traditional top-BLAST-hit annotation method shows that our procedure increases the number of functionally informative annotations by approximately 50%. Conclusion AutoFACT will serve as a useful annotation tool for smaller sequencing groups lacking dedicated bioinformatics staff. It is implemented in PERL and runs on LINUX/UNIX platforms. AutoFACT is available at http://megasun.bch.umontreal.ca/Software/AutoFACT.htm.

  6. Clever generation of rich SPARQL queries from annotated relational schema: application to Semantic Web Service creation for biological databases.

    Science.gov (United States)

    Wollbrett, Julien; Larmande, Pierre; de Lamotte, Frédéric; Ruiz, Manuel

    2013-04-15

    In recent years, a large amount of "-omics" data have been produced. However, these data are stored in many different species-specific databases that are managed by different institutes and laboratories. Biologists often need to find and assemble data from disparate sources to perform certain analyses. Searching for these data and assembling them is a time-consuming task. The Semantic Web helps to facilitate interoperability across databases. A common approach involves the development of wrapper systems that map a relational database schema onto existing domain ontologies. However, few attempts have been made to automate the creation of such wrappers. We developed a framework, named BioSemantic, for the creation of Semantic Web Services that are applicable to relational biological databases. This framework makes use of both Semantic Web and Web Services technologies and can be divided into two main parts: (i) the generation and semi-automatic annotation of an RDF view; and (ii) the automatic generation of SPARQL queries and their integration into Semantic Web Services backbones. We have used our framework to integrate genomic data from different plant databases. BioSemantic is a framework that was designed to speed integration of relational databases. We present how it can be used to speed the development of Semantic Web Services for existing relational biological databases. Currently, it creates and annotates RDF views that enable the automatic generation of SPARQL queries. Web Services are also created and deployed automatically, and the semantic annotations of our Web Services are added automatically using SAWSDL attributes. BioSemantic is downloadable at http://southgreen.cirad.fr/?q=content/Biosemantic.

  7. Circulating, Cell-Free Micro-RNA Profiles Reflect Discordant Development of Dementia in Monozygotic Twins

    DEFF Research Database (Denmark)

    Mengel-From, Jonas; Rønne, Mette E; Carlsen, Anting L

    2018-01-01

    We aim to examine if circulating micro-RNA and cytokine levels associate with dementia diagnosis and cognitive scores. To test our hypothesis, we use plasma donated from 48 monozygotic twin pairs in 1997 and 46 micro-RNAs and 10 cytokines were quantified using microfluidic RT-qPCR and multiplex...... solid-phase immunoassays, respectively. Micro-RNA and cytokine profiling were examined for associations with dementia diagnoses in a longitudinal registry study or with cognitive scores at baseline. Thirty-six micro-RNAs and all cytokines were detected consistently. Micro-RNA profiles associate...... with diagnoses and cognitive scores at statistically significant levels while cytokine only showed trends pointing at chronic inflammation in twins having or developing dementia. The most notable findings were decreased miR-106a and miR-210, and increased miR-106b expression in twins with a dementia diagnosis...

  8. Essential Annotation Schema for Ecology (EASE)—A framework supporting the efficient data annotation and faceted navigation in ecology

    Science.gov (United States)

    Eichenberg, David; Liebergesell, Mario; König-Ries, Birgitta; Wirth, Christian

    2017-01-01

    Ecology has become a data intensive science over the last decades which often relies on the reuse of data in cross-experimental analyses. However, finding data which qualifies for the reuse in a specific context can be challenging. It requires good quality metadata and annotations as well as efficient search strategies. To date, full text search (often on the metadata only) is the most widely used search strategy although it is known to be inaccurate. Faceted navigation is providing a filter mechanism which is based on fine granular metadata, categorizing search objects along numeric and categorical parameters relevant for their discovery. Selecting from these parameters during a full text search creates a system of filters which allows to refine and improve the results towards more relevance. We developed a framework for the efficient annotation and faceted navigation in ecology. It consists of an XML schema for storing the annotation of search objects and is accompanied by a vocabulary focused on ecology to support the annotation process. The framework consolidates ideas which originate from widely accepted metadata standards, textbooks, scientific literature, and vocabularies as well as from expert knowledge contributed by researchers from ecology and adjacent disciplines. PMID:29023519

  9. IMG 4 version of the integrated microbial genomes comparative analysis system

    Science.gov (United States)

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Pillay, Manoj; Ratner, Anna; Huang, Jinghua; Woyke, Tanja; Huntemann, Marcel; Anderson, Iain; Billis, Konstantinos; Varghese, Neha; Mavromatis, Konstantinos; Pati, Amrita; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2014-01-01

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG’s data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG’s annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu). PMID:24165883

  10. IMG 4 version of the integrated microbial genomes comparative analysis system

    Energy Technology Data Exchange (ETDEWEB)

    Markowitz, Victor M. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Chen, I-Min A. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Palaniappan, Krishna [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Chu, Ken [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Szeto, Ernest [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Pillay, Manoj [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Ratner, Anna [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Huang, Jinghua [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Biological Data Management and Technology Center. Computational Research Division; Woyke, Tanja [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Huntemann, Marcel [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Anderson, Iain [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Billis, Konstantinos [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Varghese, Neha [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Mavromatis, Konstantinos [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Pati, Amrita [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Ivanova, Natalia N. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program; Kyrpides, Nikos C. [USDOE Joint Genome Institute (JGI), Walnut Creek, CA (United States). Microbial Genome and Metagenome Program

    2013-10-27

    The Integrated Microbial Genomes (IMG) data warehouse integrates genomes from all three domains of life, as well as plasmids, viruses and genome fragments. IMG provides tools for analyzing and reviewing the structural and functional annotations of genomes in a comparative context. IMG’s data content and analytical capabilities have increased continuously since its first version released in 2005. Since the last report published in the 2012 NAR Database Issue, IMG’s annotation and data integration pipelines have evolved while new tools have been added for recording and analyzing single cell genomes, RNA Seq and biosynthetic cluster data. Finally, different IMG datamarts provide support for the analysis of publicly available genomes (IMG/W: http://img.jgi.doe.gov/w), expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er) and teaching and training in the area of microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu).

  11. MicroRNA-608 and microRNA-34a regulate chordoma malignancy by targeting EGFR, Bcl-xL and MET.

    Directory of Open Access Journals (Sweden)

    Ying Zhang

    Full Text Available Chordomas are rare malignant tumors that originate from the notochord remnants and occur in the skull base, spine and sacrum. Due to a very limited understanding of the molecular pathogenesis of chordoma, there are no adjuvant and molecular therapies besides surgical resection and radiation therapy. microRNAs (miRNAs are small noncoding regulatory RNA molecules with critical roles in cancer. The role of miRNAs in chordomas is mostly unknown. We uncover microRNA-608 (miR-608 and microRNA-34a (miR-34a as novel tumor suppressive microRNAs that regulate malignancy in chordoma. We find that miR-608 and miR-34a expressions are downregulated in human chordoma cell lines and primary cells at least partially via alteration of their genes' copy numbers. We identify the commonly deregulated oncogenes EGFR and Bcl-xL as direct targets of miR-608 and the receptor tyrosine kinase MET as direct target of miR-34a. We show that EGFR and MET activations promote chordoma cell proliferation and invasion and that pharmacological inhibition of EGFR and MET inhibits chordoma cell proliferation and survival. We demonstrate that restoration of miR-608 and miR-34a inhibits cell proliferation and invasion and induces apoptosis in chordoma cells. We find that miR-34a inversely correlates with MET expression and miR-608 inversely correlates with EGFR expression in chordoma cells. These findings demonstrate for the first time that miR-608 and miR-34a regulate chordoma malignancy by regulating EGFR, MET and Bcl-xL.

  12. MicroRNAs and potential target interactions in psoriasis

    DEFF Research Database (Denmark)

    Zibert, John Robert; Løvendorf, Marianne B.; Litman, Thomas

    2010-01-01

    BACKGROUND: Psoriasis is a chronic inflammatory skin disease often seen in patients with a genetic susceptibility. MicroRNAs (miRNA) are endogenous, short RNA molecules that can bind to parts of mRNA target genes, thus inhibiting their translation and causing accelerated turnover or transcript...... degradation. MicroRNAs are important in the pathogenesis of human diseases such as immunological disorders, as they regulate a broad range of biological processes. OBJECTIVE: We investigated miRNA-mRNA interactions in involved (PP) and non-involved (PN) psoriatic skin compared with healthy skin (NN). METHODS...

  13. MicroRNA in oral cancer research: future prospects.

    Science.gov (United States)

    Sarode, Sachin C; Sarode, Gargi S; Patil, Shankargouda

    2014-09-01

    MicroRNA (miRNA) and related therapeutic approaches hold great promise in the field of cancer managements. Various studies on epithelial malignancies have shown encouraging results on various fronts. Its association with invasion, tumor growth, epithelial mesenchymal transition (EMT), angiogenesis, cancer stem cells (CSCs), metastasis and refects the diversified role of miRNA. Moreover, miRNA plays an important role in determining the prognosis of the patients. MicroRNAs interactions with each other and with external factors [human papilloma virus (HPV) (like oncoproteins)] intrigue us to explore more deep into this fascinating world.(1.)

  14. IntelliGO: a new vector-based semantic similarity measure including annotation origin

    Directory of Open Access Journals (Sweden)

    Devignes Marie-Dominique

    2010-12-01

    Full Text Available Abstract Background The Gene Ontology (GO is a well known controlled vocabulary describing the biological process, molecular function and cellular component aspects of gene annotation. It has become a widely used knowledge source in bioinformatics for annotating genes and measuring their semantic similarity. These measures generally involve the GO graph structure, the information content of GO aspects, or a combination of both. However, only a few of the semantic similarity measures described so far can handle GO annotations differently according to their origin (i.e. their evidence codes. Results We present here a new semantic similarity measure called IntelliGO which integrates several complementary properties in a novel vector space model. The coefficients associated with each GO term that annotates a given gene or protein include its information content as well as a customized value for each type of GO evidence code. The generalized cosine similarity measure, used for calculating the dot product between two vectors, has been rigorously adapted to the context of the GO graph. The IntelliGO similarity measure is tested on two benchmark datasets consisting of KEGG pathways and Pfam domains grouped as clans, considering the GO biological process and molecular function terms, respectively, for a total of 683 yeast and human genes and involving more than 67,900 pair-wise comparisons. The ability of the IntelliGO similarity measure to express the biological cohesion of sets of genes compares favourably to four existing similarity measures. For inter-set comparison, it consistently discriminates between distinct sets of genes. Furthermore, the IntelliGO similarity measure allows the influence of weights assigned to evidence codes to be checked. Finally, the results obtained with a complementary reference technique give intermediate but correct correlation values with the sequence similarity, Pfam, and Enzyme classifications when compared to

  15. Integrated analyses of microRNAs demonstrate their widespread influence on gene expression in high-grade serous ovarian carcinoma.

    Science.gov (United States)

    Creighton, Chad J; Hernandez-Herrera, Anadulce; Jacobsen, Anders; Levine, Douglas A; Mankoo, Parminder; Schultz, Nikolaus; Du, Ying; Zhang, Yiqun; Larsson, Erik; Sheridan, Robert; Xiao, Weimin; Spellman, Paul T; Getz, Gad; Wheeler, David A; Perou, Charles M; Gibbs, Richard A; Sander, Chris; Hayes, D Neil; Gunaratne, Preethi H

    2012-01-01

    The Cancer Genome Atlas (TCGA) Network recently comprehensively catalogued the molecular aberrations in 487 high-grade serous ovarian cancers, with much remaining to be elucidated regarding the microRNAs (miRNAs). Here, using TCGA ovarian data, we surveyed the miRNAs, in the context of their predicted gene targets. Integration of miRNA and gene patterns yielded evidence that proximal pairs of miRNAs are processed from polycistronic primary transcripts, and that intronic miRNAs and their host gene mRNAs derive from common transcripts. Patterns of miRNA expression revealed multiple tumor subtypes and a set of 34 miRNAs predictive of overall patient survival. In a global analysis, miRNA:mRNA pairs anti-correlated in expression across tumors showed a higher frequency of in silico predicted target sites in the mRNA 3'-untranslated region (with less frequency observed for coding sequence and 5'-untranslated regions). The miR-29 family and predicted target genes were among the most strongly anti-correlated miRNA:mRNA pairs; over-expression of miR-29a in vitro repressed several anti-correlated genes (including DNMT3A and DNMT3B) and substantially decreased ovarian cancer cell viability. This study establishes miRNAs as having a widespread impact on gene expression programs in ovarian cancer, further strengthening our understanding of miRNA biology as it applies to human cancer. As with gene transcripts, miRNAs exhibit high diversity reflecting the genomic heterogeneity within a clinically homogeneous disease population. Putative miRNA:mRNA interactions, as identified using integrative analysis, can be validated. TCGA data are a valuable resource for the identification of novel tumor suppressive miRNAs in ovarian as well as other cancers.

  16. ONEMercury: Towards Automatic Annotation of Earth Science Metadata

    Science.gov (United States)

    Tuarob, S.; Pouchard, L. C.; Noy, N.; Horsburgh, J. S.; Palanisamy, G.

    2012-12-01

    Earth sciences have become more data-intensive, requiring access to heterogeneous data collected from multiple places, times, and thematic scales. For example, research on climate change may involve exploring and analyzing observational data such as the migration of animals and temperature shifts across the earth, as well as various model-observation inter-comparison studies. Recently, DataONE, a federated data network built to facilitate access to and preservation of environmental and ecological data, has come to exist. ONEMercury has recently been implemented as part of the DataONE project to serve as a portal for discovering and accessing environmental and observational data across the globe. ONEMercury harvests metadata from the data hosted by multiple data repositories and makes it searchable via a common search interface built upon cutting edge search engine technology, allowing users to interact with the system, intelligently filter the search results on the fly, and fetch the data from distributed data sources. Linking data from heterogeneous sources always has a cost. A problem that ONEMercury faces is the different levels of annotation in the harvested metadata records. Poorly annotated records tend to be missed during the search process as they lack meaningful keywords. Furthermore, such records would not be compatible with the advanced search functionality offered by ONEMercury as the interface requires a metadata record be semantically annotated. The explosion of the number of metadata records harvested from an increasing number of data repositories makes it impossible to annotate the harvested records manually, urging the need for a tool capable of automatically annotating poorly curated metadata records. In this paper, we propose a topic-model (TM) based approach for automatic metadata annotation. Our approach mines topics in the set of well annotated records and suggests keywords for poorly annotated records based on topic similarity. We utilize the

  17. Annotated bibliography of structural equation modelling: technical work.

    Science.gov (United States)

    Austin, J T; Wolfle, L M

    1991-05-01

    Researchers must be familiar with a variety of source literature to facilitate the informed use of structural equation modelling. Knowledge can be acquired through the study of an expanding literature found in a diverse set of publishing forums. We propose that structural equation modelling publications can be roughly classified into two groups: (a) technical and (b) substantive applications. Technical materials focus on the procedures rather than substantive conclusions derived from applications. The focus of this article is the former category; included are foundational/major contributions, minor contributions, critical and evaluative reviews, integrations, simulations and computer applications, precursor and historical material, and pedagogical textbooks. After a brief introduction, we annotate 294 articles in the technical category dating back to Sewall Wright (1921).

  18. A Selected Annotated Bibliography on Work Time Options.

    Science.gov (United States)

    Ivantcho, Barbara

    This annotated bibliography is divided into three sections. Section I contains annotations of general publications on work time options. Section II presents resources on flexitime and the compressed work week. In Section III are found resources related to these reduced work time options: permanent part-time employment, job sharing, voluntary…

  19. MicroRNAs play big roles in modulating macrophages response toward mycobacteria infection.

    Science.gov (United States)

    Abdalla, Abualgasim Elgaili; Duan, Xiangke; Deng, Wanyan; Zeng, Jie; Xie, Jianping

    2016-11-01

    Macrophages are crucial player in the defense against multiple intracellular pathogens. Mycobacterium tuberculosis, the causative agent of tuberculosis which inflicted around one third of global population, can replicate and persist within macrophages. MicroRNAs, endogenous, small noncoding RNA, can regulate the expression of macrophages genes required for appropriate signaling. Mycobacteria can manipulate the expression of macrophages microRNAs to subvert cell response for its survival and persistence. This review summarized the progress of microRNAs in mycobacterial pathogenesis. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. RCAS: an RNA centric annotation system for transcriptome-wide regions of interest.

    Science.gov (United States)

    Uyar, Bora; Yusuf, Dilmurat; Wurmus, Ricardo; Rajewsky, Nikolaus; Ohler, Uwe; Akalin, Altuna

    2017-06-02

    In the field of RNA, the technologies for studying the transcriptome have created a tremendous potential for deciphering the puzzles of the RNA biology. Along with the excitement, the unprecedented volume of RNA related omics data is creating great challenges in bioinformatics analyses. Here, we present the RNA Centric Annotation System (RCAS), an R package, which is designed to ease the process of creating gene-centric annotations and analysis for the genomic regions of interest obtained from various RNA-based omics technologies. The design of RCAS is modular, which enables flexible usage and convenient integration with other bioinformatics workflows. RCAS is an R/Bioconductor package but we also created graphical user interfaces including a Galaxy wrapper and a stand-alone web service. The application of RCAS on published datasets shows that RCAS is not only able to reproduce published findings but also helps generate novel knowledge and hypotheses. The meta-gene profiles, gene-centric annotation, motif analysis and gene-set analysis provided by RCAS provide contextual knowledge which is necessary for understanding the functional aspects of different biological events that involve RNAs. In addition, the array of different interfaces and deployment options adds the convenience of use for different levels of users. RCAS is available at http://bioconductor.org/packages/release/bioc/html/RCAS.html and http://rcas.mdc-berlin.de. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Annotated bibliography of the physical data of Rainier Mesa and Yucca Mountain

    International Nuclear Information System (INIS)

    Russell, C.E.

    1988-09-01

    Yucca Mountain, located on and adjacent to the Nevada Test Site (NTS) has been designated as the only site to undergo characterization to determine if it meets the criteria to become the Nation's first high-level nuclear waste repository. During this process, care must be taken to not compromise the site's integrity through excessive testing. In order to supplement the limited data to be gathered at Yucca Mountain, analog areas are to be considered. This annotated bibliography was compiled by the Desert Research Institute to help investigate ways in which Rainier Mesa could either be used as a supplemental repository test site or where existing Rainier Mesa data can be used either to support or refute test results from Yucca Mountain. Rainier Mesa, the location of numerous underground nuclear tests on the NTS, possesses some geologic characteristics similar to those of Yucca Mountain, which makes it a likely candidate for comparison. Almost 500 references regarding geology, hydrology, meteorology, biology, and archaeology were annotated and entered alpha-numerically into the bibliography. These references were categorized into 50 topics which are defined in Section 2 and presented in Section 3. Each reference is categorized as to whether it contains Yucca Mountain data, Rainier Mesa data, or both, and a final category consists of those reports that contain Rainier Mesa data that have already been applied to Yucca Mountain research. The annotated bibliography is presented in Section 4

  2. Sequence- and Structure-Based Functional Annotation and Assessment of Metabolic Transporters in Aspergillus oryzae: A Representative Case Study

    Directory of Open Access Journals (Sweden)

    Nachon Raethong

    2016-01-01

    Full Text Available Aspergillus oryzae is widely used for the industrial production of enzymes. In A. oryzae metabolism, transporters appear to play crucial roles in controlling the flux of molecules for energy generation, nutrients delivery, and waste elimination in the cell. While the A. oryzae genome sequence is available, transporter annotation remains limited and thus the connectivity of metabolic networks is incomplete. In this study, we developed a metabolic annotation strategy to understand the relationship between the sequence, structure, and function for annotation of A. oryzae metabolic transporters. Sequence-based analysis with manual curation showed that 58 genes of 12,096 total genes in the A. oryzae genome encoded metabolic transporters. Under consensus integrative databases, 55 unambiguous metabolic transporter genes were distributed into channels and pores (7 genes, electrochemical potential-driven transporters (33 genes, and primary active transporters (15 genes. To reveal the transporter functional role, a combination of homology modeling and molecular dynamics simulation was implemented to assess the relationship between sequence to structure and structure to function. As in the energy metabolism of A. oryzae, the H+-ATPase encoded by the AO090005000842 gene was selected as a representative case study of multilevel linkage annotation. Our developed strategy can be used for enhancing metabolic network reconstruction.

  3. Sequence- and Structure-Based Functional Annotation and Assessment of Metabolic Transporters in Aspergillus oryzae: A Representative Case Study.

    Science.gov (United States)

    Raethong, Nachon; Wong-Ekkabut, Jirasak; Laoteng, Kobkul; Vongsangnak, Wanwipa

    2016-01-01

    Aspergillus oryzae is widely used for the industrial production of enzymes. In A. oryzae metabolism, transporters appear to play crucial roles in controlling the flux of molecules for energy generation, nutrients delivery, and waste elimination in the cell. While the A. oryzae genome sequence is available, transporter annotation remains limited and thus the connectivity of metabolic networks is incomplete. In this study, we developed a metabolic annotation strategy to understand the relationship between the sequence, structure, and function for annotation of A. oryzae metabolic transporters. Sequence-based analysis with manual curation showed that 58 genes of 12,096 total genes in the A. oryzae genome encoded metabolic transporters. Under consensus integrative databases, 55 unambiguous metabolic transporter genes were distributed into channels and pores (7 genes), electrochemical potential-driven transporters (33 genes), and primary active transporters (15 genes). To reveal the transporter functional role, a combination of homology modeling and molecular dynamics simulation was implemented to assess the relationship between sequence to structure and structure to function. As in the energy metabolism of A. oryzae, the H(+)-ATPase encoded by the AO090005000842 gene was selected as a representative case study of multilevel linkage annotation. Our developed strategy can be used for enhancing metabolic network reconstruction.

  4. The Role of microRNAs in the Biology of Rare Diseases

    Directory of Open Access Journals (Sweden)

    Domenica Taruscio

    2011-10-01

    Full Text Available Rare diseases (RD are characterized by low prevalence and affect not more than five individuals per 10,000 in the European population; they are a large and heterogeneous group of disorders including more than 7,000 conditions and often involve all organs and tissues, with several clinical subtypes within the same disease. Very often information concerning either diagnosis and/or prognosis on many RD is insufficient. microRNAs are a class of small non-coding RNAs that regulate gene expression at the posttranscriptional level by either degrading or blocking translation of messenger RNA targets. Recently, microRNA expression patterns of body fluids underscored their potential as noninvasive biomarkers for various diseases. The role of microRNAs as potential biomarkers has become particularly attractive. The identification of disease-related microRNAs is essential for understanding the pathogenesis of diseases at the molecular level, and is critical for designing specific molecular tools for diagnosis, treatment and prevention. Computational analysis of microRNA-disease associations is an important complementary means for prioritizing microRNAs for further experimental examination. In this article, we explored the added value of miRs as biomarkers in a selected panel of RD hitting different tissues/systems at different life stages, but sharing the need of better biomarkers for diagnostic and prognostic purposes.

  5. An Integrated Architecture for Engineering Problem Solving

    National Research Council Canada - National Science Library

    Pisan, Yusuf

    1998-01-01

    .... This thesis describes the Integrated Problem Solving Architecture (IPSA) that combines qualitative, quantitative and diagrammatic reasoning skills to produce annotated solutions to engineering problems...

  6. At Issue: Academic Integrity, an Annotated Bibliography

    Science.gov (United States)

    Pricer, Wayne F.

    2009-01-01

    Academic integrity is central to the heart of any academic institution, yet the topic is a complex one. This bibliography addresses the subjects of copyright and plagiarism. Resources for exploring common campus copyright and fair use issues seek to answer common, frequently misunderstood questions such as what exactly does "copyright" mean? What…

  7. Prepare-Participate-Connect: Active Learning with Video Annotation

    Science.gov (United States)

    Colasante, Meg; Douglas, Kathy

    2016-01-01

    Annotation of video provides students with the opportunity to view and engage with audiovisual content in an interactive and participatory way rather than in passive-receptive mode. This article discusses research into the use of video annotation in four vocational programs at RMIT University in Melbourne, which allowed students to interact with…

  8. Developing Annotation Solutions for Online Data Driven Learning

    Science.gov (United States)

    Perez-Paredes, Pascual; Alcaraz-Calero, Jose M.

    2009-01-01

    Although "annotation" is a widely-researched topic in Corpus Linguistics (CL), its potential role in Data Driven Learning (DDL) has not been addressed in depth by Foreign Language Teaching (FLT) practitioners. Furthermore, most of the research in the use of DDL methods pays little attention to annotation in the design and implementation…

  9. Re-annotation and re-analysis of the Campylobacter jejuni NCTC11168 genome sequence

    Directory of Open Access Journals (Sweden)

    Dorrell Nick

    2007-06-01

    Full Text Available Abstract Background Campylobacter jejuni is the leading bacterial cause of human gastroenteritis in the developed world. To improve our understanding of this important human pathogen, the C. jejuni NCTC11168 genome was sequenced and published in 2000. The original annotation was a milestone in Campylobacter research, but is outdated. We now describe the complete re-annotation and re-analysis of the C. jejuni NCTC11168 genome using current database information, novel tools and annotation techniques not used during the original annotation. Results Re-annotation was carried out using sequence database searches such as FASTA, along with programs such as TMHMM for additional support. The re-annotation also utilises sequence data from additional Campylobacter strains and species not available during the original annotation. Re-annotation was accompanied by a full literature search that was incorporated into the updated EMBL file [EMBL: AL111168]. The C. jejuni NCTC11168 re-annotation reduced the total number of coding sequences from 1654 to 1643, of which 90.0% have additional information regarding the identification of new motifs and/or relevant literature. Re-annotation has led to 18.2% of coding sequence product functions being revised. Conclusions Major updates were made to genes involved in the biosynthesis of important surface structures such as lipooligosaccharide, capsule and both O- and N-linked glycosylation. This re-annotation will be a key resource for Campylobacter research and will also provide a prototype for the re-annotation and re-interpretation of other bacterial genomes.

  10. Comparison of concept recognizers for building the Open Biomedical Annotator

    Directory of Open Access Journals (Sweden)

    Rubin Daniel

    2009-09-01

    Full Text Available Abstract The National Center for Biomedical Ontology (NCBO is developing a system for automated, ontology-based access to online biomedical resources (Shah NH, et al.: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 2009, 10(Suppl 2:S1. The system's indexing workflow processes the text metadata of diverse resources such as datasets from GEO and ArrayExpress to annotate and index them with concepts from appropriate ontologies. This indexing requires the use of a concept-recognition tool to identify ontology concepts in the resource's textual metadata. In this paper, we present a comparison of two concept recognizers – NLM's MetaMap and the University of Michigan's Mgrep. We utilize a number of data sources and dictionaries to evaluate the concept recognizers in terms of precision, recall, speed of execution, scalability and customizability. Our evaluations demonstrate that Mgrep has a clear edge over MetaMap for large-scale service oriented applications. Based on our analysis we also suggest areas of potential improvements for Mgrep. We have subsequently used Mgrep to build the Open Biomedical Annotator service. The Annotator service has access to a large dictionary of biomedical terms derived from the United Medical Language System (UMLS and NCBO ontologies. The Annotator also leverages the hierarchical structure of the ontologies and their mappings to expand annotations. The Annotator service is available to the community as a REST Web service for creating ontology-based annotations of their data.

  11. Effects of Reviewing Annotations and Homework Solutions on Math Learning Achievement

    Science.gov (United States)

    Hwang, Wu-Yuin; Chen, Nian-Shing; Shadiev, Rustam; Li, Jin-Sing

    2011-01-01

    Previous studies have demonstrated that making annotations can be a meaningful and useful learning method that promote metacognition and enhance learning achievement. A web-based annotation system, Virtual Pen (VPEN), which provides for the creation and review of annotations and homework solutions, has been developed to foster learning process…

  12. microRNAs: Implications for air pollution research

    International Nuclear Information System (INIS)

    Jardim, Melanie J.

    2011-01-01

    The purpose of this review is to provide an update of the current understanding on the role of microRNAs in mediating genetic responses to air pollutants and to contemplate on how these responses ultimately control susceptibility to ambient air pollution. Morbidity and mortality attributable to air pollution continues to be a growing public health concern worldwide. Despite several studies on the health effects of ambient air pollution, underlying molecular mechanisms of susceptibility and disease remain elusive. In the last several years, special attention has been given to the role of epigenetics in mediating, not only genetic and physiological responses to certain environmental insults, but also in regulating underlying susceptibility to environmental stressors. Epigenetic mechanisms control the expression of gene products, both basally and as a response to a perturbation, without affecting the sequence of DNA itself. These mechanisms include structural regulation of the chromatin structure, such as DNA methylation and histone modifications, and post-transcriptional gene regulation, such as microRNA mediated repression of gene expression. microRNAs are small noncoding RNAs that have been quickly established as key regulators of gene expression. As such, miRNAs have been found to control several cellular processes including apoptosis, proliferation and differentiation. More recently, research has emerged suggesting that changes in the expression of some miRNAs may be critical for mediating biological, and ultimately physiological, responses to air pollutants. Although the study of microRNAs, and epigenetics as a whole, has come quite far in the field of cancer, the understanding of how these mechanisms regulate gene–environment interactions to environmental exposures in everyday life is unclear. This article does not necessarily reflect the views and policies of the US EPA.

  13. microRNAs: Implications for air pollution research

    Energy Technology Data Exchange (ETDEWEB)

    Jardim, Melanie J., E-mail: melaniejardim@gmail.com [Environmental Public Health Division, National Health and Environmental Effects Research Laboratory, United States Environmental Protection Agency, Chapel Hill, NC (United States)

    2011-12-01

    The purpose of this review is to provide an update of the current understanding on the role of microRNAs in mediating genetic responses to air pollutants and to contemplate on how these responses ultimately control susceptibility to ambient air pollution. Morbidity and mortality attributable to air pollution continues to be a growing public health concern worldwide. Despite several studies on the health effects of ambient air pollution, underlying molecular mechanisms of susceptibility and disease remain elusive. In the last several years, special attention has been given to the role of epigenetics in mediating, not only genetic and physiological responses to certain environmental insults, but also in regulating underlying susceptibility to environmental stressors. Epigenetic mechanisms control the expression of gene products, both basally and as a response to a perturbation, without affecting the sequence of DNA itself. These mechanisms include structural regulation of the chromatin structure, such as DNA methylation and histone modifications, and post-transcriptional gene regulation, such as microRNA mediated repression of gene expression. microRNAs are small noncoding RNAs that have been quickly established as key regulators of gene expression. As such, miRNAs have been found to control several cellular processes including apoptosis, proliferation and differentiation. More recently, research has emerged suggesting that changes in the expression of some miRNAs may be critical for mediating biological, and ultimately physiological, responses to air pollutants. Although the study of microRNAs, and epigenetics as a whole, has come quite far in the field of cancer, the understanding of how these mechanisms regulate gene-environment interactions to environmental exposures in everyday life is unclear. This article does not necessarily reflect the views and policies of the US EPA.

  14. MoFi: A Software Tool for Annotating Glycoprotein Mass Spectra by Integrating Hybrid Data from the Intact Protein and Glycopeptide Level.

    Science.gov (United States)

    Skala, Wolfgang; Wohlschlager, Therese; Senn, Stefan; Huber, Gabriel E; Huber, Christian G

    2018-04-18

    Hybrid mass spectrometry (MS) is an emerging technique for characterizing glycoproteins, which typically display pronounced microheterogeneity. Since hybrid MS combines information from different experimental levels, it crucially depends on computational methods. Here, we describe a novel software tool, MoFi, which integrates hybrid MS data to assign glycans and other post-translational modifications (PTMs) in deconvoluted mass spectra of intact proteins. Its two-stage search algorithm first assigns monosaccharide/PTM compositions to each peak and then compiles a hierarchical list of glycan combinations compatible with these compositions. Importantly, the program only includes those combinations which are supported by a glycan library as derived from glycopeptide or released glycan analysis. By applying MoFi to mass spectra of rituximab, ado-trastuzumab emtansine, and recombinant human erythropoietin, we demonstrate how integration of bottom-up data may be used to refine information collected at the intact protein level. Accordingly, our software reveals that a single mass frequently can be explained by a considerable number of glycoforms. Yet, it simultaneously ranks proteoforms according to their probability, based on a score which is calculated from relative glycan abundances. Notably, glycoforms that comprise identical glycans may nevertheless differ in score if those glycans occupy different sites. Hence, MoFi exposes different layers of complexity that are present in the annotation of a glycoprotein mass spectrum.

  15. Specific and Novel microRNAs Are Regulated as Response to Fungal Infection in Human Dendritic Cells

    Science.gov (United States)

    Dix, Andreas; Czakai, Kristin; Leonhardt, Ines; Schäferhoff, Karin; Bonin, Michael; Guthke, Reinhard; Einsele, Hermann; Kurzai, Oliver; Löffler, Jürgen; Linde, Jörg

    2017-01-01

    Within the last two decades, the incidence of invasive fungal infections has been significantly increased. They are characterized by high mortality rates and are often caused by Candida albicans and Aspergillus fumigatus. The increasing number of infections underlines the necessity for additional anti-fungal therapies, which require extended knowledge of gene regulations during fungal infection. MicroRNAs are regulators of important cellular processes, including the immune response. By analyzing their regulation and impact on target genes, novel therapeutic and diagnostic approaches may be developed. Here, we examine the role of microRNAs in human dendritic cells during fungal infection. Dendritic cells represent the bridge between the innate and the adaptive immune systems. Therefore, analysis of gene regulation of dendritic cells is of particular significance. By applying next-generation sequencing of small RNAs, we quantify microRNA expression in monocyte-derived dendritic cells after 6 and 12 h of infection with C. albicans and A. fumigatus as well as treatment with lipopolysaccharides (LPS). We identified 26 microRNAs that are differentially regulated after infection by the fungi or LPS. Three and five of them are specific for fungal infections after 6 and 12 h, respectively. We further validated interactions of miR-132-5p and miR-212-5p with immunological relevant target genes, such as FKBP1B, KLF4, and SPN, on both RNA and protein level. Our results indicate that these microRNAs fine-tune the expression of immune-related target genes during fungal infection. Beyond that, we identified previously undiscovered microRNAs. We validated three novel microRNAs via qRT-PCR. A comparison with known microRNAs revealed possible relations with the miR-378 family and miR-1260a/b for two of them, while the third one features a unique sequence with no resemblance to known microRNAs. In summary, this study analyzes the effect of known microRNAs in dendritic cells during

  16. Clinical applications of microRNAs [v1; ref status: indexed, http://f1000r.es/14d

    Directory of Open Access Journals (Sweden)

    Per Hydbring

    2013-06-01

    Full Text Available MicroRNAs represent a class of small RNAs derived from polymerase II controlled transcriptional regions. The primary transcript forms one or several bulging double stranded hairpins which are processed by Drosha and Dicer into hetero-duplexes. The targeting microRNA strand of the duplex is incorporated into the RNA Induced Silencing Complex from where it silences up to hundreds of mRNA transcript by inducing mRNA degradation or blocking protein translation. Apart from involvement in a variety of biological processes, microRNAs were early recognized for their potential in disease diagnostics and therapeutics. Due to their stability, microRNAs could be used as biomarkers. Currently, there are microRNA panels helping physicians determining the origins of cancer in disseminated tumors. The development of microRNA therapeutics has proved more challenging mainly due to delivery issues. However, one drug is already in clinical trials and several more await entering clinical phases. This review summarizes what has been recognized pre-clinically and clinically on diagnostic microRNAs. In addition, it highlights individual microRNA drugs in running platforms driven by four leading microRNA-therapeutic companies.

  17. A Set of Annotation Interfaces for Alignment of Parallel Corpora

    Directory of Open Access Journals (Sweden)

    Singh Anil Kumar

    2014-09-01

    Full Text Available Annotation interfaces for parallel corpora which fit in well with other tools can be very useful. We describe a set of annotation interfaces which fulfill this criterion. This set includes a sentence alignment interface, two different word or word group alignment interfaces and an initial version of a parallel syntactic annotation alignment interface. These tools can be used for manual alignment, or they can be used to correct automatic alignments. Manual alignment can be performed in combination with certain kinds of linguistic annotation. Most of these interfaces use a representation called the Shakti Standard Format that has been found to be very robust and has been used for large and successful projects. It ties together the different interfaces, so that the data created by them is portable across all tools which support this representation. The existence of a query language for data stored in this representation makes it possible to build tools that allow easy search and modification of annotated parallel data.

  18. "Annotated Lectures": Student-Instructor Interaction in Large-Scale Global Education

    Directory of Open Access Journals (Sweden)

    Roger Diehl

    2009-10-01

    Full Text Available We describe an "Annotated Lectures" system, which will be used in a global virtual teaching and student collaboration event on embodied intelligence presented by the University of Zurich. The lectures will be broadcasted via video-conference to lecture halls of different universities around the globe. Among other collaboration features, an "Annotated Lectures" system will be implemented in a 3D collaborative virtual environment and used by the participating students to make annotations to the video-recorded lectures, which will be sent to and answered by their supervisors, and forwarded to the lecturers in an aggregated way. The "Annotated Lectures" system aims to overcome the issues of limited studentinstructor interaction in large-scale education, and to foster an intercultural and multidisciplinary discourse among students who review the lectures in a group. After presenting the concept of the "Annotated Lectures" system, we discuss a prototype version including a description of the technical components and its expected benefit for large-scale global education.

  19. A novel serum microRNA panel to discriminate benign from malignant ovarian disease.

    LENUS (Irish Health Repository)

    Langhe, Ream

    2015-01-28

    Ovarian cancer is the seventh most common cancer in women and the most frequent cause of gynaecological malignancy-related mortality in women. Currently, no standardized reliable screening test exists. MicroRNA profiling has allowed the identification of signatures associated with diagnosis, prognosis and response to treatment of human tumours. The aim of this study was to determine if a microRNA signature could distinguish between malignant and benign ovarian disease. A training set of 5 serous ovarian carcinomas and 5 benign serous cystadenomas were selected for the initial experiments. The validation set included 20 serous ovarian carcinomas and 20 benign serous cystadenomas. The serum\\/plasma focus microRNA Exiqon panel was used for the training set. For the validation set a pick and mix Exiqon panel, which focuses on microRNAs of interest was used. A panel of 4 microRNAs (let-7i-5p, miR-122, miR-152-5p and miR-25-3p) was significantly down regulated in cancer patients. These microRNAs target WNT signalling, AKT\\/mTOR and TLR-4\\/MyD88, which have previously been found to play a role in ovarian carcinogenesis and chemoresistance. let-7i-5p, miR-122, miR-152-5p and miR-25-3p could act as diagnostic biomarkers in ovarian cancer.

  20. PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3' UTRs and coding sequences.

    Science.gov (United States)

    Šulc, Miroslav; Marín, Ray M; Robins, Harlan S; Vaníček, Jiří

    2015-07-01

    The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA transcripts targeted in their 3' untranslated regions (3' UTRs), and (ii) PACCMIT-CDS, designed to find mRNAs targeted within their coding sequences (CDSs). While PACCMIT belongs among the accurate algorithms for predicting conserved microRNA targets in the 3' UTRs, the main contribution of the web server is 2-fold: PACCMIT provides an accurate tool for predicting targets also of weakly conserved or non-conserved microRNAs, whereas PACCMIT-CDS addresses the lack of similar portals adapted specifically for targets in CDS. The web server asks the user for microRNAs and mRNAs to be analyzed, accesses the precomputed P-values for all microRNA-mRNA pairs from a database for all mRNAs and microRNAs in a given species, ranks the predicted microRNA-mRNA pairs, evaluates their significance according to the false discovery rate and finally displays the predictions in a tabular form. The results are also available for download in several standard formats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Annotation an effective device for student feedback: a critical review of the literature.

    Science.gov (United States)

    Ball, Elaine C

    2010-05-01

    The paper examines hand-written annotation, its many features, difficulties and strengths as a feedback tool. It extends and clarifies what modest evidence is in the public domain and offers an evaluation of how to use annotation effectively in the support of student feedback [Marshall, C.M., 1998a. The Future of Annotation in a Digital (paper) World. Presented at the 35th Annual GLSLIS Clinic: Successes and Failures of Digital Libraries, June 20-24, University of Illinois at Urbana-Champaign, March 24, pp. 1-20; Marshall, C.M., 1998b. Toward an ecology of hypertext annotation. Hypertext. In: Proceedings of the Ninth ACM Conference on Hypertext and Hypermedia, June 20-24, Pittsburgh Pennsylvania, US, pp. 40-49; Wolfe, J.L., Nuewirth, C.M., 2001. From the margins to the centre: the future of annotation. Journal of Business and Technical Communication, 15(3), 333-371; Diyanni, R., 2002. One Hundred Great Essays. Addison-Wesley, New York; Wolfe, J.L., 2002. Marginal pedagogy: how annotated texts affect writing-from-source texts. Written Communication, 19(2), 297-333; Liu, K., 2006. Annotation as an index to critical writing. Urban Education, 41, 192-207; Feito, A., Donahue, P., 2008. Minding the gap annotation as preparation for discussion. Arts and Humanities in Higher Education, 7(3), 295-307; Ball, E., 2009. A participatory action research study on handwritten annotation feedback and its impact on staff and students. Systemic Practice and Action Research, 22(2), 111-124; Ball, E., Franks, H., McGrath, M., Leigh, J., 2009. Annotation is a valuable tool to enhance learning and assessment in student essays. Nurse Education Today, 29(3), 284-291]. Although a significant number of studies examine annotation, this is largely related to on-line tools and computer mediated communication and not hand-written annotation as comment, phrase or sign written on the student essay to provide critique. Little systematic research has been conducted to consider how this latter form

  2. BioCause: Annotating and analysing causality in the biomedical domain.

    Science.gov (United States)

    Mihăilă, Claudiu; Ohta, Tomoko; Pyysalo, Sampo; Ananiadou, Sophia

    2013-01-16

    Biomedical corpora annotated with event-level information represent an important resource for domain-specific information extraction (IE) systems. However, bio-event annotation alone cannot cater for all the needs of biologists. Unlike work on relation and event extraction, most of which focusses on specific events and named entities, we aim to build a comprehensive resource, covering all statements of causal association present in discourse. Causality lies at the heart of biomedical knowledge, such as diagnosis, pathology or systems biology, and, thus, automatic causality recognition can greatly reduce the human workload by suggesting possible causal connections and aiding in the curation of pathway models. A biomedical text corpus annotated with such relations is, hence, crucial for developing and evaluating biomedical text mining. We have defined an annotation scheme for enriching biomedical domain corpora with causality relations. This schema has subsequently been used to annotate 851 causal relations to form BioCause, a collection of 19 open-access full-text biomedical journal articles belonging to the subdomain of infectious diseases. These documents have been pre-annotated with named entity and event information in the context of previous shared tasks. We report an inter-annotator agreement rate of over 60% for triggers and of over 80% for arguments using an exact match constraint. These increase significantly using a relaxed match setting. Moreover, we analyse and describe the causality relations in BioCause from various points of view. This information can then be leveraged for the training of automatic causality detection systems. Augmenting named entity and event annotations with information about causal discourse relations could benefit the development of more sophisticated IE systems. These will further influence the development of multiple tasks, such as enabling textual inference to detect entailments, discovering new facts and providing new

  3. Dysregulation of serum microRNA-574-3p and its clinical significance in hepatocellular carcinoma.

    Science.gov (United States)

    Shen, Xianjuan; Xue, Yajing; Cong, Hui; Wang, Xudong; Ju, Shaoqing

    2018-07-01

    Objectives To explore microRNA-574-3p expression in serum of patients with hepatocellular carcinoma and investigate correlations between serum microRNA-574-3p expression and the development and prognosis of hepatocellular carcinoma. Design and methods Serum samples were collected from 70 patients with primary hepatocellular carcinoma, 40 patients with cirrhosis and 45 healthy controls. Serum microRNA-574-3p expression levels were detected by real-time quantitative polymerase chain reaction. The linearity, specificity and reproducibility were evaluated. In addition, the diagnostic value of microRNA-574-3p and its correlations with clinicopathologic features were assessed. Results The relative expression of microRNA-574-3p in hepatocellular carcinoma patients, cirrhosis patients and healthy controls was 2.306 (1.801-3.130), 1.362 (0.994-1.665) and 1.263 (0.765-1.723), respectively, indicating that it was significantly higher in hepatocellular carcinoma patients than that in the other two groups ( U = 439.5, 514.5, both P hepatocellular carcinoma patients, the relative expression of microRNA-574-3p was significantly correlated with hepatitis B virus DNA concentration ( r = 0.348, P = 0.022). Compared with healthy control group, AUC ROC of serum microRNA-574-3p in hepatocellular carcinoma group was 0.837 with 95% CI: 0.763-0.910. Combining microRNA-574-3p, AFU and alpha-fetoprotein together, the sensitivity was highest compared with other markers alone or combined. Conclusions The relative expression of serum microRNA-574-3p in hepatocellular carcinoma patients was significantly higher than that in cirrhosis patients and healthy controls, and it may be an important biomarker in the auxiliary diagnosis of hepatocellular carcinoma.

  4. Elevated levels of circulating microRNA-200 family members correlate with serous epithelial ovarian cancer

    Directory of Open Access Journals (Sweden)

    Kan Casina WS

    2012-12-01

    Full Text Available Abstract Background There is a critical need for improved diagnostic markers for high grade serous epithelial ovarian cancer (SEOC. MicroRNAs are stable in the circulation and may have utility as biomarkers of malignancy. We investigated whether levels of serum microRNA could discriminate women with high-grade SEOC from age matched healthy volunteers. Methods To identify microRNA of interest, microRNA expression profiling was performed on 4 SEOC cell lines and normal human ovarian surface epithelial cells. Total RNA was extracted from 500 μL aliquots of serum collected from patients with SEOC (n = 28 and age-matched healthy donors (n = 28. Serum microRNA levels were assessed by quantitative RT-PCR following preamplification. Results microRNA (miR-182, miR-200a, miR-200b and miR-200c were highly overexpressed in the SEOC cell lines relative to normal human ovarian surface epithelial cells and were assessed in RNA extracted from serum as candidate biomarkers. miR-103, miR-92a and miR -638 had relatively invariant expression across all ovarian cell lines, and with small-nucleolar C/D box 48 (RNU48 were assessed in RNA extracted from serum as candidate endogenous normalizers. No correlation between serum levels and age were observed (age range 30-79 years for any of these microRNA or RNU48. Individually, miR-200a, miR-200b and miR-200c normalized to serum volume and miR-103 were significantly higher in serum of the SEOC cohort (P  Conclusions We identified serum microRNAs able to discriminate patients with high grade SEOC from age-matched healthy controls. The addition of these microRNAs to current testing regimes may improve diagnosis for women with SEOC.

  5. Automatic Function Annotations for Hoare Logic

    Directory of Open Access Journals (Sweden)

    Daniel Matichuk

    2012-11-01

    Full Text Available In systems verification we are often concerned with multiple, inter-dependent properties that a program must satisfy. To prove that a program satisfies a given property, the correctness of intermediate states of the program must be characterized. However, this intermediate reasoning is not always phrased such that it can be easily re-used in the proofs of subsequent properties. We introduce a function annotation logic that extends Hoare logic in two important ways: (1 when proving that a function satisfies a Hoare triple, intermediate reasoning is automatically stored as function annotations, and (2 these function annotations can be exploited in future Hoare logic proofs. This reduces duplication of reasoning between the proofs of different properties, whilst serving as a drop-in replacement for traditional Hoare logic to avoid the costly process of proof refactoring. We explain how this was implemented in Isabelle/HOL and applied to an experimental branch of the seL4 microkernel to significantly reduce the size and complexity of existing proofs.

  6. Automatically Annotated Mapping for Indoor Mobile Robot Applications

    DEFF Research Database (Denmark)

    Özkil, Ali Gürcan; Howard, Thomas J.

    2012-01-01

    This paper presents a new and practical method for mapping and annotating indoor environments for mobile robot use. The method makes use of 2D occupancy grid maps for metric representation, and topology maps to indicate the connectivity of the ‘places-of-interests’ in the environment. Novel use...... localization and mapping in topology space, and fuses camera and robot pose estimations to build an automatically annotated global topo-metric map. It is developed as a framework for a hospital service robot and tested in a real hospital. Experiments show that the method is capable of producing globally...... consistent, automatically annotated hybrid metric-topological maps that is needed by mobile service robots....

  7. VAT: a computational framework to functionally annotate variants in personal genomes within a cloud-computing environment.

    Science.gov (United States)

    Habegger, Lukas; Balasubramanian, Suganthi; Chen, David Z; Khurana, Ekta; Sboner, Andrea; Harmanci, Arif; Rozowsky, Joel; Clarke, Declan; Snyder, Michael; Gerstein, Mark

    2012-09-01

    The functional annotation of variants obtained through sequencing projects is generally assumed to be a simple intersection of genomic coordinates with genomic features. However, complexities arise for several reasons, including the differential effects of a variant on alternatively spliced transcripts, as well as the difficulty in assessing the impact of small insertions/deletions and large structural variants. Taking these factors into consideration, we developed the Variant Annotation Tool (VAT) to functionally annotate variants from multiple personal genomes at the transcript level as well as obtain summary statistics across genes and individuals. VAT also allows visualization of the effects of different variants, integrates allele frequencies and genotype data from the underlying individuals and facilitates comparative analysis between different groups of individuals. VAT can either be run through a command-line interface or as a web application. Finally, in order to enable on-demand access and to minimize unnecessary transfers of large data files, VAT can be run as a virtual machine in a cloud-computing environment. VAT is implemented in C and PHP. The VAT web service, Amazon Machine Image, source code and detailed documentation are available at vat.gersteinlab.org.

  8. AGORA : Organellar genome annotation from the amino acid and nucleotide references.

    Science.gov (United States)

    Jung, Jaehee; Kim, Jong Im; Jeong, Young-Sik; Yi, Gangman

    2018-03-29

    Next-generation sequencing (NGS) technologies have led to the accumulation of highthroughput sequence data from various organisms in biology. To apply gene annotation of organellar genomes for various organisms, more optimized tools for functional gene annotation are required. Almost all gene annotation tools are mainly focused on the chloroplast genome of land plants or the mitochondrial genome of animals.We have developed a web application AGORA for the fast, user-friendly, and improved annotations of organellar genomes. AGORA annotates genes based on a BLAST-based homology search and clustering with selected reference sequences from the NCBI database or user-defined uploaded data. AGORA can annotate the functional genes in almost all mitochondrion and plastid genomes of eukaryotes. The gene annotation of a genome with an exon-intron structure within a gene or inverted repeat region is also available. It provides information of start and end positions of each gene, BLAST results compared with the reference sequence, and visualization of gene map by OGDRAW. Users can freely use the software, and the accessible URL is https://bigdata.dongguk.edu/gene_project/AGORA/.The main module of the tool is implemented by the python and php, and the web page is built by the HTML and CSS to support all browsers. gangman@dongguk.edu.

  9. Systematically profiling and annotating long intergenic non-coding RNAs in human embryonic stem cell.

    Science.gov (United States)

    Tang, Xing; Hou, Mei; Ding, Yang; Li, Zhaohui; Ren, Lichen; Gao, Ge

    2013-01-01

    While more and more long intergenic non-coding RNAs (lincRNAs) were identified to take important roles in both maintaining pluripotency and regulating differentiation, how these lincRNAs may define and drive cell fate decisions on a global scale are still mostly elusive. Systematical profiling and comprehensive annotation of embryonic stem cells lincRNAs may not only bring a clearer big picture of these novel regulators but also shed light on their functionalities. Based on multiple RNA-Seq datasets, we systematically identified 300 human embryonic stem cell lincRNAs (hES lincRNAs). Of which, one forth (78 out of 300) hES lincRNAs were further identified to be biasedly expressed in human ES cells. Functional analysis showed that they were preferentially involved in several early-development related biological processes. Comparative genomics analysis further suggested that around half of the identified hES lincRNAs were conserved in mouse. To facilitate further investigation of these hES lincRNAs, we constructed an online portal for biologists to access all their sequences and annotations interactively. In addition to navigation through a genome browse interface, users can also locate lincRNAs through an advanced query interface based on both keywords and expression profiles, and analyze results through multiple tools. By integrating multiple RNA-Seq datasets, we systematically characterized and annotated 300 hES lincRNAs. A full functional web portal is available freely at http://scbrowse.cbi.pku.edu.cn. As the first global profiling and annotating of human embryonic stem cell lincRNAs, this work aims to provide a valuable resource for both experimental biologists and bioinformaticians.

  10. Integrative annotation of 21,037 human genes validated by full-length cDNA clones.

    Directory of Open Access Journals (Sweden)

    Tadashi Imanishi

    2004-06-01

    Full Text Available The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/. It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs, identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA

  11. Control of Drosophila Type I and Type II central brain neuroblast proliferation by bantam microRNA

    DEFF Research Database (Denmark)

    Weng, Ruifen; Cohen, Stephen M

    2015-01-01

    Post-transcriptional regulation of stem cell self-renewal by microRNAs is emerging as an important mechanism controlling tissue homeostasis. Here, we provide evidence that bantam microRNA controls neuroblast number and proliferation in the Drosophila central brain. Bantam also supports proliferat......Post-transcriptional regulation of stem cell self-renewal by microRNAs is emerging as an important mechanism controlling tissue homeostasis. Here, we provide evidence that bantam microRNA controls neuroblast number and proliferation in the Drosophila central brain. Bantam also supports...

  12. Increased expression of microRNA-221 inhibits PAK1 in endothelial progenitor cells and impairs its function via c-Raf/MEK/ERK pathway

    International Nuclear Information System (INIS)

    Zhang, Xiaoping; Mao, Haian; Chen, Jin-yuan; Wen, Shengjun; Li, Dan; Ye, Meng; Lv, Zhongwei

    2013-01-01

    Highlights: ► MicroRNA-221 is upregulated in the endothelial progenitor cells of atherosclerosis patients. ► PAK1 is a direct target of microRNA-221. ► MicroRNA-221 inhibits EPCs proliferation through c-Raf/MEK/ERK pathway. -- Abstract: Coronary artery disease (CAD) is associated with high mortality and occurs via endothelial injury. Endothelial progenitor cells (EPCs) restore the integrity of the endothelium and protect it from atherosclerosis. In this study, we compared the expression of microRNAs (miRNAs) in EPCs in atherosclerosis patients and normal controls. We found that miR-221 expression was significantly up-regulated in patients compared with controls. We predicted and identified p21/Cdc42/Rac1-activated kinase 1 (PAK1) as a novel target of miR-221 in EPCs. We also demonstrated that miR-221 targeted a putative binding site in the 3′UTR of PAK1, and absence of this site was inversely associated with miR-221 expression in EPCs. We confirmed this relationship using a luciferase reporter assay. Furthermore, overexpression of miR-221 in EPCs significantly decreased EPC proliferation, in accordance with the inhibitory effects induced by decreased PAK1. Overall, these findings demonstrate that miR-221 affects the MEK/ERK pathway by targeting PAK1 to inhibit the proliferation of EPCs

  13. An annotated corpus with nanomedicine and pharmacokinetic parameters

    Directory of Open Access Journals (Sweden)

    Lewinski NA

    2017-10-01

    Full Text Available Nastassja A Lewinski,1 Ivan Jimenez,1 Bridget T McInnes2 1Department of Chemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA, 2Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA Abstract: A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration’s Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided. Keywords: nanotechnology, informatics, natural language processing, text mining, corpora

  14. Annotated Tsunami bibliography: 1962-1976

    International Nuclear Information System (INIS)

    Pararas-Carayannis, G.; Dong, B.; Farmer, R.

    1982-08-01

    This compilation contains annotated citations to nearly 3000 tsunami-related publications from 1962 to 1976 in English and several other languages. The foreign-language citations have English titles and abstracts

  15. Assessment of community-submitted ontology annotations from a novel database-journal partnership.

    Science.gov (United States)

    Berardini, Tanya Z; Li, Donghui; Muller, Robert; Chetty, Raymond; Ploetz, Larry; Singh, Shanker; Wensel, April; Huala, Eva

    2012-01-01

    As the scientific literature grows, leading to an increasing volume of published experimental data, so does the need to access and analyze this data using computational tools. The most commonly used method to convert published experimental data on gene function into controlled vocabulary annotations relies on a professional curator, employed by a model organism database or a more general resource such as UniProt, to read published articles and compose annotation statements based on the articles' contents. A more cost-effective and scalable approach capable of capturing gene function data across the whole range of biological research organisms in computable form is urgently needed. We have analyzed a set of ontology annotations generated through collaborations between the Arabidopsis Information Resource and several plant science journals. Analysis of the submissions entered using the online submission tool shows that most community annotations were well supported and the ontology terms chosen were at an appropriate level of specificity. Of the 503 individual annotations that were submitted, 97% were approved and community submissions captured 72% of all possible annotations. This new method for capturing experimental results in a computable form provides a cost-effective way to greatly increase the available body of annotations without sacrificing annotation quality. Database URL: www.arabidopsis.org.

  16. Data integration in biological research: an overview.

    Science.gov (United States)

    Lapatas, Vasileios; Stefanidakis, Michalis; Jimenez, Rafael C; Via, Allegra; Schneider, Maria Victoria

    2015-12-01

    Data sharing, integration and annotation are essential to ensure the reproducibility of the analysis and interpretation of the experimental findings. Often these activities are perceived as a role that bioinformaticians and computer scientists have to take with no or little input from the experimental biologist. On the contrary, biological researchers, being the producers and often the end users of such data, have a big role in enabling biological data integration. The quality and usefulness of data integration depend on the existence and adoption of standards, shared formats, and mechanisms that are suitable for biological researchers to submit and annotate the data, so it can be easily searchable, conveniently linked and consequently used for further biological analysis and discovery. Here, we provide background on what is data integration from a computational science point of view, how it has been applied to biological research, which key aspects contributed to its success and future directions.

  17. Identification of Conserved and Novel MicroRNAs during Tail Regeneration in the Mexican Axolotl

    Directory of Open Access Journals (Sweden)

    Micah D. Gearhart

    2015-09-01

    Full Text Available The Mexican axolotl salamander (Ambystoma mexicanum is one member of a select group of vertebrate animals that have retained the amazing ability to regenerate multiple body parts. In addition to being an important model system for regeneration, the axolotl has also contributed extensively to studies of basic development. While many genes known to play key roles during development have now been implicated in various forms of regeneration, much of the regulatory apparatus controlling the underlying molecular circuitry remains unknown. In recent years, microRNAs have been identified as key regulators of gene expression during development, in many diseases and also, increasingly, in regeneration. Here, we have used deep sequencing combined with qRT-PCR to undertake a comprehensive identification of microRNAs involved in regulating regeneration in the axolotl. Specifically, among the microRNAs that we have found to be expressed in axolotl tissues, we have identified 4564 microRNA families known to be widely conserved among vertebrates, as well as 59,811 reads of putative novel microRNAs. These findings support the hypothesis that microRNAs play key roles in managing the precise spatial and temporal patterns of gene expression that ensures the correct regeneration of missing tissues.

  18. MicroRNAs: From Female Fertility, Germ Cells, and Stem Cells to Cancer in Humans

    Directory of Open Access Journals (Sweden)

    Irma Virant-Klun

    2016-01-01

    Full Text Available MicroRNAs are a family of naturally occurring small noncoding RNA molecules that play an important regulatory role in gene expression. They are suggested to regulate a large proportion of protein encoding genes by mediating the translational suppression and posttranscriptional control of gene expression. Recent findings show that microRNAs are emerging as important regulators of cellular differentiation and dedifferentiation, and are deeply involved in developmental processes including human preimplantation development. They keep a balance between pluripotency and differentiation in the embryo and embryonic stem cells. Moreover, it became evident that dysregulation of microRNA expression may play a fundamental role in progression and dissemination of different cancers including ovarian cancer. The interest is still increased by the discovery of exosomes, that is, cell-derived vesicles, which can carry different proteins but also microRNAs between different cells and are involved in cell-to-cell communication. MicroRNAs, together with exosomes, have a great potential to be used for prognosis, therapy, and biomarkers of different diseases including infertility. The aim of this review paper is to summarize the existent knowledge on microRNAs related to female fertility and cancer: from primordial germ cells and ovarian function, germinal stem cells, oocytes, and embryos to embryonic stem cells.

  19. INDIGO – INtegrated Data Warehouse of MIcrobial GenOmes with Examples from the Red Sea Extremophiles

    Science.gov (United States)

    Alam, Intikhab; Antunes, André; Kamau, Allan Anthony; Ba alawi, Wail; Kalkatawi, Manal; Stingl, Ulrich; Bajic, Vladimir B.

    2013-01-01

    Background The next generation sequencing technologies substantially increased the throughput of microbial genome sequencing. To functionally annotate newly sequenced microbial genomes, a variety of experimental and computational methods are used. Integration of information from different sources is a powerful approach to enhance such annotation. Functional analysis of microbial genomes, necessary for downstream experiments, crucially depends on this annotation but it is hampered by the current lack of suitable information integration and exploration systems for microbial genomes. Results We developed a data warehouse system (INDIGO) that enables the integration of annotations for exploration and analysis of newly sequenced microbial genomes. INDIGO offers an opportunity to construct complex queries and combine annotations from multiple sources starting from genomic sequence to protein domain, gene ontology and pathway levels. This data warehouse is aimed at being populated with information from genomes of pure cultures and uncultured single cells of Red Sea bacteria and Archaea. Currently, INDIGO contains information from Salinisphaera shabanensis, Haloplasma contractile, and Halorhabdus tiamatea - extremophiles isolated from deep-sea anoxic brine lakes of the Red Sea. We provide examples of utilizing the system to gain new insights into specific aspects on the unique lifestyle and adaptations of these organisms to extreme environments. Conclusions We developed a data warehouse system, INDIGO, which enables comprehensive integration of information from various resources to be used for annotation, exploration and analysis of microbial genomes. It will be regularly updated and extended with new genomes. It is aimed to serve as a resource dedicated to the Red Sea microbes. In addition, through INDIGO, we provide our Automatic Annotation of Microbial Genomes (AAMG) pipeline. The INDIGO web server is freely available at http://www.cbrc.kaust.edu.sa/indigo. PMID

  20. Annotation-Based Whole Genomic Prediction and Selection

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Do, Duy Ngoc; Janss, Luc

    Genomic selection is widely used in both animal and plant species, however, it is performed with no input from known genomic or biological role of genetic variants and therefore is a black box approach in a genomic era. This study investigated the role of different genomic regions and detected QTLs...... in their contribution to estimated genomic variances and in prediction of genomic breeding values by applying SNP annotation approaches to feed efficiency. Ensembl Variant Predictor (EVP) and Pig QTL database were used as the source of genomic annotation for 60K chip. Genomic prediction was performed using the Bayes...... classes. Predictive accuracy was 0.531, 0.532, 0.302, and 0.344 for DFI, RFI, ADG and BF, respectively. The contribution per SNP to total genomic variance was similar among annotated classes across different traits. Predictive performance of SNP classes did not significantly differ from randomized SNP...