WorldWideScience

Sample records for chlamydomonas annotation jamboree

  1. MEETING: Chlamydomonas Annotation Jamboree - October 2003

    Energy Technology Data Exchange (ETDEWEB)

    Grossman, Arthur R

    2007-04-13

    Shotgun sequencing of the nuclear genome of Chlamydomonas reinhardtii (Chlamydomonas throughout) was performed at an approximate 10X coverage by JGI. Roughly half of the genome is now contained on 26 scaffolds, all of which are at least 1.6 Mb, and the coverage of the genome is ~95%. There are now over 200,000 cDNA sequence reads that we have generated as part of the Chlamydomonas genome project (Grossman, 2003; Shrager et al., 2003; Grossman et al. 2007; Merchant et al., 2007); other sequences have also been generated by the Kasuza sequence group (Asamizu et al., 1999; Asamizu et al., 2000) or individual laboratories that have focused on specific genes. Shrager et al. (2003) placed the reads into distinct contigs (an assemblage of reads with overlapping nucleotide sequences), and contigs that group together as part of the same genes have been designated ACEs (assembly of contigs generated from EST information). All of the reads have also been mapped to the Chlamydomonas nuclear genome and the cDNAs and their corresponding genomic sequences have been reassembled, and the resulting assemblage is called an ACEG (an Assembly of contiguous EST sequences supported by genomic sequence) (Jain et al., 2007). Most of the unique genes or ACEGs are also represented by gene models that have been generated by the Joint Genome Institute (JGI, Walnut Creek, CA). These gene models have been placed onto the DNA scaffolds and are presented as a track on the Chlamydomonas genome browser associated with the genome portal (http://genome.jgi-psf.org/Chlre3/Chlre3.home.html). Ultimately, the meeting grant awarded by DOE has helped enormously in the development of an annotation pipeline (a set of guidelines used in the annotation of genes) and resulted in high quality annotation of over 4,000 genes; the annotators were from both Europe and the USA. Some of the people who led the annotation initiative were Arthur Grossman, Olivier Vallon, and Sabeeha Merchant (with many individual

  2. Community annotation and bioinformatics workforce development in concert--Little Skate Genome Annotation Workshops and Jamborees.

    Science.gov (United States)

    Wang, Qinghua; Arighi, Cecilia N; King, Benjamin L; Polson, Shawn W; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F; Page, Shallee T; Rendino, Marc Farnum; Thomas, William Kelley; Udwary, Daniel W; Wu, Cathy H

    2012-01-01

    Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome.

  3. Community annotation and bioinformatics workforce development in concert—Little Skate Genome Annotation Workshops and Jamborees

    Science.gov (United States)

    Wang, Qinghua; Arighi, Cecilia N.; King, Benjamin L.; Polson, Shawn W.; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F.; Page, Shallee T.; Farnum Rendino, Marc; Thomas, William Kelley; Udwary, Daniel W.; Wu, Cathy H.

    2012-01-01

    Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome. PMID:22434832

  4. National Boy Scout Jamboree

    Science.gov (United States)

    1989-01-01

    This video looks at a NASA sponsored exhibit at the National Boy Scout Jamboree in Fredricksburg, VA. Boy Scouts are shown interacting with NASA researchers and astronauts and touring mockups of Space Station Freedom and Apollo 11. NASA's program to encourage the researchers of tomorrow is detailed.

  5. The Chlamydomonas genome project: a decade on

    Science.gov (United States)

    Blaby, Ian K.; Blaby-Haas, Crysten; Tourasse, Nicolas; Hom, Erik F. Y.; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George; Stanke, Mario; Harris, Elizabeth H.; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S.; Prochnik, Simon

    2014-01-01

    The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis and micronutrient homeostasis. Ten years since its genome project was initiated, an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the “omics” era. Housed at Phytozome, the Joint Genome Institute’s (JGI) plant genomics portal, the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of RNA-Seq data. Here, we present the past, present and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. PMID:24950814

  6. ChlamyCyc: an integrative systems biology database and web-portal for Chlamydomonas reinhardtii.

    Science.gov (United States)

    May, Patrick; Christian, Jan-Ole; Kempa, Stefan; Walther, Dirk

    2009-05-04

    The unicellular green alga Chlamydomonas reinhardtii is an important eukaryotic model organism for the study of photosynthesis and plant growth. In the era of modern high-throughput technologies there is an imperative need to integrate large-scale data sets from high-throughput experimental techniques using computational methods and database resources to provide comprehensive information about the molecular and cellular organization of a single organism. In the framework of the German Systems Biology initiative GoFORSYS, a pathway database and web-portal for Chlamydomonas (ChlamyCyc) was established, which currently features about 250 metabolic pathways with associated genes, enzymes, and compound information. ChlamyCyc was assembled using an integrative approach combining the recently published genome sequence, bioinformatics methods, and experimental data from metabolomics and proteomics experiments. We analyzed and integrated a combination of primary and secondary database resources, such as existing genome annotations from JGI, EST collections, orthology information, and MapMan classification. ChlamyCyc provides a curated and integrated systems biology repository that will enable and assist in systematic studies of fundamental cellular processes in Chlamydomonas. The ChlamyCyc database and web-portal is freely available under http://chlamycyc.mpimp-golm.mpg.de.

  7. Sixth Annual NASA Ames Space Science and Astrobiology Jamboree

    Science.gov (United States)

    Hollingsworth, Jeffery; Howell, Steve; Fonda, Mark; Dateo, Chris; Martinez, Christine M.

    2018-01-01

    Welcome to the Sixth Annual NASA Ames Research Center, Space Science and Astrobiology Jamboree at NASA Ames Research Center (ARC). The Space Science and Astrobiology Division consists of over 60 Civil Servants, with more than 120 Cooperative Agreement Research Scientists, Post-Doctoral Fellows, Science Support Contractors, Visiting Scientists, and many other Research Associates. Within the Division there is engagement in scientific investigations over a breadth of disciplines including Astrobiology, Astrophysics, Exobiology, Exoplanets, Planetary Systems Science, and many more. The Division's personnel support NASA spacecraft missions (current and planned), including SOFIA, K2, MSL, New Horizons, JWST, WFIRST, and others. Our top-notch science research staff is spread amongst three branches in five buildings at ARC. Naturally, it can thus be difficult to remain abreast of what fellow scientific researchers pursue actively, and then what may present and/or offer regarding inter-Branch, intra-Division future collaborative efforts. In organizing this annual jamboree, the goals are to offer a wholesome, one-venue opportunity to sense the active scientific research and spacecraft mission involvement within the Division; and to facilitate communication and collaboration amongst our research scientists. Annually, the Division honors one senior research scientist with a Pollack Lecture, and one early career research scientist with an Outstanding Early Career Space Scientist Lecture. For the Pollack Lecture, the honor is bestowed upon a senior researcher who has made significant contributions within any area of research aligned with space science and/or astrobiology. This year we are pleased to honor Linda Jahnke. With the Early Career Lecture, the honor is bestowed upon an early-career researcher who has substantially demonstrated great promise for significant contributions within space science, astrobiology, and/or, in support of spacecraft missions addressing such

  8. ChlamyCyc: an integrative systems biology database and web-portal for Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Kempa Stefan

    2009-05-01

    Full Text Available Abstract Background The unicellular green alga Chlamydomonas reinhardtii is an important eukaryotic model organism for the study of photosynthesis and plant growth. In the era of modern high-throughput technologies there is an imperative need to integrate large-scale data sets from high-throughput experimental techniques using computational methods and database resources to provide comprehensive information about the molecular and cellular organization of a single organism. Results In the framework of the German Systems Biology initiative GoFORSYS, a pathway database and web-portal for Chlamydomonas (ChlamyCyc was established, which currently features about 250 metabolic pathways with associated genes, enzymes, and compound information. ChlamyCyc was assembled using an integrative approach combining the recently published genome sequence, bioinformatics methods, and experimental data from metabolomics and proteomics experiments. We analyzed and integrated a combination of primary and secondary database resources, such as existing genome annotations from JGI, EST collections, orthology information, and MapMan classification. Conclusion ChlamyCyc provides a curated and integrated systems biology repository that will enable and assist in systematic studies of fundamental cellular processes in Chlamydomonas. The ChlamyCyc database and web-portal is freely available under http://chlamycyc.mpimp-golm.mpg.de.

  9. Identification of an NADP/thioredoxin system in Chlamydomonas reinhardtii

    Science.gov (United States)

    Huppe, H. C.; Picaud, A.; Buchanan, B. B.; Miginiac-Maslow, M.

    1991-01-01

    The protein components of the NADP/thioredoxin system, NADP-thioredoxin reductase (NTR) and thioredoxin h, have been purified and characterized from the green alga, Chlamydomonas reinhardtii. The analysis of this system confirms that photoautotrophic Chlamydomonas cells resemble leaves in having both an NADP- and ferrodoxin-linked thioredoxin redox system. Chlamydomonas thioredoxin h, which is smaller on sodium dodecyl sulfate-polyacrylamide gel electrophoresis than thioredoxin m from the same source, cross-reacted with antisera to thioredoxin h from spinach (Spinacia oleracea L.) and wheat germ (Triticum vulgaris L.) but not with antisera to m or f thioredoxins. In these properties, the thioredoxin h resembled a thioredoxin from Chlamydomonas, designated Ch1, whose sequence was reported recently (P. Decottignies et al., 1991, Eur. J. Biochem. 198, 505-512). The differential reactivity of thioredoxin h with antisera was used to demonstrate that thioredoxin h is enriched outside the chloroplast. The NTR was purified from Chlamydomonas using thioredoxin h from the same source. Similar to its counterpart from other organisms, Chlamydomonas NTR had a subunit size of approx. 36 kDa and was specific for NADPH. Chlamydomonas NTR effectively reduced thioredoxin h from the same source but showed little activity with the other thioredoxins tested, including spinach thioredoxin h and Escherichia coli thioredoxin. Comparison of the reduction of Chlamydomonas thioredoxins m and h by each of the endogenous thioredoxin reductases, NTR and ferredoxin-thioredoxin reductase, revealed a differential specificity of each enzyme for thioredoxin. Thus, NTR showed increased activity with thioredoxin h and ferredoxin-thioredoxin reductase with thioredoxins m and f.

  10. Heat shock induced change in protein ubiquitination in Chlamydomonas

    International Nuclear Information System (INIS)

    Shimogawara, K.; Muto, S.

    1989-01-01

    Ubiquitin was purified from pea (Pisum sativum L.) and its antibody was produced. Western blot analysis showed that the antibody cross-reacted with ubiquitins from a green alga Chlamydomonas reinhardtii, a brown alga Laminaria angustata and a red alga Porphyridium cruentum but not with ubiquitin from a blue-green alga Synechococcus sp. In Chlamydomonas, the antibody also reacted with some ubiquitinated proteins including 28- and 31-kDa polypeptides. The isoelectric points of Chlamydomonas ubiquitin and the 28- and 31-kDa ubiquitinated proteins were 8.0, 8.9 and 10.3, respectively. The ubiquitinated proteins, including the 28- and 31-kDa polypeptides were detected after in vitro ATP-dependent ubiquitination of Chlamydomonas cell extract with l25 I-labeled bovine ubiquitin. Heat treatment of Chlamydomonas cells (>40°C) caused drastic increase of ubiquitinated proteins with high mol wt (>60kDa), and coordinated redistribution or decrease of other ubiquitinated proteins and free ubiquitin. Quantitative analysis revealed that the 28- and 31-kDa ubiquitinated proteins showed different responses against heat stress, i.e. the former being more sensitive than the latter. (author)

  11. Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data

    Directory of Open Access Journals (Sweden)

    Merchant Sabeeha S

    2011-07-01

    Full Text Available Abstract Background Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. Description The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of

  12. Lipidomic Analysis of Chlamydomonas reinhardtii under Nitrogen and Sulfur Deprivation.

    Directory of Open Access Journals (Sweden)

    Dawei Yang

    Full Text Available Chlamydomonas reinhardtii accumulates lipids under complete nutrient starvation conditions while overall growth in biomass stops. In order to better understand biochemical changes under nutrient deprivation that maintain production of algal biomass, we used a lipidomic assay for analyzing the temporal regulation of the composition of complex lipids in C. reinhardtii in response to nitrogen and sulfur deprivation. Using a chip-based nanoelectrospray direct infusion into an ion trap mass spectrometer, we measured a diversity of lipid species reported for C. reinhardtii, including PG phosphatidylglycerols, PI Phosphatidylinositols, MGDG monogalactosyldiacylglycerols, DGDG digalactosyldiacylglycerols, SQDG sulfoquinovosyldiacylglycerols, DGTS homoserine ether lipids and TAG triacylglycerols. Individual lipid species were annotated by matching mass precursors and MS/MS fragmentations to the in-house LipidBlast mass spectral database and MS2Analyzer. Multivariate statistics showed a clear impact on overall lipidomic phenotypes on both the temporal and the nutrition stress level. Homoserine-lipids were found up-regulated at late growth time points and higher cell density, while triacyclglycerols showed opposite regulation of unsaturated and saturated fatty acyl chains under nutritional deprivation.

  13. UV-B Perception and Acclimation in Chlamydomonas reinhardtii[OPEN

    Science.gov (United States)

    Chappuis, Richard; Allorent, Guillaume

    2016-01-01

    Plants perceive UV-B, an intrinsic component of sunlight, via a signaling pathway that is mediated by the photoreceptor UV RESISTANCE LOCUS8 (UVR8) and induces UV-B acclimation. To test whether similar UV-B perception mechanisms exist in the evolutionarily distant green alga Chlamydomonas reinhardtii, we identified Chlamydomonas orthologs of UVR8 and the key signaling factor CONSTITUTIVELY PHOTOMORPHOGENIC1 (COP1). Cr-UVR8 shares sequence and structural similarity to Arabidopsis thaliana UVR8, has conserved tryptophan residues for UV-B photoreception, monomerizes upon UV-B exposure, and interacts with Cr-COP1 in a UV-B-dependent manner. Moreover, Cr-UVR8 can interact with At-COP1 and complement the Arabidopsis uvr8 mutant, demonstrating that it is a functional UV-B photoreceptor. Chlamydomonas shows apparent UV-B acclimation in colony survival and photosynthetic efficiency assays. UV-B exposure, at low levels that induce acclimation, led to broad changes in the Chlamydomonas transcriptome, including in genes related to photosynthesis. Impaired UV-B-induced activation in the Cr-COP1 mutant hit1 indicates that UVR8-COP1 signaling induces transcriptome changes in response to UV-B. Also, hit1 mutants are impaired in UV-B acclimation. Chlamydomonas UV-B acclimation preserved the photosystem II core proteins D1 and D2 under UV-B stress, which mitigated UV-B-induced photoinhibition. These findings highlight the early evolution of UVR8 photoreceptor signaling in the green lineage to induce UV-B acclimation and protection. PMID:27020958

  14. Metabolism of D-lactate and structurally related organic acids in Chlamydomonas reinhardtii

    International Nuclear Information System (INIS)

    Husic, D.W.

    1986-01-01

    During the initial minutes of anaerobiosis, 14 C-labeled D-lactate, derived from the photosynthetic sugar phosphate pool, accumulated in the unicellular green alga, Chlamydomonas reinhardtii. The production of the D-isomer of lactate by algae is in contrast to plant and mammalian cells in which L-lactate is formed. After initial lactate formation, Chlamydomonas exhibits a mixed-acid type fermentation, thereby avoiding lactate accumulation and enabling the cells to tolerate extended periods of anaerobiosis. A pyruvate reductase which catalyzes the formation of D-lactate in Chlamydomonas was partially purified and characterized. Lactate produced anaerobically was metabolized only when Chlamydomonas cells were returned to aerobic conditions, and reoxidation of the D-lactate was apparently catalyzed by a mitochondrial membrane-bound dehydrogenase, rather than by the soluble pyruvate reductase. Mutants of Chlamydomonas, deficient in mitochondrial respiration, were used to demonstrate that lactate metabolism was linked to the mitochondrial electron transport chain. In addition, the oxidation of glycolate, a structural analog of lactate, was also linked to mitochondrial electron transport in vivo

  15. Respiratory-deficient mutants of the unicellular green alga Chlamydomonas: a review.

    Science.gov (United States)

    Salinas, Thalia; Larosa, Véronique; Cardol, Pierre; Maréchal-Drouard, Laurence; Remacle, Claire

    2014-05-01

    Genetic manipulation of the unicellular green alga Chlamydomonas reinhardtii is straightforward. Nuclear genes can be interrupted by insertional mutagenesis or targeted by RNA interference whereas random or site-directed mutagenesis allows the introduction of mutations in the mitochondrial genome. This, combined with a screen that easily allows discriminating respiratory-deficient mutants, makes Chlamydomonas a model system of choice to study mitochondria biology in photosynthetic organisms. Since the first description of Chlamydomonas respiratory-deficient mutants in 1977 by random mutagenesis, many other mutants affected in mitochondrial components have been characterized. These respiratory-deficient mutants increased our knowledge on function and assembly of the respiratory enzyme complexes. More recently some of these mutants allowed the study of mitochondrial gene expression processes poorly understood in Chlamydomonas. In this review, we update the data concerning the respiratory components with a special focus on the assembly factors identified on other organisms. In addition, we make an inventory of different mitochondrial respiratory mutants that are inactivated either on mitochondrial or nuclear genes. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  16. OK, thanks! A new mutualism between Chlamydomonas and methylobacteria facilitates growth on amino acids and peptides.

    Science.gov (United States)

    Calatrava, Victoria; Hom, Erik F Y; Llamas, Ángel; Fernández, Emilio; Galván, Aurora

    2018-04-01

    Nitrogen is a key nutrient for land plants and phytoplankton in terrestrial and aquatic ecosystems. The model alga Chlamydomonas reinhardtii can grow efficiently on several inorganic nitrogen sources (e.g. ammonium, nitrate, nitrite) as well as many amino acids. In this study, we show that Chlamydomonas is unable to use proline, hydroxyproline and peptides that contain these amino acids. However, we discovered that algal growth on these substrates is supported in association with Methylobacterium spp., and that a mutualistic carbon-nitrogen metabolic exchange between Chlamydomonas and Methylobacterium spp. is established. Specifically, the mineralization of these amino acids and peptides by Methylobacterium spp. produces ammonium that can be assimilated by Chlamydomonas, and CO2 photosynthetically fixed by Chlamydomonas yields glycerol that can be assimilated by Methylobacterium. As Chlamydomonas is an algal ancestor to land plants and Methylobacterium is a plant growth-promoting bacterium, this new model of mutualism may facilitate insights into the ecology and evolution of plant-bacterial interactions and design principles of synthetic ecology.

  17. Interactions between marine facultative epiphyte Chlamydomonas sp. (Chlamydomonadales, Chlorophyta) and ceramiaceaen algae (Rhodophyta).

    Science.gov (United States)

    Klochkova, Tatyana A; Cho, Ga Youn; Boo, Sung Min; Chung, Ki Wha; Kim, Song Ja; Kim, Gwang Hoon

    2008-07-01

    Previously unrecorded marine Chlamydomonas that grew epiphytic on ceramiaceaen algae was collected from the western coast of Korea and isolated into a unialgal culture. The isolate was subjected to 18S rDNA phylogenetic analysis as well as ultrastructure and life cycle studies. It had an affinity with the marine Chlamydomonas species and was less related to freshwater/terrestrial representatives of this genus. It had flagella shorter than the cell body two-layered cell wall with striated outer surface and abundant mucilaginous material beneath the innermost layer and no contractile vacuoles. This alga grew faster in mixed cultures with ceramiaceaen algae rather than in any tested unialgal culture condition; the cells looked healthier and zoosporangia and motile flagellated vegetative cells appeared more often. These results suggested that this Chlamydomonas might be a facultative epiphyte benefiting from its hosts. Several ceramiaceaen algae were tested as host plants. Meanwhile, cell deformation or collapse of the whole thallus was caused to Aglaothamnion byssoides, and preliminary study suggested that a substance released from Chlamydomonas caused the response. This is first report on harmful epiphytic interactions between Chlamydomonas species and red ceramiaceaen algae.

  18. Azolla filiculoides Nitrogenase Activity Decrease Induced by Inoculation with Chlamydomonas sp. †

    Science.gov (United States)

    Habte, Mitiku

    1986-01-01

    Experiments were conducted to determine the influence of Chlamydomonas sp. on nitrogen fixation (C2H2 → C2H4) in Azolla filiculoides and on the nitrogen fixation and growth of free-living Anabaena azollae 2B organisms. Inoculation of azolla medium with Chlamydomonas sp. was associated with decreased nitrogenase activity in A. filiculoides and with increases in the density of a fungal population identified as Acremonium sp. Subsequent inoculation of azolla medium with this fungus was also accompanied by a significant decrease in nitrogenase activity of A. filiculoides. However, the extent of depression of nitrogenase activity was significantly higher when azolla medium was inoculated with Chlamydomonas sp. than when it was inoculated with Acremonium sp. Inoculation of nitrogen-free Stanier medium with either Acremonium sp. or Chlamydomonas sp. did not adversely affect the growth or nitrogenase activity of free-living A. azollae. Decreased nitrogenase activity in A. filiculoides is apparently related to the adverse influence of the green alga and the fungus on the macrosymbiont. The mechanisms that might be involved are discussed. PMID:16347211

  19. The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions

    Energy Technology Data Exchange (ETDEWEB)

    Merchant, Sabeeha S

    2007-04-09

    Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the 120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.

  20. Improving the optimum yield and growth of Chlamydomonas ...

    African Journals Online (AJOL)

    N.T

    2016-06-08

    Jun 8, 2016 ... genomes such as Chlamydomonas reinhardtii, Chlorella vulgaris, Volvox ..... The potential of micro algae as laboratory tool in cosmetic industries ..... lutein by Chlorella protothecoides at various glucose concentrations in.

  1. The Deep Thioredoxome in Chlamydomonas reinhardtii: New Insights into Redox Regulation.

    Science.gov (United States)

    Pérez-Pérez, María Esther; Mauriès, Adeline; Maes, Alexandre; Tourasse, Nicolas J; Hamon, Marion; Lemaire, Stéphane D; Marchand, Christophe H

    2017-08-07

    Thiol-based redox post-translational modifications have emerged as important mechanisms of signaling and regulation in all organisms, and thioredoxin plays a key role by controlling the thiol-disulfide status of target proteins. Recent redox proteomic studies revealed hundreds of proteins regulated by glutathionylation and nitrosylation in the unicellular green alga Chlamydomonas reinhardtii, while much less is known about the thioredoxin interactome in this organism. By combining qualitative and quantitative proteomic analyses, we have comprehensively investigated the Chlamydomonas thioredoxome and 1188 targets have been identified. They participate in a wide range of metabolic pathways and cellular processes. This study broadens not only the redox regulation to new enzymes involved in well-known thioredoxin-regulated metabolic pathways but also sheds light on cellular processes for which data supporting redox regulation are scarce (aromatic amino acid biosynthesis, nuclear transport, etc). Moreover, we characterized 1052 thioredoxin-dependent regulatory sites and showed that these data constitute a valuable resource for future functional studies in Chlamydomonas. By comparing this thioredoxome with proteomic data for glutathionylation and nitrosylation at the protein and cysteine levels, this work confirms the existence of a complex redox regulation network in Chlamydomonas and provides evidence of a tremendous selectivity of redox post-translational modifications for specific cysteine residues. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.

  2. Homogentisate phytyltransferase from the unicellular green alga Chlamydomonas reinhardtii.

    Science.gov (United States)

    Gálvez-Valdivieso, Gregorio; Cardeñosa, Rosa; Pineda, Manuel; Aguilar, Miguel

    2015-09-01

    Homogentisate phytyltransferase (HPT) (EC 2.5.1.-) catalyzes the first committed step of tocopherol biosynthesis in all photosynthetic organisms. This paper presents the molecular characterization and expression analysis of HPT1 gene, and a study on the accumulation of tocopherols under different environmental conditions in the unicellular green alga Chlamydomonas reinhardtii. The Chlamydomonas HPT1 protein conserves all the prenylphosphate- and divalent cation-binding sites that are found in polyprenyltransferases and all the amino acids that are essential for its catalytic activity. Its hydrophobicity profile confirms that HPT is a membrane-bound protein. Chlamydomonas genomic DNA analysis suggests that HPT is encoded by a single gene, HPT1, whose promoter region contains multiple motifs related to regulation by jasmonate, abscisic acid, low temperature and light, and an ATCTA motif presents in genes involved in tocopherol biosynthesis and some photosynthesis-related genes. Expression analysis revealed that HPT1 is strongly regulated by dark and low-temperature. Under the same treatments, α-tocopherol increased in cultures exposed to darkness or heat, whereas γ-tocopherol did it in low temperature. The regulatory expression pattern of HPT1 and the changes of tocopherol abundance support the idea that different tocopherols play specific functions, and suggest a role for γ-tocopherol in the adaptation to growth under low-temperature. Copyright © 2015 Elsevier GmbH. All rights reserved.

  3. Katanin localization requires triplet microtubules in Chlamydomonas reinhardtii.

    Directory of Open Access Journals (Sweden)

    Jessica M Esparza

    Full Text Available Centrioles and basal bodies are essential for a variety of cellular processes that include the recruitment of proteins to these structures for both centrosomal and ciliary function. This recruitment is compromised when centriole/basal body assembly is defective. Mutations that cause basal body assembly defects confer supersensitivity to Taxol. These include bld2, bld10, bld12, uni3, vfl1, vfl2, and vfl3. Flagellar motility mutants do not confer sensitivity with the exception of mutations in the p60 (pf19 and p80 (pf15 subunits of the microtubule severing protein katanin. We have identified additional pf15 and bld2 (ε-tubulin alleles in screens for Taxol sensitivity. Null pf15 and bld2 alleles are viable and are not essential genes in Chlamydomonas. Analysis of double mutant strains with the pf15-3 and bld2-6 null alleles suggests that basal bodies in Chlamydomonas may recruit additional proteins beyond katanin that affect spindle microtubule stability. The bld2-5 allele is a hypomorphic allele and its phenotype is modulated by nutritional cues. Basal bodies in bld2-5 cells are missing proximal ends. The basal body mutants show aberrant localization of an epitope-tagged p80 subunit of katanin. Unlike IFT proteins, katanin p80 does not localize to the transition fibers of the basal bodies based on an analysis of the uni1 mutant as well as the lack of colocalization of katanin p80 with IFT74. We suggest that the triplet microtubules are likely to play a key role in katanin p80 recruitment to the basal body of Chlamydomonas rather than the transition fibers that are needed for IFT localization.

  4. Improving the optimum yield and growth of Chlamydomonas ...

    African Journals Online (AJOL)

    Chlamydomonas reinhardtii CC125 (wild type) and CW15 (cell wall mutants) were feed up on solid and liquid Tris phosphate (TP) media with various concentrations of acetate, glycerol(10-100 mM) or methanol (0.01-718 mM) and cultivated under phototrophic, mixotrophic and heterotrophic conditions. Use of 10 and 35 ...

  5. Basis of genetic adaptation to heavy metal stress in the acidophilic green alga Chlamydomonas acidophila.

    Science.gov (United States)

    Puente-Sánchez, Fernando; Díaz, Silvia; Penacho, Vanessa; Aguilera, Angeles; Olsson, Sanna

    2018-07-01

    To better understand heavy metal tolerance in Chlamydomonas acidophila, an extremophilic green alga, we assembled its transcriptome and measured transcriptomic expression before and after Cd exposure in this and the neutrophilic model microalga Chlamydomonas reinhardtii. Genes possibly related to heavy metal tolerance and detoxification were identified and analyzed as potential key innovations that enable this species to live in an extremely acid habitat with high levels of heavy metals. In addition we provide a data set of single orthologous genes from eight green algal species as a valuable resource for comparative studies including eukaryotic extremophiles. Our results based on differential gene expression, detection of unique genes and analyses of codon usage all indicate that there are important genetic differences in C. acidophila compared to C. reinhardtii. Several efflux family proteins were identified as candidate key genes for adaptation to acid environments. This study suggests for the first time that exposure to cadmium strongly increases transposon expression in green algae, and that oil biosynthesis genes are induced in Chlamydomonas under heavy metal stress. Finally, the comparison of the transcriptomes of several acidophilic and non-acidophilic algae showed that the Chlamydomonas genus is polyphyletic and that acidophilic algae have distinctive aminoacid usage patterns. Copyright © 2018 Elsevier B.V. All rights reserved.

  6. Synthesizing and salvaging NAD: lessons learned from Chlamydomonas reinhardtii.

    Directory of Open Access Journals (Sweden)

    Huawen Lin

    2010-09-01

    Full Text Available The essential coenzyme nicotinamide adenine dinucleotide (NAD+ plays important roles in metabolic reactions and cell regulation in all organisms. Bacteria, fungi, plants, and animals use different pathways to synthesize NAD+. Our molecular and genetic data demonstrate that in the unicellular green alga Chlamydomonas NAD+ is synthesized from aspartate (de novo synthesis, as in plants, or nicotinamide, as in mammals (salvage synthesis. The de novo pathway requires five different enzymes: L-aspartate oxidase (ASO, quinolinate synthetase (QS, quinolate phosphoribosyltransferase (QPT, nicotinate/nicotinamide mononucleotide adenylyltransferase (NMNAT, and NAD+ synthetase (NS. Sequence similarity searches, gene isolation and sequencing of mutant loci indicate that mutations in each enzyme result in a nicotinamide-requiring mutant phenotype in the previously isolated nic mutants. We rescued the mutant phenotype by the introduction of BAC DNA (nic2-1 and nic13-1 or plasmids with cloned genes (nic1-1 and nic15-1 into the mutants. NMNAT, which is also in the de novo pathway, and nicotinamide phosphoribosyltransferase (NAMPT constitute the nicotinamide-dependent salvage pathway. A mutation in NAMPT (npt1-1 has no obvious growth defect and is not nicotinamide-dependent. However, double mutant strains with the npt1-1 mutation and any of the nic mutations are inviable. When the de novo pathway is inactive, the salvage pathway is essential to Chlamydomonas for the synthesis of NAD+. A homolog of the human SIRT6-like gene, SRT2, is upregulated in the NS mutant, which shows a longer vegetative life span than wild-type cells. Our results suggest that Chlamydomonas is an excellent model system to study NAD+ metabolism and cell longevity.

  7. Rapid induction of lipid droplets in Chlamydomonas reinhardtii and Chlorella vulgaris by Brefeldin A.

    Directory of Open Access Journals (Sweden)

    Sangwoo Kim

    Full Text Available Algal lipids are the focus of intensive research because they are potential sources of biodiesel. However, most algae produce neutral lipids only under stress conditions. Here, we report that treatment with Brefeldin A (BFA, a chemical inducer of ER stress, rapidly triggers lipid droplet (LD formation in two different microalgal species, Chlamydomonas reinhardtii and Chlorella vulgaris. LD staining using Nile red revealed that BFA-treated algal cells exhibited many more fluorescent bodies than control cells. Lipid analyses based on thin layer chromatography and gas chromatography revealed that the additional lipids formed upon BFA treatment were mainly triacylglycerols (TAGs. The increase in TAG accumulation was accompanied by a decrease in the betaine lipid diacylglyceryl N,N,N-trimethylhomoserine (DGTS, a major component of the extraplastidic membrane lipids in Chlamydomonas, suggesting that at least some of the TAGs were assembled from the degradation products of membrane lipids. Interestingly, BFA induced TAG accumulation in the Chlamydomonas cells regardless of the presence or absence of an acetate or nitrogen source in the medium. This effect of BFA in Chlamydomonas cells seems to be due to BFA-induced ER stress, as supported by the induction of three homologs of ER stress marker genes by the drug. Together, these results suggest that ER stress rapidly triggers TAG accumulation in two green microalgae, C. reinhardtii and C. vulgaris. A further investigation of the link between ER stress and TAG synthesis may yield an efficient means of producing biofuel from algae.

  8. An improved ARS2-derived nuclear reporter enhances the efficiency and ease of genetic engineering in Chlamydomonas

    DEFF Research Database (Denmark)

    Specht, Elizabeth A; Nour-Eldin, Hussam Hassan; Hoang, Kevin T D

    2015-01-01

    The model alga Chlamydomonas reinhardtii has been used to pioneer genetic engineering techniques for high-value protein and biofuel production from algae. To date, most studies of transgenic Chlamydomonas have utilized the chloroplast genome due to its ease of engineering, with a sizeable suite o...

  9. The nucleobase cation symporter 1 of Chlamydomonas reinhardtii and that of the evolutionarily distant Arabidopsis thaliana display parallel function and establish a plant-specific solute transport profile.

    Science.gov (United States)

    Schein, Jessica R; Hunt, Kevin A; Minton, Janet A; Schultes, Neil P; Mourad, George S

    2013-09-01

    The single cell alga Chlamydomonas reinhardtii is capable of importing purines as nitrogen sources. An analysis of the annotated C. reinhardtii genome reveals at least three distinct gene families encoding for known nucleobase transporters. In this study the solute transport and binding properties for the lone C. reinhardtii nucleobase cation symporter 1 (CrNCS1) are determined through heterologous expression in Saccharomyces cerevisiae. CrNCS1 acts as a transporter of adenine, guanine, uracil and allantoin, sharing similar - but not identical - solute recognition specificity with the evolutionary distant NCS1 from Arabidopsis thaliana. The results suggest that the solute specificity for plant NCS1 occurred early in plant evolution and are distinct from solute transport specificities of single cell fungal NCS1 proteins. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  10. An efficient protocol for the Agrobacterium-mediated genetic transformation of microalga Chlamydomonas reinhardtii.

    Science.gov (United States)

    Pratheesh, P T; Vineetha, M; Kurup, G Muraleedhara

    2014-06-01

    Algal-based recombinant protein production has gained immense interest in recent years. The development of algal expression system was earlier hindered due to the lack of efficient and cost-effective transformation techniques capable of heterologous gene integration and expression. The recent development of Agrobacterium-mediated genetic transformation method is expected to be the ideal solution for these problems. We have developed an efficient protocol for the Agrobacterium-mediated genetic transformation of microalga Chlamydomonas reinhardtii. Pre-treatment of Agrobacterium in TAP induction medium (pH 5.2) containing 100 μM acetosyringone and 1 mM glycine betaine and infection of Chlamydomonas with the induced Agrobacterium greatly improved transformation frequency. This protocol was found to double the number of transgenic events on selection media compared to that of previous reports. PCR was used successfully to amplify fragments of the hpt and GUS genes from transformed cells, while Southern blot confirmed the integration of GUS gene into the genome of C. reinhardtii. RT-PCR, Northern blot and GUS histochemical analyses confirm GUS gene expression in the transgenic cell lines of Chlamydomonas. This protocol provides a quick, efficient, economical and high-frequency transformation method for microalgae.

  11. Expression of a Synthetic Gene for the Major Cytotoxin (Cyt1Aa of Bacillus thuringiensis subsp. israelensis in the Chloroplast of Wild-Type Chlamydomonas

    Directory of Open Access Journals (Sweden)

    Seongjoon Kang

    2018-05-01

    Full Text Available Chlamydomonas reinhardtii (Chlamydomonas strains that are toxic to mosquito larvae because they express chloroplast transgenes that are based on the mosquitocidal proteins of Bacillus thuringiensis subsp. israelensis (Bti could be very useful in mosquito control. Chlamydomonas has several advantages for this approach, including genetic controls not generally available with industrial algae. The Bti toxin is produced by sporulating bacteria and has been used for mosquito control for >30 years without creating highly resistant mosquito populations. The suite of toxins is four main proteins: three Cry proteins and the cytotoxic Cyt1Aa (27 kDa. Cyt1Aa is not very toxic to mosquitoes by itself, but it prevents the development of resistance. The production of Cyt1Aa in other microbes, however, has been challenging due to its affinity for certain membrane phospholipids. Here we report on the production of recombinant Cyt1Aa (rCyt1A in the chloroplast of photosynthetic Chlamydomonas at levels of at least 0.3% total protein. Live cell bioassays demonstrated toxicity of the rCyt1Aa Chlamydomonas to larvae of Aedes aegypti. We also expressed the chloroplast cyt1Aa gene in a wild-type Chlamydomonas strain (21 gr that can grow on nitrate. These results have implications for developing a Chlamydomonas strain that will be toxic to mosquito larvae but will not induce strongly resistant populations.

  12. Metabolic studies of Hg-203 on chlamydomonas reinhardi

    International Nuclear Information System (INIS)

    Macka, W.; Stehlik, G.; Wihlidal, H.; Washuettl, J.; Bancher, E.

    1977-09-01

    Vegetative cultures of the green algae Chlamydomonas reinhardi WT + in the log-phase reduce mercury(II)-nitrate to elemental mercury which is removed from the cell suspension by the stream of gas bubbling through it. Monomethyl and dimethyl mercury as intermediate metabolic compounds are to be excluded, because none of them could be found in the algae, the nutrient medium or the gas phase. (author)

  13. Estimating the annotation error rate of curated GO database sequence annotations

    Directory of Open Access Journals (Sweden)

    Brown Alfred L

    2007-05-01

    Full Text Available Abstract Background Annotations that describe the function of sequences are enormously important to researchers during laboratory investigations and when making computational inferences. However, there has been little investigation into the data quality of sequence function annotations. Here we have developed a new method of estimating the error rate of curated sequence annotations, and applied this to the Gene Ontology (GO sequence database (GOSeqLite. This method involved artificially adding errors to sequence annotations at known rates, and used regression to model the impact on the precision of annotations based on BLAST matched sequences. Results We estimated the error rate of curated GO sequence annotations in the GOSeqLite database (March 2006 at between 28% and 30%. Annotations made without use of sequence similarity based methods (non-ISS had an estimated error rate of between 13% and 18%. Annotations made with the use of sequence similarity methodology (ISS had an estimated error rate of 49%. Conclusion While the overall error rate is reasonably low, it would be prudent to treat all ISS annotations with caution. Electronic annotators that use ISS annotations as the basis of predictions are likely to have higher false prediction rates, and for this reason designers of these systems should consider avoiding ISS annotations where possible. Electronic annotators that use ISS annotations to make predictions should be viewed sceptically. We recommend that curators thoroughly review ISS annotations before accepting them as valid. Overall, users of curated sequence annotations from the GO database should feel assured that they are using a comparatively high quality source of information.

  14. The Chlamydomonas genome reveals the evolution of key animal and plant functions

    Czech Academy of Sciences Publication Activity Database

    Merchant, S.S.; Prochnik, S. E.; Bišová, Kateřina

    2007-01-01

    Roč. 318, - (2007), s. 245-251 ISSN 0036-8075 Institutional research plan: CEZ:AV0Z50200510 Keywords : chlamydomonas reinhardtii * alga * eukaryotic cell Subject RIV: EE - Microbiology, Virology Impact factor: 26.372, year: 2007

  15. Nonthermal effect of microwave irradiation on nitrite uptake in Chlamydomonas reinhardtii

    International Nuclear Information System (INIS)

    Pedrajas, C.; Cotrino, J.

    1989-01-01

    When cells of the unicellular green alga Chlamydomonas reinhardtii were subjected to microwave irradiation at 2.45 GHz, nitrite uptake kinetics still obeyed the Michaelis-Menten equation, the Km of the process remaining constant, whereas V max increased, which indicates an enhanced nonthermal permeability in irradiated cells. (author)

  16. annot8r: GO, EC and KEGG annotation of EST datasets

    Directory of Open Access Journals (Sweden)

    Schmid Ralf

    2008-04-01

    Full Text Available Abstract Background The expressed sequence tag (EST methodology is an attractive option for the generation of sequence data for species for which no completely sequenced genome is available. The annotation and comparative analysis of such datasets poses a formidable challenge for research groups that do not have the bioinformatics infrastructure of major genome sequencing centres. Therefore, there is a need for user-friendly tools to facilitate the annotation of non-model species EST datasets with well-defined ontologies that enable meaningful cross-species comparisons. To address this, we have developed annot8r, a platform for the rapid annotation of EST datasets with GO-terms, EC-numbers and KEGG-pathways. Results annot8r automatically downloads all files relevant for the annotation process and generates a reference database that stores UniProt entries, their associated Gene Ontology (GO, Enzyme Commission (EC and Kyoto Encyclopaedia of Genes and Genomes (KEGG annotation and additional relevant data. For each of GO, EC and KEGG, annot8r extracts a specific sequence subset from the UniProt dataset based on the information stored in the reference database. These three subsets are then formatted for BLAST searches. The user provides the protein or nucleotide sequences to be annotated and annot8r runs BLAST searches against these three subsets. The BLAST results are parsed and the corresponding annotations retrieved from the reference database. The annotations are saved both as flat files and also in a relational postgreSQL results database to facilitate more advanced searches within the results. annot8r is integrated with the PartiGene suite of EST analysis tools. Conclusion annot8r is a tool that assigns GO, EC and KEGG annotations for data sets resulting from EST sequencing projects both rapidly and efficiently. The benefits of an underlying relational database, flexibility and the ease of use of the program make it ideally suited for non

  17. Development of a forward genetic screen to isolate oil mutants in the green microalga Chlamydomonas reinhardtii.

    Science.gov (United States)

    Cagnon, Caroline; Mirabella, Boris; Nguyen, Hoa Mai; Beyly-Adriano, Audrey; Bouvet, Séverine; Cuiné, Stéphan; Beisson, Fred; Peltier, Gilles; Li-Beisson, Yonghua

    2013-12-02

    Oils produced by microalgae are precursors to biodiesel. To achieve a profitable production of biodiesel from microalgae, identification of factors governing oil synthesis and turnover is desirable. The green microalga Chlamydomonas reinhardtii is amenable to genetic analyses and has recently emerged as a model to study oil metabolism. However, a detailed method to isolate various types of oil mutants that is adapted to Chlamydomonas has not been reported. We describe here a forward genetic approach to isolate mutants altered in oil synthesis and turnover from C. reinhardtii. It consists of a three-step screening procedure: a primary screen by flow cytometry of Nile red stained transformants grown in 96-deep-well plates under three sequential conditions (presence of nitrogen, then absence of nitrogen, followed by oil remobilization); a confirmation step using Nile red stained biological triplicates; and a validation step consisting of the quantification by thin layer chromatography of oil content of selected strains. Thirty-one mutants were isolated by screening 1,800 transformants generated by random insertional mutagenesis (1.7%). Five showed increased oil accumulation under the nitrogen-replete condition and 13 had altered oil content under nitrogen-depletion. All mutants were affected in oil remobilization. This study demonstrates that various types of oil mutants can be isolated in Chlamydomonas based on the method set-up here, including mutants accumulating oil under optimal biomass growth. The strategy conceived and the protocol set-up should be applicable to other microalgal species such as Nannochloropsis and Chlorella, thus serving as a useful tool in Chlamydomonas oil research and algal biotechnology.

  18. Robust Transgene Expression from Bicistronic mRNA in the Green Alga Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Masayuki Onishi

    2016-12-01

    Full Text Available The unicellular green alga Chlamydomonas reinhardtii is a model organism that provides an opportunity to understand the evolution and functional biology of the lineage that includes the land plants, as well as aspects of the fundamental core biology conserved throughout the eukaryotic phylogeny. Although many tools are available to facilitate genetic, molecular biological, biochemical, and cell biological studies in Chlamydomonas, expression of unselected transgenes of interest (GOIs has been challenging. In most methods used previously, the GOI and a selectable marker are expressed from two separate mRNAs, so that their concomitant expression is not guaranteed. In this study, we developed constructs that allow expression of an upstream GOI and downstream selectable marker from a single bicistronic mRNA. Although this approach in other systems has typically required a translation-enhancing element such as an internal ribosome entry site for the downstream marker, we found that a short stretch of unstructured junction sequence was sufficient to obtain adequate expression of the downstream gene, presumably through post-termination reinitiation. With this system, we obtained robust expression of both endogenous and heterologous GOIs, including fluorescent proteins and tagged fusion proteins, in the vast majority of transformants, thus eliminating the need for tedious secondary screening for GOI-expressing transformants. This improved efficiency should greatly facilitate a variety of genetic and cell-biological studies in Chlamydomonas and also enable new applications such as expression-based screens and large-scale production of foreign proteins.

  19. Engineering the chloroplast targeted malarial vaccine antigens in Chlamydomonas starch granules.

    Directory of Open Access Journals (Sweden)

    David Dauvillée

    2010-12-01

    Full Text Available Malaria, an Anopheles-borne parasitic disease, remains a major global health problem causing illness and death that disproportionately affects developing countries. Despite the incidence of malaria, which remains one of the most severe infections of human populations, there is no licensed vaccine against this life-threatening disease. In this context, we decided to explore the expression of Plasmodium vaccine antigens fused to the granule bound starch synthase (GBSS, the major protein associated to the starch matrix in all starch-accumulating plants and algae such as Chlamydomonas reinhardtii.We describe the development of genetically engineered starch granules containing plasmodial vaccine candidate antigens produced in the unicellular green algae Chlamydomonas reinhardtii. We show that the C-terminal domains of proteins from the rodent Plasmodium species, Plasmodium berghei Apical Major Antigen AMA1, or Major Surface Protein MSP1 fused to the algal granule bound starch synthase (GBSS are efficiently expressed and bound to the polysaccharide matrix. Mice were either immunized intraperitoneally with the engineered starch particles and Freund adjuvant, or fed with the engineered particles co-delivered with the mucosal adjuvant, and challenged intraperitoneally with a lethal inoculum of P. Berghei. Both experimental strategies led to a significantly reduced parasitemia with an extension of life span including complete cure for intraperitoneal delivery as assessed by negative blood thin smears. In the case of the starch bound P. falciparum GBSS-MSP1 fusion protein, the immune sera or purified immunoglobulin G of mice immunized with the corresponding starch strongly inhibited in vitro the intra-erythrocytic asexual development of the most human deadly plasmodial species.This novel system paves the way for the production of clinically relevant plasmodial antigens as algal starch-based particles designated herein as amylosomes, demonstrating that

  20. Engineering the chloroplast targeted malarial vaccine antigens in Chlamydomonas starch granules.

    Science.gov (United States)

    Dauvillée, David; Delhaye, Stéphane; Gruyer, Sébastien; Slomianny, Christian; Moretz, Samuel E; d'Hulst, Christophe; Long, Carole A; Ball, Steven G; Tomavo, Stanislas

    2010-12-15

    Malaria, an Anopheles-borne parasitic disease, remains a major global health problem causing illness and death that disproportionately affects developing countries. Despite the incidence of malaria, which remains one of the most severe infections of human populations, there is no licensed vaccine against this life-threatening disease. In this context, we decided to explore the expression of Plasmodium vaccine antigens fused to the granule bound starch synthase (GBSS), the major protein associated to the starch matrix in all starch-accumulating plants and algae such as Chlamydomonas reinhardtii. We describe the development of genetically engineered starch granules containing plasmodial vaccine candidate antigens produced in the unicellular green algae Chlamydomonas reinhardtii. We show that the C-terminal domains of proteins from the rodent Plasmodium species, Plasmodium berghei Apical Major Antigen AMA1, or Major Surface Protein MSP1 fused to the algal granule bound starch synthase (GBSS) are efficiently expressed and bound to the polysaccharide matrix. Mice were either immunized intraperitoneally with the engineered starch particles and Freund adjuvant, or fed with the engineered particles co-delivered with the mucosal adjuvant, and challenged intraperitoneally with a lethal inoculum of P. Berghei. Both experimental strategies led to a significantly reduced parasitemia with an extension of life span including complete cure for intraperitoneal delivery as assessed by negative blood thin smears. In the case of the starch bound P. falciparum GBSS-MSP1 fusion protein, the immune sera or purified immunoglobulin G of mice immunized with the corresponding starch strongly inhibited in vitro the intra-erythrocytic asexual development of the most human deadly plasmodial species. This novel system paves the way for the production of clinically relevant plasmodial antigens as algal starch-based particles designated herein as amylosomes, demonstrating that efficient production

  1. Ubiquitous Annotation Systems

    DEFF Research Database (Denmark)

    Hansen, Frank Allan

    2006-01-01

    Ubiquitous annotation systems allow users to annotate physical places, objects, and persons with digital information. Especially in the field of location based information systems much work has been done to implement adaptive and context-aware systems, but few efforts have focused on the general...... requirements for linking information to objects in both physical and digital space. This paper surveys annotation techniques from open hypermedia systems, Web based annotation systems, and mobile and augmented reality systems to illustrate different approaches to four central challenges ubiquitous annotation...... systems have to deal with: anchoring, structuring, presentation, and authoring. Through a number of examples each challenge is discussed and HyCon, a context-aware hypermedia framework developed at the University of Aarhus, Denmark, is used to illustrate an integrated approach to ubiquitous annotations...

  2. Evidences of oxidative stress during hydrogen photoproduction in sulfur-deprived cultures of Chlamydomonas reinhardtii

    Czech Academy of Sciences Publication Activity Database

    Sáens, M. E.; Bišová, Kateřina; Touloupakis, E.; Faraloni, C.; Dario Di Marzio, W.; Torzillo, G.

    2015-01-01

    Roč. 40, č. 30 (2015), s. 10410-10417 ISSN 0360-3199 Institutional support: RVO:61388971 Keywords : Oxidative stress * Chlamydomonas reinhardtii * H-2 production Subject RIV: EE - Microbiology, Virology Impact factor: 3.205, year: 2015

  3. Toxicity of PAMAM dendrimers to Chlamydomonas reinhardtii

    International Nuclear Information System (INIS)

    Petit, Anne-Noelle; Eullaffroy, Philippe; Debenest, Timothee; Gagne, Francois

    2010-01-01

    In recent decades, a new class of polymeric materials, PAMAM dendrimers, has attracted marked interest owing to their unique nanoscopic architecture and their hopeful perspectives in nanomedicine and therapeutics. However, the potential release of dendrimers into the aquatic environment raises the issue about their toxicity on aquatic organisms. Our investigation sought to estimate the toxicity of cationic PAMAM dendrimers on the green alga, Chlamydomonas reinhardtii. Algal cultures were exposed to different concentrations (0.3-10 mg L -1 ) of low dendrimer generations (G2, G4 and G5) for 72 h. Potential adverse effects on Chlamydomonas were assessed using esterase activity (cell viability), photosynthetic O 2 evolution, pigments content and chlorophyll a fluorescence transient. According to the median inhibitory concentration (IC 50 ) appraised from esterase activity, toxicity on cell viability decreased with dendrimer generation number (2, 3 and 5 mg L -1 for G2, G4 and G5 dendrimers, respectively). Moreover, the three generations of dendrimers did not induce the same changes in the photosynthetic metabolism of the green alga. O 2 evolution was stimulated in cultures exposed to the lowest generations tested (i.e. G2 and G4) whereas no significant effects were observed with G5. In addition, total chlorophyll content was increased after G2 treatment at 2.5 mg L -1 . Finally, G2 and G4 had positive effects on photosystem II (PSII): the amount of active PSII reaction centers, the primary charge separation and the electron transport between Q A and Q B were all increased inducing activation of the photosynthetic electron transport chain. These changes resulted in stimulation of full photosynthetic performance.

  4. Comparison of the resistance of two Chlamydomonas reinhardii strains with different β- and carotene content

    International Nuclear Information System (INIS)

    Gikoshvili, T.I.; Vilenchik, M.M.; Ladygin, V.G.; Kuzin, A.M.

    1989-01-01

    Radiosensitivity of Chlamydomonas reinhardii strain containing considerable amount of ξ-carotene is lower than that of the wild strain. This indicates that ξ-caotene is oneof the natural radioresistance factors

  5. Experimental Definition and Validation of Protein Coding Transcripts in Chlamydomonas reinhardtii

    Energy Technology Data Exchange (ETDEWEB)

    Kourosh Salehi-Ashtiani; Jason A. Papin

    2012-01-13

    Algal fuel sources promise unsurpassed yields in a carbon neutral manner that minimizes resource competition between agriculture and fuel crops. Many challenges must be addressed before algal biofuels can be accepted as a component of the fossil fuel replacement strategy. One significant challenge is that the cost of algal fuel production must become competitive with existing fuel alternatives. Algal biofuel production presents the opportunity to fine-tune microbial metabolic machinery for an optimal blend of biomass constituents and desired fuel molecules. Genome-scale model-driven algal metabolic design promises to facilitate both goals by directing the utilization of metabolites in the complex, interconnected metabolic networks to optimize production of the compounds of interest. Using Chlamydomonas reinhardtii as a model, we developed a systems-level methodology bridging metabolic network reconstruction with annotation and experimental verification of enzyme encoding open reading frames. We reconstructed a genome-scale metabolic network for this alga and devised a novel light-modeling approach that enables quantitative growth prediction for a given light source, resolving wavelength and photon flux. We experimentally verified transcripts accounted for in the network and physiologically validated model function through simulation and generation of new experimental growth data, providing high confidence in network contents and predictive applications. The network offers insight into algal metabolism and potential for genetic engineering and efficient light source design, a pioneering resource for studying light-driven metabolism and quantitative systems biology. Our approach to generate a predictive metabolic model integrated with cloned open reading frames, provides a cost-effective platform to generate metabolic engineering resources. While the generated resources are specific to algal systems, the approach that we have developed is not specific to algae and

  6. Toxicity of PAMAM dendrimers to Chlamydomonas reinhardtii

    Energy Technology Data Exchange (ETDEWEB)

    Petit, Anne-Noelle, E-mail: anne-noelle.petit@ec.gc.ca [Environment Canada, 105 McGill Street, Montreal, Quebec H2Y 2E7 (Canada); Eullaffroy, Philippe [Laboratoire Plantes, Pesticides et Developpement Durable, EA 2069, URVVC, BP 1039, Universite de Reims Champagne-Ardenne, 51687 Reims Cedex 2 (France); Debenest, Timothee; Gagne, Francois [Environment Canada, 105 McGill Street, Montreal, Quebec H2Y 2E7 (Canada)

    2010-10-15

    In recent decades, a new class of polymeric materials, PAMAM dendrimers, has attracted marked interest owing to their unique nanoscopic architecture and their hopeful perspectives in nanomedicine and therapeutics. However, the potential release of dendrimers into the aquatic environment raises the issue about their toxicity on aquatic organisms. Our investigation sought to estimate the toxicity of cationic PAMAM dendrimers on the green alga, Chlamydomonas reinhardtii. Algal cultures were exposed to different concentrations (0.3-10 mg L{sup -1}) of low dendrimer generations (G2, G4 and G5) for 72 h. Potential adverse effects on Chlamydomonas were assessed using esterase activity (cell viability), photosynthetic O{sub 2} evolution, pigments content and chlorophyll a fluorescence transient. According to the median inhibitory concentration (IC{sub 50}) appraised from esterase activity, toxicity on cell viability decreased with dendrimer generation number (2, 3 and 5 mg L{sup -1} for G2, G4 and G5 dendrimers, respectively). Moreover, the three generations of dendrimers did not induce the same changes in the photosynthetic metabolism of the green alga. O{sub 2} evolution was stimulated in cultures exposed to the lowest generations tested (i.e. G2 and G4) whereas no significant effects were observed with G5. In addition, total chlorophyll content was increased after G2 treatment at 2.5 mg L{sup -1}. Finally, G2 and G4 had positive effects on photosystem II (PSII): the amount of active PSII reaction centers, the primary charge separation and the electron transport between Q{sub A} and Q{sub B} were all increased inducing activation of the photosynthetic electron transport chain. These changes resulted in stimulation of full photosynthetic performance.

  7. Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop.

    Science.gov (United States)

    Brister, James Rodney; Bao, Yiming; Kuiken, Carla; Lefkowitz, Elliot J; Le Mercier, Philippe; Leplae, Raphael; Madupu, Ramana; Scheuermann, Richard H; Schobel, Seth; Seto, Donald; Shrivastava, Susmita; Sterk, Peter; Zeng, Qiandong; Klimke, William; Tatusova, Tatiana

    2010-10-01

    Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world's biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.

  8. Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop

    Directory of Open Access Journals (Sweden)

    Qiandong Zeng

    2010-10-01

    Full Text Available Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world’s biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.

  9. Biofiksasi CO2 Oleh Mikroalga Chlamydomonas sp dalam Photobioreaktor Tubular

    Directory of Open Access Journals (Sweden)

    Hadiyanto Hadiyanto

    2014-05-01

    Full Text Available Mikroalga memiliki potensi dalam membiofiksasi CO2 dan dapat dimanfaatkan untuk mengurangi kadar CO2 dalam gas pencemar. Pertumbuhan mikroalga sangat dipengaruhi oleh konsentrasi gas CO2 di dalam gas pencemar. Tujuan penelitian ini adalah untuk mengeetahui kemampuan mikroalga Chlamydomonas sp yang dikultivasi dalam photobioreaktor tubular dalam penyerapan gas CO2 serta untuk mengetahui konsentrasi maksimum gas CO2 dalam umpan untuk memproduksi biomasa mikroalga yang optimal. Percobaan dilakukan dnegan memvariasi laju alir dari 0.03 -0.071 L/menit dan konsentrasi CO2 dalam umpan 10-30%. Hasil penelitian menunjukkan bahwa biomasa mikroalga dapat diproduksi dengan maksimal dengan konsentrasi gas CO2 20% dengan laju alir 0.07 L/min. Semakin tinggi laju alir maka produksi biomasa alga semakin besar. Kecepatan pertumbuhan alga maksimum terjadi pada 0.31 /hari. Pada konsentrasi gas CO2 30%, terjadi substrate inhibition yang disebabkan carbon dalam bentuk ion bicarbonate tidak dapat dikonsumsi lagi di dalam kultur alga. Kata kunci : Mikroalga, chlamydomonas sp, biofiksasi CO2, biogas Abstract Microalgae have a potential for CO2 biofixation and therefore can be used to reduce the CO2 concentration in the gas pollutants. Moreover, microalgae growth is strongly affected by the concentration of CO2 in the exhaust gas pollutants. The objective of this research was to investigate the ability of microalgae Chlamydomonas sp which was cultivated in a tubular photobioreactor for CO2 absorption as well as to determine the maximum concentration of CO2 in the feed gas to obtain optimum microalgae biomass. The experiments were performed by varying the gas flow rate of 0.03 -0.071 L / min and the concentration of CO2 in the feed of 10-30%. The results showed that the maximum biomass of microalgae can be produced with CO2 concentration of 20% vol with a flow rate of 0.07 L / min. The result also showed that increasing the gas flow rate, the greater of the production of

  10. Bioenergetics of growth and lipid production in Chlamydomonas reinhardtii

    International Nuclear Information System (INIS)

    Küçük, Kübra; Tevatia, Rahul; Sorgüven, Esra; Demirel, Yaşar; Özilgen, Mustafa

    2015-01-01

    The study of thermodynamic aspects of the lipid, e.g., raw material for biodiesel, production in microalgae is important, as the non-lipid producing biological activities of the algal cultivation consume part of the solar energy captured during photosynthesis in expense of the exergetic efficiency of the lipid production process. The cultivation of Chlamydomonas reinhardtii (a unicellular biflagellate fresh-water microalga) is modeled as a three-step chemical mechanism representing growth, respiration, and lipid production. Further, the comprehensive thermodynamic analysis of these mechanisms is presented. The cumulative degree of perfection of the cellular proliferation, after excluding the lipid synthesis, fluctuates with no trend around 0.52 ± 0.19. The exergy analysis has indicated that C. reinhardtii prefers to maximize the lipid production when it is difficult to generate new cells. Under batch production of algal biomass, the highest heat and exergy loss per unit biomass production are accountable under the most favorable biological growth conditions, whereas the highest exergetic efficiency of the lipid production accounted under the least favorable growth conditions, which is in line with the previous studies. - Highlights: • Biomass, lipid production and respiration modeled as three-step chemical reaction. • CDP (cumulative degree of perfection) is calculated based on the model. • The CDP of the algae, after excluding the lipids, is about 0.52 ± 0.19. • Chlamydomonas reinhardtii maximized lipid production when it was difficult to grow

  11. Triclosan-induced transcriptional and biochemical alterations in the freshwater green algae Chlamydomonas reinhardtii

    NARCIS (Netherlands)

    Pan, Chang Gui; Peng, Feng-Jiao; Shi, Wen Jun; Hu, Li Xin; Wei, Xiao Dong; Ying, Guang Guo

    2018-01-01

    Triclosan (TCS) is an antibacterial and antifungal agent widely used in personal care products (PCPs). We investigated the effects of TCS (20 μg/L, 100 μg/L and 500 μg/L) on Chlamydomonas reinhardtii by measuring the algal growth, chlorophyll content, lipid peroxidation, and transcription of the

  12. Trophic transfer of gold nanoparticles from Euglena gracilis or Chlamydomonas reinhardtii to Daphnia magna

    International Nuclear Information System (INIS)

    Lee, Woo-Mi; Yoon, Sung-Ji; Shin, Yu-Jin; An, Youn-Joo

    2015-01-01

    Understanding the trophic transfer of nanoparticles (NPs) is important because NPs are small enough to easily penetrate into organisms. In this study, we evaluated the trophic transfer of gold NPs (AuNPs) within the aquatic food chain. We observed AuNPs transfer from 2 species of primary producers (Chlamydomonas reinhardtii or Euglena gracilis) to the primary consumer (Daphnia magna). Also, bioaccumulation of AuNPs in E. gracilis was higher than that in C. reinhardtii. The reasons for the difference in Au accumulation may be the physical structure of these organisms, and the surface area that is available for interaction with NPs. C. reinhardtii has a cell wall that may act as a barrier to the penetration of NPs. The size of E. gracilis is larger than that of C. reinhardtii. This study demonstrates the trophic transfer of AuNPs from a general producer to a consumer in an aquatic environment. - Highlights: • This study evaluated the trophic transfer of AuNPs in an aquatic food chain. • Chlamydomonas reinhardtii and Euglena gracilis were selected as the primary producers. • Daphnia magna was used as the primary consumer. • The bioaccumulation of AuNPs in E. gracilis was higher than that in C. reinhardtii. • AuNPs were transferred from C. reinhardtii and E. gracilis to D. magna. - Gold nanoparticles can transfer from primary producers (Chlamydomonas reinhardtii or Euglena gracilis) to the primary consumer (Daphnia magna) in an aquatic environment

  13. Semi-Semantic Annotation: A guideline for the URDU.KON-TB treebank POS annotation

    Directory of Open Access Journals (Sweden)

    Qaiser ABBAS

    2016-12-01

    Full Text Available This work elaborates the semi-semantic part of speech annotation guidelines for the URDU.KON-TB treebank: an annotated corpus. A hierarchical annotation scheme was designed to label the part of speech and then applied on the corpus. This raw corpus was collected from the Urdu Wikipedia and the Jang newspaper and then annotated with the proposed semi-semantic part of speech labels. The corpus contains text of local & international news, social stories, sports, culture, finance, religion, traveling, etc. This exercise finally contributed a part of speech annotation to the URDU.KON-TB treebank. Twenty-two main part of speech categories are divided into subcategories, which conclude the morphological, and semantical information encoded in it. This article reports the annotation guidelines in major; however, it also briefs the development of the URDU.KON-TB treebank, which includes the raw corpus collection, designing & employment of annotation scheme and finally, its statistical evaluation and results. The guidelines presented as follows, will be useful for linguistic community to annotate the sentences not only for the national language Urdu but for the other indigenous languages like Punjab, Sindhi, Pashto, etc., as well.

  14. Gene silencing of stearoyl-ACP desaturase enhances the stearic acid content in Chlamydomonas reinhardtii

    NARCIS (Netherlands)

    Jaeger, de L.; Springer, J.; Wolbert, E.J.H.; Martens, D.E.; Eggink, G.; Wijffels, R.H.

    2017-01-01

    In this study, stearoyl-ACP desaturase (SAD), the enzyme that converts stearic acid into oleic acid, is silenced by artificial microRNA in the green microalga Chlamydomonas reinhardtii. Two different constructs, which target different positions on the mRNA of stearoyl-ACP desaturase, were tested.

  15. BioAnnote: a software platform for annotating biomedical documents with application in medical learning environments.

    Science.gov (United States)

    López-Fernández, H; Reboiro-Jato, M; Glez-Peña, D; Aparicio, F; Gachet, D; Buenaga, M; Fdez-Riverola, F

    2013-07-01

    Automatic term annotation from biomedical documents and external information linking are becoming a necessary prerequisite in modern computer-aided medical learning systems. In this context, this paper presents BioAnnote, a flexible and extensible open-source platform for automatically annotating biomedical resources. Apart from other valuable features, the software platform includes (i) a rich client enabling users to annotate multiple documents in a user friendly environment, (ii) an extensible and embeddable annotation meta-server allowing for the annotation of documents with local or remote vocabularies and (iii) a simple client/server protocol which facilitates the use of our meta-server from any other third-party application. In addition, BioAnnote implements a powerful scripting engine able to perform advanced batch annotations. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  16. Protocol: methodology for chromatin immunoprecipitation (ChIP in Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Strenkert Daniela

    2011-11-01

    Full Text Available Abstract We report on a detailed chromatin immunoprecipitation (ChIP protocol for the unicellular green alga Chlamydomonas reinhardtii. The protocol is suitable for the analysis of nucleosome occupancy, histone modifications and transcription factor binding sites at the level of mononucleosomes for targeted and genome-wide studies. We describe the optimization of conditions for crosslinking, chromatin fragmentation and antibody titer determination and provide recommendations and an example for the normalization of ChIP results as determined by real-time PCR.

  17. Chado controller: advanced annotation management with a community annotation system.

    Science.gov (United States)

    Guignon, Valentin; Droc, Gaëtan; Alaux, Michael; Baurens, Franc-Christophe; Garsmeur, Olivier; Poiron, Claire; Carver, Tim; Rouard, Mathieu; Bocs, Stéphanie

    2012-04-01

    We developed a controller that is compliant with the Chado database schema, GBrowse and genome annotation-editing tools such as Artemis and Apollo. It enables the management of public and private data, monitors manual annotation (with controlled vocabularies, structural and functional annotation controls) and stores versions of annotation for all modified features. The Chado controller uses PostgreSQL and Perl. The Chado Controller package is available for download at http://www.gnpannot.org/content/chado-controller and runs on any Unix-like operating system, and documentation is available at http://www.gnpannot.org/content/chado-controller-doc The system can be tested using the GNPAnnot Sandbox at http://www.gnpannot.org/content/gnpannot-sandbox-form valentin.guignon@cirad.fr; stephanie.sidibe-bocs@cirad.fr Supplementary data are available at Bioinformatics online.

  18. Biofixation of Carbon dioxide by Chlamydomonas sp. in a Tubular Photobioreactor

    Directory of Open Access Journals (Sweden)

    H Hadiyanto

    2012-02-01

    Full Text Available The biogas production from anaerobic digestion is a potential fuel for power generators application, if biogas can be upgraded to the same standards as fossil natural gas by CO2, H2S, and other non-combustible component removal. Microalgae Chlamydomonas sp. has potency to biofix the carbon dioxide and can be used as an additional food ingredient. The variations of flow rate and carbon dioxide concentration in the process resulting different value of biomass production and carbon dioxide biofixation. Biomass production at 40% carbon dioxide concentration obtained 5.685 gr/dm3 at 10% carbon dioxide concentration obtained 4.892 gr/dm3. The greatest value of carbon dioxide absorption occurs at a 40% concentration amounting to 12.09%. The rate of growth and productivity of microalgae tend to rise in 10% and 20% (%v carbon dioxide concentration, but began started a constant at 30% and 40% (%v carbon dioxide concentration. Biomass production tends to increase in light conditions while a constant in dark conditions. This study used Chlamydomonas sp. as media culture and performed on bubble column and tubular reactor with 6 litres of culture medium at a temperature of 28oC and atmospheric pressure.

  19. Comparative genomics in Chlamydomonas and Plasmodium identifies an ancient nuclear envelope protein family essential for sexual reproduction in protists, fungi, plants, and vertebrates.

    Science.gov (United States)

    Ning, Jue; Otto, Thomas D; Pfander, Claudia; Schwach, Frank; Brochet, Mathieu; Bushell, Ellen; Goulding, David; Sanders, Mandy; Lefebvre, Paul A; Pei, Jimin; Grishin, Nick V; Vanderlaan, Gary; Billker, Oliver; Snell, William J

    2013-05-15

    Fertilization is a crucial yet poorly characterized event in eukaryotes. Our previous discovery that the broadly conserved protein HAP2 (GCS1) functioned in gamete membrane fusion in the unicellular green alga Chlamydomonas and the malaria pathogen Plasmodium led us to exploit the rare biological phenomenon of isogamy in Chlamydomonas in a comparative transcriptomics strategy to uncover additional conserved sexual reproduction genes. All previously identified Chlamydomonas fertilization-essential genes fell into related clusters based on their expression patterns. Out of several conserved genes in a minus gamete cluster, we focused on Cre06.g280600, an ortholog of the fertilization-related Arabidopsis GEX1. Gene disruption, cell biological, and immunolocalization studies show that CrGEX1 functions in nuclear fusion in Chlamydomonas. Moreover, CrGEX1 and its Plasmodium ortholog, PBANKA_113980, are essential for production of viable meiotic progeny in both organisms and thus for mosquito transmission of malaria. Remarkably, we discovered that the genes are members of a large, previously unrecognized family whose first-characterized member, KAR5, is essential for nuclear fusion during yeast sexual reproduction. Our comparative transcriptomics approach provides a new resource for studying sexual development and demonstrates that exploiting the data can lead to the discovery of novel biology that is conserved across distant taxa.

  20. Meta-analysis of crowdsourced data compendia suggests pan-disease transcriptional signatures of autoimmunity [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    William W. Lau

    2016-12-01

    Full Text Available Background: The proliferation of publicly accessible large-scale biological data together with increasing availability of bioinformatics tools have the potential to transform biomedical research. Here we report a crowdsourcing Jamboree that explored whether a team of volunteer biologists without formal bioinformatics training could use OMiCC, a crowdsourcing web platform that facilitates the reuse and (meta- analysis of public gene expression data, to compile and annotate gene expression data, and design comparisons between disease and control sample groups. Methods: The Jamboree focused on several common human autoimmune diseases, including systemic lupus erythematosus (SLE, multiple sclerosis (MS, type I diabetes (DM1, and rheumatoid arthritis (RA, and the corresponding mouse models. Meta-analyses were performed in OMiCC using comparisons constructed by the participants to identify 1 gene expression signatures for each disease (disease versus healthy controls at the gene expression and biological pathway levels, 2 conserved signatures across all diseases within each species (pan-disease signatures, and 3 conserved signatures between species for each disease and across all diseases (cross-species signatures. Results: A large number of differentially expressed genes were identified for each disease based on meta-analysis, with observed overlap among diseases both within and across species. Gene set/pathway enrichment of upregulated genes suggested conserved signatures (e.g., interferon across all human and mouse conditions. Conclusions: Our Jamboree exercise provides evidence that when enabled by appropriate tools, a "crowd" of biologists can work together to accelerate the pace by which the increasingly large amounts of public data can be reused and meta-analyzed for generating and testing hypotheses. Our encouraging experience suggests that a similar crowdsourcing approach can be used to explore other biological questions.

  1. ChlamyCyc - a comprehensive database and web-portal centered on _Chlamydomonas reinhardtii_

    OpenAIRE

    Jan-Ole Christian; Patrick May; Stefan Kempa; Dirk Walther

    2009-01-01

    *Background* - The unicellular green alga _Chlamydomonas reinhardtii_ is an important eukaryotic model organism for the study of photosynthesis and growth, as well as flagella development and other cellular processes. In the era of high-throughput technologies there is an imperative need to integrate large-scale data sets from high-throughput experimental techniques using computational methods and database resources to provide comprehensive information about the whole cellular system of a sin...

  2. Tracking the elusive 5' exonuclease activity of Chlamydomonas reinhardtii RNase J.

    Science.gov (United States)

    Liponska, Anna; Jamalli, Ailar; Kuras, Richard; Suay, Loreto; Garbe, Enrico; Wollman, Francis-André; Laalami, Soumaya; Putzer, Harald

    2018-04-01

    Chlamydomonas RNase J is the first member of this enzyme family that has endo- but no intrinsic 5' exoribonucleolytic activity. This questions its proposed role in chloroplast mRNA maturation. RNA maturation and stability in the chloroplast are controlled by nuclear-encoded ribonucleases and RNA binding proteins. Notably, mRNA 5' end maturation is thought to be achieved by the combined action of a 5' exoribonuclease and specific pentatricopeptide repeat proteins (PPR) that block the progression of the nuclease. In Arabidopsis the 5' exo- and endoribonuclease RNase J has been implicated in this process. Here, we verified the chloroplast localization of the orthologous Chlamydomonas (Cr) RNase J and studied its activity, both in vitro and in vivo in a heterologous B. subtilis system. Our data show that Cr RNase J has endo- but no significant intrinsic 5' exonuclease activity that would be compatible with its proposed role in mRNA maturation. This is the first example of an RNase J ortholog that does not possess a 5' exonuclease activity. A yeast two-hybrid screen revealed a number of potential interaction partners but three of the most promising candidates tested, failed to induce the latent exonuclease activity of Cr RNase J. We still favor the hypothesis that Cr RNase J plays an important role in RNA metabolism, but our findings suggest that it rather acts as an endoribonuclease in the chloroplast.

  3. MixtureTree annotator: a program for automatic colorization and visual annotation of MixtureTree.

    Directory of Open Access Journals (Sweden)

    Shu-Chuan Chen

    Full Text Available The MixtureTree Annotator, written in JAVA, allows the user to automatically color any phylogenetic tree in Newick format generated from any phylogeny reconstruction program and output the Nexus file. By providing the ability to automatically color the tree by sequence name, the MixtureTree Annotator provides a unique advantage over any other programs which perform a similar function. In addition, the MixtureTree Annotator is the only package that can efficiently annotate the output produced by MixtureTree with mutation information and coalescent time information. In order to visualize the resulting output file, a modified version of FigTree is used. Certain popular methods, which lack good built-in visualization tools, for example, MEGA, Mesquite, PHY-FI, TreeView, treeGraph and Geneious, may give results with human errors due to either manually adding colors to each node or with other limitations, for example only using color based on a number, such as branch length, or by taxonomy. In addition to allowing the user to automatically color any given Newick tree by sequence name, the MixtureTree Annotator is the only method that allows the user to automatically annotate the resulting tree created by the MixtureTree program. The MixtureTree Annotator is fast and easy-to-use, while still allowing the user full control over the coloring and annotating process.

  4. Relation between hydrogen production and photosynthesis in the green algae Chlamydomonas reinhardtii

    OpenAIRE

    Basu, Alex

    2015-01-01

    The modernized world is over-consuming low-cost energy sources that strongly contributes to pollution and environmental stress. As a consequence, the interest for environmentally friendly alternatives has increased immensely. One such alternative is the use of solar energy and water as a raw material to produce biohydrogen through the process of photosynthetic water splitting. In this work, the relation between H2-production and photosynthesis in the green algae Chlamydomonas reinhardtii was ...

  5. Refactoring the six-gene photosystem II core in the chloroplast of the green algae Chlamydomonas reinhardtii

    DEFF Research Database (Denmark)

    Gimpel, Javier A.; Nour-Eldin, Hussam Hassan; Scranton, Melissa A.

    2016-01-01

    production, particularly under specific environmental conditions. PSII is a complex multisubunit enzyme with strong interdependence among its components. In this work, we have deleted the six core genes of PSII in the eukaryotic alga Chlamydomonas reinhardtii and refactored them in a single DNA construct...

  6. Evaluating Hierarchical Structure in Music Annotations.

    Science.gov (United States)

    McFee, Brian; Nieto, Oriol; Farbood, Morwaread M; Bello, Juan Pablo

    2017-01-01

    Music exhibits structure at multiple scales, ranging from motifs to large-scale functional components. When inferring the structure of a piece, different listeners may attend to different temporal scales, which can result in disagreements when they describe the same piece. In the field of music informatics research (MIR), it is common to use corpora annotated with structural boundaries at different levels. By quantifying disagreements between multiple annotators, previous research has yielded several insights relevant to the study of music cognition. First, annotators tend to agree when structural boundaries are ambiguous. Second, this ambiguity seems to depend on musical features, time scale, and genre. Furthermore, it is possible to tune current annotation evaluation metrics to better align with these perceptual differences. However, previous work has not directly analyzed the effects of hierarchical structure because the existing methods for comparing structural annotations are designed for "flat" descriptions, and do not readily generalize to hierarchical annotations. In this paper, we extend and generalize previous work on the evaluation of hierarchical descriptions of musical structure. We derive an evaluation metric which can compare hierarchical annotations holistically across multiple levels. sing this metric, we investigate inter-annotator agreement on the multilevel annotations of two different music corpora, investigate the influence of acoustic properties on hierarchical annotations, and evaluate existing hierarchical segmentation algorithms against the distribution of inter-annotator agreement.

  7. Evaluating Hierarchical Structure in Music Annotations

    Directory of Open Access Journals (Sweden)

    Brian McFee

    2017-08-01

    Full Text Available Music exhibits structure at multiple scales, ranging from motifs to large-scale functional components. When inferring the structure of a piece, different listeners may attend to different temporal scales, which can result in disagreements when they describe the same piece. In the field of music informatics research (MIR, it is common to use corpora annotated with structural boundaries at different levels. By quantifying disagreements between multiple annotators, previous research has yielded several insights relevant to the study of music cognition. First, annotators tend to agree when structural boundaries are ambiguous. Second, this ambiguity seems to depend on musical features, time scale, and genre. Furthermore, it is possible to tune current annotation evaluation metrics to better align with these perceptual differences. However, previous work has not directly analyzed the effects of hierarchical structure because the existing methods for comparing structural annotations are designed for “flat” descriptions, and do not readily generalize to hierarchical annotations. In this paper, we extend and generalize previous work on the evaluation of hierarchical descriptions of musical structure. We derive an evaluation metric which can compare hierarchical annotations holistically across multiple levels. sing this metric, we investigate inter-annotator agreement on the multilevel annotations of two different music corpora, investigate the influence of acoustic properties on hierarchical annotations, and evaluate existing hierarchical segmentation algorithms against the distribution of inter-annotator agreement.

  8. Pipeline to upgrade the genome annotations

    Directory of Open Access Journals (Sweden)

    Lijin K. Gopi

    2017-12-01

    Full Text Available Current era of functional genomics is enriched with good quality draft genomes and annotations for many thousands of species and varieties with the support of the advancements in the next generation sequencing technologies (NGS. Around 25,250 genomes, of the organisms from various kingdoms, are submitted in the NCBI genome resource till date. Each of these genomes was annotated using various tools and knowledge-bases that were available during the period of the annotation. It is obvious that these annotations will be improved if the same genome is annotated using improved tools and knowledge-bases. Here we present a new genome annotation pipeline, strengthened with various tools and knowledge-bases that are capable of producing better quality annotations from the consensus of the predictions from different tools. This resource also perform various additional annotations, apart from the usual gene predictions and functional annotations, which involve SSRs, novel repeats, paralogs, proteins with transmembrane helices, signal peptides etc. This new annotation resource is trained to evaluate and integrate all the predictions together to resolve the overlaps and ambiguities of the boundaries. One of the important highlights of this resource is the capability of predicting the phylogenetic relations of the repeats using the evolutionary trace analysis and orthologous gene clusters. We also present a case study, of the pipeline, in which we upgrade the genome annotation of Nelumbo nucifera (sacred lotus. It is demonstrated that this resource is capable of producing an improved annotation for a better understanding of the biology of various organisms.

  9. Reasoning with Annotations of Texts

    OpenAIRE

    Ma , Yue; Lévy , François; Ghimire , Sudeep

    2011-01-01

    International audience; Linguistic and semantic annotations are important features for text-based applications. However, achieving and maintaining a good quality of a set of annotations is known to be a complex task. Many ad hoc approaches have been developed to produce various types of annotations, while comparing those annotations to improve their quality is still rare. In this paper, we propose a framework in which both linguistic and domain information can cooperate to reason with annotat...

  10. Reverse genetics in Chlamydomonas: a platform for isolating insertional mutants

    Directory of Open Access Journals (Sweden)

    de Montaigu Amaury

    2011-07-01

    Full Text Available Abstract A method was developed to identify insertional mutants of Chlamydomonas reinhardtii disrupted for selected target genes. The approach relies on the generation of thousands of transformants followed by PCR-based screenings that allow for identification of strains harboring the introduced marker gene within specific genes of interest. Our results highlight the strengths and limitations of two independent screens that differed in the nature of the marker DNA used (PCR-amplified fragment containing the plasmid-free marker versus entire linearized plasmid with the marker and in the strategies used to maintain and store transformants.

  11. ChlamyCyc: an integrative systems biology database and web-portal for Chlamydomonas reinhardtii

    OpenAIRE

    May, P.; Christian, J.O.; Kempa, S.; Walther, D.

    2009-01-01

    Abstract Background The unicellular green alga Chlamydomonas reinhardtii is an important eukaryotic model organism for the study of photosynthesis and plant growth. In the era of modern high-throughput technologies there is an imperative need to integrate large-scale data sets from high-throughput experimental techniques using computational methods and database resources to provide comprehensive information about the molecular and cellular organization of a single organism. Results In the fra...

  12. Characterization of Chlamydomonas reinhardtii Core Histones by Top-Down Mass Spectrometry Reveals Unique Algae-Specific Variants and Post-Translational Modifications.

    Science.gov (United States)

    Khan, Aliyya; Eikani, Carlo K; Khan, Hana; Iavarone, Anthony T; Pesavento, James J

    2018-01-05

    The unicellular microalga Chlamydomonas reinhardtii has played an instrumental role in the development of many new fields (bioproducts, biofuels, etc.) as well as the advancement of basic science (photosynthetic apparati, flagellar function, etc.). Chlamydomonas' versatility ultimately derives from the genes encoded in its genome and the way that the expression of these genes is regulated, which is largely influenced by a family of DNA binding proteins called histones. We characterize C. reinhardtii core histones, both variants and their post-translational modifications, by chromatographic separation, followed by top-down mass spectrometry (TDMS). Because TDMS has not been previously used to study Chlamydomonas proteins, we show rampant artifactual protein oxidation using established nuclei purification and histone extraction methods. After addressing oxidation, both histones H3 and H4 are found to each have a single polypeptide sequence that is minimally acetylated and methylated. Surprisingly, we uncover a novel monomethylation at lysine 79 on histone H4 present on all observed molecules. Histone H2B and H2A are found to have two and three variants, respectively, and both are minimally modified. This study provides an updated assessment of the core histone proteins in the green alga C. reinhardtii by top-down mass spectrometry and lays the foundation for further investigation of these essential proteins.

  13. Oil accumulation in the model green alga Chlamydomonas reinhardtii: characterization, variability between common laboratory strains and relationship with starch reserves

    Directory of Open Access Journals (Sweden)

    Carrier Patrick

    2011-01-01

    Full Text Available Abstract Background When cultivated under stress conditions, many microalgae species accumulate both starch and oil (triacylglycerols. The model green microalga Chlamydomonas reinhardtii has recently emerged as a model to test genetic engineering or cultivation strategies aiming at increasing lipid yields for biodiesel production. Blocking starch synthesis has been suggested as a way to boost oil accumulation. Here, we characterize the triacylglycerol (TAG accumulation process in Chlamydomonas and quantify TAGs in various wild-type and starchless strains. Results In response to nitrogen deficiency, Chlamydomonas reinhardtii produced TAGs enriched in palmitic, oleic and linoleic acids that accumulated in oil-bodies. Oil synthesis was maximal between 2 and 3 days following nitrogen depletion and reached a plateau around day 5. In the first 48 hours of oil deposition, a ~80% reduction in the major plastidial membrane lipids occurred. Upon nitrogen re-supply, mobilization of TAGs started after starch degradation but was completed within 24 hours. Comparison of oil content in five common laboratory strains (CC124, CC125, cw15, CC1690 and 11-32A revealed a high variability, from 2 μg TAG per million cell in CC124 to 11 μg in 11-32A. Quantification of TAGs on a cell basis in three mutants affected in starch synthesis (cw15sta1-2, cw15sta6 and cw15sta7-1 showed that blocking starch synthesis did not result in TAG over-accumulation compared to their direct progenitor, the arginine auxotroph strain 330. Moreover, no significant correlation was found between cellular oil and starch levels among the twenty wild-type, mutants and complemented strains tested. By contrast, cellular oil content was found to increase steeply with salt concentration in the growth medium. At 100 mM NaCl, oil level similar to nitrogen depletion conditions could be reached in CC124 strain. Conclusion A reference basis for future genetic studies of oil metabolism in Chlamydomonas

  14. Semantic annotation of consumer health questions.

    Science.gov (United States)

    Kilicoglu, Halil; Ben Abacha, Asma; Mrabet, Yassine; Shooshan, Sonya E; Rodriguez, Laritza; Masterton, Kate; Demner-Fushman, Dina

    2018-02-06

    Consumers increasingly use online resources for their health information needs. While current search engines can address these needs to some extent, they generally do not take into account that most health information needs are complex and can only fully be expressed in natural language. Consumer health question answering (QA) systems aim to fill this gap. A major challenge in developing consumer health QA systems is extracting relevant semantic content from the natural language questions (question understanding). To develop effective question understanding tools, question corpora semantically annotated for relevant question elements are needed. In this paper, we present a two-part consumer health question corpus annotated with several semantic categories: named entities, question triggers/types, question frames, and question topic. The first part (CHQA-email) consists of relatively long email requests received by the U.S. National Library of Medicine (NLM) customer service, while the second part (CHQA-web) consists of shorter questions posed to MedlinePlus search engine as queries. Each question has been annotated by two annotators. The annotation methodology is largely the same between the two parts of the corpus; however, we also explain and justify the differences between them. Additionally, we provide information about corpus characteristics, inter-annotator agreement, and our attempts to measure annotation confidence in the absence of adjudication of annotations. The resulting corpus consists of 2614 questions (CHQA-email: 1740, CHQA-web: 874). Problems are the most frequent named entities, while treatment and general information questions are the most common question types. Inter-annotator agreement was generally modest: question types and topics yielded highest agreement, while the agreement for more complex frame annotations was lower. Agreement in CHQA-web was consistently higher than that in CHQA-email. Pairwise inter-annotator agreement proved most

  15. Chlamydomonas Outer Arm Dynein Alters Conformation in Response to Ca2+

    OpenAIRE

    Sakato, Miho; Sakakibara, Hitoshi; King, Stephen M.

    2007-01-01

    We have previously shown that Ca2+ directly activates ATP-sensitive microtubule binding by a Chlamydomonas outer arm dynein subparticle containing the β and γ heavy chains (HCs). The γ HC–associated LC4 light chain is a member of the calmodulin family and binds 1-2 Ca2+ with KCa = 3 × 10−5 M in vitro, suggesting it may act as a Ca2+ sensor for outer arm dynein. Here we investigate interactions between the LC4 light chain and γ HC. Two IQ consensus motifs for binding calmodulin-like proteins a...

  16. Predicting word sense annotation agreement

    DEFF Research Database (Denmark)

    Martinez Alonso, Hector; Johannsen, Anders Trærup; Lopez de Lacalle, Oier

    2015-01-01

    High agreement is a common objective when annotating data for word senses. However, a number of factors make perfect agreement impossible, e.g. the limitations of the sense inventories, the difficulty of the examples or the interpretation preferences of the annotations. Estimating potential...... agreement is thus a relevant task to supplement the evaluation of sense annotations. In this article we propose two methods to predict agreement on word-annotation instances. We experiment with a continuous representation and a three-way discretization of observed agreement. In spite of the difficulty...

  17. Alignment-Annotator web server: rendering and annotating sequence alignments.

    Science.gov (United States)

    Gille, Christoph; Fähling, Michael; Weyand, Birgit; Wieland, Thomas; Gille, Andreas

    2014-07-01

    Alignment-Annotator is a novel web service designed to generate interactive views of annotated nucleotide and amino acid sequence alignments (i) de novo and (ii) embedded in other software. All computations are performed at server side. Interactivity is implemented in HTML5, a language native to web browsers. The alignment is initially displayed using default settings and can be modified with the graphical user interfaces. For example, individual sequences can be reordered or deleted using drag and drop, amino acid color code schemes can be applied and annotations can be added. Annotations can be made manually or imported (BioDAS servers, the UniProt, the Catalytic Site Atlas and the PDB). Some edits take immediate effect while others require server interaction and may take a few seconds to execute. The final alignment document can be downloaded as a zip-archive containing the HTML files. Because of the use of HTML the resulting interactive alignment can be viewed on any platform including Windows, Mac OS X, Linux, Android and iOS in any standard web browser. Importantly, no plugins nor Java are required and therefore Alignment-Anotator represents the first interactive browser-based alignment visualization. http://www.bioinformatics.org/strap/aa/ and http://strap.charite.de/aa/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Enhanced methane production of Chlorella vulgaris and Chlamydomonas reinhardtii by hydrolytic enzymes addition

    International Nuclear Information System (INIS)

    Mahdy, Ahmed; Mendez, Lara; Ballesteros, Mercedes; González-Fernández, Cristina

    2014-01-01

    Highlights: • Methane production of microalgae biomass is hampered by their cell wall. • Pretreatment should be designed in accordance to the microalgae specie. • Fresh Chlamydomonas reinhardtii exhibited high anaerobic biodegradability. • Chlorella vulgaris anaerobic biodegradability was enhanced by 50% using protease pretreatment. - Abstract: The effect of enzymatic hydrolysis on microalgae organic matter solubilisation and methane production was investigated in this study. Even though both biomasses, Chlamydomonas reinhardtii and Chlorella vulgaris, exhibited similar macromolecular distribution, their cell wall composition provided different behaviors. The addition of carbohydrolase (Viscozyme) and protease (Alcalase) resulted in high carbohydrates and protein solubilisation on both biomasses (86–96%). Despite the high carbohydrate solubilisation with the carbohydrolase, methane production was enhanced by 14% for C. vulgaris, while hydrolyzed C. reinhardtii did not show any improvement. The addition of protease to C. reinhardtii increased methane production by 1.17-fold. The low enhancement achieved together with the inherent high biodegradability of this biomass would not justify the cost associated to the enzyme addition. On the other hand, C. vulgaris hydrolyzed with the protease resulted in 86% anaerobic biodegradability compared to 54% of the raw biomass. Therefore, the application of protease prior anaerobic digestion of C. vulgaris could be a promising approach to decrease the energetic input required for cell wall disruption

  19. Filling Knowledge Gaps in Biological Networks: integrating global approaches to understand H2 metabolism in Chlamydomonas reinhardtii - Final Report

    Energy Technology Data Exchange (ETDEWEB)

    Posewitz, Matthew C

    2011-06-30

    The green alga Chlamydomonas reinhardtii (Chlamydomonas) has numerous genes encoding enzymes that function in fermentative pathways. Among these genes, are the [FeFe]-hydrogenases, pyruvate formate lyase, pyruvate ferredoxin oxidoreductase, acetate kinase, and phosphotransacetylase. We have systematically undertaken a series of targeted mutagenesis approaches to disrupt each of these key genes and omics techniques to characterize alterations in metabolic flux. Funds from DE-FG02-07ER64423 were specifically leveraged to generate mutants with disruptions in the genes encoding the [FeFe]-hydrogenases HYDA1 and HYDA2, pyruvate formate lyase (PFL1), and in bifunctional alcohol/aldehyde alcohol dehydrogenase (ADH1). Additionally funds were used to conduct global transcript profiling experiments of wildtype Chlamydomonas cells, as well as of the hydEF-1 mutant, which is unable to make H2 due to a lesion in the [FeFe]-hydrogenase biosynthetic pathway. In the wildtype cells, formate, acetate and ethanol are the dominant fermentation products with traces of CO2 and H2 also being produced. In the hydEF-1 mutant, succinate production is increased to offset the loss of protons as a terminal electron acceptor. In the pfl-1 mutant, lactate offsets the loss of formate production, and in the adh1-1 mutant glycerol is made instead of ethanol. To further probe the system, we generated a double mutant (pfl1-1 adh1) that is unable to synthesize both formate and ethanol. This strain, like the pfl1 mutants, secreted lactate, but also exhibited a significant increase in the levels of extracellular glycerol, acetate, and intracellular reduced sugars, and a decline in dark, fermentative H2 production. Whereas wild-type Chlamydomonas fermentation primarily produces formate and ethanol, the double mutant performs a complete rerouting of the glycolytic carbon to lactate and glycerol. Lastly, transcriptome data have been analysed for both the wildtype and hydEF-1, that correlate with our

  20. Modifying effect of caffeine on lethality and mutability of Chlamydomonas reinhardii cells following UV irradiation

    International Nuclear Information System (INIS)

    Podstavkova, S.; Vlcek, D.; Miadokova, E.

    1983-01-01

    The modifying effect of caffeine was studied using two standard and two UV-sensitive strains of Chlamydomonas reinhardii Dang. Cell survival and mutation frequency was microscopically evaluated on media without caffeine and on media with 1.5 mM of caffeine. The obtained results were indicative of the stimulating effect of caffeine upon survival in all strains. (author)

  1. Carbon allocation and element composition in four Chlamydomonas mutants defective in genes related to the CO2 concentrating mechanism

    Czech Academy of Sciences Publication Activity Database

    Memmola, F.; Mukherjee, B.; Moroney, James V.; Giordano, Mario

    2014-01-01

    Roč. 121, 2-3 (2014), s. 201-211 ISSN 0166-8595 Institutional support: RVO:61388971 Keywords : Chlamydomonas mutants * carbon * carbon dioxide * elemental stoichiometry Subject RIV: EE - Microbiology, Virology Impact factor: 3.502, year: 2014

  2. Objective-guided image annotation.

    Science.gov (United States)

    Mao, Qi; Tsang, Ivor Wai-Hung; Gao, Shenghua

    2013-04-01

    Automatic image annotation, which is usually formulated as a multi-label classification problem, is one of the major tools used to enhance the semantic understanding of web images. Many multimedia applications (e.g., tag-based image retrieval) can greatly benefit from image annotation. However, the insufficient performance of image annotation methods prevents these applications from being practical. On the other hand, specific measures are usually designed to evaluate how well one annotation method performs for a specific objective or application, but most image annotation methods do not consider optimization of these measures, so that they are inevitably trapped into suboptimal performance of these objective-specific measures. To address this issue, we first summarize a variety of objective-guided performance measures under a unified representation. Our analysis reveals that macro-averaging measures are very sensitive to infrequent keywords, and hamming measure is easily affected by skewed distributions. We then propose a unified multi-label learning framework, which directly optimizes a variety of objective-specific measures of multi-label learning tasks. Specifically, we first present a multilayer hierarchical structure of learning hypotheses for multi-label problems based on which a variety of loss functions with respect to objective-guided measures are defined. And then, we formulate these loss functions as relaxed surrogate functions and optimize them by structural SVMs. According to the analysis of various measures and the high time complexity of optimizing micro-averaging measures, in this paper, we focus on example-based measures that are tailor-made for image annotation tasks but are seldom explored in the literature. Experiments show consistency with the formal analysis on two widely used multi-label datasets, and demonstrate the superior performance of our proposed method over state-of-the-art baseline methods in terms of example-based measures on four

  3. Sulphate, more than a nutrient, protects the microalga Chlamydomonas moewusii from cadmium toxicity

    Energy Technology Data Exchange (ETDEWEB)

    Mera, Roi; Torres, Enrique, E-mail: torres@udc.es; Abalde, Julio

    2014-03-01

    Highlights: • Sulphate effect on cadmium toxicity in the microalga Chlamydomonas moewusii Gerloff. • Cadmium increases the sulphur requirements in Chlamydomonas moewusii. • Kinetic coefficients for sulphate utilization and cadmium effect on them. • Sulphate and cadmium influence on the biosynthesis of low-molecular mass thiols. • Cadmium toxicity reduction by sulphate due to higher biosynthesis of thiols. - Abstract: Sulphur is an essential macroelement that plays important roles in living organisms. The thiol rich sulphur compounds, such as cysteine, γ-Glu–Cys, glutathione and phytochelatins participate in the tolerance mechanisms against cadmium toxicity. Plants, algae, yeasts and most prokaryotes cover their demand for reduced sulphur by reduction of inorganic sulphate. The aim of this study was to investigate, using a bifactorial experimental design, the effect of different sulphate concentrations in the nutrient solution on cadmium toxicity in the freshwater microalga Chlamydomonas moewusii. Cell growth, kinetic parameters of sulphate utilization and intracellular concentrations of low-molecular mass thiol compounds were determined. A mathematical model to describe the growth of this microalga based on the effects of sulphate and cadmium was obtained. An ANOVA revealed an interaction between them, 16% of the effect sizes was explained by this interaction. A higher amount of sulphate in the culture medium allowed a higher cadmium tolerance due to an increase in the thiol compound biosynthesis. The amount of low-molecular mass thiol compounds, mainly phytochelatins, synthesized by this microalga was significantly dependent on the sulphate and cadmium concentrations; the higher phytochelatin content was obtained in cultures with 4 mg Cd/L and 1 mM sulphate. The maximum EC{sub 50} value (based on nominal cadmium concentration) reached for this microalga was 4.46 ± 0.42 mg Cd/L when the sulphate concentration added to the culture medium was also 1 m

  4. System-level network analysis of nitrogen starvation and recovery in Chlamydomonas reinhardtii reveals potential new targets for increased lipid accumulation

    Czech Academy of Sciences Publication Activity Database

    Valledor, Luis; Furuhashi, T.; Recuenco-Muňoz, L.; Wienkoop, S.; Weckwerth, W.

    2014-01-01

    Roč. 7, č. 171 (2014), s. 1-17 ISSN 1754-6834 Institutional support: RVO:67179843 Keywords : chlamydomonas reinhardtii * lipid accumulation * nitrogen Subject RIV: EI - Biotechnology ; Bionics Impact factor: 6.044, year: 2014

  5. Concept annotation in the CRAFT corpus.

    Science.gov (United States)

    Bada, Michael; Eckert, Miriam; Evans, Donald; Garcia, Kristin; Shipley, Krista; Sitnikov, Dmitry; Baumgartner, William A; Cohen, K Bretonnel; Verspoor, Karin; Blake, Judith A; Hunter, Lawrence E

    2012-07-09

    Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml.

  6. Essential Requirements for Digital Annotation Systems

    Directory of Open Access Journals (Sweden)

    ADRIANO, C. M.

    2012-06-01

    Full Text Available Digital annotation systems are usually based on partial scenarios and arbitrary requirements. Accidental and essential characteristics are usually mixed in non explicit models. Documents and annotations are linked together accidentally according to the current technology, allowing for the development of disposable prototypes, but not to the support of non-functional requirements such as extensibility, robustness and interactivity. In this paper we perform a careful analysis on the concept of annotation, studying the scenarios supported by digital annotation tools. We also derived essential requirements based on a classification of annotation systems applied to existing tools. The analysis performed and the proposed classification can be applied and extended to other type of collaborative systems.

  7. Making web annotations persistent over time

    Energy Technology Data Exchange (ETDEWEB)

    Sanderson, Robert [Los Alamos National Laboratory; Van De Sompel, Herbert [Los Alamos National Laboratory

    2010-01-01

    As Digital Libraries (DL) become more aligned with the web architecture, their functional components need to be fundamentally rethought in terms of URIs and HTTP. Annotation, a core scholarly activity enabled by many DL solutions, exhibits a clearly unacceptable characteristic when existing models are applied to the web: due to the representations of web resources changing over time, an annotation made about a web resource today may no longer be relevant to the representation that is served from that same resource tomorrow. We assume the existence of archived versions of resources, and combine the temporal features of the emerging Open Annotation data model with the capability offered by the Memento framework that allows seamless navigation from the URI of a resource to archived versions of that resource, and arrive at a solution that provides guarantees regarding the persistence of web annotations over time. More specifically, we provide theoretical solutions and proof-of-concept experimental evaluations for two problems: reconstructing an existing annotation so that the correct archived version is displayed for all resources involved in the annotation, and retrieving all annotations that involve a given archived version of a web resource.

  8. Semantic annotation in biomedicine: the current landscape.

    Science.gov (United States)

    Jovanović, Jelena; Bagheri, Ebrahim

    2017-09-22

    The abundance and unstructured nature of biomedical texts, be it clinical or research content, impose significant challenges for the effective and efficient use of information and knowledge stored in such texts. Annotation of biomedical documents with machine intelligible semantics facilitates advanced, semantics-based text management, curation, indexing, and search. This paper focuses on annotation of biomedical entity mentions with concepts from relevant biomedical knowledge bases such as UMLS. As a result, the meaning of those mentions is unambiguously and explicitly defined, and thus made readily available for automated processing. This process is widely known as semantic annotation, and the tools that perform it are known as semantic annotators.Over the last dozen years, the biomedical research community has invested significant efforts in the development of biomedical semantic annotation technology. Aiming to establish grounds for further developments in this area, we review a selected set of state of the art biomedical semantic annotators, focusing particularly on general purpose annotators, that is, semantic annotation tools that can be customized to work with texts from any area of biomedicine. We also examine potential directions for further improvements of today's annotators which could make them even more capable of meeting the needs of real-world applications. To motivate and encourage further developments in this area, along the suggested and/or related directions, we review existing and potential practical applications and benefits of semantic annotators.

  9. The small molecule fenpropimorph rapidly converts chloroplast membrane lipids to triacylglycerols in Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Hanul eKim

    2015-02-01

    Full Text Available Concern about global warming has prompted an intense interest in developing economical methods of producing biofuels. Microalgae provide a promising platform for biofuel production, because they accumulate high levels of lipids, and do not compete with food or feed sources. However, current methods of producing algal oil involve subjecting the microalgae to stress conditions, such as nitrogen deprivation, and are prohibitively expensive. Here, we report that the fungicide fenpropimorph rapidly causes high levels of neutral lipids to accumulate in Chlamydomonas reinhardtii cells. When treated with fenpropimorph (10 μg mL–1 for 1 h, Chlamydomonas cells accumulated at least four-fold the amount of triacylglycerols (TAGs present in the untreated control cells. Furthermore, the quantity of TAGs present after 1 h of fenpropimorph treatment was over two-fold higher than that formed after 9 days of nitrogen starvation in medium with no acetate supplement. Biochemical analysis of lipids revealed that the accumulated TAGs were derived mainly from chloroplast polar membrane lipids. Such a conversion of chloroplast polar lipids to TAGs is desirable for biodiesel production, because polar lipids are usually removed during the biodiesel production process. Thus, our data exemplified that a cost and time effective method of producing TAGs is possible using fenpropimorph or similar drugs.

  10. Dephosphorylation Pathway of D-myo-Inositol 1,4,5-trisphosphate in the Unicellular Green Alga Chlamydomonas eugametos

    NARCIS (Netherlands)

    Klerk, Hans; Himbergen, John A.J. van; Musgrave, Alan; Haastert, Peter J.M. van; Ende, Herman van den

    In vitro dephosphorylation of D-myo-inositol 1,4,5-trisphosphate [Ins(l,4,5)P-3] by vegetative cells, gametes and zygotes of the green alga Chlamydomonas eugametos was studied using a soluble cell fraction as enzyme source and labelled Ins(1,4,5)P-3 as substrate. This compound was dephosphorylated

  11. Contributions to In Silico Genome Annotation

    KAUST Repository

    Kalkatawi, Manal M.

    2017-11-30

    Genome annotation is an important topic since it provides information for the foundation of downstream genomic and biological research. It is considered as a way of summarizing part of existing knowledge about the genomic characteristics of an organism. Annotating different regions of a genome sequence is known as structural annotation, while identifying functions of these regions is considered as a functional annotation. In silico approaches can facilitate both tasks that otherwise would be difficult and timeconsuming. This study contributes to genome annotation by introducing several novel bioinformatics methods, some based on machine learning (ML) approaches. First, we present Dragon PolyA Spotter (DPS), a method for accurate identification of the polyadenylation signals (PAS) within human genomic DNA sequences. For this, we derived a novel feature-set able to characterize properties of the genomic region surrounding the PAS, enabling development of high accuracy optimized ML predictive models. DPS considerably outperformed the state-of-the-art results. The second contribution concerns developing generic models for structural annotation, i.e., the recognition of different genomic signals and regions (GSR) within eukaryotic DNA. We developed DeepGSR, a systematic framework that facilitates generating ML models to predict GSR with high accuracy. To the best of our knowledge, no available generic and automated method exists for such task that could facilitate the studies of newly sequenced organisms. The prediction module of DeepGSR uses deep learning algorithms to derive highly abstract features that depend mainly on proper data representation and hyperparameters calibration. DeepGSR, which was evaluated on recognition of PAS and translation initiation sites (TIS) in different organisms, yields a simpler and more precise representation of the problem under study, compared to some other hand-tailored models, while producing high accuracy prediction results. Finally

  12. Active learning reduces annotation time for clinical concept extraction.

    Science.gov (United States)

    Kholghi, Mahnoosh; Sitbon, Laurianne; Zuccon, Guido; Nguyen, Anthony

    2017-10-01

    To investigate: (1) the annotation time savings by various active learning query strategies compared to supervised learning and a random sampling baseline, and (2) the benefits of active learning-assisted pre-annotations in accelerating the manual annotation process compared to de novo annotation. There are 73 and 120 discharge summary reports provided by Beth Israel institute in the train and test sets of the concept extraction task in the i2b2/VA 2010 challenge, respectively. The 73 reports were used in user study experiments for manual annotation. First, all sequences within the 73 reports were manually annotated from scratch. Next, active learning models were built to generate pre-annotations for the sequences selected by a query strategy. The annotation/reviewing time per sequence was recorded. The 120 test reports were used to measure the effectiveness of the active learning models. When annotating from scratch, active learning reduced the annotation time up to 35% and 28% compared to a fully supervised approach and a random sampling baseline, respectively. Reviewing active learning-assisted pre-annotations resulted in 20% further reduction of the annotation time when compared to de novo annotation. The number of concepts that require manual annotation is a good indicator of the annotation time for various active learning approaches as demonstrated by high correlation between time rate and concept annotation rate. Active learning has a key role in reducing the time required to manually annotate domain concepts from clinical free text, either when annotating from scratch or reviewing active learning-assisted pre-annotations. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Chlamydomonas reinhardtii: duration of its cell cycle and phases at growth rates affected by light intensity

    Czech Academy of Sciences Publication Activity Database

    Vítová, Milada; Bišová, Kateřina; Umysová, Dáša; Hlavová, Monika; Kawano, S.; Zachleder, Vilém; Čížková, Mária

    2011-01-01

    Roč. 233, č. 1 (2011), s. 75-86 ISSN 0032-0935 R&D Projects: GA AV ČR IAA500200614; GA ČR GA525/09/0102; GA ČR GA204/09/0111 Institutional research plan: CEZ:AV0Z50200510 Keywords : Cell division timing * Cell cycle phases * Chlamydomonas Subject RIV: EE - Microbiology, Virology Impact factor: 3.000, year: 2011

  14. RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes

    Energy Technology Data Exchange (ETDEWEB)

    Brettin, Thomas; Davis, James J.; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Olsen, Gary J.; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D.; Shukla, Maulik; Thomason, James A.; Stevens, Rick; Vonstein, Veronika; Wattam, Alice R.; Xia, Fangfang

    2015-02-10

    The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.

  15. RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes.

    Science.gov (United States)

    Brettin, Thomas; Davis, James J; Disz, Terry; Edwards, Robert A; Gerdes, Svetlana; Olsen, Gary J; Olson, Robert; Overbeek, Ross; Parrello, Bruce; Pusch, Gordon D; Shukla, Maulik; Thomason, James A; Stevens, Rick; Vonstein, Veronika; Wattam, Alice R; Xia, Fangfang

    2015-02-10

    The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.

  16. The chloroplasts membrane phospholipids of Chlamydomonas reinhardii mutant not forming the Photosystem 2

    International Nuclear Information System (INIS)

    Trusova, V.M.; Ladygin, V.G.; Mezentsev, V.V.; Molchanov, M.I.

    1987-01-01

    Study on a component composition and physical state of photosynthetic membranes of Chlamydomonas chloroplasts of the wild type and mutant A-110 with disturbance of electron transfer chain in the photosystem 2 region permitted to conclude that 170 A diameter particles localized on the internal hydrophobic surface of membrane chips are deleted with respect to phosphatidylglycerin. The results obtained permit to suggest that the formation of protein-lipid complexes containing phosphatidylglycerins is suppressed in mutant A-110 which is not capable of the lamellar system differentation in

  17. Proteomic analysis of isolated chlamydomonas centrioles reveals orthologs of ciliary-disease genes.

    Science.gov (United States)

    Keller, Lani C; Romijn, Edwin P; Zamora, Ivan; Yates, John R; Marshall, Wallace F

    2005-06-21

    The centriole is one of the most enigmatic organelles in the cell. Centrioles are cylindrical, microtubule-based barrels found in the core of the centrosome. Centrioles also act as basal bodies during interphase to nucleate the assembly of cilia and flagella. There are currently only a handful of known centriole proteins. We used mass-spectrometry-based MudPIT (multidimensional protein identification technology) to identify the protein composition of basal bodies (centrioles) isolated from the green alga Chlamydomonas reinhardtii. This analysis detected the majority of known centriole proteins, including centrin, epsilon tubulin, and the cartwheel protein BLD10p. By combining proteomic data with information about gene expression and comparative genomics, we identified 45 cross-validated centriole candidate proteins in two classes. Members of the first class of proteins (BUG1-BUG27) are encoded by genes whose expression correlates with flagellar assembly and which therefore may play a role in ciliogenesis-related functions of basal bodies. Members of the second class (POC1-POC18) are implicated by comparative-genomics and -proteomics studies to be conserved components of the centriole. We confirmed centriolar localization for the human homologs of four candidate proteins. Three of the cross-validated centriole candidate proteins are encoded by orthologs of genes (OFD1, NPHP-4, and PACRG) implicated in mammalian ciliary function and disease, suggesting that oral-facial-digital syndrome and nephronophthisis may involve a dysfunction of centrioles and/or basal bodies. By analyzing isolated Chlamydomonas basal bodies, we have been able to obtain the first reported proteomic analysis of the centriole.

  18. Computer systems for annotation of single molecule fragments

    Science.gov (United States)

    Schwartz, David Charles; Severin, Jessica

    2016-07-19

    There are provided computer systems for visualizing and annotating single molecule images. Annotation systems in accordance with this disclosure allow a user to mark and annotate single molecules of interest and their restriction enzyme cut sites thereby determining the restriction fragments of single nucleic acid molecules. The markings and annotations may be automatically generated by the system in certain embodiments and they may be overlaid translucently onto the single molecule images. An image caching system may be implemented in the computer annotation systems to reduce image processing time. The annotation systems include one or more connectors connecting to one or more databases capable of storing single molecule data as well as other biomedical data. Such diverse array of data can be retrieved and used to validate the markings and annotations. The annotation systems may be implemented and deployed over a computer network. They may be ergonomically optimized to facilitate user interactions.

  19. Image annotation under X Windows

    Science.gov (United States)

    Pothier, Steven

    1991-08-01

    A mechanism for attaching graphic and overlay annotation to multiple bits/pixel imagery while providing levels of performance approaching that of native mode graphics systems is presented. This mechanism isolates programming complexity from the application programmer through software encapsulation under the X Window System. It ensures display accuracy throughout operations on the imagery and annotation including zooms, pans, and modifications of the annotation. Trade-offs that affect speed of display, consumption of memory, and system functionality are explored. The use of resource files to tune the display system is discussed. The mechanism makes use of an abstraction consisting of four parts; a graphics overlay, a dithered overlay, an image overly, and a physical display window. Data structures are maintained that retain the distinction between the four parts so that they can be modified independently, providing system flexibility. A unique technique for associating user color preferences with annotation is introduced. An interface that allows interactive modification of the mapping between image value and color is discussed. A procedure that provides for the colorization of imagery on 8-bit display systems using pixel dithering is explained. Finally, the application of annotation mechanisms to various applications is discussed.

  20. Chlamydomonas IFT25 is dispensable for flagellar assembly but required to export the BBSome from flagella

    Directory of Open Access Journals (Sweden)

    Bin Dong

    2017-11-01

    Full Text Available Intraflagellar transport (IFT particles are composed of polyprotein complexes IFT-A and IFT-B as well as cargo adaptors such as the BBSome. Two IFT-B subunits, IFT25 and IFT27 were found to form a heterodimer, which is essential in exporting the BBSome out of the cilium but not involved in flagellar assembly and cytokinesis in vertebrates. Controversial results were, however, recorded to show that defects in IFT, flagellar assembly and even cytokinesis were caused by IFT27 knockdown in Chlamydomonas reinhardtii. Using C. reinhardtii as a model organism, we report that depletion of IFT25 has no effect on flagellar assembly and does not affect the entry of the BBSome into the flagellum, but IFT25 depletion did impair BBSome movement out of the flagellum, clarifying the evolutionally conserved role of IFT25 in regulating the exit of the BBSome from the flagellum cross species. Interestingly, depletion of IFT25 causes dramatic reduction of IFT27 as expected, which does not cause defects in flagellar assembly and cytokinesis in C. reinhardtii. Our data thus support that Chlamydomonas IFT27, like its vertebrate homologues, is not involved in flagellar assembly and cytokinesis.

  1. Motion lecture annotation system to learn Naginata performances

    Science.gov (United States)

    Kobayashi, Daisuke; Sakamoto, Ryota; Nomura, Yoshihiko

    2013-12-01

    This paper describes a learning assistant system using motion capture data and annotation to teach "Naginata-jutsu" (a skill to practice Japanese halberd) performance. There are some video annotation tools such as YouTube. However these video based tools have only single angle of view. Our approach that uses motion-captured data allows us to view any angle. A lecturer can write annotations related to parts of body. We have made a comparison of effectiveness between the annotation tool of YouTube and the proposed system. The experimental result showed that our system triggered more annotations than the annotation tool of YouTube.

  2. Isolation and proteomic analysis of Chlamydomonas centrioles.

    Science.gov (United States)

    Keller, Lani C; Marshall, Wallace F

    2008-01-01

    Centrioles are barrel-shaped cytoskeletal organelles composed of nine triplet microtubules blades arranged in a pinwheel-shaped array. Centrioles are required for recruitment of pericentriolar material (PCM) during centrosome formation, and they act as basal bodies, which are necessary for the outgrowth of cilia and flagella. Despite being described over a hundred years ago, centrioles are still among the most enigmatic organelles in all of cell biology. To gain molecular insights into the function and assembly of centrioles, we sought to determine the composition of the centriole proteome. Here, we describe a method that allows for the isolation of virtually "naked" centrioles, with little to no obscuring PCM, from the green alga, Chlamydomonas. Proteomic analysis of this material provided evidence that multiple human disease gene products encode protein components of the centriole, including genes involved in Meckel syndrome and Oral-Facial-Digital syndrome. Isolated centrioles can be used in combination with a wide variety of biochemical assays in addition to being utilized as a source for proteomic analysis.

  3. BEACON: automated tool for Bacterial GEnome Annotation ComparisON

    KAUST Repository

    Kalkatawi, Manal M.

    2015-08-18

    Background Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). Results The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON’s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced. Conclusions We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/

  4. BEACON: automated tool for Bacterial GEnome Annotation ComparisON.

    Science.gov (United States)

    Kalkatawi, Manal; Alam, Intikhab; Bajic, Vladimir B

    2015-08-18

    Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON's utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27%, while the number of genes without any function assignment is reduced. We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .

  5. JGI Plant Genomics Gene Annotation Pipeline

    Energy Technology Data Exchange (ETDEWEB)

    Shu, Shengqiang; Rokhsar, Dan; Goodstein, David; Hayes, David; Mitros, Therese

    2014-07-14

    Plant genomes vary in size and are highly complex with a high amount of repeats, genome duplication and tandem duplication. Gene encodes a wealth of information useful in studying organism and it is critical to have high quality and stable gene annotation. Thanks to advancement of sequencing technology, many plant species genomes have been sequenced and transcriptomes are also sequenced. To use these vastly large amounts of sequence data to make gene annotation or re-annotation in a timely fashion, an automatic pipeline is needed. JGI plant genomics gene annotation pipeline, called integrated gene call (IGC), is our effort toward this aim with aid of a RNA-seq transcriptome assembly pipeline. It utilizes several gene predictors based on homolog peptides and transcript ORFs. See Methods for detail. Here we present genome annotation of JGI flagship green plants produced by this pipeline plus Arabidopsis and rice except for chlamy which is done by a third party. The genome annotations of these species and others are used in our gene family build pipeline and accessible via JGI Phytozome portal whose URL and front page snapshot are shown below.

  6. Annotating temporal information in clinical narratives.

    Science.gov (United States)

    Sun, Weiyi; Rumshisky, Anna; Uzuner, Ozlem

    2013-12-01

    Temporal information in clinical narratives plays an important role in patients' diagnosis, treatment and prognosis. In order to represent narrative information accurately, medical natural language processing (MLP) systems need to correctly identify and interpret temporal information. To promote research in this area, the Informatics for Integrating Biology and the Bedside (i2b2) project developed a temporally annotated corpus of clinical narratives. This corpus contains 310 de-identified discharge summaries, with annotations of clinical events, temporal expressions and temporal relations. This paper describes the process followed for the development of this corpus and discusses annotation guideline development, annotation methodology, and corpus quality. Copyright © 2013 Elsevier Inc. All rights reserved.

  7. Facilitating functional annotation of chicken microarray data

    Directory of Open Access Journals (Sweden)

    Gresham Cathy R

    2009-10-01

    Full Text Available Abstract Background Modeling results from chicken microarray studies is challenging for researchers due to little functional annotation associated with these arrays. The Affymetrix GenChip chicken genome array, one of the biggest arrays that serve as a key research tool for the study of chicken functional genomics, is among the few arrays that link gene products to Gene Ontology (GO. However the GO annotation data presented by Affymetrix is incomplete, for example, they do not show references linked to manually annotated functions. In addition, there is no tool that facilitates microarray researchers to directly retrieve functional annotations for their datasets from the annotated arrays. This costs researchers amount of time in searching multiple GO databases for functional information. Results We have improved the breadth of functional annotations of the gene products associated with probesets on the Affymetrix chicken genome array by 45% and the quality of annotation by 14%. We have also identified the most significant diseases and disorders, different types of genes, and known drug targets represented on Affymetrix chicken genome array. To facilitate functional annotation of other arrays and microarray experimental datasets we developed an Array GO Mapper (AGOM tool to help researchers to quickly retrieve corresponding functional information for their dataset. Conclusion Results from this study will directly facilitate annotation of other chicken arrays and microarray experimental datasets. Researchers will be able to quickly model their microarray dataset into more reliable biological functional information by using AGOM tool. The disease, disorders, gene types and drug targets revealed in the study will allow researchers to learn more about how genes function in complex biological systems and may lead to new drug discovery and development of therapies. The GO annotation data generated will be available for public use via AgBase website and

  8. Dictionary-driven protein annotation.

    Science.gov (United States)

    Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

    2002-09-01

    Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/ bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were

  9. The effectiveness of annotated (vs. non-annotated) digital pathology slides as a teaching tool during dermatology and pathology residencies.

    Science.gov (United States)

    Marsch, Amanda F; Espiritu, Baltazar; Groth, John; Hutchens, Kelli A

    2014-06-01

    With today's technology, paraffin-embedded, hematoxylin & eosin-stained pathology slides can be scanned to generate high quality virtual slides. Using proprietary software, digital images can also be annotated with arrows, circles and boxes to highlight certain diagnostic features. Previous studies assessing digital microscopy as a teaching tool did not involve the annotation of digital images. The objective of this study was to compare the effectiveness of annotated digital pathology slides versus non-annotated digital pathology slides as a teaching tool during dermatology and pathology residencies. A study group composed of 31 dermatology and pathology residents was asked to complete an online pre-quiz consisting of 20 multiple choice style questions, each associated with a static digital pathology image. After completion, participants were given access to an online tutorial composed of digitally annotated pathology slides and subsequently asked to complete a post-quiz. A control group of 12 residents completed a non-annotated version of the tutorial. Nearly all participants in the study group improved their quiz score, with an average improvement of 17%, versus only 3% (P = 0.005) in the control group. These results support the notion that annotated digital pathology slides are superior to non-annotated slides for the purpose of resident education. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  10. Automatic annotation of head velocity and acceleration in Anvil

    DEFF Research Database (Denmark)

    Jongejan, Bart

    2012-01-01

    We describe an automatic face tracker plugin for the ANVIL annotation tool. The face tracker produces data for velocity and for acceleration in two dimensions. We compare the annotations generated by the face tracking algorithm with independently made manual annotations for head movements....... The annotations are a useful supplement to manual annotations and may help human annotators to quickly and reliably determine onset of head movements and to suggest which kind of head movement is taking place....

  11. Seasonal and diel changes in photosynthetic activity of the snow algae Chlamydomonas nivalis (Chlorophyceae) from Svalbard determined by PAM fluorometry

    Czech Academy of Sciences Publication Activity Database

    Stibal, Marek; Elster, Josef; Šabacká, Marie; Kaštovská, Klára

    2007-01-01

    Roč. 59, - (2007), s. 265-273 ISSN 0168-6496 R&D Projects: GA AV ČR KJB6005409 Institutional research plan: CEZ:AV0Z60050516 Keywords : Chlamydomonas nivalis * photosynthetic activity * PAM fluorometry Subject RIV: EF - Botanics Impact factor: 3.039, year: 2007

  12. Mesotext. Framing and exploring annotations

    NARCIS (Netherlands)

    Boot, P.; Boot, P.; Stronks, E.

    2007-01-01

    From the introduction: Annotation is an important item on the wish list for digital scholarly tools. It is one of John Unsworth’s primitives of scholarship (Unsworth 2000). Especially in linguistics,a number of tools have been developed that facilitate the creation of annotations to source material

  13. Annotating images by mining image search results

    NARCIS (Netherlands)

    Wang, X.J.; Zhang, L.; Li, X.; Ma, W.Y.

    2008-01-01

    Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search

  14. High-yield secretion of recombinant proteins from the microalga Chlamydomonas reinhardtii

    DEFF Research Database (Denmark)

    Ramos Martinez, Erick Miguel; Fimognari, Lorenzo; Sakuragi, Yumiko

    2017-01-01

    Microalga-based biomanufacturing of recombinant proteins is attracting growing attention due to its advantages in safety, metabolic diversity, scalability and sustainability. Secretion of recombinant proteins can accelerate the use of microalgal platforms by allowing post......-translational modifications and easy recovery of products from the culture media. However, currently, the yields of secreted recombinant proteins are low, which hampers the commercial application of this strategy. This study aimed at expanding the genetic tools for enhancing secretion of recombinant proteins in Chlamydomonas...... in the endoplasmic reticulum (ER). Taken together, the results demonstrate the utility of the gametolysin signal sequence and (SP)n glycomodule to promote a more efficient biomanufacturing of microalgae-based recombinant proteins....

  15. WormBase: Annotating many nematode genomes.

    Science.gov (United States)

    Howe, Kevin; Davis, Paul; Paulini, Michael; Tuli, Mary Ann; Williams, Gary; Yook, Karen; Durbin, Richard; Kersey, Paul; Sternberg, Paul W

    2012-01-01

    WormBase (www.wormbase.org) has been serving the scientific community for over 11 years as the central repository for genomic and genetic information for the soil nematode Caenorhabditis elegans. The resource has evolved from its beginnings as a database housing the genomic sequence and genetic and physical maps of a single species, and now represents the breadth and diversity of nematode research, currently serving genome sequence and annotation for around 20 nematodes. In this article, we focus on WormBase's role of genome sequence annotation, describing how we annotate and integrate data from a growing collection of nematode species and strains. We also review our approaches to sequence curation, and discuss the impact on annotation quality of large functional genomics projects such as modENCODE.

  16. Teaching and Learning Communities through Online Annotation

    Science.gov (United States)

    van der Pluijm, B.

    2016-12-01

    What do colleagues do with your assigned textbook? What they say or think about the material? Want students to be more engaged in their learning experience? If so, online materials that complement standard lecture format provide new opportunity through managed, online group annotation that leverages the ubiquity of internet access, while personalizing learning. The concept is illustrated with the new online textbook "Processes in Structural Geology and Tectonics", by Ben van der Pluijm and Stephen Marshak, which offers a platform for sharing of experiences, supplementary materials and approaches, including readings, mathematical applications, exercises, challenge questions, quizzes, alternative explanations, and more. The annotation framework used is Hypothes.is, which offers a free, open platform markup environment for annotation of websites and PDF postings. The annotations can be public, grouped or individualized, as desired, including export access and download of annotations. A teacher group, hosted by a moderator/owner, limits access to members of a user group of teachers, so that its members can use, copy or transcribe annotations for their own lesson material. Likewise, an instructor can host a student group that encourages sharing of observations, questions and answers among students and instructor. Also, the instructor can create one or more closed groups that offers study help and hints to students. Options galore, all of which aim to engage students and to promote greater responsibility for their learning experience. Beyond new capacity, the ability to analyze student annotation supports individual learners and their needs. For example, student notes can be analyzed for key phrases and concepts, and identify misunderstandings, omissions and problems. Also, example annotations can be shared to enhance notetaking skills and to help with studying. Lastly, online annotation allows active application to lecture posted slides, supporting real-time notetaking

  17. Displaying Annotations for Digitised Globes

    Science.gov (United States)

    Gede, Mátyás; Farbinger, Anna

    2018-05-01

    Thanks to the efforts of the various globe digitising projects, nowadays there are plenty of old globes that can be examined as 3D models on the computer screen. These globes usually contain a lot of interesting details that an average observer would not entirely discover for the first time. The authors developed a website that can display annotations for such digitised globes. These annotations help observers of the globe to discover all the important, interesting details. Annotations consist of a plain text title, a HTML formatted descriptive text and a corresponding polygon and are stored in KML format. The website is powered by the Cesium virtual globe engine.

  18. THE DIMENSIONS OF COMPOSITION ANNOTATION.

    Science.gov (United States)

    MCCOLLY, WILLIAM

    ENGLISH TEACHER ANNOTATIONS WERE STUDIED TO DETERMINE THE DIMENSIONS AND PROPERTIES OF THE ENTIRE SYSTEM FOR WRITING CORRECTIONS AND CRITICISMS ON COMPOSITIONS. FOUR SETS OF COMPOSITIONS WERE WRITTEN BY STUDENTS IN GRADES 9 THROUGH 13. TYPESCRIPTS OF THE COMPOSITIONS WERE ANNOTATED BY CLASSROOM ENGLISH TEACHERS. THEN, 32 ENGLISH TEACHERS JUDGED…

  19. Evaluation of three automated genome annotations for Halorhabdus utahensis.

    Directory of Open Access Journals (Sweden)

    Peter Bakke

    2009-07-01

    Full Text Available Genome annotations are accumulating rapidly and depend heavily on automated annotation systems. Many genome centers offer annotation systems but no one has compared their output in a systematic way to determine accuracy and inherent errors. Errors in the annotations are routinely deposited in databases such as NCBI and used to validate subsequent annotation errors. We submitted the genome sequence of halophilic archaeon Halorhabdus utahensis to be analyzed by three genome annotation services. We have examined the output from each service in a variety of ways in order to compare the methodology and effectiveness of the annotations, as well as to explore the genes, pathways, and physiology of the previously unannotated genome. The annotation services differ considerably in gene calls, features, and ease of use. We had to manually identify the origin of replication and the species-specific consensus ribosome-binding site. Additionally, we conducted laboratory experiments to test H. utahensis growth and enzyme activity. Current annotation practices need to improve in order to more accurately reflect a genome's biological potential. We make specific recommendations that could improve the quality of microbial annotation projects.

  20. MimoSA: a system for minimotif annotation

    Directory of Open Access Journals (Sweden)

    Kundeti Vamsi

    2010-06-01

    Full Text Available Abstract Background Minimotifs are short peptide sequences within one protein, which are recognized by other proteins or molecules. While there are now several minimotif databases, they are incomplete. There are reports of many minimotifs in the primary literature, which have yet to be annotated, while entirely novel minimotifs continue to be published on a weekly basis. Our recently proposed function and sequence syntax for minimotifs enables us to build a general tool that will facilitate structured annotation and management of minimotif data from the biomedical literature. Results We have built the MimoSA application for minimotif annotation. The application supports management of the Minimotif Miner database, literature tracking, and annotation of new minimotifs. MimoSA enables the visualization, organization, selection and editing functions of minimotifs and their attributes in the MnM database. For the literature components, Mimosa provides paper status tracking and scoring of papers for annotation through a freely available machine learning approach, which is based on word correlation. The paper scoring algorithm is also available as a separate program, TextMine. Form-driven annotation of minimotif attributes enables entry of new minimotifs into the MnM database. Several supporting features increase the efficiency of annotation. The layered architecture of MimoSA allows for extensibility by separating the functions of paper scoring, minimotif visualization, and database management. MimoSA is readily adaptable to other annotation efforts that manually curate literature into a MySQL database. Conclusions MimoSA is an extensible application that facilitates minimotif annotation and integrates with the Minimotif Miner database. We have built MimoSA as an application that integrates dynamic abstract scoring with a high performance relational model of minimotif syntax. MimoSA's TextMine, an efficient paper-scoring algorithm, can be used to

  1. Medical Services at an International Summer Camp Event Under Hot and Humid Conditions: Experiences From the 23rd World Scout Jamboree, Japan.

    Science.gov (United States)

    Watanabe, Takemasa; Mizutani, Keiji; Iwai, Toshiyasu; Nakashima, Hiroshi

    2018-06-01

    The 23rd World Scout Jamboree (WSJ) was a 10-day summer camp held in Japan in 2015 under hot and humid conditions. The attendees comprised 33,628 people from 155 countries and territories. The aim of this study was to examine the provision of medical services under such conditions and to identify preventive factors for major diseases among long-term campers. Data were obtained from WSJ medical center records and examined to clarify the effects of age, sex, and period on visit frequencies and rates. Medical records from 3215 patients were examined. Daytime temperatures were 31.5±3.2°C and relative humidity was 61±13% (mean±SD). The initial visit rates among scouts and adults were 72.2 and 77.2 per 1000 persons, respectively. No significant age difference was observed in the initial visit rate; however, it was significantly higher among female patients than male patients. Significant differences were also seen in the adjusted odds ratios by age, sex, and period for disease distributions of initial visit frequencies. In addition, a higher initial visit frequency for heat strain-related diseases was seen among the scouts. Initial visit frequencies for heatstroke and/or dehydration increased just after opening day and persisted until closing day. Our findings suggest the importance of taking effective countermeasures against heat strain, fatigue, and unsanitary conditions at the WSJ. Medical services staff should take attendees' age, sex, and period into consideration to prevent heat strain-related diseases during such camps under hot and humid conditions. Copyright © 2018 Wilderness Medical Society. Published by Elsevier Inc. All rights reserved.

  2. Productivity and selective accumulation of carotenoids of the novel extremophile microalga Chlamydomonas acidophila grown with different carbon sources in batch systems

    NARCIS (Netherlands)

    Cuaresma, M.; Casal, C.; Forján, E.; Vílchez, C.

    2011-01-01

    Cultivation of extremophile microorganisms has attracted interest due to their ability to accumulate highvalue compounds. Chlamydomonas acidophila is an acidophile green microalga isolated by our group from Tinto River, an acidic river that flows down from the mining area in Huelva, Spain. This

  3. Annotate-it: a Swiss-knife approach to annotation, analysis and interpretation of single nucleotide variation in human disease.

    Science.gov (United States)

    Sifrim, Alejandro; Van Houdt, Jeroen Kj; Tranchevent, Leon-Charles; Nowakowska, Beata; Sakai, Ryo; Pavlopoulos, Georgios A; Devriendt, Koen; Vermeesch, Joris R; Moreau, Yves; Aerts, Jan

    2012-01-01

    The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org.

  4. Annotated chemical patent corpus: a gold standard for text mining.

    Directory of Open Access Journals (Sweden)

    Saber A Akhondi

    Full Text Available Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org.

  5. Dissecting the molecular mechanisms of intraflagellar transport in Chlamydomonas

    DEFF Research Database (Denmark)

    Pedersen, L. B.; Geimer, S.; Rosenbaum, J. L.

    2006-01-01

    Background The assembly and maintenance of eukaryotic cilia and flagella are mediated by intraflagellar transport (IFT), a bidirectional microtubule (MT)-based transport system. The IFT system consists of anterograde (kinesin-2) and retrograde (cDynein1b) motor complexes and IFT particles...... comprising two complexes, A and B. In the current model for IFT, kinesin-2 carries cDynein1b, IFT particles, and axonemal precursors from the flagellar base to the tip, and cDynein1b transports kinesin-2, IFT particles, and axonemal turnover products from the tip back to the base. Most of the components...... of the IFT system have been identified and characterized, but the mechanisms by which these different components are coordinated and regulated at the flagellar base and tip are unclear. Results Using a variety of Chlamydomonas mutants, we confirm that cDynein1b requires kinesin-2 for transport toward the tip...

  6. Characterization of Hydrocortisone Biometabolites and 18S rRNA Gene in Chlamydomonas reinhardtii Cultures

    Directory of Open Access Journals (Sweden)

    Seyed Bagher Mosavi-Azam

    2008-10-01

    Full Text Available A unicellular microalga, Chlamydomonas reinhardtii, was isolated from rice paddy-field soil and water samples and used in the biotransformation of hydrocortisone (1. This strain has not been previously tested for steroid bioconversion. Fermentation was carried out in BG-11 medium supplemented with 0.05% substrate at 25ºC for 14 days of incubation. The products obtained were chromatographically purified and characterized using spectroscopic methods. 11b,17b-Dihydroxyandrost-4-en-3-one (2, 11b-hydroxyandrost-4-en-3,17-dione (3, 11b,17a,20b,21-tetrahydroxypregn-4-en-3-one (4 and prednisolone (5 were the main products of the bioconversion. The observed bioreaction features were the side chain degradation of the substrate to give compounds 2 and 3 and the 20-ketone reduction and 1,2-dehydrogenation affording compounds 4 and 5, respectively. A time course study showed the accumulation of product 2 from the second day of the fermentation and of compounds 3, 4 and 5 from the third day. All the metabolites reached their maximum concentration in seven days. Microalgal 18S rRNA gene was also amplified by PCR. PCR products were sequenced to confirm their authenticity as 18S rRNA gene of microalgae. The result of PCR blasted with other sequenced microalgae in NCBI showed 100% homology to the 18S small subunit rRNA of two Chlamydomonas reinhardtii spp.

  7. Characterization of hydrocortisone biometabolites and 18S rRNA gene in Chlamydomonas reinhardtii cultures.

    Science.gov (United States)

    Ghasemi, Younes; Rasoul-Amini, Sara; Morowvat, Mohammad Hossein; Raee, Mohammad Javad; Ghoshoon, Mohammad Bagher; Nouri, Fatemeh; Negintaji, Narges; Parvizi, Rezvan; Mosavi-Azam, Seyed Bagher

    2008-10-31

    A unicellular microalga, Chlamydomonas reinhardtii, was isolated from rice paddy-field soil and water samples and used in the biotransformation of hydrocortisone (1). This strain has not been previously tested for steroid bioconversion. Fermentation was carried out in BG-11 medium supplemented with 0.05% substrate at 25 degrees C for 14 days of incubation. The products obtained were chromatographically purified and characterized using spectroscopic methods. 11b,17 beta-Dihydroxyandrost-4-en-3-one (2), 11 beta-hydroxyandrost-4-en-3,17-dione (3), 11 beta,17 alpha,20 beta,21-tetrahydroxypregn-4-en-3-one (4) and prednisolone (5) were the main products of the bioconversion. The observed bioreaction features were the side chain degradation of the substrate to give compounds 2 and 3 and the 20-ketone reduction and 1,2-dehydrogenation affording compounds 4 and 5, respectively. A time course study showed the accumulation of product 2 from the second day of the fermentation and of compounds 3, 4 and 5 from the third day. All the metabolites reached their maximum concentration in seven days. Microalgal 18S rRNA gene was also amplified by PCR. PCR products were sequenced to confirm their authenticity as 18S rRNA gene of microalgae. The result of PCR blasted with other sequenced microalgae in NCBI showed 100% homology to the 18S small subunit rRNA of two Chlamydomonas reinhardtii spp.

  8. Chlamydomonas reinhardtii: the model of choice to study mitochondria from unicellular photosynthetic organisms.

    Science.gov (United States)

    Funes, Soledad; Franzén, Lars-Gunnar; González-Halphen, Diego

    2007-01-01

    Chlamydomonas reinhardtii is a model organism to study photosynthesis, cellular division, flagellar biogenesis, and, more recently, mitochondrial function. It has distinct advantages in comparison to higher plants because it is unicellular, haploid, and amenable to tetrad analysis, and its three genomes are subject to specific transformation. It also has the possibility to grow either photoautotrophically or heterotrophically on acetate, making the assembly of the photosynthetic machinery not essential for cell viability. Methods developed allow the isolation of C. reinhardtii mitochondria free of thylakoid contaminants. We review the general procedures used for the biochemical characterization of mitochondria from this green alga.

  9. Diverse Image Annotation

    KAUST Repository

    Wu, Baoyuan

    2017-11-09

    In this work we study the task of image annotation, of which the goal is to describe an image using a few tags. Instead of predicting the full list of tags, here we target for providing a short list of tags under a limited number (e.g., 3), to cover as much information as possible of the image. The tags in such a short list should be representative and diverse. It means they are required to be not only corresponding to the contents of the image, but also be different to each other. To this end, we treat the image annotation as a subset selection problem based on the conditional determinantal point process (DPP) model, which formulates the representation and diversity jointly. We further explore the semantic hierarchy and synonyms among the candidate tags, and require that two tags in a semantic hierarchy or in a pair of synonyms should not be selected simultaneously. This requirement is then embedded into the sampling algorithm according to the learned conditional DPP model. Besides, we find that traditional metrics for image annotation (e.g., precision, recall and F1 score) only consider the representation, but ignore the diversity. Thus we propose new metrics to evaluate the quality of the selected subset (i.e., the tag list), based on the semantic hierarchy and synonyms. Human study through Amazon Mechanical Turk verifies that the proposed metrics are more close to the humans judgment than traditional metrics. Experiments on two benchmark datasets show that the proposed method can produce more representative and diverse tags, compared with existing image annotation methods.

  10. Diverse Image Annotation

    KAUST Repository

    Wu, Baoyuan; Jia, Fan; Liu, Wei; Ghanem, Bernard

    2017-01-01

    In this work we study the task of image annotation, of which the goal is to describe an image using a few tags. Instead of predicting the full list of tags, here we target for providing a short list of tags under a limited number (e.g., 3), to cover as much information as possible of the image. The tags in such a short list should be representative and diverse. It means they are required to be not only corresponding to the contents of the image, but also be different to each other. To this end, we treat the image annotation as a subset selection problem based on the conditional determinantal point process (DPP) model, which formulates the representation and diversity jointly. We further explore the semantic hierarchy and synonyms among the candidate tags, and require that two tags in a semantic hierarchy or in a pair of synonyms should not be selected simultaneously. This requirement is then embedded into the sampling algorithm according to the learned conditional DPP model. Besides, we find that traditional metrics for image annotation (e.g., precision, recall and F1 score) only consider the representation, but ignore the diversity. Thus we propose new metrics to evaluate the quality of the selected subset (i.e., the tag list), based on the semantic hierarchy and synonyms. Human study through Amazon Mechanical Turk verifies that the proposed metrics are more close to the humans judgment than traditional metrics. Experiments on two benchmark datasets show that the proposed method can produce more representative and diverse tags, compared with existing image annotation methods.

  11. Annotating individual human genomes.

    Science.gov (United States)

    Torkamani, Ali; Scott-Van Zeeland, Ashley A; Topol, Eric J; Schork, Nicholas J

    2011-10-01

    Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. Copyright © 2011 Elsevier Inc. All rights reserved.

  12. ANNOTATING INDIVIDUAL HUMAN GENOMES*

    Science.gov (United States)

    Torkamani, Ali; Scott-Van Zeeland, Ashley A.; Topol, Eric J.; Schork, Nicholas J.

    2014-01-01

    Advances in DNA sequencing technologies have made it possible to rapidly, accurately and affordably sequence entire individual human genomes. As impressive as this ability seems, however, it will not likely to amount to much if one cannot extract meaningful information from individual sequence data. Annotating variations within individual genomes and providing information about their biological or phenotypic impact will thus be crucially important in moving individual sequencing projects forward, especially in the context of the clinical use of sequence information. In this paper we consider the various ways in which one might annotate individual sequence variations and point out limitations in the available methods for doing so. It is arguable that, in the foreseeable future, DNA sequencing of individual genomes will become routine for clinical, research, forensic, and personal purposes. We therefore also consider directions and areas for further research in annotating genomic variants. PMID:21839162

  13. Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.

    Science.gov (United States)

    Apweiler, R; Gateau, A; Contrino, S; Martin, M J; Junker, V; O'Donovan, C; Lang, F; Mitaritonna, N; Kappus, S; Bairoch, A

    1997-01-01

    SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.

  14. The GATO gene annotation tool for research laboratories

    Directory of Open Access Journals (Sweden)

    A. Fujita

    2005-11-01

    Full Text Available Large-scale genome projects have generated a rapidly increasing number of DNA sequences. Therefore, development of computational methods to rapidly analyze these sequences is essential for progress in genomic research. Here we present an automatic annotation system for preliminary analysis of DNA sequences. The gene annotation tool (GATO is a Bioinformatics pipeline designed to facilitate routine functional annotation and easy access to annotated genes. It was designed in view of the frequent need of genomic researchers to access data pertaining to a common set of genes. In the GATO system, annotation is generated by querying some of the Web-accessible resources and the information is stored in a local database, which keeps a record of all previous annotation results. GATO may be accessed from everywhere through the internet or may be run locally if a large number of sequences are going to be annotated. It is implemented in PHP and Perl and may be run on any suitable Web server. Usually, installation and application of annotation systems require experience and are time consuming, but GATO is simple and practical, allowing anyone with basic skills in informatics to access it without any special training. GATO can be downloaded at [http://mariwork.iq.usp.br/gato/]. Minimum computer free space required is 2 MB.

  15. GSV Annotated Bibliography

    Energy Technology Data Exchange (ETDEWEB)

    Roberts, Randy S. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Pope, Paul A. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Jiang, Ming [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Trucano, Timothy G. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Aragon, Cecilia R. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Ni, Kevin [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Wei, Thomas [Argonne National Lab. (ANL), Argonne, IL (United States); Chilton, Lawrence K. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Bakel, Alan [Argonne National Lab. (ANL), Argonne, IL (United States)

    2010-09-14

    The following annotated bibliography was developed as part of the geospatial algorithm verification and validation (GSV) project for the Simulation, Algorithms and Modeling program of NA-22. Verification and Validation of geospatial image analysis algorithms covers a wide range of technologies. Papers in the bibliography are thus organized into the following five topic areas: Image processing and analysis, usability and validation of geospatial image analysis algorithms, image distance measures, scene modeling and image rendering, and transportation simulation models. Many other papers were studied during the course of the investigation including. The annotations for these articles can be found in the paper "On the verification and validation of geospatial image analysis algorithms".

  16. Solar Tutorial and Annotation Resource (STAR)

    Science.gov (United States)

    Showalter, C.; Rex, R.; Hurlburt, N. E.; Zita, E. J.

    2009-12-01

    We have written a software suite designed to facilitate solar data analysis by scientists, students, and the public, anticipating enormous datasets from future instruments. Our “STAR" suite includes an interactive learning section explaining 15 classes of solar events. Users learn software tools that exploit humans’ superior ability (over computers) to identify many events. Annotation tools include time slice generation to quantify loop oscillations, the interpolation of event shapes using natural cubic splines (for loops, sigmoids, and filaments) and closed cubic splines (for coronal holes). Learning these tools in an environment where examples are provided prepares new users to comfortably utilize annotation software with new data. Upon completion of our tutorial, users are presented with media of various solar events and asked to identify and annotate the images, to test their mastery of the system. Goals of the project include public input into the data analysis of very large datasets from future solar satellites, and increased public interest and knowledge about the Sun. In 2010, the Solar Dynamics Observatory (SDO) will be launched into orbit. SDO’s advancements in solar telescope technology will generate a terabyte per day of high-quality data, requiring innovation in data management. While major projects develop automated feature recognition software, so that computers can complete much of the initial event tagging and analysis, still, that software cannot annotate features such as sigmoids, coronal magnetic loops, coronal dimming, etc., due to large amounts of data concentrated in relatively small areas. Previously, solar physicists manually annotated these features, but with the imminent influx of data it is unrealistic to expect specialized researchers to examine every image that computers cannot fully process. A new approach is needed to efficiently process these data. Providing analysis tools and data access to students and the public have proven

  17. Discovering gene annotations in biomedical text databases

    Directory of Open Access Journals (Sweden)

    Ozsoyoglu Gultekin

    2008-03-01

    Full Text Available Abstract Background Genes and gene products are frequently annotated with Gene Ontology concepts based on the evidence provided in genomics articles. Manually locating and curating information about a genomic entity from the biomedical literature requires vast amounts of human effort. Hence, there is clearly a need forautomated computational tools to annotate the genes and gene products with Gene Ontology concepts by computationally capturing the related knowledge embedded in textual data. Results In this article, we present an automated genomic entity annotation system, GEANN, which extracts information about the characteristics of genes and gene products in article abstracts from PubMed, and translates the discoveredknowledge into Gene Ontology (GO concepts, a widely-used standardized vocabulary of genomic traits. GEANN utilizes textual "extraction patterns", and a semantic matching framework to locate phrases matching to a pattern and produce Gene Ontology annotations for genes and gene products. In our experiments, GEANN has reached to the precision level of 78% at therecall level of 61%. On a select set of Gene Ontology concepts, GEANN either outperforms or is comparable to two other automated annotation studies. Use of WordNet for semantic pattern matching improves the precision and recall by 24% and 15%, respectively, and the improvement due to semantic pattern matching becomes more apparent as the Gene Ontology terms become more general. Conclusion GEANN is useful for two distinct purposes: (i automating the annotation of genomic entities with Gene Ontology concepts, and (ii providing existing annotations with additional "evidence articles" from the literature. The use of textual extraction patterns that are constructed based on the existing annotations achieve high precision. The semantic pattern matching framework provides a more flexible pattern matching scheme with respect to "exactmatching" with the advantage of locating approximate

  18. Annotating the human genome with Disease Ontology

    Science.gov (United States)

    Osborne, John D; Flatow, Jared; Holko, Michelle; Lin, Simon M; Kibbe, Warren A; Zhu, Lihua (Julie); Danila, Maria I; Feng, Gang; Chisholm, Rex L

    2009-01-01

    Background The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. Results We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. Conclusion The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome. PMID:19594883

  19. MIPS bacterial genomes functional annotation benchmark dataset.

    Science.gov (United States)

    Tetko, Igor V; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Fobo, Gisela; Ruepp, Andreas; Antonov, Alexey V; Surmeli, Dimitrij; Mewes, Hans-Wernen

    2005-05-15

    Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. BFAB is available at http://mips.gsf.de/proj/bfab

  20. Annotating non-coding regions of the genome.

    Science.gov (United States)

    Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

    2010-08-01

    Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.

  1. Systems Theory and Communication. Annotated Bibliography.

    Science.gov (United States)

    Covington, William G., Jr.

    This annotated bibliography presents annotations of 31 books and journal articles dealing with systems theory and its relation to organizational communication, marketing, information theory, and cybernetics. Materials were published between 1963 and 1992 and are listed alphabetically by author. (RS)

  2. The surplus value of semantic annotations

    NARCIS (Netherlands)

    Marx, M.

    2010-01-01

    We compare the costs of semantic annotation of textual documents to its benefits for information processing tasks. Semantic annotation can improve the performance of retrieval tasks and facilitates an improved search experience through faceted search, focused retrieval, better document summaries,

  3. Annotation-based enrichment of Digital Objects using open-source frameworks

    Directory of Open Access Journals (Sweden)

    Marcus Emmanuel Barnes

    2017-07-01

    Full Text Available The W3C Web Annotation Data Model, Protocol, and Vocabulary unify approaches to annotations across the web, enabling their aggregation, discovery and persistence over time. In addition, new javascript libraries provide the ability for users to annotate multi-format content. In this paper, we describe how we have leveraged these developments to provide annotation features alongside Islandora’s existing preservation, access, and management capabilities. We also discuss our experience developing with the Web Annotation Model as an open web architecture standard, as well as our approach to integrating mature external annotation libraries. The resulting software (the Web Annotation Utility Module for Islandora accommodates annotation across multiple formats. This solution can be used in various digital scholarship contexts.

  4. A cost-effective approach to produce 15N-labelled amino acids employing Chlamydomonas reinhardtii CC503.

    Science.gov (United States)

    Nicolás Carcelén, Jesús; Marchante-Gayón, Juan Manuel; González, Pablo Rodríguez; Valledor, Luis; Cañal, María Jesús; Alonso, José Ignacio García

    2017-08-18

    The use of enriched stable isotopes is of outstanding importance in chemical metrology as it allows the application of isotope dilution mass spectrometry (IDMS). Primary methods based on IDMS ensure the quality of the analytical measurements and traceability of the results to the international system of units. However, the synthesis of isotopically labelled molecules from enriched stable isotopes is an expensive and a difficult task. Either chemical and biochemical methods to produce labelled molecules have been proposed, but so far, few cost-effective methods have been described. The aim of this study was to use the microalgae Chlamydomonas reinhardtii to produce, at laboratory scale, 15 N-labelled amino acids with a high isotopic enrichment. To do that, a culture media containing 15 NH 4 Cl was used. No kinetic isotope effect (KIE) was observed. The labelled proteins biosynthesized by the microorganism were extracted from the biomass and the 15 N-labelled amino acids were obtained after a protein hydrolysis with HCl. The use of the wall deficient strain CC503 cw92 mt+ is fit for purpose, as it only assimilates ammonia as nitrogen source, avoiding isotope contamination with nitrogen from the atmosphere or the reagents used in the culture medium, and enhancing the protein extraction efficiency compared to cell-walled wild type Chlamydomonas. The isotopic enrichment of the labelled amino acids was calculated from their isotopic composition measured by gas chromatography mass spectrometry (GC-MS). The average isotopic enrichment for the 16 amino acids characterized was 99.56 ± 0.05% and the concentration of the amino acids in the hydrolysate ranged from 18 to 90 µg/mL. Previously reported biochemical methods to produce isotopically labelled proteins have been applied in the fields of proteomics and fluxomics. For these approaches, low amounts of products are required and the isotopic enrichment of the molecules has never been properly determined. So far, only 13

  5. Effect of mutagen combined action on Chlamydomonas Reinhardtii cells. I

    International Nuclear Information System (INIS)

    Vlcek, D.; Podstavkova, S.; Dubovsky, J.

    1978-01-01

    The effect was investigated of single and combined actions of alkylnitrosourea derivatives (N-methyl-N-nitrosourea and N-ethyl-N-nitrosourea) and UV-radiation on the survival of cells of Chlamydomonas reinhardtii algae in dependence on the sequence of application of mutagens and on the given conditions of cultivation following mutagen activity. In particular, the single phases were investigated of the total lethal effect, i.e., the death of cells before division and their death after division. The most pronounced changes in dependence on the sequence of application of mutagens and on the given conditions of cultivation were noted in cell death before division. In dependence on the sequence of application of mutagens, the effect of the combined action on the survival of cells changed from an additive (alkylnitrosourea + UV-radiation) to a protective effect (UV-radiation + alkylnitrosourea). (author)

  6. Current and future trends in marine image annotation software

    Science.gov (United States)

    Gomes-Pereira, Jose Nuno; Auger, Vincent; Beisiegel, Kolja; Benjamin, Robert; Bergmann, Melanie; Bowden, David; Buhl-Mortensen, Pal; De Leo, Fabio C.; Dionísio, Gisela; Durden, Jennifer M.; Edwards, Luke; Friedman, Ariell; Greinert, Jens; Jacobsen-Stout, Nancy; Lerner, Steve; Leslie, Murray; Nattkemper, Tim W.; Sameoto, Jessica A.; Schoening, Timm; Schouten, Ronald; Seager, James; Singh, Hanumant; Soubigou, Olivier; Tojeira, Inês; van den Beld, Inge; Dias, Frederico; Tempera, Fernando; Santos, Ricardo S.

    2016-12-01

    Given the need to describe, analyze and index large quantities of marine imagery data for exploration and monitoring activities, a range of specialized image annotation tools have been developed worldwide. Image annotation - the process of transposing objects or events represented in a video or still image to the semantic level, may involve human interactions and computer-assisted solutions. Marine image annotation software (MIAS) have enabled over 500 publications to date. We review the functioning, application trends and developments, by comparing general and advanced features of 23 different tools utilized in underwater image analysis. MIAS requiring human input are basically a graphical user interface, with a video player or image browser that recognizes a specific time code or image code, allowing to log events in a time-stamped (and/or geo-referenced) manner. MIAS differ from similar software by the capability of integrating data associated to video collection, the most simple being the position coordinates of the video recording platform. MIAS have three main characteristics: annotating events in real time, posteriorly to annotation and interact with a database. These range from simple annotation interfaces, to full onboard data management systems, with a variety of toolboxes. Advanced packages allow to input and display data from multiple sensors or multiple annotators via intranet or internet. Posterior human-mediated annotation often include tools for data display and image analysis, e.g. length, area, image segmentation, point count; and in a few cases the possibility of browsing and editing previous dive logs or to analyze the annotations. The interaction with a database allows the automatic integration of annotations from different surveys, repeated annotation and collaborative annotation of shared datasets, browsing and querying of data. Progress in the field of automated annotation is mostly in post processing, for stable platforms or still images

  7. PANNZER2: a rapid functional annotation web server.

    Science.gov (United States)

    Törönen, Petri; Medlar, Alan; Holm, Liisa

    2018-05-08

    The unprecedented growth of high-throughput sequencing has led to an ever-widening annotation gap in protein databases. While computational prediction methods are available to make up the shortfall, a majority of public web servers are hindered by practical limitations and poor performance. Here, we introduce PANNZER2 (Protein ANNotation with Z-scoRE), a fast functional annotation web server that provides both Gene Ontology (GO) annotations and free text description predictions. PANNZER2 uses SANSparallel to perform high-performance homology searches, making bulk annotation based on sequence similarity practical. PANNZER2 can output GO annotations from multiple scoring functions, enabling users to see which predictions are robust across predictors. Finally, PANNZER2 predictions scored within the top 10 methods for molecular function and biological process in the CAFA2 NK-full benchmark. The PANNZER2 web server is updated on a monthly schedule and is accessible at http://ekhidna2.biocenter.helsinki.fi/sanspanz/. The source code is available under the GNU Public Licence v3.

  8. The Chlamydomonas cell wall and its constituent glycoproteins analyzed by the quick-freeze, deep-etch technique

    OpenAIRE

    1985-01-01

    Using the quick-freeze, deep-etch technique, we have analyzed the structure of the intact cell wall of Chlamydomonas reinhardi, and have visualized its component glycoproteins after mechanical shearing and after depolymerization induced by perchlorate or by the wall-disrupting agent, autolysin. The intact wall has previously been shown in a thin- section study (Roberts, K., M. Gurney-Smith, and G. J. Hills, 1972, J. Ultrastruct. Res. 40:599-613) to consist of a discrete central triplet bisect...

  9. MitoBamAnnotator: A web-based tool for detecting and annotating heteroplasmy in human mitochondrial DNA sequences.

    Science.gov (United States)

    Zhidkov, Ilia; Nagar, Tal; Mishmar, Dan; Rubin, Eitan

    2011-11-01

    The use of Next-Generation Sequencing of mitochondrial DNA is becoming widespread in biological and clinical research. This, in turn, creates a need for a convenient tool that detects and analyzes heteroplasmy. Here we present MitoBamAnnotator, a user friendly web-based tool that allows maximum flexibility and control in heteroplasmy research. MitoBamAnnotator provides the user with a comprehensively annotated overview of mitochondrial genetic variation, allowing for an in-depth analysis with no prior knowledge in programming. Copyright © 2011 Elsevier B.V. and Mitochondria Research Society. All rights reserved. All rights reserved.

  10. Deletion of CGLD1 Impairs PSII and Increases Singlet Oxygen Tolerance of Green Alga Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Jiale Xing

    2017-12-01

    Full Text Available The green alga Chlamydomonas reinhardtii is a key model organism for studying photosynthesis and oxidative stress in unicellular eukaryotes. Using a forward genetics approach, we have identified and characterized a mutant x32, which lacks a predicted protein named CGLD1 (Conserved in Green Lineage and Diatom 1 in GreenCut2, under normal and stress conditions. We show that loss of CGLD1 resulted in minimal photoautotrophic growth and PSII activity in the organism. We observed reduced amount of PSII complex and core subunits in the x32 mutant based on blue-native (BN/PAGE and immunoblot analysis. Moreover, x32 exhibited increased sensitivity to high-light stress and altered tolerance to different reactive oxygenic species (ROS stress treatments, i.e., decreased resistance to H2O2/or tert-Butyl hydroperoxide (t-BOOH and increased tolerance to neutral red (NR and rose bengal (RB that induce the formation of singlet oxygen, respectively. Further analysis via quantitative real-time PCR (qRT-PCR indicated that the increased singlet-oxygen tolerance of x32 was largely correlated with up-regulated gene expression of glutathione-S-transferases (GST. The phenotypical and physiological implications revealed from our experiments highlight the important roles of CGLD1 in maintaining structure and function of PSII as well as in protection of Chlamydomonas under photo-oxidative stress conditions.

  11. Correction of the Caulobacter crescentus NA1000 genome annotation.

    Directory of Open Access Journals (Sweden)

    Bert Ely

    Full Text Available Bacterial genome annotations are accumulating rapidly in the GenBank database and the use of automated annotation technologies to create these annotations has become the norm. However, these automated methods commonly result in a small, but significant percentage of genome annotation errors. To improve accuracy and reliability, we analyzed the Caulobacter crescentus NA1000 genome utilizing computer programs Artemis and MICheck to manually examine the third codon position GC content, alignment to a third codon position GC frame plot peak, and matches in the GenBank database. We identified 11 new genes, modified the start site of 113 genes, and changed the reading frame of 38 genes that had been incorrectly annotated. Furthermore, our manual method of identifying protein-coding genes allowed us to remove 112 non-coding regions that had been designated as coding regions. The improved NA1000 genome annotation resulted in a reduction in the use of rare codons since noncoding regions with atypical codon usage were removed from the annotation and 49 new coding regions were added to the annotation. Thus, a more accurate codon usage table was generated as well. These results demonstrate that a comparison of the location of peaks third codon position GC content to the location of protein coding regions could be used to verify the annotation of any genome that has a GC content that is greater than 60%.

  12. Annotation of regular polysemy and underspecification

    DEFF Research Database (Denmark)

    Martínez Alonso, Héctor; Pedersen, Bolette Sandford; Bel, Núria

    2013-01-01

    We present the result of an annotation task on regular polysemy for a series of seman- tic classes or dot types in English, Dan- ish and Spanish. This article describes the annotation process, the results in terms of inter-encoder agreement, and the sense distributions obtained with two methods...

  13. PCAS – a precomputed proteome annotation database resource

    Directory of Open Access Journals (Sweden)

    Luo Jingchu

    2003-11-01

    Full Text Available Abstract Background Many model proteomes or "complete" sets of proteins of given organisms are now publicly available. Much effort has been invested in computational annotation of those "draft" proteomes. Motif or domain based algorithms play a pivotal role in functional classification of proteins. Employing most available computational algorithms, mainly motif or domain recognition algorithms, we set up to develop an online proteome annotation system with integrated proteome annotation data to complement existing resources. Results We report here the development of PCAS (ProteinCentric Annotation System as an online resource of pre-computed proteome annotation data. We applied most available motif or domain databases and their analysis methods, including hmmpfam search of HMMs in Pfam, SMART and TIGRFAM, RPS-PSIBLAST search of PSSMs in CDD, pfscan of PROSITE patterns and profiles, as well as PSI-BLAST search of SUPERFAMILY PSSMs. In addition, signal peptide and TM are predicted using SignalP and TMHMM respectively. We mapped SUPERFAMILY and COGs to InterPro, so the motif or domain databases are integrated through InterPro. PCAS displays table summaries of pre-computed data and a graphical presentation of motifs or domains relative to the protein. As of now, PCAS contains human IPI, mouse IPI, and rat IPI, A. thaliana, C. elegans, D. melanogaster, S. cerevisiae, and S. pombe proteome. PCAS is available at http://pak.cbi.pku.edu.cn/proteome/gca.php Conclusion PCAS gives better annotation coverage for model proteomes by employing a wider collection of available algorithms. Besides presenting the most confident annotation data, PCAS also allows customized query so users can inspect statistically less significant boundary information as well. Therefore, besides providing general annotation information, PCAS could be used as a discovery platform. We plan to update PCAS twice a year. We will upgrade PCAS when new proteome annotation algorithms

  14. A semi-automatic annotation tool for cooking video

    Science.gov (United States)

    Bianco, Simone; Ciocca, Gianluigi; Napoletano, Paolo; Schettini, Raimondo; Margherita, Roberto; Marini, Gianluca; Gianforme, Giorgio; Pantaleo, Giuseppe

    2013-03-01

    In order to create a cooking assistant application to guide the users in the preparation of the dishes relevant to their profile diets and food preferences, it is necessary to accurately annotate the video recipes, identifying and tracking the foods of the cook. These videos present particular annotation challenges such as frequent occlusions, food appearance changes, etc. Manually annotate the videos is a time-consuming, tedious and error-prone task. Fully automatic tools that integrate computer vision algorithms to extract and identify the elements of interest are not error free, and false positive and false negative detections need to be corrected in a post-processing stage. We present an interactive, semi-automatic tool for the annotation of cooking videos that integrates computer vision techniques under the supervision of the user. The annotation accuracy is increased with respect to completely automatic tools and the human effort is reduced with respect to completely manual ones. The performance and usability of the proposed tool are evaluated on the basis of the time and effort required to annotate the same video sequences.

  15. Experiments with crowdsourced re-annotation of a POS tagging data set

    DEFF Research Database (Denmark)

    Hovy, Dirk; Plank, Barbara; Søgaard, Anders

    2014-01-01

    Crowdsourcing lets us collect multiple annotations for an item from several annotators. Typically, these are annotations for non-sequential classification tasks. While there has been some work on crowdsourcing named entity annotations, researchers have assumed that syntactic tasks such as part......-of-speech (POS) tagging cannot be crowdsourced. This paper shows that workers can actually annotate sequential data almost as well as experts. Further, we show that the models learned from crowdsourced annotations fare as well as the models learned from expert annotations in downstream tasks....

  16. MPEG-7 based video annotation and browsing

    Science.gov (United States)

    Hoeynck, Michael; Auweiler, Thorsten; Wellhausen, Jens

    2003-11-01

    The huge amount of multimedia data produced worldwide requires annotation in order to enable universal content access and to provide content-based search-and-retrieval functionalities. Since manual video annotation can be time consuming, automatic annotation systems are required. We review recent approaches to content-based indexing and annotation of videos for different kind of sports and describe our approach to automatic annotation of equestrian sports videos. We especially concentrate on MPEG-7 based feature extraction and content description, where we apply different visual descriptors for cut detection. Further, we extract the temporal positions of single obstacles on the course by analyzing MPEG-7 edge information. Having determined single shot positions as well as the visual highlights, the information is jointly stored with meta-textual information in an MPEG-7 description scheme. Based on this information, we generate content summaries which can be utilized in a user-interface in order to provide content-based access to the video stream, but further for media browsing on a streaming server.

  17. Development of phytase-expressing chlamydomonas reinhardtii for monogastric animal nutrition.

    Science.gov (United States)

    Erpel, Fernanda; Restovic, Franko; Arce-Johnson, Patricio

    2016-03-12

    In plant-derived animal feedstuffs, nearly 80 % of the total phosphorus content is stored as phytate. However, phytate is poorly digested by monogastric animals such as poultry, swine and fish, as they lack the hydrolytic enzyme phytase; hence it is regarded as a nutritionally inactive compound from a phosphate bioavailability point of view. In addition, it also chelates important dietary minerals and essential amino acids. Therefore, dietary supplementation with bioavailable phosphate and exogenous phytases are required to achieve optimal animal growth. In order to simplify the obtaining and application processes, we developed a phytase expressing cell-wall deficient Chlamydomonas reinhardtii strain. In this work, we developed a transgenic microalgae expressing a fungal phytase to be used as a food supplement for monogastric animals. A codon optimized Aspergillus niger PhyA E228K phytase (mE228K) with improved performance at pH 3.5 was transformed into the plastid genome of Chlamydomonas reinhardtii in order to achieve optimal expression. We engineered a plastid-specific construction harboring the mE228K gene, which allowed us to obtain high expression level lines with measurable in vitro phytase activity. Both wild-type and cell-wall deficient strains were selected, as the latter is a suitable model for animal digestion. The enzymatic activity of the mE228K expressing lines were approximately 5 phytase units per gram of dry biomass at pH 3.5 and 37 °C, similar to physiological conditions and economically competitive for use in commercial activities. A reference basis for the future biotechnological application of microalgae is provided in this work. A cell-wall deficient transgenic microalgae with phytase activity at gastrointestinal pH and temperature and suitable for pellet formation was developed. Moreover, the associated microalgae biomass costs of this strain would be between US$5 and US$60 per ton of feedstuff, similar to the US$2 per ton of feedstuffs

  18. Ground Truth Annotation in T Analyst

    DEFF Research Database (Denmark)

    2015-01-01

    This video shows how to annotate the ground truth tracks in the thermal videos. The ground truth tracks are produced to be able to compare them to tracks obtained from a Computer Vision tracking approach. The program used for annotation is T-Analyst, which is developed by Aliaksei Laureshyn, Ph...

  19. Effect of mutagen combined action on Chlamydomonas reinhardtii cells. II

    International Nuclear Information System (INIS)

    Podstavkova, S.; Vlcek, D.; Dubovsky, J.

    1978-01-01

    The effect of UV radiation and UV radiation combined with alkylnitrosourea derivatives (N-methyl-N-nitrosourea and N-ethyl-N-nitrosourea) was observed on survival of cells of the algae Chlamydomonas reinhardtii. In particular, single parts were evaluated of the overall lethal effect - dying of cells before division and dying of cells after division. It was found that the combined action of low doses of UV radiation and alkylnitrosoureas result in a pronounced protective effect which manifests itself by a higher frequency of surviving cells than was that effected by the action of alkylnitrosoureas alone. As a result of combined action with higher doses of UV radiation this effect is lost, and the resultant values will come close to the theoretically anticipated values. This gradual transition from a protective to an additive effect mainly manifests itself by changes in the proportion of cells dying before division. (author)

  20. Propagating annotations of molecular networks using in silico fragmentation.

    Science.gov (United States)

    da Silva, Ricardo R; Wang, Mingxun; Nothias, Louis-Félix; van der Hooft, Justin J J; Caraballo-Rodríguez, Andrés Mauricio; Fox, Evan; Balunas, Marcy J; Klassen, Jonathan L; Lopes, Norberto Peporine; Dorrestein, Pieter C

    2018-04-18

    The annotation of small molecules is one of the most challenging and important steps in untargeted mass spectrometry analysis, as most of our biological interpretations rely on structural annotations. Molecular networking has emerged as a structured way to organize and mine data from untargeted tandem mass spectrometry (MS/MS) experiments and has been widely applied to propagate annotations. However, propagation is done through manual inspection of MS/MS spectra connected in the spectral networks and is only possible when a reference library spectrum is available. One of the alternative approaches used to annotate an unknown fragmentation mass spectrum is through the use of in silico predictions. One of the challenges of in silico annotation is the uncertainty around the correct structure among the predicted candidate lists. Here we show how molecular networking can be used to improve the accuracy of in silico predictions through propagation of structural annotations, even when there is no match to a MS/MS spectrum in spectral libraries. This is accomplished through creating a network consensus of re-ranked structural candidates using the molecular network topology and structural similarity to improve in silico annotations. The Network Annotation Propagation (NAP) tool is accessible through the GNPS web-platform https://gnps.ucsd.edu/ProteoSAFe/static/gnps-theoretical.jsp.

  1. Gene calling and bacterial genome annotation with BG7.

    Science.gov (United States)

    Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo

    2015-01-01

    New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).

  2. Annotation of the Evaluative Language in a Dependency Treebank

    Directory of Open Access Journals (Sweden)

    Šindlerová Jana

    2017-12-01

    Full Text Available In the paper, we present our efforts to annotate evaluative language in the Prague Dependency Treebank 2.0. The project is a follow-up of the series of annotations of small plaintext corpora. It uses automatic identification of potentially evaluative nodes through mapping a Czech subjectivity lexicon to syntactically annotated data. These nodes are then manually checked by an annotator and either dismissed as standing in a non-evaluative context, or confirmed as evaluative. In the latter case, information about the polarity orientation, the source and target of evaluation is added by the annotator. The annotations unveiled several advantages and disadvantages of the chosen framework. The advantages involve more structured and easy-to-handle environment for the annotator, visibility of syntactic patterning of the evaluative state, effective solving of discontinuous structures or a new perspective on the influence of good/bad news. The disadvantages include little capability of treating cases with evaluation spread among more syntactically connected nodes at once, little capability of treating metaphorical expressions, or disregarding the effects of negation and intensification in the current scheme.

  3. The caBIG annotation and image Markup project.

    Science.gov (United States)

    Channin, David S; Mongkolwat, Pattanasak; Kleper, Vladimir; Sepukar, Kastubh; Rubin, Daniel L

    2010-04-01

    Image annotation and markup are at the core of medical interpretation in both the clinical and the research setting. Digital medical images are managed with the DICOM standard format. While DICOM contains a large amount of meta-data about whom, where, and how the image was acquired, DICOM says little about the content or meaning of the pixel data. An image annotation is the explanatory or descriptive information about the pixel data of an image that is generated by a human or machine observer. An image markup is the graphical symbols placed over the image to depict an annotation. While DICOM is the standard for medical image acquisition, manipulation, transmission, storage, and display, there are no standards for image annotation and markup. Many systems expect annotation to be reported verbally, while markups are stored in graphical overlays or proprietary formats. This makes it difficult to extract and compute with both of them. The goal of the Annotation and Image Markup (AIM) project is to develop a mechanism, for modeling, capturing, and serializing image annotation and markup data that can be adopted as a standard by the medical imaging community. The AIM project produces both human- and machine-readable artifacts. This paper describes the AIM information model, schemas, software libraries, and tools so as to prepare researchers and developers for their use of AIM.

  4. Interoperable Multimedia Annotation and Retrieval for the Tourism Sector

    NARCIS (Netherlands)

    Chatzitoulousis, Antonios; Efraimidis, Pavlos S.; Athanasiadis, I.N.

    2015-01-01

    The Atlas Metadata System (AMS) employs semantic web annotation techniques in order to create an interoperable information annotation and retrieval platform for the tourism sector. AMS adopts state-of-the-art metadata vocabularies, annotation techniques and semantic web technologies.

  5. Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.

    Science.gov (United States)

    Cohen, K Bretonnel; Lanfranchi, Arrick; Choi, Miji Joo-Young; Bada, Michael; Baumgartner, William A; Panteleyeva, Natalya; Verspoor, Karin; Palmer, Martha; Hunter, Lawrence E

    2017-08-17

    Coreference resolution is the task of finding strings in text that have the same referent as other strings. Failures of coreference resolution are a common cause of false negatives in information extraction from the scientific literature. In order to better understand the nature of the phenomenon of coreference in biomedical publications and to increase performance on the task, we annotated the Colorado Richly Annotated Full Text (CRAFT) corpus with coreference relations. The corpus was manually annotated with coreference relations, including identity and appositives for all coreferring base noun phrases. The OntoNotes annotation guidelines, with minor adaptations, were used. Interannotator agreement ranges from 0.480 (entity-based CEAF) to 0.858 (Class-B3), depending on the metric that is used to assess it. The resulting corpus adds nearly 30,000 annotations to the previous release of the CRAFT corpus. Differences from related projects include a much broader definition of markables, connection to extensive annotation of several domain-relevant semantic classes, and connection to complete syntactic annotation. Tool performance was benchmarked on the data. A publicly available out-of-the-box, general-domain coreference resolution system achieved an F-measure of 0.14 (B3), while a simple domain-adapted rule-based system achieved an F-measure of 0.42. An ensemble of the two reached F of 0.46. Following the IDENTITY chains in the data would add 106,263 additional named entities in the full 97-paper corpus, for an increase of 76% percent in the semantic classes of the eight ontologies that have been annotated in earlier versions of the CRAFT corpus. The project produced a large data set for further investigation of coreference and coreference resolution in the scientific literature. The work raised issues in the phenomenon of reference in this domain and genre, and the paper proposes that many mentions that would be considered generic in the general domain are not

  6. A Novel Approach to Semantic and Coreference Annotation at LLNL

    Energy Technology Data Exchange (ETDEWEB)

    Firpo, M

    2005-02-04

    A case is made for the importance of high quality semantic and coreference annotation. The challenges of providing such annotation are described. Asperger's Syndrome is introduced, and the connections are drawn between the needs of text annotation and the abilities of persons with Asperger's Syndrome to meet those needs. Finally, a pilot program is recommended wherein semantic annotation is performed by people with Asperger's Syndrome. The primary points embodied in this paper are as follows: (1) Document annotation is essential to the Natural Language Processing (NLP) projects at Lawrence Livermore National Laboratory (LLNL); (2) LLNL does not currently have a system in place to meet its need for text annotation; (3) Text annotation is challenging for a variety of reasons, many related to its very rote nature; (4) Persons with Asperger's Syndrome are particularly skilled at rote verbal tasks, and behavioral experts agree that they would excel at text annotation; and (6) A pilot study is recommend in which two to three people with Asperger's Syndrome annotate documents and then the quality and throughput of their work is evaluated relative to that of their neuro-typical peers.

  7. Gene Ontology annotation of the rice blast fungus, Magnaporthe oryzae

    Directory of Open Access Journals (Sweden)

    Deng Jixin

    2009-02-01

    Full Text Available Abstract Background Magnaporthe oryzae, the causal agent of blast disease of rice, is the most destructive disease of rice worldwide. The genome of this fungal pathogen has been sequenced and an automated annotation has recently been updated to Version 6 http://www.broad.mit.edu/annotation/genome/magnaporthe_grisea/MultiDownloads.html. However, a comprehensive manual curation remains to be performed. Gene Ontology (GO annotation is a valuable means of assigning functional information using standardized vocabulary. We report an overview of the GO annotation for Version 5 of M. oryzae genome assembly. Methods A similarity-based (i.e., computational GO annotation with manual review was conducted, which was then integrated with a literature-based GO annotation with computational assistance. For similarity-based GO annotation a stringent reciprocal best hits method was used to identify similarity between predicted proteins of M. oryzae and GO proteins from multiple organisms with published associations to GO terms. Significant alignment pairs were manually reviewed. Functional assignments were further cross-validated with manually reviewed data, conserved domains, or data determined by wet lab experiments. Additionally, biological appropriateness of the functional assignments was manually checked. Results In total, 6,286 proteins received GO term assignment via the homology-based annotation, including 2,870 hypothetical proteins. Literature-based experimental evidence, such as microarray, MPSS, T-DNA insertion mutation, or gene knockout mutation, resulted in 2,810 proteins being annotated with GO terms. Of these, 1,673 proteins were annotated with new terms developed for Plant-Associated Microbe Gene Ontology (PAMGO. In addition, 67 experiment-determined secreted proteins were annotated with PAMGO terms. Integration of the two data sets resulted in 7,412 proteins (57% being annotated with 1,957 distinct and specific GO terms. Unannotated proteins

  8. Systems Biology of Lipid Body Formation in the Green Alga Chlamydomonas reinhardtii

    Energy Technology Data Exchange (ETDEWEB)

    Goodenough, Ursula [Washington Univ., St. Louis, MO (United States)

    2017-11-10

    The project aimed to deepen our understanding of alga triacylglycerol (TAG) production to undergird explorations of using algal TAG as a source of biodiesel fuel. Our published contributions included the following: 1) Development of a rapid assay for TAG in algal cultures which was widely distributed to the algal community. 2) A comprehensive transcriptome analysis of the development of the ultra-high-TAG “obese” phenotype In Chlamydomonas reinhardtii. 3) A comprehensive biochemical and ultrastructural analysis of the cell wall of Nannochloropsis gaditana, whose walls render it both growth-hardy and difficult to rupture for TAG recovery. A manuscript in preparation considers the autophagy response in C. reinhardtii and its entrance into stationary phase, both having an impact on TAG production.

  9. Combined evidence annotation of transposable elements in genome sequences.

    Directory of Open Access Journals (Sweden)

    Hadi Quesneville

    2005-07-01

    Full Text Available Transposable elements (TEs are mobile, repetitive sequences that make up significant fractions of metazoan genomes. Despite their near ubiquity and importance in genome and chromosome biology, most efforts to annotate TEs in genome sequences rely on the results of a single computational program, RepeatMasker. In contrast, recent advances in gene annotation indicate that high-quality gene models can be produced from combining multiple independent sources of computational evidence. To elevate the quality of TE annotations to a level comparable to that of gene models, we have developed a combined evidence-model TE annotation pipeline, analogous to systems used for gene annotation, by integrating results from multiple homology-based and de novo TE identification methods. As proof of principle, we have annotated "TE models" in Drosophila melanogaster Release 4 genomic sequences using the combined computational evidence derived from RepeatMasker, BLASTER, TBLASTX, all-by-all BLASTN, RECON, TE-HMM and the previous Release 3.1 annotation. Our system is designed for use with the Apollo genome annotation tool, allowing automatic results to be curated manually to produce reliable annotations. The euchromatic TE fraction of D. melanogaster is now estimated at 5.3% (cf. 3.86% in Release 3.1, and we found a substantially higher number of TEs (n = 6,013 than previously identified (n = 1,572. Most of the new TEs derive from small fragments of a few hundred nucleotides long and highly abundant families not previously annotated (e.g., INE-1. We also estimated that 518 TE copies (8.6% are inserted into at least one other TE, forming a nest of elements. The pipeline allows rapid and thorough annotation of even the most complex TE models, including highly deleted and/or nested elements such as those often found in heterochromatic sequences. Our pipeline can be easily adapted to other genome sequences, such as those of the D. melanogaster heterochromatin or other

  10. NoGOA: predicting noisy GO annotations using evidences and sparse representation.

    Science.gov (United States)

    Yu, Guoxian; Lu, Chang; Wang, Jun

    2017-07-21

    Gene Ontology (GO) is a community effort to represent functional features of gene products. GO annotations (GOA) provide functional associations between GO terms and gene products. Due to resources limitation, only a small portion of annotations are manually checked by curators, and the others are electronically inferred. Although quality control techniques have been applied to ensure the quality of annotations, the community consistently report that there are still considerable noisy (or incorrect) annotations. Given the wide application of annotations, however, how to identify noisy annotations is an important but yet seldom studied open problem. We introduce a novel approach called NoGOA to predict noisy annotations. NoGOA applies sparse representation on the gene-term association matrix to reduce the impact of noisy annotations, and takes advantage of sparse representation coefficients to measure the semantic similarity between genes. Secondly, it preliminarily predicts noisy annotations of a gene based on aggregated votes from semantic neighborhood genes of that gene. Next, NoGOA estimates the ratio of noisy annotations for each evidence code based on direct annotations in GOA files archived on different periods, and then weights entries of the association matrix via estimated ratios and propagates weights to ancestors of direct annotations using GO hierarchy. Finally, it integrates evidence-weighted association matrix and aggregated votes to predict noisy annotations. Experiments on archived GOA files of six model species (H. sapiens, A. thaliana, S. cerevisiae, G. gallus, B. Taurus and M. musculus) demonstrate that NoGOA achieves significantly better results than other related methods and removing noisy annotations improves the performance of gene function prediction. The comparative study justifies the effectiveness of integrating evidence codes with sparse representation for predicting noisy GO annotations. Codes and datasets are available at http://mlda.swu.edu.cn/codes.php?name=NoGOA .

  11. Plann: A command-line application for annotating plastome sequences.

    Science.gov (United States)

    Huang, Daisie I; Cronk, Quentin C B

    2015-08-01

    Plann automates the process of annotating a plastome sequence in GenBank format for either downstream processing or for GenBank submission by annotating a new plastome based on a similar, well-annotated plastome. Plann is a Perl script to be executed on the command line. Plann compares a new plastome sequence to the features annotated in a reference plastome and then shifts the intervals of any matching features to the locations in the new plastome. Plann's output can be used in the National Center for Biotechnology Information's tbl2asn to create a Sequin file for GenBank submission. Unlike Web-based annotation packages, Plann is a locally executable script that will accurately annotate a plastome sequence to a locally specified reference plastome. Because it executes from the command line, it is ready to use in other software pipelines and can be easily rerun as a draft plastome is improved.

  12. Semantator: annotating clinical narratives with semantic web ontologies.

    Science.gov (United States)

    Song, Dezhao; Chute, Christopher G; Tao, Cui

    2012-01-01

    To facilitate clinical research, clinical data needs to be stored in a machine processable and understandable way. Manual annotating clinical data is time consuming. Automatic approaches (e.g., Natural Language Processing systems) have been adopted to convert such data into structured formats; however, the quality of such automatically extracted data may not always be satisfying. In this paper, we propose Semantator, a semi-automatic tool for document annotation with Semantic Web ontologies. With a loaded free text document and an ontology, Semantator supports the creation/deletion of ontology instances for any document fragment, linking/disconnecting instances with the properties in the ontology, and also enables automatic annotation by connecting to the NCBO annotator and cTAKES. By representing annotations in Semantic Web standards, Semantator supports reasoning based upon the underlying semantics of the owl:disjointWith and owl:equivalentClass predicates. We present discussions based on user experiences of using Semantator.

  13. A revised mineral nutrient supplement increases biomass and growth rate in Chlamydomonas reinhardtii.

    Science.gov (United States)

    Kropat, Janette; Hong-Hermesdorf, Anne; Casero, David; Ent, Petr; Castruita, Madeli; Pellegrini, Matteo; Merchant, Sabeeha S; Malasarn, Davin

    2011-06-01

    Interest in exploiting algae as a biofuel source and the role of inorganic nutrient deficiency in inducing triacylglyceride (TAG) accumulation in cells necessitates a strategy to efficiently formulate species-specific culture media that can easily be manipulated. Using the reference organism Chlamydomonas reinhardtii, we tested the hypothesis that modeling trace element supplements after the cellular ionome would result in optimized cell growth. We determined the trace metal content of several commonly used Chlamydomonas strains in various culture conditions and developed a revised trace element solution to parallel these measurements. Comparison of cells growing in the revised supplement versus a traditional trace element solution revealed faster growth rates and higher maximum cell densities with the revised recipe. RNA-seq analysis of cultures growing in the traditional versus revised medium suggest that the variation in transcriptomes was smaller than that found between different wild-type strains grown in traditional Hutner's supplement. Visual observation did not reveal defects in cell motility or mating efficiency in the new supplement. Ni²⁺-inducible expression from the CYC6 promoter remained a useful tool, albeit with an increased requirement for Ni²⁺ because of the introduction of an EDTA buffer system in the revised medium. Other advantages include more facile preparation of trace element stock solutions, a reduction in total chemical use, a more consistent batch-to-batch formulation and long-term stability (tested up to 5 years). Under the new growth regime, we analyzed cells growing under different macro- and micronutrient deficiencies. TAG accumulation in N deficiency is comparable in the new medium. Fe and Zn deficiency also induced TAG accumulation, as suggested by Nile Red staining. This approach can be used to efficiently optimize culture conditions for other algal species to improve growth and to assay cell physiology. © 2011 The Authors

  14. A Chlamydomonas-derived Human Papillomavirus 16 E7 vaccine induces specific tumor protection.

    Directory of Open Access Journals (Sweden)

    Olivia C Demurtas

    Full Text Available The E7 protein of the Human Papillomavirus (HPV type 16, being involved in malignant cellular transformation, represents a key antigen for developing therapeutic vaccines against HPV-related lesions and cancers. Recombinant production of this vaccine antigen in an active form and in compliance with good manufacturing practices (GMP plays a crucial role for developing effective vaccines. E7-based therapeutic vaccines produced in plants have been shown to be active in tumor regression and protection in pre-clinical models. However, some drawbacks of in whole-plant vaccine production encouraged us to explore the production of the E7-based therapeutic vaccine in Chlamydomonas reinhardtii, an organism easy to grow and transform and fully amenable to GMP guidelines.An expression cassette encoding E7GGG, a mutated, attenuated form of the E7 oncoprotein, alone or as a fusion with affinity tags (His6 or FLAG, under the control of the C. reinhardtii chloroplast psbD 5' UTR and the psbA 3' UTR, was introduced into the C. reinhardtii chloroplast genome by homologous recombination. The protein was mostly soluble and reached 0.12% of total soluble proteins. Affinity purification was optimized and performed for both tagged forms. Induction of specific anti-E7 IgGs and E7-specific T-cell proliferation were detected in C57BL/6 mice vaccinated with total Chlamydomonas extract and with affinity-purified protein. High levels of tumor protection were achieved after challenge with a tumor cell line expressing the E7 protein.The C. reinhardtii chloroplast is a suitable expression system for the production of the E7GGG protein, in a soluble, immunogenic form. The production in contained and sterile conditions highlights the potential of microalgae as alternative platforms for the production of vaccines for human uses.

  15. Using single cell cultivation system for on-chip monitoring of the interdivision timer in Chlamydomonas reinhardtii cell cycle

    Directory of Open Access Journals (Sweden)

    Soloviev Mikhail

    2010-09-01

    Full Text Available Abstract Regulation of cell cycle progression in changing environments is vital for cell survival and maintenance, and different regulation mechanisms based on cell size and cell cycle time have been proposed. To determine the mechanism of cell cycle regulation in the unicellular green algae Chlamydomonas reinhardtii, we developed an on-chip single-cell cultivation system that allows for the strict control of the extracellular environment. We divided the Chlamydomonas cell cycle into interdivision and division phases on the basis of changes in cell size and found that, regardless of the amount of photosynthetically active radiation (PAR and the extent of illumination, the length of the interdivision phase was inversely proportional to the rate of increase of cell volume. Their product remains constant indicating the existence of an 'interdivision timer'. The length of the division phase, in contrast, remained nearly constant. Cells cultivated under light-dark-light conditions did not divide unless they had grown to twice their initial volume during the first light period. This indicates the existence of a 'commitment sizer'. The ratio of the cell volume at the beginning of the division phase to the initial cell volume determined the number of daughter cells, indicating the existence of a 'mitotic sizer'.

  16. Ontology modularization to improve semantic medical image annotation.

    Science.gov (United States)

    Wennerberg, Pinar; Schulz, Klaus; Buitelaar, Paul

    2011-02-01

    Searching for medical images and patient reports is a significant challenge in a clinical setting. The contents of such documents are often not described in sufficient detail thus making it difficult to utilize the inherent wealth of information contained within them. Semantic image annotation addresses this problem by describing the contents of images and reports using medical ontologies. Medical images and patient reports are then linked to each other through common annotations. Subsequently, search algorithms can more effectively find related sets of documents on the basis of these semantic descriptions. A prerequisite to realizing such a semantic search engine is that the data contained within should have been previously annotated with concepts from medical ontologies. One major challenge in this regard is the size and complexity of medical ontologies as annotation sources. Manual annotation is particularly time consuming labor intensive in a clinical environment. In this article we propose an approach to reducing the size of clinical ontologies for more efficient manual image and text annotation. More precisely, our goal is to identify smaller fragments of a large anatomy ontology that are relevant for annotating medical images from patients suffering from lymphoma. Our work is in the area of ontology modularization, which is a recent and active field of research. We describe our approach, methods and data set in detail and we discuss our results. Copyright © 2010 Elsevier Inc. All rights reserved.

  17. [Prescription annotations in Welfare Pharmacy].

    Science.gov (United States)

    Han, Yi

    2018-03-01

    Welfare Pharmacy contains medical formulas documented by the government and official prescriptions used by the official pharmacy in the pharmaceutical process. In the last years of Southern Song Dynasty, anonyms gave a lot of prescription annotations, made textual researches for the name, source, composition and origin of the prescriptions, and supplemented important historical data of medical cases and researched historical facts. The annotations of Welfare Pharmacy gathered the essence of medical theory, and can be used as precious materials to correctly understand the syndrome differentiation, compatibility regularity and clinical application of prescriptions. This article deeply investigated the style and form of the prescription annotations in Welfare Pharmacy, the name of prescriptions and the evolution of terminology, the major functions of the prescriptions, processing methods, instructions for taking medicine and taboos of prescriptions, the medical cases and clinical efficacy of prescriptions, the backgrounds, sources, composition and cultural meanings of prescriptions, proposed that the prescription annotations played an active role in the textual dissemination, patent medicine production and clinical diagnosis and treatment of Welfare Pharmacy. This not only helps understand the changes in the names and terms of traditional Chinese medicines in Welfare Pharmacy, but also provides the basis for understanding the knowledge sources, compatibility regularity, important drug innovations and clinical medications of prescriptions in Welfare Pharmacy. Copyright© by the Chinese Pharmaceutical Association.

  18. A framework for annotating human genome in disease context.

    Science.gov (United States)

    Xu, Wei; Wang, Huisong; Cheng, Wenqing; Fu, Dong; Xia, Tian; Kibbe, Warren A; Lin, Simon M

    2012-01-01

    Identification of gene-disease association is crucial to understanding disease mechanism. A rapid increase in biomedical literatures, led by advances of genome-scale technologies, poses challenge for manually-curated-based annotation databases to characterize gene-disease associations effectively and timely. We propose an automatic method-The Disease Ontology Annotation Framework (DOAF) to provide a comprehensive annotation of the human genome using the computable Disease Ontology (DO), the NCBO Annotator service and NCBI Gene Reference Into Function (GeneRIF). DOAF can keep the resulting knowledgebase current by periodically executing automatic pipeline to re-annotate the human genome using the latest DO and GeneRIF releases at any frequency such as daily or monthly. Further, DOAF provides a computable and programmable environment which enables large-scale and integrative analysis by working with external analytic software or online service platforms. A user-friendly web interface (doa.nubic.northwestern.edu) is implemented to allow users to efficiently query, download, and view disease annotations and the underlying evidences.

  19. Effect of red and blue light on the timing of cyclin-dependent kinase activity and the timing of cell division in Chlamydomonas reinhardtii

    Czech Academy of Sciences Publication Activity Database

    Oldenhof, H.; Bišová, Kateřina; Ende, H.; Zachleder, Vilém

    2004-01-01

    Roč. 42, - (2004), s. 341-348 ISSN 0981-9428 R&D Projects: GA ČR GA204/02/1438 Institutional research plan: CEZ:AV0Z5020903 Keywords : blue light * chlamydomonas reingardtii * cell cycle Subject RIV: EA - Cell Biology Impact factor: 1.414, year: 2004

  20. Annotating abstract pronominal anaphora in the DAD project

    DEFF Research Database (Denmark)

    Navarretta, Costanza; Olsen, Sussi Anni

    2008-01-01

    n this paper we present an extension of the MATE/GNOME annotation scheme for anaphora (Poesio 2004) which accounts for abstract anaphora in Danish and Italian. By abstract anaphora it is here meant pronouns whose linguistic antecedents are verbal phrases, clauses and discourse segments. The exten......n this paper we present an extension of the MATE/GNOME annotation scheme for anaphora (Poesio 2004) which accounts for abstract anaphora in Danish and Italian. By abstract anaphora it is here meant pronouns whose linguistic antecedents are verbal phrases, clauses and discourse segments....... The extended scheme, which we call the DAD annotation scheme, allows to annotate information about abstract anaphora which is important to investigate their use, see Webber (1988), Gundel et al. (2003), Navarretta (2004) and which can influence their automatic treatment. Intercoder agreement scores obtained...... by applying the DAD annotation scheme on texts and dialogues in the two languages are given and show that th information proposed in the scheme can be recognised in a reliable way....

  1. Annotated bibliography

    International Nuclear Information System (INIS)

    1997-08-01

    Under a cooperative agreement with the U.S. Department of Energy's Office of Science and Technology, Waste Policy Institute (WPI) is conducting a five-year research project to develop a research-based approach for integrating communication products in stakeholder involvement related to innovative technology. As part of the research, WPI developed this annotated bibliography which contains almost 100 citations of articles/books/resources involving topics related to communication and public involvement aspects of deploying innovative cleanup technology. To compile the bibliography, WPI performed on-line literature searches (e.g., Dialog, International Association of Business Communicators Public Relations Society of America, Chemical Manufacturers Association, etc.), consulted past years proceedings of major environmental waste cleanup conferences (e.g., Waste Management), networked with professional colleagues and DOE sites to gather reports or case studies, and received input during the August 1996 Research Design Team meeting held to discuss the project's research methodology. Articles were selected for annotation based upon their perceived usefulness to the broad range of public involvement and communication practitioners

  2. Supporting Keyword Search for Image Retrieval with Integration of Probabilistic Annotation

    Directory of Open Access Journals (Sweden)

    Tie Hua Zhou

    2015-05-01

    Full Text Available The ever-increasing quantities of digital photo resources are annotated with enriching vocabularies to form semantic annotations. Photo-sharing social networks have boosted the need for efficient and intuitive querying to respond to user requirements in large-scale image collections. In order to help users formulate efficient and effective image retrieval, we present a novel integration of a probabilistic model based on keyword query architecture that models the probability distribution of image annotations: allowing users to obtain satisfactory results from image retrieval via the integration of multiple annotations. We focus on the annotation integration step in order to specify the meaning of each image annotation, thus leading to the most representative annotations of the intent of a keyword search. For this demonstration, we show how a probabilistic model has been integrated to semantic annotations to allow users to intuitively define explicit and precise keyword queries in order to retrieve satisfactory image results distributed in heterogeneous large data sources. Our experiments on SBU (collected by Stony Brook University database show that (i our integrated annotation contains higher quality representatives and semantic matches; and (ii the results indicating annotation integration can indeed improve image search result quality.

  3. Flocculation of Chlamydomonas reinhardtii with Different Phenotypic Traits by Metal Cations and High pH

    Directory of Open Access Journals (Sweden)

    Jianhua Fan

    2017-11-01

    Full Text Available Concentrating algal cells by flocculation as a prelude to centrifugation could significantly reduce the energy and cost of harvesting the algae. However, how variation in phenotypic traits such as cell surface features, cell size and motility alter the efficiency of metal cation and pH-induced flocculation is not well understood. Our results demonstrate that both wild-type and cell wall-deficient strains of the green unicellular alga Chlamydomonas reinhardtii efficiently flocculate (>90% at an elevated pH of the medium (pH 11 upon the addition of divalent cations such as calcium and magnesium (>5 mM. The trivalent ferric cation (at 10 mM proved to be essential for promoting flocculation under weak alkaline conditions (pH ∼8.5, with a maximum efficiency that exceeded 95 and 85% for wild-type CC1690 and the cell wall-deficient sta6 mutant, respectively. Near complete flocculation could be achieved using a combination of 5 mM calcium and a pH >11, while the medium recovered following cell removal could be re-cycled without affecting algal growth rates. Moreover, the absence of starch in the cell had little overall impact on flocculation efficiency. These findings contribute to our understanding of flocculation in different Chlamydomonas strains and have implications with respect to inexpensive methods for harvesting algae with different phenotypic traits. Additional research on the conditions (e.g., pH and metal ions used for efficient flocculation of diverse algal groups with diverse characteristics, at both small and large scale, will help establish inexpensive procedures for harvesting cell biomass.

  4. Quick Pad Tagger : An Efficient Graphical User Interface for Building Annotated Corpora with Multiple Annotation Layers

    OpenAIRE

    Marc Schreiber; Kai Barkschat; Bodo Kraft; Albert Zundorf

    2015-01-01

    More and more domain specific applications in the internet make use of Natural Language Processing (NLP) tools (e. g. Information Extraction systems). The output quality of these applications relies on the output quality of the used NLP tools. Often, the quality can be increased by annotating a domain specific corpus. However, annotating a corpus is a time consuming and exhaustive task. To reduce the annota tion time we present...

  5. Effect of temperature and light intensity on growth and photosynthetic activity of Chlamydomonas Reinhardtii

    International Nuclear Information System (INIS)

    Alfonsel, M.; Fernandez Gonzalez, J.

    1986-01-01

    The effect of five temperatures (15, 20, 25, 30 and 35 0 C) and two levels of illumination on growth and photosynthetic activity of Chlamydomonas reinhardtii has been studied. The growth of the cultures was evaluated by optical density. Photosynthetic activity has been carried out studying either the assimilation rate of CO 2 labelled with C 14 or the oxygen evolution by means of polarographic measurements. The maximum photosynthetic rate has been obtained at 25 0 C for the lower lavel of illumination (2400 lux) and at 35 0 C for the higher one (13200 lux). These results suggest an interacton of temperature and illumination on photosynthetic activity. (author)

  6. Supplementary Material for: BEACON: automated tool for Bacterial GEnome Annotation ComparisON

    KAUST Repository

    Kalkatawi, Manal M.; Alam, Intikhab; Bajic, Vladimir B.

    2015-01-01

    Abstract Background Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). Results The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACONâ s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced. Conclusions We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .

  7. Extending eScience Provenance with User-Submitted Semantic Annotations

    Science.gov (United States)

    Michaelis, J.; Zednik, S.; West, P.; Fox, P. A.; McGuinness, D. L.

    2010-12-01

    eScience based systems generate provenance of their data products, related to such things as: data processing, data collection conditions, expert evaluation, and data product quality. Recent advances in web-based technology offer users the possibility of making annotations to both data products and steps in accompanying provenance traces, thereby expanding the utility of such provenance for others. These contributing users may have varying backgrounds, ranging from system experts to outside domain experts to citizen scientists. Furthermore, such users may wish to make varying types of annotations - ranging from documenting the purpose of a provenance step to raising concerns about the quality of data dependencies. Semantic Web technologies allow for such kinds of rich annotations to be made to provenance through the use of ontology vocabularies for (i) organizing provenance, and (ii) organizing user/annotation classifications. Furthermore, through Linked Data practices, Semantic linkages may be made from provenance steps to external data of interest. A desire for Semantically-annotated provenance has been motivated by data management issues in the Mauna Loa Solar Observatory’s (MLSO) Advanced Coronal Observing System (ACOS). In ACOS, photomoeter-based readings are taken of solar activity and subsequently processed into final data products consumable by end users. At intermediate stages of ACOS processing, factors such as evaluations by human experts and weather conditions are logged, which could impact data product quality. If such factors are linked via user-submitted annotations to provenance, it could be significantly beneficial for other users. Likewise, the background of a user could impact the credibility of their annotations. For example, an annotation made by a citizen scientist describing the purpose of a provenance step may not be as reliable as a similar annotation made by an ACOS project member. For this work, we have developed a software package that

  8. Harnessing Collaborative Annotations on Online Formative Assessments

    Science.gov (United States)

    Lin, Jian-Wei; Lai, Yuan-Cheng

    2013-01-01

    This paper harnesses collaborative annotations by students as learning feedback on online formative assessments to improve the learning achievements of students. Through the developed Web platform, students can conduct formative assessments, collaboratively annotate, and review historical records in a convenient way, while teachers can generate…

  9. Crowdsourcing and annotating NER for Twitter #drift

    DEFF Research Database (Denmark)

    Fromreide, Hege; Hovy, Dirk; Søgaard, Anders

    2014-01-01

    We present two new NER datasets for Twitter; a manually annotated set of 1,467 tweets (kappa=0.942) and a set of 2,975 expert-corrected, crowdsourced NER annotated tweets from the dataset described in Finin et al. (2010). In our experiments with these datasets, we observe two important points: (a......) language drift on Twitter is significant, and while off-the-shelf systems have been reported to perform well on in-sample data, they often perform poorly on new samples of tweets, (b) state-of-the-art performance across various datasets can beobtained from crowdsourced annotations, making it more feasible...

  10. Uptake pf 203Hg++ and sup(115M)Cd++ by growing chlamydomonas reinhardi under different conditions

    International Nuclear Information System (INIS)

    Macka, W.; Stehlik, G.; Whilidal, H.; Washuettl, J.; Bancher, E.

    1977-09-01

    The uptake of labelled mercury and cadmium ions by living and dead cells Chlamydomonas reinhardi WT + was measured at 25 deg C in minimal and optimal conditions (dark/nitrogen or light/air, respectively). In each case incorporation was completely independent of external energy. Living and dead cells incorporate almost the same amount of the added heavy metal ions; after about 4 to 8 hours saturation was obtained. Furthermore the distribution of mercury in the system alga/culture medium/gas phase and the amount of mercury and cadmium adsorbed by cells were studied. (author)

  11. SNAD: sequence name annotation-based designer

    Directory of Open Access Journals (Sweden)

    Gorbalenya Alexander E

    2009-08-01

    Full Text Available Abstract Background A growing diversity of biological data is tagged with unique identifiers (UIDs associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a tedious exercise prone to mistakes and omissions. Results Here we introduce SNAD (Sequence Name Annotation-based Designer that mediates automatic conversion of sequence UIDs (associated with multiple alignment or phylogenetic tree, or supplied as plain text list into biologically meaningful names and acronyms. This conversion is directed by precompiled or user-defined templates that exploit wealth of annotation available in cognate entries of external databases. Using examples, we demonstrate how this tool can be used to generate names for practical purposes, particularly in virology. Conclusion A tool for controllable annotation-based conversion of sequence UIDs into biologically meaningful names and acronyms has been developed and placed into service, fostering links between quality of sequence annotation, and efficiency of communication and knowledge dissemination among researchers.

  12. An open annotation ontology for science on web 3.0.

    Science.gov (United States)

    Ciccarese, Paolo; Ocana, Marco; Garcia Castro, Leyla Jael; Das, Sudeshna; Clark, Tim

    2011-05-17

    There is currently a gap between the rich and expressive collection of published biomedical ontologies, and the natural language expression of biomedical papers consumed on a daily basis by scientific researchers. The purpose of this paper is to provide an open, shareable structure for dynamic integration of biomedical domain ontologies with the scientific document, in the form of an Annotation Ontology (AO), thus closing this gap and enabling application of formal biomedical ontologies directly to the literature as it emerges. Initial requirements for AO were elicited by analysis of integration needs between biomedical web communities, and of needs for representing and integrating results of biomedical text mining. Analysis of strengths and weaknesses of previous efforts in this area was also performed. A series of increasingly refined annotation tools were then developed along with a metadata model in OWL, and deployed for feedback and additional requirements the ontology to users at a major pharmaceutical company and a major academic center. Further requirements and critiques of the model were also elicited through discussions with many colleagues and incorporated into the work. This paper presents Annotation Ontology (AO), an open ontology in OWL-DL for annotating scientific documents on the web. AO supports both human and algorithmic content annotation. It enables "stand-off" or independent metadata anchored to specific positions in a web document by any one of several methods. In AO, the document may be annotated but is not required to be under update control of the annotator. AO contains a provenance model to support versioning, and a set model for specifying groups and containers of annotation. AO is freely available under open source license at http://purl.org/ao/, and extensive documentation including screencasts is available on AO's Google Code page: http://code.google.com/p/annotation-ontology/ . The Annotation Ontology meets critical requirements for

  13. ACID: annotation of cassette and integron data

    Directory of Open Access Journals (Sweden)

    Stokes Harold W

    2009-04-01

    Full Text Available Abstract Background Although integrons and their associated gene cassettes are present in ~10% of bacteria and can represent up to 3% of the genome in which they are found, very few have been properly identified and annotated in public databases. These genetic elements have been overlooked in comparison to other vectors that facilitate lateral gene transfer between microorganisms. Description By automating the identification of integron integrase genes and of the non-coding cassette-associated attC recombination sites, we were able to assemble a database containing all publicly available sequence information regarding these genetic elements. Specialists manually curated the database and this information was used to improve the automated detection and annotation of integrons and their encoded gene cassettes. ACID (annotation of cassette and integron data can be searched using a range of queries and the data can be downloaded in a number of formats. Users can readily annotate their own data and integrate it into ACID using the tools provided. Conclusion ACID is a community resource providing easy access to annotations of integrons and making tools available to detect them in novel sequence data. ACID also hosts a forum to prompt integron-related discussion, which can hopefully lead to a more universal definition of this genetic element.

  14. Use of Annotations for Component and Framework Interoperability

    Science.gov (United States)

    David, O.; Lloyd, W.; Carlson, J.; Leavesley, G. H.; Geter, F.

    2009-12-01

    The popular programming languages Java and C# provide annotations, a form of meta-data construct. Software frameworks for web integration, web services, database access, and unit testing now take advantage of annotations to reduce the complexity of APIs and the quantity of integration code between the application and framework infrastructure. Adopting annotation features in frameworks has been observed to lead to cleaner and leaner application code. The USDA Object Modeling System (OMS) version 3.0 fully embraces the annotation approach and additionally defines a meta-data standard for components and models. In version 3.0 framework/model integration previously accomplished using API calls is now achieved using descriptive annotations. This enables the framework to provide additional functionality non-invasively such as implicit multithreading, and auto-documenting capabilities while achieving a significant reduction in the size of the model source code. Using a non-invasive methodology leads to models and modeling components with only minimal dependencies on the modeling framework. Since models and modeling components are not directly bound to framework by the use of specific APIs and/or data types they can more easily be reused both within the framework as well as outside of it. To study the effectiveness of an annotation based framework approach with other modeling frameworks, a framework-invasiveness study was conducted to evaluate the effects of framework design on model code quality. A monthly water balance model was implemented across several modeling frameworks and several software metrics were collected. The metrics selected were measures of non-invasive design methods for modeling frameworks from a software engineering perspective. It appears that the use of annotations positively impacts several software quality measures. In a next step, the PRMS model was implemented in OMS 3.0 and is currently being implemented for water supply forecasting in the

  15. Creating Gaze Annotations in Head Mounted Displays

    DEFF Research Database (Denmark)

    Mardanbeigi, Diako; Qvarfordt, Pernilla

    2015-01-01

    To facilitate distributed communication in mobile settings, we developed GazeNote for creating and sharing gaze annotations in head mounted displays (HMDs). With gaze annotations it possible to point out objects of interest within an image and add a verbal description. To create an annota- tion...

  16. Ontological Annotation with WordNet

    Energy Technology Data Exchange (ETDEWEB)

    Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.; Chappell, Alan R.; Whitney, Paul D.; Posse, Christian; Paulson, Patrick R.; Baddeley, Bob; Hohimer, Ryan E.; White, Amanda M.

    2006-06-06

    Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of WordNet to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

  17. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

  18. Chlamydomonas DYX1C1/PF23 is essential for axonemal assembly and proper morphology of inner dynein arms.

    Directory of Open Access Journals (Sweden)

    Ryosuke Yamamoto

    2017-09-01

    Full Text Available Cytoplasmic assembly of ciliary dyneins, a process known as preassembly, requires numerous non-dynein proteins, but the identities and functions of these proteins are not fully elucidated. Here, we show that the classical Chlamydomonas motility mutant pf23 is defective in the Chlamydomonas homolog of DYX1C1. The pf23 mutant has a 494 bp deletion in the DYX1C1 gene and expresses a shorter DYX1C1 protein in the cytoplasm. Structural analyses, using cryo-ET, reveal that pf23 axonemes lack most of the inner dynein arms. Spectral counting confirms that DYX1C1 is essential for the assembly of the majority of ciliary inner dynein arms (IDA as well as a fraction of the outer dynein arms (ODA. A C-terminal truncation of DYX1C1 shows a reduction in a subset of these ciliary IDAs. Sucrose gradients of cytoplasmic extracts show that preassembled ciliary dyneins are reduced compared to wild-type, which suggests an important role in dynein complex stability. The role of PF23/DYX1C1 remains unknown, but we suggest that DYX1C1 could provide a scaffold for macromolecular assembly.

  19. Functional annotation of hierarchical modularity.

    Directory of Open Access Journals (Sweden)

    Kanchana Padmanabhan

    Full Text Available In biological networks of molecular interactions in a cell, network motifs that are biologically relevant are also functionally coherent, or form functional modules. These functionally coherent modules combine in a hierarchical manner into larger, less cohesive subsystems, thus revealing one of the essential design principles of system-level cellular organization and function-hierarchical modularity. Arguably, hierarchical modularity has not been explicitly taken into consideration by most, if not all, functional annotation systems. As a result, the existing methods would often fail to assign a statistically significant functional coherence score to biologically relevant molecular machines. We developed a methodology for hierarchical functional annotation. Given the hierarchical taxonomy of functional concepts (e.g., Gene Ontology and the association of individual genes or proteins with these concepts (e.g., GO terms, our method will assign a Hierarchical Modularity Score (HMS to each node in the hierarchy of functional modules; the HMS score and its p-value measure functional coherence of each module in the hierarchy. While existing methods annotate each module with a set of "enriched" functional terms in a bag of genes, our complementary method provides the hierarchical functional annotation of the modules and their hierarchically organized components. A hierarchical organization of functional modules often comes as a bi-product of cluster analysis of gene expression data or protein interaction data. Otherwise, our method will automatically build such a hierarchy by directly incorporating the functional taxonomy information into the hierarchy search process and by allowing multi-functional genes to be part of more than one component in the hierarchy. In addition, its underlying HMS scoring metric ensures that functional specificity of the terms across different levels of the hierarchical taxonomy is properly treated. We have evaluated our

  20. Fluid Annotations in a Open World

    DEFF Research Database (Denmark)

    Zellweger, Polle Trescott; Bouvin, Niels Olof; Jehøj, Henning

    2001-01-01

    Fluid Documents use animated typographical changes to provide a novel and appealing user experience for hypertext browsing and for viewing document annotations in context. This paper describes an effort to broaden the utility of Fluid Documents by using the open hypermedia Arakne Environment to l...... to layer fluid annotations and links on top of abitrary HTML pages on the World Wide Web. Changes to both Fluid Documents and Arakne are required....

  1. Physiological and Biochemical characterization of Chlamydomonas sp. the Hydrogen Production's Strain

    International Nuclear Information System (INIS)

    Chader, S.; Belhamel, M.; H Hacene

    2006-01-01

    The hydrogen produced by biological way became, one of the most interesting subjects of research relating to development the energy system starting from renewable sources. This study describes the closed relation between the physiological behaviour, biochemical and rate of gases produced by Chlamydomonas sp. strain AT14, isolated in the area of Touat (the Sahara Algerian) and cultivated in a toric photo-bioreactor. A considerable growth was noted, where the concentration of the biomass double in only two days after incubation. The micro-algal cells present a 100% of viability, which relocate has satisfactory behaviour in the toric engine. In addition, the displacement water level in the system of measurement implies has gas production (0.1 ml) in coordination with the anaerobic period of the reactional enclosure. The yield of this way of hydrogen production is depending on the species used, the light intensity, and the conditions of culture. (authors)

  2. Cell growth and protein synthesis of unicellular green alga Chlamydomonas in heavy water

    International Nuclear Information System (INIS)

    Ishida, M.R.

    1983-01-01

    The effects of heavy water on the cell growth and protein synthesis of the photoautotrophically growing Chlamydomonas cells were studied. The growth rate of the cells is inversely proportional to the concentrations of heavy water. The cells can barely live in 90% heavy water, but they die in 99.85% heavy water within a few days. Incorporation of 14 Cleucine into cells is markedly stimulated by heavy water of various concentrations between 30 and 99.85% in the case of the short time incubation. Contrary to this, in the long time incubation as several days, heavy water inhibits the protein synthesis. Such two modes of the protein synthetic activities are dependent upon the incubation time of the cells grown photoautotrophically in the heavy water media. (author)

  3. Taxonomic identity and physiological ecology of Chlamydomonas hedleyi sp. nov. , algal flagellate symbiont from the foraminifer Archaias angulatus

    Energy Technology Data Exchange (ETDEWEB)

    Lee, J J; Crockett, L J; Hagen, J; Stone, R J

    1975-12-31

    The fine structure of the symbiotic alga isolated from the foraminiferan Archaias angulatus (Fichtel et Moll) DeMontfort is typical of the Chlorophyceae of the volvocalean and chlorococcalean lines. Spherical non-motile cells, 10--14 ..mu..m in diameter, characterize the dominant life cycle phase. Long oval motile forms with truncated apices are present 3--5 days after transfer to fresh medium. The pyrenoids are embedded anteriorly in the singly bilobed chloroplast and are surrounded by a sheath of starch platelets. In spite of the non-motile state of cells in older cultures (which is perhaps a reflection of its normally symbiotic condition), the alga is identified as a species of the volvocalean genus Chlamydomonas and is named C. hedleyi sp. nov. The symbiont has no vitamin or organic requirements but growth is increased threefold in the presence of thiamine, and twofold in the presence of 1 ..mu..m glutamic acid, histidine and methionine. Urea was the best nitrogen source tested. Purines and pyrimidines did not serve as nitrogen sources. Chlamydomonas hedleyi grows well in a salinity range of 6- greater than 52 per thousand and a pH range of 6--8.5. 7.04 x 10/sup -7/ M carbon h/sup -1/ g/sup -1/ was fixed by the symbiont, 57 percent being released into the medium as a chromatographically homogeneous organic molecule provisionally identified as mannitol.

  4. Model and Interoperability using Meta Data Annotations

    Science.gov (United States)

    David, O.

    2011-12-01

    Software frameworks and architectures are in need for meta data to efficiently support model integration. Modelers have to know the context of a model, often stepping into modeling semantics and auxiliary information usually not provided in a concise structure and universal format, consumable by a range of (modeling) tools. XML often seems the obvious solution for capturing meta data, but its wide adoption to facilitate model interoperability is limited by XML schema fragmentation, complexity, and verbosity outside of a data-automation process. Ontologies seem to overcome those shortcomings, however the practical significance of their use remains to be demonstrated. OMS version 3 took a different approach for meta data representation. The fundamental building block of a modular model in OMS is a software component representing a single physical process, calibration method, or data access approach. Here, programing language features known as Annotations or Attributes were adopted. Within other (non-modeling) frameworks it has been observed that annotations lead to cleaner and leaner application code. Framework-supported model integration, traditionally accomplished using Application Programming Interfaces (API) calls is now achieved using descriptive code annotations. Fully annotated components for various hydrological and Ag-system models now provide information directly for (i) model assembly and building, (ii) data flow analysis for implicit multi-threading or visualization, (iii) automated and comprehensive model documentation of component dependencies, physical data properties, (iv) automated model and component testing, calibration, and optimization, and (v) automated audit-traceability to account for all model resources leading to a particular simulation result. Such a non-invasive methodology leads to models and modeling components with only minimal dependencies on the modeling framework but a strong reference to its originating code. Since models and

  5. Black English Annotations for Elementary Reading Programs.

    Science.gov (United States)

    Prasad, Sandre

    This report describes a program that uses annotations in the teacher's editions of existing reading programs to indicate the characteristics of black English that may interfere with the reading process of black children. The first part of the report provides a rationale for the annotation approach, explaining that the discrepancy between written…

  6. Special Issue: Annotated Bibliography for Volumes XIX-XXXII.

    Science.gov (United States)

    Pullin, Richard A.

    1998-01-01

    This annotated bibliography lists 310 articles from the "Journal of Cooperative Education" from Volumes XIX-XXXII, 1983-1997. Annotations are presented in the order they appear in the journal; author and subject indexes are provided. (JOW)

  7. MetaStorm: A Public Resource for Customizable Metagenomics Annotation.

    Directory of Open Access Journals (Sweden)

    Gustavo Arango-Argoty

    Full Text Available Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/, which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution.

  8. MetaStorm: A Public Resource for Customizable Metagenomics Annotation.

    Science.gov (United States)

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution.

  9. MIPS: analysis and annotation of genome information in 2007.

    Science.gov (United States)

    Mewes, H W; Dietmann, S; Frishman, D; Gregory, R; Mannhaupt, G; Mayer, K F X; Münsterkötter, M; Ruepp, A; Spannagl, M; Stümpflen, V; Rattei, T

    2008-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).

  10. MetaStorm: A Public Resource for Customizable Metagenomics Annotation

    Science.gov (United States)

    Arango-Argoty, Gustavo; Singh, Gargi; Heath, Lenwood S.; Pruden, Amy; Xiao, Weidong; Zhang, Liqing

    2016-01-01

    Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution. PMID:27632579

  11. Improving Microbial Genome Annotations in an Integrated Database Context

    Science.gov (United States)

    Chen, I-Min A.; Markowitz, Victor M.; Chu, Ken; Anderson, Iain; Mavromatis, Konstantinos; Kyrpides, Nikos C.; Ivanova, Natalia N.

    2013-01-01

    Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG) family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/. PMID:23424620

  12. Improving microbial genome annotations in an integrated database context.

    Directory of Open Access Journals (Sweden)

    I-Min A Chen

    Full Text Available Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/.

  13. Response of Chlamydomonas reinhardtii to naphthenic acid exposure

    Energy Technology Data Exchange (ETDEWEB)

    Goff, K.; Wilson, K. [Saskatchewan Univ., Saskatoon, SK (Canada); Headley, J. [Environment Canada, Ottawa, ON (Canada)

    2010-07-01

    This study examined the feasibility of using a model organism for the algal bioremediation of oil sands process water (OSPW), a highly toxic mixture of sediments, bitumen, ions, and organic and inorganic compounds. Naphthenic acids (NAs) are a contaminant class of particular concern. Bioremediation techniques may mitigate toxicity of OSPW in general, and NAs in particular. Although most studies on the biodegradation of NAs focus on the role of bacteria, fungi, and emergent macrophytes, studies have indicated that algae may also play a key role through direct degradation, biosequestration, or photosynthetic aeration of waters to promote other biological reactions. Chlamydomonas frigida is of particular interest, but no cultures are currently available. Therefore, this study used C. reinhardtii, a well-characterized model organism, to begin analysis of potential algal bioremediation of OSPW. Cultures of C. reinhardtii were grown heterotrophically in nutrient media spiked with a dilution series of NAs. Culture densities were measured to compile growth curves over time, changes in rate of growth, and survivability. Negative ion electrospray mass spectrometry was used to determine the concentration of NAs in solution in relation to growth rate and culture density. The study determined the tolerance of C. reinhardtii to NAs. A mechanism for this tolerance was then proposed.

  14. Histones of Chlamydomonas reinhardtii. Synthesis, acetylation, and methylation

    International Nuclear Information System (INIS)

    Waterborg, J.H.; Robertson, A.J.; Tatar, D.L.; Borza, C.M.; Davie, J.R.

    1995-01-01

    Histones of the green alga Chlamydomonas reinhardtii were prepared by a new method and fractionated by reversed-phase high-performance liquid chromatography. Acid-urea-Triton gel analysis and tritiated acetate labeling demonstrated high levels of steady-state acetylation for the single histone H3 protein, in contrast to low levels on histones H4 and H2B. Twenty percent of histone H3 is subject to dynamic acetylation with, on average, three acetylated lysine residues per protein molecule. Histone synthesis in light-dark-synchronized cultures was biphasic with pattern differences between two histone H1 variants, between two H2A variants, and between H2B and ubiquitinated H2B. Automated protein sequence analysis of histone H3 demonstrated a site-specific pattern of steady-state acetylation between 7 and 17% at five of the six amino-terminal lysines and of monomethylation between 5 and 81% at five of the eight amino-terminal lysines in a pattern that may limit dynamic acetylation. An algal histone H3 sequence was confirmed by protein sequencing with a since threonine as residue 28 instead of the serine(28)-alanine(29) sequence, present in all other known plant and animal H3 histones

  15. The Bologna Annotation Resource (BAR 3.0): improving protein functional annotation.

    Science.gov (United States)

    Profiti, Giuseppe; Martelli, Pier Luigi; Casadio, Rita

    2017-07-03

    BAR 3.0 updates our server BAR (Bologna Annotation Resource) for predicting protein structural and functional features from sequence. We increase data volume, query capabilities and information conveyed to the user. The core of BAR 3.0 is a graph-based clustering procedure of UniProtKB sequences, following strict pairwise similarity criteria (sequence identity ≥40% with alignment coverage ≥90%). Each cluster contains the available annotation downloaded from UniProtKB, GO, PFAM and PDB. After statistical validation, GO terms and PFAM domains are cluster-specific and annotate new sequences entering the cluster after satisfying similarity constraints. BAR 3.0 includes 28 869 663 sequences in 1 361 773 clusters, of which 22.2% (22 241 661 sequences) and 47.4% (24 555 055 sequences) have at least one validated GO term and one PFAM domain, respectively. 1.4% of the clusters (36% of all sequences) include PDB structures and the cluster is associated to a hidden Markov model that allows building template-target alignment suitable for structural modeling. Some other 3 399 026 sequences are singletons. BAR 3.0 offers an improved search interface, allowing queries by UniProtKB-accession, Fasta sequence, GO-term, PFAM-domain, organism, PDB and ligand/s. When evaluated on the CAFA2 targets, BAR 3.0 largely outperforms our previous version and scores among state-of-the-art methods. BAR 3.0 is publicly available and accessible at http://bar.biocomp.unibo.it/bar3. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. BAT: An open-source, web-based audio events annotation tool

    OpenAIRE

    Blai Meléndez-Catalan, Emilio Molina, Emilia Gómez

    2017-01-01

    In this paper we present BAT (BMAT Annotation Tool), an open-source, web-based tool for the manual annotation of events in audio recordings developed at BMAT (Barcelona Music and Audio Technologies). The main feature of the tool is that it provides an easy way to annotate the salience of simultaneous sound sources. Additionally, it allows to define multiple ontologies to adapt to multiple tasks and offers the possibility to cross-annotate audio data. Moreover, it is easy to install and deploy...

  17. Annotating images by mining image search results.

    Science.gov (United States)

    Wang, Xin-Jing; Zhang, Lei; Li, Xirong; Ma, Wei-Ying

    2008-11-01

    Although it has been studied for years by the computer vision and machine learning communities, image annotation is still far from practical. In this paper, we propose a novel attempt at model-free image annotation, which is a data-driven approach that annotates images by mining their search results. Some 2.4 million images with their surrounding text are collected from a few photo forums to support this approach. The entire process is formulated in a divide-and-conquer framework where a query keyword is provided along with the uncaptioned image to improve both the effectiveness and efficiency. This is helpful when the collected data set is not dense everywhere. In this sense, our approach contains three steps: 1) the search process to discover visually and semantically similar search results, 2) the mining process to identify salient terms from textual descriptions of the search results, and 3) the annotation rejection process to filter out noisy terms yielded by Step 2. To ensure real-time annotation, two key techniques are leveraged-one is to map the high-dimensional image visual features into hash codes, the other is to implement it as a distributed system, of which the search and mining processes are provided as Web services. As a typical result, the entire process finishes in less than 1 second. Since no training data set is required, our approach enables annotating with unlimited vocabulary and is highly scalable and robust to outliers. Experimental results on both real Web images and a benchmark image data set show the effectiveness and efficiency of the proposed algorithm. It is also worth noting that, although the entire approach is illustrated within the divide-and conquer framework, a query keyword is not crucial to our current implementation. We provide experimental results to prove this.

  18. Collaborative Paper-Based Annotation of Lecture Slides

    Science.gov (United States)

    Steimle, Jurgen; Brdiczka, Oliver; Muhlhauser, Max

    2009-01-01

    In a study of notetaking in university courses, we found that the large majority of students prefer paper to computer-based media like Tablet PCs for taking notes and making annotations. Based on this finding, we developed CoScribe, a concept and system which supports students in making collaborative handwritten annotations on printed lecture…

  19. Music journals in South Africa 1854-2010: an annotated bibliography

    African Journals Online (AJOL)

    Music journals in South Africa 1854-2010: an annotated bibliography. ... The article focuses on presenting an annotated bibliography of music journalism in South Africa from as early as 1854 until 2010. Most of ... Key words: annotated bibliography, electronic journals, music journals, periodicals, South African music history ...

  20. An annotated corpus with nanomedicine and pharmacokinetic parameters.

    Science.gov (United States)

    Lewinski, Nastassja A; Jimenez, Ivan; McInnes, Bridget T

    2017-01-01

    A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration's Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided.

  1. Plann: A command-line application for annotating plastome sequences1

    Science.gov (United States)

    Huang, Daisie I.; Cronk, Quentin C. B.

    2015-01-01

    Premise of the study: Plann automates the process of annotating a plastome sequence in GenBank format for either downstream processing or for GenBank submission by annotating a new plastome based on a similar, well-annotated plastome. Methods and Results: Plann is a Perl script to be executed on the command line. Plann compares a new plastome sequence to the features annotated in a reference plastome and then shifts the intervals of any matching features to the locations in the new plastome. Plann’s output can be used in the National Center for Biotechnology Information’s tbl2asn to create a Sequin file for GenBank submission. Conclusions: Unlike Web-based annotation packages, Plann is a locally executable script that will accurately annotate a plastome sequence to a locally specified reference plastome. Because it executes from the command line, it is ready to use in other software pipelines and can be easily rerun as a draft plastome is improved. PMID:26312193

  2. Evaluation of web-based annotation of ophthalmic images for multicentric clinical trials.

    Science.gov (United States)

    Chalam, K V; Jain, P; Shah, V A; Shah, Gaurav Y

    2006-06-01

    An Internet browser-based annotation system can be used to identify and describe features in digitalized retinal images, in multicentric clinical trials, in real time. In this web-based annotation system, the user employs a mouse to draw and create annotations on a transparent layer, that encapsulates the observations and interpretations of a specific image. Multiple annotation layers may be overlaid on a single image. These layers may correspond to annotations by different users on the same image or annotations of a temporal sequence of images of a disease process, over a period of time. In addition, geometrical properties of annotated figures may be computed and measured. The annotations are stored in a central repository database on a server, which can be retrieved by multiple users in real time. This system facilitates objective evaluation of digital images and comparison of double-blind readings of digital photographs, with an identifiable audit trail. Annotation of ophthalmic images allowed clinically feasible and useful interpretation to track properties of an area of fundus pathology. This provided an objective method to monitor properties of pathologies over time, an essential component of multicentric clinical trials. The annotation system also allowed users to view stereoscopic images that are stereo pairs. This web-based annotation system is useful and valuable in monitoring patient care, in multicentric clinical trials, telemedicine, teaching and routine clinical settings.

  3. Essential Annotation Schema for Ecology (EASE)-A framework supporting the efficient data annotation and faceted navigation in ecology.

    Science.gov (United States)

    Pfaff, Claas-Thido; Eichenberg, David; Liebergesell, Mario; König-Ries, Birgitta; Wirth, Christian

    2017-01-01

    Ecology has become a data intensive science over the last decades which often relies on the reuse of data in cross-experimental analyses. However, finding data which qualifies for the reuse in a specific context can be challenging. It requires good quality metadata and annotations as well as efficient search strategies. To date, full text search (often on the metadata only) is the most widely used search strategy although it is known to be inaccurate. Faceted navigation is providing a filter mechanism which is based on fine granular metadata, categorizing search objects along numeric and categorical parameters relevant for their discovery. Selecting from these parameters during a full text search creates a system of filters which allows to refine and improve the results towards more relevance. We developed a framework for the efficient annotation and faceted navigation in ecology. It consists of an XML schema for storing the annotation of search objects and is accompanied by a vocabulary focused on ecology to support the annotation process. The framework consolidates ideas which originate from widely accepted metadata standards, textbooks, scientific literature, and vocabularies as well as from expert knowledge contributed by researchers from ecology and adjacent disciplines.

  4. Essential Annotation Schema for Ecology (EASE-A framework supporting the efficient data annotation and faceted navigation in ecology.

    Directory of Open Access Journals (Sweden)

    Claas-Thido Pfaff

    Full Text Available Ecology has become a data intensive science over the last decades which often relies on the reuse of data in cross-experimental analyses. However, finding data which qualifies for the reuse in a specific context can be challenging. It requires good quality metadata and annotations as well as efficient search strategies. To date, full text search (often on the metadata only is the most widely used search strategy although it is known to be inaccurate. Faceted navigation is providing a filter mechanism which is based on fine granular metadata, categorizing search objects along numeric and categorical parameters relevant for their discovery. Selecting from these parameters during a full text search creates a system of filters which allows to refine and improve the results towards more relevance. We developed a framework for the efficient annotation and faceted navigation in ecology. It consists of an XML schema for storing the annotation of search objects and is accompanied by a vocabulary focused on ecology to support the annotation process. The framework consolidates ideas which originate from widely accepted metadata standards, textbooks, scientific literature, and vocabularies as well as from expert knowledge contributed by researchers from ecology and adjacent disciplines.

  5. Annotating Logical Forms for EHR Questions.

    Science.gov (United States)

    Roberts, Kirk; Demner-Fushman, Dina

    2016-05-01

    This paper discusses the creation of a semantically annotated corpus of questions about patient data in electronic health records (EHRs). The goal is to provide the training data necessary for semantic parsers to automatically convert EHR questions into a structured query. A layered annotation strategy is used which mirrors a typical natural language processing (NLP) pipeline. First, questions are syntactically analyzed to identify multi-part questions. Second, medical concepts are recognized and normalized to a clinical ontology. Finally, logical forms are created using a lambda calculus representation. We use a corpus of 446 questions asking for patient-specific information. From these, 468 specific questions are found containing 259 unique medical concepts and requiring 53 unique predicates to represent the logical forms. We further present detailed characteristics of the corpus, including inter-annotator agreement results, and describe the challenges automatic NLP systems will face on this task.

  6. Managing and Querying Image Annotation and Markup in XML.

    Science.gov (United States)

    Wang, Fusheng; Pan, Tony; Sharma, Ashish; Saltz, Joel

    2010-01-01

    Proprietary approaches for representing annotations and image markup are serious barriers for researchers to share image data and knowledge. The Annotation and Image Markup (AIM) project is developing a standard based information model for image annotation and markup in health care and clinical trial environments. The complex hierarchical structures of AIM data model pose new challenges for managing such data in terms of performance and support of complex queries. In this paper, we present our work on managing AIM data through a native XML approach, and supporting complex image and annotation queries through native extension of XQuery language. Through integration with xService, AIM databases can now be conveniently shared through caGrid.

  7. Managing and Querying Image Annotation and Markup in XML

    Science.gov (United States)

    Wang, Fusheng; Pan, Tony; Sharma, Ashish; Saltz, Joel

    2010-01-01

    Proprietary approaches for representing annotations and image markup are serious barriers for researchers to share image data and knowledge. The Annotation and Image Markup (AIM) project is developing a standard based information model for image annotation and markup in health care and clinical trial environments. The complex hierarchical structures of AIM data model pose new challenges for managing such data in terms of performance and support of complex queries. In this paper, we present our work on managing AIM data through a native XML approach, and supporting complex image and annotation queries through native extension of XQuery language. Through integration with xService, AIM databases can now be conveniently shared through caGrid. PMID:21218167

  8. Expressed Peptide Tags: An additional layer of data for genome annotation

    Energy Technology Data Exchange (ETDEWEB)

    Savidor, Alon [ORNL; Donahoo, Ryan S [ORNL; Hurtado-Gonzales, Oscar [University of Tennessee, Knoxville (UTK); Verberkmoes, Nathan C [ORNL; Shah, Manesh B [ORNL; Lamour, Kurt H [ORNL; McDonald, W Hayes [ORNL

    2006-01-01

    While genome sequencing is becoming ever more routine, genome annotation remains a challenging process. Identification of the coding sequences within the genomic milieu presents a tremendous challenge, especially for eukaryotes with their complex gene architectures. Here we present a method to assist the annotation process through the use of proteomic data and bioinformatics. Mass spectra of digested protein preparations of the organism of interest were acquired and searched against a protein database created by a six frame translation of the genome. The identified peptides were mapped back to the genome, compared to the current annotation, and then categorized as supporting or extending the current genome annotation. We named the classified peptides Expressed Peptide Tags (EPTs). The well annotated bacterium Rhodopseudomonas palustris was used as a control for the method and showed high degree of correlation between EPT mapping and the current annotation, with 86% of the EPTs confirming existing gene calls and less than 1% of the EPTs expanding on the current annotation. The eukaryotic plant pathogens Phytophthora ramorum and Phytophthora sojae, whose genomes have been recently sequenced and are much less well annotated, were also subjected to this method. A series of algorithmic steps were taken to increase the confidence of EPT identification for these organisms, including generation of smaller sub-databases to be searched against, and definition of EPT criteria that accommodates the more complex eukaryotic gene architecture. As expected, the analysis of the Phytophthora species showed less correlation between EPT mapping and their current annotation. While ~77% of Phytophthora EPTs supported the current annotation, a portion of them (7.2% and 12.6% for P. ramorum and P. sojae, respectively) suggested modification to current gene calls or identified novel genes that were missed by the current genome annotation of these organisms.

  9. Fuzzy Emotional Semantic Analysis and Automated Annotation of Scene Images

    Directory of Open Access Journals (Sweden)

    Jianfang Cao

    2015-01-01

    Full Text Available With the advances in electronic and imaging techniques, the production of digital images has rapidly increased, and the extraction and automated annotation of emotional semantics implied by images have become issues that must be urgently addressed. To better simulate human subjectivity and ambiguity for understanding scene images, the current study proposes an emotional semantic annotation method for scene images based on fuzzy set theory. A fuzzy membership degree was calculated to describe the emotional degree of a scene image and was implemented using the Adaboost algorithm and a back-propagation (BP neural network. The automated annotation method was trained and tested using scene images from the SUN Database. The annotation results were then compared with those based on artificial annotation. Our method showed an annotation accuracy rate of 91.2% for basic emotional values and 82.4% after extended emotional values were added, which correspond to increases of 5.5% and 8.9%, respectively, compared with the results from using a single BP neural network algorithm. Furthermore, the retrieval accuracy rate based on our method reached approximately 89%. This study attempts to lay a solid foundation for the automated emotional semantic annotation of more types of images and therefore is of practical significance.

  10. Cellulose degradation and assimilation by the unicellular phototrophic eukaryote Chlamydomonas reinhardtii.

    Science.gov (United States)

    Blifernez-Klassen, Olga; Klassen, Viktor; Doebbe, Anja; Kersting, Klaudia; Grimm, Philipp; Wobbe, Lutz; Kruse, Olaf

    2012-01-01

    Plants convert sunlight to biomass, which is primarily composed of lignocellulose, the most abundant natural biopolymer and a potential feedstock for fuel and chemical production. Cellulose assimilation has so far only been described for heterotrophic organisms that rely on photosynthetically active primary producers of organic compounds. Among phototrophs, the unicellular green microalga Chlamydomonas reinhardtii is widely known as one of the best established model organisms. It occupies many habitats, including aquatic and soil ecosystems. This ubiquity underscores the versatile metabolic properties of this microorganism. Here we present yet another paradigm of adaptation for C. reinhardtii, highlighting its photoheterotrophic ability to utilize cellulose for growth in the absence of other carbon sources. When grown under CO(2)-limiting conditions in the light, secretion of endo-β-1,4-glucanases by the cell causes digestion of exogenous cellulose, followed by cellobiose uptake and assimilation. Phototrophic microbes like C. reinhardtii may thus serve as biocatalysts for cellulosic biofuel production.

  11. Multiview Hessian regularization for image annotation.

    Science.gov (United States)

    Liu, Weifeng; Tao, Dacheng

    2013-07-01

    The rapid development of computer hardware and Internet technology makes large scale data dependent models computationally tractable, and opens a bright avenue for annotating images through innovative machine learning algorithms. Semisupervised learning (SSL) therefore received intensive attention in recent years and was successfully deployed in image annotation. One representative work in SSL is Laplacian regularization (LR), which smoothes the conditional distribution for classification along the manifold encoded in the graph Laplacian, however, it is observed that LR biases the classification function toward a constant function that possibly results in poor generalization. In addition, LR is developed to handle uniformly distributed data (or single-view data), although instances or objects, such as images and videos, are usually represented by multiview features, such as color, shape, and texture. In this paper, we present multiview Hessian regularization (mHR) to address the above two problems in LR-based image annotation. In particular, mHR optimally combines multiple HR, each of which is obtained from a particular view of instances, and steers the classification function that varies linearly along the data manifold. We apply mHR to kernel least squares and support vector machines as two examples for image annotation. Extensive experiments on the PASCAL VOC'07 dataset validate the effectiveness of mHR by comparing it with baseline algorithms, including LR and HR.

  12. Ten steps to get started in Genome Assembly and Annotation

    Science.gov (United States)

    Dominguez Del Angel, Victoria; Hjerde, Erik; Sterck, Lieven; Capella-Gutierrez, Salvadors; Notredame, Cederic; Vinnere Pettersson, Olga; Amselem, Joelle; Bouri, Laurent; Bocs, Stephanie; Klopp, Christophe; Gibrat, Jean-Francois; Vlasova, Anna; Leskosek, Brane L.; Soler, Lucile; Binzer-Panchal, Mahesh; Lantz, Henrik

    2018-01-01

    As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR). PMID:29568489

  13. Sharing Map Annotations in Small Groups: X Marks the Spot

    Science.gov (United States)

    Congleton, Ben; Cerretani, Jacqueline; Newman, Mark W.; Ackerman, Mark S.

    Advances in location-sensing technology, coupled with an increasingly pervasive wireless Internet, have made it possible (and increasingly easy) to access and share information with context of one’s geospatial location. We conducted a four-phase study, with 27 students, to explore the practices surrounding the creation, interpretation and sharing of map annotations in specific social contexts. We found that annotation authors consider multiple factors when deciding how to annotate maps, including the perceived utility to the audience and how their contributions will reflect on the image they project to others. Consumers of annotations value the novelty of information, but must be convinced of the author’s credibility. In this paper we describe our study, present the results, and discuss implications for the design of software for sharing map annotations.

  14. Annotation-based feature extraction from sets of SBML models.

    Science.gov (United States)

    Alm, Rebekka; Waltemath, Dagmar; Wolfien, Markus; Wolkenhauer, Olaf; Henkel, Ron

    2015-01-01

    Model repositories such as BioModels Database provide computational models of biological systems for the scientific community. These models contain rich semantic annotations that link model entities to concepts in well-established bio-ontologies such as Gene Ontology. Consequently, thematically similar models are likely to share similar annotations. Based on this assumption, we argue that semantic annotations are a suitable tool to characterize sets of models. These characteristics improve model classification, allow to identify additional features for model retrieval tasks, and enable the comparison of sets of models. In this paper we discuss four methods for annotation-based feature extraction from model sets. We tested all methods on sets of models in SBML format which were composed from BioModels Database. To characterize each of these sets, we analyzed and extracted concepts from three frequently used ontologies, namely Gene Ontology, ChEBI and SBO. We find that three out of the methods are suitable to determine characteristic features for arbitrary sets of models: The selected features vary depending on the underlying model set, and they are also specific to the chosen model set. We show that the identified features map on concepts that are higher up in the hierarchy of the ontologies than the concepts used for model annotations. Our analysis also reveals that the information content of concepts in ontologies and their usage for model annotation do not correlate. Annotation-based feature extraction enables the comparison of model sets, as opposed to existing methods for model-to-keyword comparison, or model-to-model comparison.

  15. Roadmap for annotating transposable elements in eukaryote genomes.

    Science.gov (United States)

    Permal, Emmanuelle; Flutre, Timothée; Quesneville, Hadi

    2012-01-01

    Current high-throughput techniques have made it feasible to sequence even the genomes of non-model organisms. However, the annotation process now represents a bottleneck to genome analysis, especially when dealing with transposable elements (TE). Combined approaches, using both de novo and knowledge-based methods to detect TEs, are likely to produce reasonably comprehensive and sensitive results. This chapter provides a roadmap for researchers involved in genome projects to address this issue. At each step of the TE annotation process, from the identification of TE families to the annotation of TE copies, we outline the tools and good practices to be used.

  16. Database resources for the tuberculosis community.

    Science.gov (United States)

    Lew, Jocelyne M; Mao, Chunhong; Shukla, Maulik; Warren, Andrew; Will, Rebecca; Kuznetsov, Dmitry; Xenarios, Ioannis; Robertson, Brian D; Gordon, Stephen V; Schnappinger, Dirk; Cole, Stewart T; Sobral, Bruno

    2013-01-01

    Access to online repositories for genomic and associated "-omics" datasets is now an essential part of everyday research activity. It is important therefore that the Tuberculosis community is aware of the databases and tools available to them online, as well as for the database hosts to know what the needs of the research community are. One of the goals of the Tuberculosis Annotation Jamboree, held in Washington DC on March 7th-8th 2012, was therefore to provide an overview of the current status of three key Tuberculosis resources, TubercuList (tuberculist.epfl.ch), TB Database (www.tbdb.org), and Pathosystems Resource Integration Center (PATRIC, www.patricbrc.org). Here we summarize some key updates and upcoming features in TubercuList, and provide an overview of the PATRIC site and its online tools for pathogen RNA-Seq analysis. Copyright © 2012 Elsevier Ltd. All rights reserved.

  17. The Effects of Multimedia Annotations on Iranian EFL Learners’ L2 Vocabulary Learning

    Directory of Open Access Journals (Sweden)

    Saeideh Ahangari

    2010-05-01

    Full Text Available In our modern technological world, Computer-Assisted Language learning (CALL is a new realm towards learning a language in general, and learning L2 vocabulary in particular. It is assumed that the use of multimedia annotations promotes language learners’ vocabulary acquisition. Therefore, this study set out to investigate the effects of different multimedia annotations (still picture annotations, dynamic picture annotations, and written annotations on L2 vocabulary learning. To fulfill this objective, the researchers selected sixty four EFL learners as the participants of this study. The participants were randomly assigned to one of the four groups: a control group that received no annotations and three experimental groups that received:  still picture annotations, dynamic picture annotations, and written annotations. Each participant was required to take a pre-test. A vocabulary post- test was also designed and administered to the participants in order to assess the efficacy of each annotation. First for each group a paired t-test was conducted between their pre and post test scores in order to observe their improvement; then through an ANCOVA test the performance of four groups was compared. The results showed that using multimedia annotations resulted in a significant difference in the participants’ vocabulary learning. Based on the results of the present study, multimedia annotations are suggested as a vocabulary teaching strategy.

  18. Evaluating Functional Annotations of Enzymes Using the Gene Ontology.

    Science.gov (United States)

    Holliday, Gemma L; Davidson, Rebecca; Akiva, Eyal; Babbitt, Patricia C

    2017-01-01

    The Gene Ontology (GO) (Ashburner et al., Nat Genet 25(1):25-29, 2000) is a powerful tool in the informatics arsenal of methods for evaluating annotations in a protein dataset. From identifying the nearest well annotated homologue of a protein of interest to predicting where misannotation has occurred to knowing how confident you can be in the annotations assigned to those proteins is critical. In this chapter we explore what makes an enzyme unique and how we can use GO to infer aspects of protein function based on sequence similarity. These can range from identification of misannotation or other errors in a predicted function to accurate function prediction for an enzyme of entirely unknown function. Although GO annotation applies to any gene products, we focus here a describing our approach for hierarchical classification of enzymes in the Structure-Function Linkage Database (SFLD) (Akiva et al., Nucleic Acids Res 42(Database issue):D521-530, 2014) as a guide for informed utilisation of annotation transfer based on GO terms.

  19. AutoFACT: An Automatic Functional Annotation and Classification Tool

    Directory of Open Access Journals (Sweden)

    Lang B Franz

    2005-06-01

    Full Text Available Abstract Background Assignment of function to new molecular sequence data is an essential step in genomics projects. The usual process involves similarity searches of a given sequence against one or more databases, an arduous process for large datasets. Results We present AutoFACT, a fully automated and customizable annotation tool that assigns biologically informative functions to a sequence. Key features of this tool are that it (1 analyzes nucleotide and protein sequence data; (2 determines the most informative functional description by combining multiple BLAST reports from several user-selected databases; (3 assigns putative metabolic pathways, functional classes, enzyme classes, GeneOntology terms and locus names; and (4 generates output in HTML, text and GFF formats for the user's convenience. We have compared AutoFACT to four well-established annotation pipelines. The error rate of functional annotation is estimated to be only between 1–2%. Comparison of AutoFACT to the traditional top-BLAST-hit annotation method shows that our procedure increases the number of functionally informative annotations by approximately 50%. Conclusion AutoFACT will serve as a useful annotation tool for smaller sequencing groups lacking dedicated bioinformatics staff. It is implemented in PERL and runs on LINUX/UNIX platforms. AutoFACT is available at http://megasun.bch.umontreal.ca/Software/AutoFACT.htm.

  20. LeARN: a platform for detecting, clustering and annotating non-coding RNAs

    Directory of Open Access Journals (Sweden)

    Schiex Thomas

    2008-01-01

    Full Text Available Abstract Background In the last decade, sequencing projects have led to the development of a number of annotation systems dedicated to the structural and functional annotation of protein-coding genes. These annotation systems manage the annotation of the non-protein coding genes (ncRNAs in a very crude way, allowing neither the edition of the secondary structures nor the clustering of ncRNA genes into families which are crucial for appropriate annotation of these molecules. Results LeARN is a flexible software package which handles the complete process of ncRNA annotation by integrating the layers of automatic detection and human curation. Conclusion This software provides the infrastructure to deal properly with ncRNAs in the framework of any annotation project. It fills the gap between existing prediction software, that detect independent ncRNA occurrences, and public ncRNA repositories, that do not offer the flexibility and interactivity required for annotation projects. The software is freely available from the download section of the website http://bioinfo.genopole-toulouse.prd.fr/LeARN

  1. Essential Annotation Schema for Ecology (EASE)—A framework supporting the efficient data annotation and faceted navigation in ecology

    Science.gov (United States)

    Eichenberg, David; Liebergesell, Mario; König-Ries, Birgitta; Wirth, Christian

    2017-01-01

    Ecology has become a data intensive science over the last decades which often relies on the reuse of data in cross-experimental analyses. However, finding data which qualifies for the reuse in a specific context can be challenging. It requires good quality metadata and annotations as well as efficient search strategies. To date, full text search (often on the metadata only) is the most widely used search strategy although it is known to be inaccurate. Faceted navigation is providing a filter mechanism which is based on fine granular metadata, categorizing search objects along numeric and categorical parameters relevant for their discovery. Selecting from these parameters during a full text search creates a system of filters which allows to refine and improve the results towards more relevance. We developed a framework for the efficient annotation and faceted navigation in ecology. It consists of an XML schema for storing the annotation of search objects and is accompanied by a vocabulary focused on ecology to support the annotation process. The framework consolidates ideas which originate from widely accepted metadata standards, textbooks, scientific literature, and vocabularies as well as from expert knowledge contributed by researchers from ecology and adjacent disciplines. PMID:29023519

  2. MicroScope: a platform for microbial genome annotation and comparative genomics.

    Science.gov (United States)

    Vallenet, D; Engelen, S; Mornico, D; Cruveiller, S; Fleury, L; Lajus, A; Rouy, Z; Roche, D; Salvignol, G; Scarpelli, C; Médigue, C

    2009-01-01

    The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http://www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope's rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of

  3. The mechanism of photosystem-II inactivation during sulphur deprivation-induced H2 production in Chlamydomonas reinhardtii.

    Science.gov (United States)

    Nagy, Valéria; Vidal-Meireles, André; Podmaniczki, Anna; Szentmihályi, Klára; Rákhely, Gábor; Zsigmond, Laura; Kovács, László; Tóth, Szilvia Z

    2018-05-01

    Sulphur limitation may restrain cell growth and viability. In the green alga Chlamydomonas reinhardtii, sulphur limitation may induce H 2 production lasting for several days, which can be exploited as a renewable energy source. Sulphur limitation causes a large number of physiological changes, including the inactivation of photosystem II (PSII), leading to the establishment of hypoxia, essential for the increase in hydrogenase expression and activity. The inactivation of PSII has long been assumed to be caused by the sulphur-limited turnover of its reaction center protein PsbA. Here we reinvestigated this issue in detail and show that: (i) upon transferring Chlamydomonas cells to sulphur-free media, the cellular sulphur content decreases only by about 25%; (ii) as demonstrated by lincomycin treatments, PsbA has a significant turnover, and other photosynthetic subunits, namely RbcL and CP43, are degraded more rapidly than PsbA. On the other hand, sulphur limitation imposes oxidative stress early on, most probably involving the formation of singlet oxygen in PSII, which leads to an increase in the expression of GDP-L-galactose phosphorylase, playing an essential role in ascorbate biosynthesis. When accumulated to the millimolar concentration range, ascorbate may inactivate the oxygen-evolving complex and provide electrons to PSII, albeit at a low rate. In the absence of a functional donor side and sufficient electron transport, PSII reaction centers are inactivated and degraded. We therefore demonstrate that the inactivation of PSII is a complex and multistep process, which may serve to mitigate the damaging effects of sulphur limitation. © 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.

  4. Automating Ontological Annotation with WordNet

    Energy Technology Data Exchange (ETDEWEB)

    Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.; Chappell, Alan R.; Whitney, Paul D.; Posse, Christian; Paulson, Patrick R.; Baddeley, Bob L.; Hohimer, Ryan E.; White, Amanda M.

    2006-01-22

    Semantic Web applications require robust and accurate annotation tools that are capable of automating the assignment of ontological classes to words in naturally occurring text (ontological annotation). Most current ontologies do not include rich lexical databases and are therefore not easily integrated with word sense disambiguation algorithms that are needed to automate ontological annotation. WordNet provides a potentially ideal solution to this problem as it offers a highly structured lexical conceptual representation that has been extensively used to develop word sense disambiguation algorithms. However, WordNet has not been designed as an ontology, and while it can be easily turned into one, the result of doing this would present users with serious practical limitations due to the great number of concepts (synonym sets) it contains. Moreover, mapping WordNet to an existing ontology may be difficult and requires substantial labor. We propose to overcome these limitations by developing an analytical platform that (1) provides a WordNet-based ontology offering a manageable and yet comprehensive set of concept classes, (2) leverages the lexical richness of WordNet to give an extensive characterization of concept class in terms of lexical instances, and (3) integrates a class recognition algorithm that automates the assignment of concept classes to words in naturally occurring text. The ensuing framework makes available an ontological annotation platform that can be effectively integrated with intelligence analysis systems to facilitate evidence marshaling and sustain the creation and validation of inference models.

  5. ONEMercury: Towards Automatic Annotation of Earth Science Metadata

    Science.gov (United States)

    Tuarob, S.; Pouchard, L. C.; Noy, N.; Horsburgh, J. S.; Palanisamy, G.

    2012-12-01

    Earth sciences have become more data-intensive, requiring access to heterogeneous data collected from multiple places, times, and thematic scales. For example, research on climate change may involve exploring and analyzing observational data such as the migration of animals and temperature shifts across the earth, as well as various model-observation inter-comparison studies. Recently, DataONE, a federated data network built to facilitate access to and preservation of environmental and ecological data, has come to exist. ONEMercury has recently been implemented as part of the DataONE project to serve as a portal for discovering and accessing environmental and observational data across the globe. ONEMercury harvests metadata from the data hosted by multiple data repositories and makes it searchable via a common search interface built upon cutting edge search engine technology, allowing users to interact with the system, intelligently filter the search results on the fly, and fetch the data from distributed data sources. Linking data from heterogeneous sources always has a cost. A problem that ONEMercury faces is the different levels of annotation in the harvested metadata records. Poorly annotated records tend to be missed during the search process as they lack meaningful keywords. Furthermore, such records would not be compatible with the advanced search functionality offered by ONEMercury as the interface requires a metadata record be semantically annotated. The explosion of the number of metadata records harvested from an increasing number of data repositories makes it impossible to annotate the harvested records manually, urging the need for a tool capable of automatically annotating poorly curated metadata records. In this paper, we propose a topic-model (TM) based approach for automatic metadata annotation. Our approach mines topics in the set of well annotated records and suggests keywords for poorly annotated records based on topic similarity. We utilize the

  6. Effect of Temperature and light intensity on growth and Photosynthetic Activity of Chlamydomonas reinhard II; Efecto de la temperatura e intensidad luminosa sobre el crecimiento y actividad fotosintetica del alga Chlamydomonas Reinhardt II

    Energy Technology Data Exchange (ETDEWEB)

    Alfonsel Jaen, M; Fernandez Gonzalez, J

    1985-07-01

    The effect of five temperatures (15,20,25,30 and 35 degree centigree) and two levels of illumination on growth and photosynthetic activity of Chlamydomonas reinhard II has been studied. The growth of the cultures was evaluated by optical density. Photosynthetic activity has been carried out studying either the assimilation rate of C0{sub 2} labelled with C-14 or the oxygen evolution by means of polarographic measurements. The maximum photosynthetic rate has been obtained at 25 degree centigree for the lower level of illumination (2400 lux) and at 35 degree centigree for the higher one (13200 lux) and at 35 degree centigree for the higher ono (13200 lux). These results suggest an interaction of temperature and illumination on photosynthetic activity. (Author) 37 refs.

  7. A Selected Annotated Bibliography on Work Time Options.

    Science.gov (United States)

    Ivantcho, Barbara

    This annotated bibliography is divided into three sections. Section I contains annotations of general publications on work time options. Section II presents resources on flexitime and the compressed work week. In Section III are found resources related to these reduced work time options: permanent part-time employment, job sharing, voluntary…

  8. Prepare-Participate-Connect: Active Learning with Video Annotation

    Science.gov (United States)

    Colasante, Meg; Douglas, Kathy

    2016-01-01

    Annotation of video provides students with the opportunity to view and engage with audiovisual content in an interactive and participatory way rather than in passive-receptive mode. This article discusses research into the use of video annotation in four vocational programs at RMIT University in Melbourne, which allowed students to interact with…

  9. Developing Annotation Solutions for Online Data Driven Learning

    Science.gov (United States)

    Perez-Paredes, Pascual; Alcaraz-Calero, Jose M.

    2009-01-01

    Although "annotation" is a widely-researched topic in Corpus Linguistics (CL), its potential role in Data Driven Learning (DDL) has not been addressed in depth by Foreign Language Teaching (FLT) practitioners. Furthermore, most of the research in the use of DDL methods pays little attention to annotation in the design and implementation…

  10. Re-annotation and re-analysis of the Campylobacter jejuni NCTC11168 genome sequence

    Directory of Open Access Journals (Sweden)

    Dorrell Nick

    2007-06-01

    Full Text Available Abstract Background Campylobacter jejuni is the leading bacterial cause of human gastroenteritis in the developed world. To improve our understanding of this important human pathogen, the C. jejuni NCTC11168 genome was sequenced and published in 2000. The original annotation was a milestone in Campylobacter research, but is outdated. We now describe the complete re-annotation and re-analysis of the C. jejuni NCTC11168 genome using current database information, novel tools and annotation techniques not used during the original annotation. Results Re-annotation was carried out using sequence database searches such as FASTA, along with programs such as TMHMM for additional support. The re-annotation also utilises sequence data from additional Campylobacter strains and species not available during the original annotation. Re-annotation was accompanied by a full literature search that was incorporated into the updated EMBL file [EMBL: AL111168]. The C. jejuni NCTC11168 re-annotation reduced the total number of coding sequences from 1654 to 1643, of which 90.0% have additional information regarding the identification of new motifs and/or relevant literature. Re-annotation has led to 18.2% of coding sequence product functions being revised. Conclusions Major updates were made to genes involved in the biosynthesis of important surface structures such as lipooligosaccharide, capsule and both O- and N-linked glycosylation. This re-annotation will be a key resource for Campylobacter research and will also provide a prototype for the re-annotation and re-interpretation of other bacterial genomes.

  11. Comparison of concept recognizers for building the Open Biomedical Annotator

    Directory of Open Access Journals (Sweden)

    Rubin Daniel

    2009-09-01

    Full Text Available Abstract The National Center for Biomedical Ontology (NCBO is developing a system for automated, ontology-based access to online biomedical resources (Shah NH, et al.: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 2009, 10(Suppl 2:S1. The system's indexing workflow processes the text metadata of diverse resources such as datasets from GEO and ArrayExpress to annotate and index them with concepts from appropriate ontologies. This indexing requires the use of a concept-recognition tool to identify ontology concepts in the resource's textual metadata. In this paper, we present a comparison of two concept recognizers – NLM's MetaMap and the University of Michigan's Mgrep. We utilize a number of data sources and dictionaries to evaluate the concept recognizers in terms of precision, recall, speed of execution, scalability and customizability. Our evaluations demonstrate that Mgrep has a clear edge over MetaMap for large-scale service oriented applications. Based on our analysis we also suggest areas of potential improvements for Mgrep. We have subsequently used Mgrep to build the Open Biomedical Annotator service. The Annotator service has access to a large dictionary of biomedical terms derived from the United Medical Language System (UMLS and NCBO ontologies. The Annotator also leverages the hierarchical structure of the ontologies and their mappings to expand annotations. The Annotator service is available to the community as a REST Web service for creating ontology-based annotations of their data.

  12. Effects of Reviewing Annotations and Homework Solutions on Math Learning Achievement

    Science.gov (United States)

    Hwang, Wu-Yuin; Chen, Nian-Shing; Shadiev, Rustam; Li, Jin-Sing

    2011-01-01

    Previous studies have demonstrated that making annotations can be a meaningful and useful learning method that promote metacognition and enhance learning achievement. A web-based annotation system, Virtual Pen (VPEN), which provides for the creation and review of annotations and homework solutions, has been developed to foster learning process…

  13. A Set of Annotation Interfaces for Alignment of Parallel Corpora

    Directory of Open Access Journals (Sweden)

    Singh Anil Kumar

    2014-09-01

    Full Text Available Annotation interfaces for parallel corpora which fit in well with other tools can be very useful. We describe a set of annotation interfaces which fulfill this criterion. This set includes a sentence alignment interface, two different word or word group alignment interfaces and an initial version of a parallel syntactic annotation alignment interface. These tools can be used for manual alignment, or they can be used to correct automatic alignments. Manual alignment can be performed in combination with certain kinds of linguistic annotation. Most of these interfaces use a representation called the Shakti Standard Format that has been found to be very robust and has been used for large and successful projects. It ties together the different interfaces, so that the data created by them is portable across all tools which support this representation. The existence of a query language for data stored in this representation makes it possible to build tools that allow easy search and modification of annotated parallel data.

  14. LocusTrack: Integrated visualization of GWAS results and genomic annotation.

    Science.gov (United States)

    Cuellar-Partida, Gabriel; Renteria, Miguel E; MacGregor, Stuart

    2015-01-01

    Genome-wide association studies (GWAS) are an important tool for the mapping of complex traits and diseases. Visual inspection of genomic annotations may be used to generate insights into the biological mechanisms underlying GWAS-identified loci. We developed LocusTrack, a web-based application that annotates and creates plots of regional GWAS results and incorporates user-specified tracks that display annotations such as linkage disequilibrium (LD), phylogenetic conservation, chromatin state, and other genomic and regulatory elements. Currently, LocusTrack can integrate annotation tracks from the UCSC genome-browser as well as from any tracks provided by the user. LocusTrack is an easy-to-use application and can be accessed at the following URL: http://gump.qimr.edu.au/general/gabrieC/LocusTrack/. Users can upload and manage GWAS results and select from and/or provide annotation tracks using simple and intuitive menus. LocusTrack scripts and associated data can be downloaded from the website and run locally.

  15. "Annotated Lectures": Student-Instructor Interaction in Large-Scale Global Education

    Directory of Open Access Journals (Sweden)

    Roger Diehl

    2009-10-01

    Full Text Available We describe an "Annotated Lectures" system, which will be used in a global virtual teaching and student collaboration event on embodied intelligence presented by the University of Zurich. The lectures will be broadcasted via video-conference to lecture halls of different universities around the globe. Among other collaboration features, an "Annotated Lectures" system will be implemented in a 3D collaborative virtual environment and used by the participating students to make annotations to the video-recorded lectures, which will be sent to and answered by their supervisors, and forwarded to the lecturers in an aggregated way. The "Annotated Lectures" system aims to overcome the issues of limited studentinstructor interaction in large-scale education, and to foster an intercultural and multidisciplinary discourse among students who review the lectures in a group. After presenting the concept of the "Annotated Lectures" system, we discuss a prototype version including a description of the technical components and its expected benefit for large-scale global education.

  16. Annotation an effective device for student feedback: a critical review of the literature.

    Science.gov (United States)

    Ball, Elaine C

    2010-05-01

    The paper examines hand-written annotation, its many features, difficulties and strengths as a feedback tool. It extends and clarifies what modest evidence is in the public domain and offers an evaluation of how to use annotation effectively in the support of student feedback [Marshall, C.M., 1998a. The Future of Annotation in a Digital (paper) World. Presented at the 35th Annual GLSLIS Clinic: Successes and Failures of Digital Libraries, June 20-24, University of Illinois at Urbana-Champaign, March 24, pp. 1-20; Marshall, C.M., 1998b. Toward an ecology of hypertext annotation. Hypertext. In: Proceedings of the Ninth ACM Conference on Hypertext and Hypermedia, June 20-24, Pittsburgh Pennsylvania, US, pp. 40-49; Wolfe, J.L., Nuewirth, C.M., 2001. From the margins to the centre: the future of annotation. Journal of Business and Technical Communication, 15(3), 333-371; Diyanni, R., 2002. One Hundred Great Essays. Addison-Wesley, New York; Wolfe, J.L., 2002. Marginal pedagogy: how annotated texts affect writing-from-source texts. Written Communication, 19(2), 297-333; Liu, K., 2006. Annotation as an index to critical writing. Urban Education, 41, 192-207; Feito, A., Donahue, P., 2008. Minding the gap annotation as preparation for discussion. Arts and Humanities in Higher Education, 7(3), 295-307; Ball, E., 2009. A participatory action research study on handwritten annotation feedback and its impact on staff and students. Systemic Practice and Action Research, 22(2), 111-124; Ball, E., Franks, H., McGrath, M., Leigh, J., 2009. Annotation is a valuable tool to enhance learning and assessment in student essays. Nurse Education Today, 29(3), 284-291]. Although a significant number of studies examine annotation, this is largely related to on-line tools and computer mediated communication and not hand-written annotation as comment, phrase or sign written on the student essay to provide critique. Little systematic research has been conducted to consider how this latter form

  17. BioCause: Annotating and analysing causality in the biomedical domain.

    Science.gov (United States)

    Mihăilă, Claudiu; Ohta, Tomoko; Pyysalo, Sampo; Ananiadou, Sophia

    2013-01-16

    Biomedical corpora annotated with event-level information represent an important resource for domain-specific information extraction (IE) systems. However, bio-event annotation alone cannot cater for all the needs of biologists. Unlike work on relation and event extraction, most of which focusses on specific events and named entities, we aim to build a comprehensive resource, covering all statements of causal association present in discourse. Causality lies at the heart of biomedical knowledge, such as diagnosis, pathology or systems biology, and, thus, automatic causality recognition can greatly reduce the human workload by suggesting possible causal connections and aiding in the curation of pathway models. A biomedical text corpus annotated with such relations is, hence, crucial for developing and evaluating biomedical text mining. We have defined an annotation scheme for enriching biomedical domain corpora with causality relations. This schema has subsequently been used to annotate 851 causal relations to form BioCause, a collection of 19 open-access full-text biomedical journal articles belonging to the subdomain of infectious diseases. These documents have been pre-annotated with named entity and event information in the context of previous shared tasks. We report an inter-annotator agreement rate of over 60% for triggers and of over 80% for arguments using an exact match constraint. These increase significantly using a relaxed match setting. Moreover, we analyse and describe the causality relations in BioCause from various points of view. This information can then be leveraged for the training of automatic causality detection systems. Augmenting named entity and event annotations with information about causal discourse relations could benefit the development of more sophisticated IE systems. These will further influence the development of multiple tasks, such as enabling textual inference to detect entailments, discovering new facts and providing new

  18. Automatic Function Annotations for Hoare Logic

    Directory of Open Access Journals (Sweden)

    Daniel Matichuk

    2012-11-01

    Full Text Available In systems verification we are often concerned with multiple, inter-dependent properties that a program must satisfy. To prove that a program satisfies a given property, the correctness of intermediate states of the program must be characterized. However, this intermediate reasoning is not always phrased such that it can be easily re-used in the proofs of subsequent properties. We introduce a function annotation logic that extends Hoare logic in two important ways: (1 when proving that a function satisfies a Hoare triple, intermediate reasoning is automatically stored as function annotations, and (2 these function annotations can be exploited in future Hoare logic proofs. This reduces duplication of reasoning between the proofs of different properties, whilst serving as a drop-in replacement for traditional Hoare logic to avoid the costly process of proof refactoring. We explain how this was implemented in Isabelle/HOL and applied to an experimental branch of the seL4 microkernel to significantly reduce the size and complexity of existing proofs.

  19. Automatically Annotated Mapping for Indoor Mobile Robot Applications

    DEFF Research Database (Denmark)

    Özkil, Ali Gürcan; Howard, Thomas J.

    2012-01-01

    This paper presents a new and practical method for mapping and annotating indoor environments for mobile robot use. The method makes use of 2D occupancy grid maps for metric representation, and topology maps to indicate the connectivity of the ‘places-of-interests’ in the environment. Novel use...... localization and mapping in topology space, and fuses camera and robot pose estimations to build an automatically annotated global topo-metric map. It is developed as a framework for a hospital service robot and tested in a real hospital. Experiments show that the method is capable of producing globally...... consistent, automatically annotated hybrid metric-topological maps that is needed by mobile service robots....

  20. AGORA : Organellar genome annotation from the amino acid and nucleotide references.

    Science.gov (United States)

    Jung, Jaehee; Kim, Jong Im; Jeong, Young-Sik; Yi, Gangman

    2018-03-29

    Next-generation sequencing (NGS) technologies have led to the accumulation of highthroughput sequence data from various organisms in biology. To apply gene annotation of organellar genomes for various organisms, more optimized tools for functional gene annotation are required. Almost all gene annotation tools are mainly focused on the chloroplast genome of land plants or the mitochondrial genome of animals.We have developed a web application AGORA for the fast, user-friendly, and improved annotations of organellar genomes. AGORA annotates genes based on a BLAST-based homology search and clustering with selected reference sequences from the NCBI database or user-defined uploaded data. AGORA can annotate the functional genes in almost all mitochondrion and plastid genomes of eukaryotes. The gene annotation of a genome with an exon-intron structure within a gene or inverted repeat region is also available. It provides information of start and end positions of each gene, BLAST results compared with the reference sequence, and visualization of gene map by OGDRAW. Users can freely use the software, and the accessible URL is https://bigdata.dongguk.edu/gene_project/AGORA/.The main module of the tool is implemented by the python and php, and the web page is built by the HTML and CSS to support all browsers. gangman@dongguk.edu.

  1. Metabolic acclimation to excess light intensity in Chlamydomonas reinhardtii.

    Science.gov (United States)

    Davis, Maria C; Fiehn, Oliver; Durnford, Dion G

    2013-07-01

    There are several well-described acclimation responses to excess light in green algae but the effect on metabolism has not been thoroughly investigated. This study examines the metabolic changes during photoacclimation to high-light (HL) stress in Chlamydomonas reinhardtii using nuclear magnetic resonance and mass spectrometry. Using principal component analysis, a clear metabolic response to HL intensity was observed on global metabolite pools, with major changes in the levels of amino acids and related nitrogen metabolites. Amino acid pools increased during short-term photoacclimation, but were especially prominent in HL-acclimated cultures. Unexpectedly, we observed an increase in mitochondrial metabolism through downstream photorespiratory pathways. The expression of two genes encoding key enzymes in the photorespiratory pathway, glycolate dehydrogenase and malate synthase, were highly responsive to the HL stress. We propose that this pathway contributes to metabolite pools involved in nitrogen assimilation and may play a direct role in photoacclimation. Our results suggest that primary and secondary metabolism is highly pliable and plays a critical role in coping with the energetic imbalance during HL exposure and a necessary adjustment to support an increased growth rate that is an effective energy sink for the excess reducing power generated during HL stress. © 2013 John Wiley & Sons Ltd.

  2. An annotated corpus with nanomedicine and pharmacokinetic parameters

    Directory of Open Access Journals (Sweden)

    Lewinski NA

    2017-10-01

    Full Text Available Nastassja A Lewinski,1 Ivan Jimenez,1 Bridget T McInnes2 1Department of Chemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA, 2Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA Abstract: A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration’s Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided. Keywords: nanotechnology, informatics, natural language processing, text mining, corpora

  3. Annotated Tsunami bibliography: 1962-1976

    International Nuclear Information System (INIS)

    Pararas-Carayannis, G.; Dong, B.; Farmer, R.

    1982-08-01

    This compilation contains annotated citations to nearly 3000 tsunami-related publications from 1962 to 1976 in English and several other languages. The foreign-language citations have English titles and abstracts

  4. Assessment of community-submitted ontology annotations from a novel database-journal partnership.

    Science.gov (United States)

    Berardini, Tanya Z; Li, Donghui; Muller, Robert; Chetty, Raymond; Ploetz, Larry; Singh, Shanker; Wensel, April; Huala, Eva

    2012-01-01

    As the scientific literature grows, leading to an increasing volume of published experimental data, so does the need to access and analyze this data using computational tools. The most commonly used method to convert published experimental data on gene function into controlled vocabulary annotations relies on a professional curator, employed by a model organism database or a more general resource such as UniProt, to read published articles and compose annotation statements based on the articles' contents. A more cost-effective and scalable approach capable of capturing gene function data across the whole range of biological research organisms in computable form is urgently needed. We have analyzed a set of ontology annotations generated through collaborations between the Arabidopsis Information Resource and several plant science journals. Analysis of the submissions entered using the online submission tool shows that most community annotations were well supported and the ontology terms chosen were at an appropriate level of specificity. Of the 503 individual annotations that were submitted, 97% were approved and community submissions captured 72% of all possible annotations. This new method for capturing experimental results in a computable form provides a cost-effective way to greatly increase the available body of annotations without sacrificing annotation quality. Database URL: www.arabidopsis.org.

  5. Radiocesium bioaccumulation in freshwater plankton: Influences of cation concentrations (K+ and Na+) on direct uptake of 137Cs in Chlamydomonas, Scenedesmus and Daphnia. Food-chain transfer of 137Cs from Chlamydomonas to Daphnia at different K+ concentrations

    International Nuclear Information System (INIS)

    Hagstroem, J.

    2002-01-01

    The influences of cation concentrations (K + and Na + ) on radiocesium ( 137 Cs) bioaccumulation in two freshwater phytoplankton species (Scenedesmus quadricauda and Chlamydomonas noctigama) were systematically investigated in batch-cultures monitored during two weeks. Both species were cultured at 9 μE M -2 s -1 constant illumination at 20 deg. C. The exponential growth phase lasted for more than 100 hours (μ ≅ 0.02 h -1 for C. noctigama and 0.03 h -1 for S, quadricauda). Over cation concentration ranges encountered in natural fresh waters ([K + ] from 0.1 μM to 3 mM, [Na + ] from 20 μM to 3 mM), a more than three order of magnitude variation was found for both intake rate and observed bioconcentration factors (BCF) at apparent steady-state (from less than 10 3 to 10 6 L (kg C) -1 ). For both species, the major effector on BCF and uptake rate was external [K + ], which was inversely proportional to these parameters over wide ranges (1-1000 μM for S. quadricauda and 0.1 to 300 μM for C. noctigama). At concentrations above these ranges K + still reduced 137 Cs bio-uptake, but less effectively. A minor influence of external [Na + ] on 137 Cs bioaccumulation was indicated for S. quadricauda, whereas no such influence was significant for C. noctigama. A biphasic pattern for 137 Cs bioaccumulation was discovered in C. noctigama. A rapid 'quasi-steady state' with an effective equilibration time of less than 100 hours was approached during the exponential growth phase. A surge in the uptake occurred when exponential growth ceased, and this pattern was consistent over the range 30 μM to 1.4 mM external [K + ]. Since depletion of external [K + ] was not detected for these treatments, this pattern can only be explained if there are at least two different cellular compartments involved. Although less certain, a second steady-state BCF appeared within two weeks, which seems to be up to one order of magnitude higher than the first. Microcosm experiments with the

  6. Crystallization and preliminary X-ray characterization of full-length Chlamydomonas reinhardtii centrin

    International Nuclear Information System (INIS)

    Alfaro, Elisa; Valle Sosa, Liliana del; Sanoguet, Zuleika; Pastrana-Ríos, Belinda; Schreiter, Eric R.

    2008-01-01

    C. reinhardtii centrin, an EF-hand calcium-binding protein localized to the microtubule-organizing center of eukaryotic organisms, has been crystallized in the presence of the model peptide melittin. X-ray diffraction data were collected to 2.2 Å resolution. Chlamydomonas reinhardtii centrin is a member of the EF-hand calcium-binding superfamily. It is found in the basal body complex and is important for flagellar motility. Like other members of the EF-hand family, centrin interacts with and modulates the function of other proteins in a calcium-dependent manner. To understand how C. reinhardtii centrin interacts with its protein targets, it has been crystallized in the presence of the model peptide melittin and X-ray diffraction data have been collected to 2.2 Å resolution. The crystals are orthorhombic, with unit-cell parameters a = 52.1, b = 114.4, c = 34.8 Å, and are likely to belong to space group P2 1 2 1 2

  7. Inhomogeneous distribution of Chlamydomonas in a cylindrical container with a bubble plume

    Science.gov (United States)

    Nonaka, Yuki; Kikuchi, Kenji; Numayama-Tsuruta, Keiko; Kage, Azusa; Ueno, Hironori; Ishikawa, Takuji

    2016-01-01

    ABSTRACT Swimming microalgae show various taxes, such as phototaxis and gravitaxis, which sometimes result in the formation of a cell-rich layer or a patch in a suspension. Despite intensive studies on the effects of shear flow and turbulence on the inhomogeneous distribution of microalgae, the effect of a bubble plume has remained unclear. In this study, we used Chlamydomonas as model microalgae, and investigated the spatial distribution of cells in a cylindrical container with a bubble plume. The results illustrate that cells become inhomogeneously distributed in the suspension due to their motility and photo-responses. A vortical ring distribution was observed below the free surface when the bubble flow rate was sufficiently small. We performed a scaling analysis on the length scale of the vortical ring, which captured the main features of the experimental results. These findings are important in understanding transport phenomena in a microalgae suspension with a bubble plume. PMID:26787679

  8. Annotation-Based Whole Genomic Prediction and Selection

    DEFF Research Database (Denmark)

    Kadarmideen, Haja; Do, Duy Ngoc; Janss, Luc

    Genomic selection is widely used in both animal and plant species, however, it is performed with no input from known genomic or biological role of genetic variants and therefore is a black box approach in a genomic era. This study investigated the role of different genomic regions and detected QTLs...... in their contribution to estimated genomic variances and in prediction of genomic breeding values by applying SNP annotation approaches to feed efficiency. Ensembl Variant Predictor (EVP) and Pig QTL database were used as the source of genomic annotation for 60K chip. Genomic prediction was performed using the Bayes...... classes. Predictive accuracy was 0.531, 0.532, 0.302, and 0.344 for DFI, RFI, ADG and BF, respectively. The contribution per SNP to total genomic variance was similar among annotated classes across different traits. Predictive performance of SNP classes did not significantly differ from randomized SNP...

  9. Separation Options for Phosphorylated Osteopontin from Transgenic Microalgae Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Ayswarya Ravi

    2018-02-01

    Full Text Available Correct folding and post-translational modifications are vital for therapeutic proteins to elicit their biological functions. Osteopontin (OPN, a bone regenerative protein present in a range of mammalian cells, is an acidic phosphoprotein with multiple potential phosphorylation sites. In this study, the ability of unicellular microalgae, Chlamydomonas reinhardtii, to produce phosphorylated recombinant OPN in its chloroplast is investigated. This study further explores the impact of phosphorylation and expression from a “plant-like” algae on separation of OPN. Chromatography resins ceramic hydroxyapatite (CHT and Gallium-immobilized metal affinity chromatography (Ga-IMAC were assessed for their binding specificity to phosphoproteins. Non-phosphorylated recombinant OPN expressed in E. coli was used to compare the specificity of interaction of the resins to phosphorylated OPN. We observed that CHT binds OPN by multimodal interactions and was better able to distinguish phosphorylated proteins in the presence of 250 mM NaCl. Ga-IMAC interaction with OPN was not selective to phosphorylation, irrespective of salt, as the resin bound OPN from both algal and bacterial sources. Anion exchange chromatography proved an efficient capture method to partially separate major phosphorylated host cell protein impurities such as Rubisco from OPN.

  10. Chlamydomonas as a model for biofuels and bio-products production.

    Science.gov (United States)

    Scranton, Melissa A; Ostrand, Joseph T; Fields, Francis J; Mayfield, Stephen P

    2015-05-01

    Developing renewable energy sources is critical to maintaining the economic growth of the planet while protecting the environment. First generation biofuels focused on food crops like corn and sugarcane for ethanol production, and soybean and palm for biodiesel production. Second generation biofuels based on cellulosic ethanol produced from terrestrial plants, has received extensive funding and recently pilot facilities have been commissioned, but to date output of fuels from these sources has fallen well short of what is needed. Recent research and pilot demonstrations have highlighted the potential of algae as one of the most promising sources of sustainable liquid transportation fuels. Algae have also been established as unique biofactories for industrial, therapeutic, and nutraceutical co-products. Chlamydomonas reinhardtii's long established role in the field of basic research in green algae has paved the way for understanding algal metabolism and developing genetic engineering protocols. These tools are now being utilized in C. reinhardtii and in other algal species for the development of strains to maximize biofuels and bio-products yields from the lab to the field. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.

  11. Creating New Medical Ontologies for Image Annotation A Case Study

    CERN Document Server

    Stanescu, Liana; Brezovan, Marius; Mihai, Cristian Gabriel

    2012-01-01

    Creating New Medical Ontologies for Image Annotation focuses on the problem of the medical images automatic annotation process, which is solved in an original manner by the authors. All the steps of this process are described in detail with algorithms, experiments and results. The original algorithms proposed by authors are compared with other efficient similar algorithms. In addition, the authors treat the problem of creating ontologies in an automatic way, starting from Medical Subject Headings (MESH). They have presented some efficient and relevant annotation models and also the basics of the annotation model used by the proposed system: Cross Media Relevance Models. Based on a text query the system will retrieve the images that contain objects described by the keywords.

  12. Elucidating high-dimensional cancer hallmark annotation via enriched ontology.

    Science.gov (United States)

    Yan, Shankai; Wong, Ka-Chun

    2017-09-01

    Cancer hallmark annotation is a promising technique that could discover novel knowledge about cancer from the biomedical literature. The automated annotation of cancer hallmarks could reveal relevant cancer transformation processes in the literature or extract the articles that correspond to the cancer hallmark of interest. It acts as a complementary approach that can retrieve knowledge from massive text information, advancing numerous focused studies in cancer research. Nonetheless, the high-dimensional nature of cancer hallmark annotation imposes a unique challenge. To address the curse of dimensionality, we compared multiple cancer hallmark annotation methods on 1580 PubMed abstracts. Based on the insights, a novel approach, UDT-RF, which makes use of ontological features is proposed. It expands the feature space via the Medical Subject Headings (MeSH) ontology graph and utilizes novel feature selections for elucidating the high-dimensional cancer hallmark annotation space. To demonstrate its effectiveness, state-of-the-art methods are compared and evaluated by a multitude of performance metrics, revealing the full performance spectrum on the full set of cancer hallmarks. Several case studies are conducted, demonstrating how the proposed approach could reveal novel insights into cancers. https://github.com/cskyan/chmannot. Copyright © 2017 Elsevier Inc. All rights reserved.

  13. Rfam: annotating families of non-coding RNA sequences.

    Science.gov (United States)

    Daub, Jennifer; Eberhardt, Ruth Y; Tate, John G; Burge, Sarah W

    2015-01-01

    The primary task of the Rfam database is to collate experimentally validated noncoding RNA (ncRNA) sequences from the published literature and facilitate the prediction and annotation of new homologues in novel nucleotide sequences. We group homologous ncRNA sequences into "families" and related families are further grouped into "clans." We collate and manually curate data cross-references for these families from other databases and external resources. Our Web site offers researchers a simple interface to Rfam and provides tools with which to annotate their own sequences using our covariance models (CMs), through our tools for searching, browsing, and downloading information on Rfam families. In this chapter, we will work through examples of annotating a query sequence, collating family information, and searching for data.

  14. Intra-species sequence comparisons for annotating genomes

    Energy Technology Data Exchange (ETDEWEB)

    Boffelli, Dario; Weer, Claire V.; Weng, Li; Lewis, Keith D.; Shoukry, Malak I.; Pachter, Lior; Keys, David N.; Rubin, Edward M.

    2004-07-15

    Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intra-species sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents and a set of genomic intervals amplified, resequenced and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom and raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species. The sequence data from this study has been submitted to GenBank under accession nos. AY667278-AY667407.

  15. EFFECT OF TREATED DOMESTIC WASTEWATER USED AS CULTURE MEDIUM ON THE GROWTH AND PRODUCTIVITY OF Chlamydomonas sp. STRAIN ISOLATED FROM LANDFILL LEACHATE

    Directory of Open Access Journals (Sweden)

    Fábio de Farias Neves

    2013-07-01

    Full Text Available Microalgae have been culturing to fix carbon and produce biofuels from the biomass. However, it is important to develop low cost strategies for microalgae production in orther to make it a viable alternative of renewable energy. The present research studied the effect of treated wastewater used as an alternative culture medium for growth and productivity of a Chlamydomonas sp. strain isolated from landfills leachate of a treatment pond located in Southern Brazil. Three culture media were evaluated, the control consisted of synthetic TAP medium, other, consisting of 50% TAP medium and 50% wastewater, and another consisting of 100% wastewater. The growth parameters do not have significant difference among the three culture media. Also, productivity do not have significant difference among the cultures with TAP medium and with 100% wastewater, resulting in dry weight values of 1,4±0,14g/L and 1,3±0,19g/L respectively. The culture with 50% TAP medium and 50% wastewater showed the highest productivity, showing an average dry weight value of 1,7±0,07g/L. The results indicate that treated wastewater can be used as an alternative culture medium for Chlamydomonas sp. strain without negative effects on growth and productivity, and possible leading to a decrease in production costs.

  16. Experimental-confirmation and functional-annotation of predicted proteins in the chicken genome

    Directory of Open Access Journals (Sweden)

    McCarthy Fiona M

    2007-11-01

    Full Text Available Abstract Background The chicken genome was sequenced because of its phylogenetic position as a non-mammalian vertebrate, its use as a biomedical model especially to study embryology and development, its role as a source of human disease organisms and its importance as the major source of animal derived food protein. However, genomic sequence data is, in itself, of limited value; generally it is not equivalent to understanding biological function. The benefit of having a genome sequence is that it provides a basis for functional genomics. However, the sequence data currently available is poorly structurally and functionally annotated and many genes do not have standard nomenclature assigned. Results We analysed eight chicken tissues and improved the chicken genome structural annotation by providing experimental support for the in vivo expression of 7,809 computationally predicted proteins, including 30 chicken proteins that were only electronically predicted or hypothetical translations in human. To improve functional annotation (based on Gene Ontology, we mapped these identified proteins to their human and mouse orthologs and used this orthology to transfer Gene Ontology (GO functional annotations to the chicken proteins. The 8,213 orthology-based GO annotations that we produced represent an 8% increase in currently available chicken GO annotations. Orthologous chicken products were also assigned standardized nomenclature based on current chicken nomenclature guidelines. Conclusion We demonstrate the utility of high-throughput expression proteomics for rapid experimental structural annotation of a newly sequenced eukaryote genome. These experimentally-supported predicted proteins were further annotated by assigning the proteins with standardized nomenclature and functional annotation. This method is widely applicable to a diverse range of species. Moreover, information from one genome can be used to improve the annotation of other genomes and

  17. Automatically annotating topics in transcripts of patient-provider interactions via machine learning.

    Science.gov (United States)

    Wallace, Byron C; Laws, M Barton; Small, Kevin; Wilson, Ira B; Trikalinos, Thomas A

    2014-05-01

    Annotated patient-provider encounters can provide important insights into clinical communication, ultimately suggesting how it might be improved to effect better health outcomes. But annotating outpatient transcripts with Roter or General Medical Interaction Analysis System (GMIAS) codes is expensive, limiting the scope of such analyses. We propose automatically annotating transcripts of patient-provider interactions with topic codes via machine learning. We use a conditional random field (CRF) to model utterance topic probabilities. The model accounts for the sequential structure of conversations and the words comprising utterances. We assess predictive performance via 10-fold cross-validation over GMIAS-annotated transcripts of 360 outpatient visits (>230,000 utterances). We then use automated in place of manual annotations to reproduce an analysis of 116 additional visits from a randomized trial that used GMIAS to assess the efficacy of an intervention aimed at improving communication around antiretroviral (ARV) adherence. With respect to 6 topic codes, the CRF achieved a mean pairwise kappa compared with human annotators of 0.49 (range: 0.47-0.53) and a mean overall accuracy of 0.64 (range: 0.62-0.66). With respect to the RCT reanalysis, results using automated annotations agreed with those obtained using manual ones. According to the manual annotations, the median number of ARV-related utterances without and with the intervention was 49.5 versus 76, respectively (paired sign test P = 0.07). When automated annotations were used, the respective numbers were 39 versus 55 (P = 0.04). While moderately accurate, the predicted annotations are far from perfect. Conversational topics are intermediate outcomes, and their utility is still being researched. This foray into automated topic inference suggests that machine learning methods can classify utterances comprising patient-provider interactions into clinically relevant topics with reasonable accuracy.

  18. Effect of Temperature and light intensity on growth and Photosynthetic Activity of Chlamydomonas reinhard II

    International Nuclear Information System (INIS)

    Alfonsel Jaen, M.; Fernandez Gonzalez, J.

    1985-01-01

    The effect of five temperatures (15,20,25,30 and 35 degree centigree) and two levels of illumination on growth and photosynthetic activity of Chlamydomonas reinhard II has been studied. The growth of the cultures was evaluated by optical density. Photosynthetic activity has been carried out studying either the assimilation rate of C0 2 labelled with C-14 or the oxygen evolution by means of polarographic measurements. The maximum photosynthetic rate has been obtained at 25 degree centigree for the lower level of illumination (2400 lux) and at 35 degree centigree for the higher one (13200 lux) and at 35 degree centigree for the higher ono (13200 lux). These results suggest an interaction of temperature and illumination on photosynthetic activity. (Author) 37 refs

  19. Annotating risk factors for heart disease in clinical narratives for diabetic patients.

    Science.gov (United States)

    Stubbs, Amber; Uzuner, Özlem

    2015-12-01

    The 2014 i2b2/UTHealth natural language processing shared task featured a track focused on identifying risk factors for heart disease (specifically, Cardiac Artery Disease) in clinical narratives. For this track, we used a "light" annotation paradigm to annotate a set of 1304 longitudinal medical records describing 296 patients for risk factors and the times they were present. We designed the annotation task for this track with the goal of balancing annotation load and time with quality, so as to generate a gold standard corpus that can benefit a clinically-relevant task. We applied light annotation procedures and determined the gold standard using majority voting. On average, the agreement of annotators with the gold standard was above 0.95, indicating high reliability. The resulting document-level annotations generated for each record in each longitudinal EMR in this corpus provide information that can support studies of progression of heart disease risk factors in the included patients over time. These annotations were used in the Risk Factor track of the 2014 i2b2/UTHealth shared task. Participating systems achieved a mean micro-averaged F1 measure of 0.815 and a maximum F1 measure of 0.928 for identifying these risk factors in patient records. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Annotating Emotions in Meetings

    NARCIS (Netherlands)

    Reidsma, Dennis; Heylen, Dirk K.J.; Ordelman, Roeland J.F.

    We present the results of two trials testing procedures for the annotation of emotion and mental state of the AMI corpus. The first procedure is an adaptation of the FeelTrace method, focusing on a continuous labelling of emotion dimensions. The second method is centered around more discrete

  1. Microtask crowdsourcing for disease mention annotation in PubMed abstracts.

    Science.gov (United States)

    Good, Benjamin M; Nanis, Max; Wu, Chunlei; Su, Andrew I

    2015-01-01

    Identifying concepts and relationships in biomedical text enables knowledge to be applied in computational analyses. Many biological natural language processing (BioNLP) projects attempt to address this challenge, but the state of the art still leaves much room for improvement. Progress in BioNLP research depends on large, annotated corpora for evaluating information extraction systems and training machine learning models. Traditionally, such corpora are created by small numbers of expert annotators often working over extended periods of time. Recent studies have shown that workers on microtask crowdsourcing platforms such as Amazon's Mechanical Turk (AMT) can, in aggregate, generate high-quality annotations of biomedical text. Here, we investigated the use of the AMT in capturing disease mentions in PubMed abstracts. We used the NCBI Disease corpus as a gold standard for refining and benchmarking our crowdsourcing protocol. After several iterations, we arrived at a protocol that reproduced the annotations of the 593 documents in the 'training set' of this gold standard with an overall F measure of 0.872 (precision 0.862, recall 0.883). The output can also be tuned to optimize for precision (max = 0.984 when recall = 0.269) or recall (max = 0.980 when precision = 0.436). Each document was completed by 15 workers, and their annotations were merged based on a simple voting method. In total 145 workers combined to complete all 593 documents in the span of 9 days at a cost of $.066 per abstract per worker. The quality of the annotations, as judged with the F measure, increases with the number of workers assigned to each task; however minimal performance gains were observed beyond 8 workers per task. These results add further evidence that microtask crowdsourcing can be a valuable tool for generating well-annotated corpora in BioNLP. Data produced for this analysis are available at http://figshare.com/articles/Disease_Mention_Annotation_with_Mechanical_Turk/1126402.

  2. Establishing Chlamydomonas reinhardtii as an industrial biotechnology host.

    Science.gov (United States)

    Scaife, Mark A; Nguyen, Ginnie T D T; Rico, Juan; Lambert, Devinn; Helliwell, Katherine E; Smith, Alison G

    2015-05-01

    Microalgae constitute a diverse group of eukaryotic unicellular organisms that are of interest for pure and applied research. Owing to their natural synthesis of value-added natural products microalgae are emerging as a source of sustainable chemical compounds, proteins and metabolites, including but not limited to those that could replace compounds currently made from fossil fuels. For the model microalga, Chlamydomonas reinhardtii, this has prompted a period of rapid development so that this organism is poised for exploitation as an industrial biotechnology platform. The question now is how best to achieve this? Highly advanced industrial biotechnology systems using bacteria and yeasts were established in a classical metabolic engineering manner over several decades. However, the advent of advanced molecular tools and the rise of synthetic biology provide an opportunity to expedite the development of C. reinhardtii as an industrial biotechnology platform, avoiding the process of incremental improvement. In this review we describe the current status of genetic manipulation of C. reinhardtii for metabolic engineering. We then introduce several concepts that underpin synthetic biology, and show how generic parts are identified and used in a standard manner to achieve predictable outputs. Based on this we suggest that the development of C. reinhardtii as an industrial biotechnology platform can be achieved more efficiently through adoption of a synthetic biology approach. © 2015 The Authors The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.

  3. Consumer energy research: an annotated bibliography. Vol. 3

    Energy Technology Data Exchange (ETDEWEB)

    Anderson, D.C.; McDougall, G.H.G.

    1983-04-01

    This annotated bibliography attempts to provide a comprehensive package of existing information in consumer related energy research. A concentrated effort was made to collect unpublished material as well as material from journals and other sources, including governments, utilities research institutes and private firms. A deliberate effort was made to include agencies outside North America. For the most part the bibliography is limited to annotations of empiracal studies. However, it includes a number of descriptive reports which appear to make a significant contribution to understanding consumers and energy use. The format of the annotations displays the author, date of publication, title and source of the study. Annotations of empirical studies are divided into four parts: objectives, methods, variables and findings/implications. Care was taken to provide a reasonable amount of detail in the annotations to enable the reader to understand the methodology, the results and the degree to which the implications fo the study can be generalized to other situations. Studies are arranged alphabetically by author. The content of the studies reviewed is classified in a series of tables which are intended to provide a summary of sources, types and foci of the various studies. These tables are intended to aid researchers interested in specific topics to locate those studies most relevant to their work. The studies are categorized using a number of different classification criteria, for example, methodology used, type of energy form, type of policy initiative, and type of consumer activity. A general overview of the studies is also presented. 17 tabs.

  4. Image annotation based on positive-negative instances learning

    Science.gov (United States)

    Zhang, Kai; Hu, Jiwei; Liu, Quan; Lou, Ping

    2017-07-01

    Automatic image annotation is now a tough task in computer vision, the main sense of this tech is to deal with managing the massive image on the Internet and assisting intelligent retrieval. This paper designs a new image annotation model based on visual bag of words, using the low level features like color and texture information as well as mid-level feature as SIFT, and mixture the pic2pic, label2pic and label2label correlation to measure the correlation degree of labels and images. We aim to prune the specific features for each single label and formalize the annotation task as a learning process base on Positive-Negative Instances Learning. Experiments are performed using the Corel5K Dataset, and provide a quite promising result when comparing with other existing methods.

  5. An Annotated Dataset of 14 Meat Images

    DEFF Research Database (Denmark)

    Stegmann, Mikkel Bille

    2002-01-01

    This note describes a dataset consisting of 14 annotated images of meat. Points of correspondence are placed on each image. As such, the dataset can be readily used for building statistical models of shape. Further, format specifications and terms of use are given.......This note describes a dataset consisting of 14 annotated images of meat. Points of correspondence are placed on each image. As such, the dataset can be readily used for building statistical models of shape. Further, format specifications and terms of use are given....

  6. Graph-based sequence annotation using a data integration approach

    Directory of Open Access Journals (Sweden)

    Pesch Robert

    2008-06-01

    Full Text Available The automated annotation of data from high throughput sequencing and genomics experiments is a significant challenge for bioinformatics. Most current approaches rely on sequential pipelines of gene finding and gene function prediction methods that annotate a gene with information from different reference data sources. Each function prediction method contributes evidence supporting a functional assignment. Such approaches generally ignore the links between the information in the reference datasets. These links, however, are valuable for assessing the plausibility of a function assignment and can be used to evaluate the confidence in a prediction. We are working towards a novel annotation system that uses the network of information supporting the function assignment to enrich the annotation process for use by expert curators and predicting the function of previously unannotated genes. In this paper we describe our success in the first stages of this development. We present the data integration steps that are needed to create the core database of integrated reference databases (UniProt, PFAM, PDB, GO and the pathway database Ara- Cyc which has been established in the ONDEX data integration system. We also present a comparison between different methods for integration of GO terms as part of the function assignment pipeline and discuss the consequences of this analysis for improving the accuracy of gene function annotation.

  7. EST-PAC a web package for EST annotation and protein sequence prediction

    Directory of Open Access Journals (Sweden)

    Strahm Yvan

    2006-10-01

    Full Text Available Abstract With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1 searching local or remote biological databases for sequence similarities using Blast services, 2 predicting protein coding sequence from EST data and, 3 annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics.

  8. An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets.

    Science.gov (United States)

    Hosseini, Parsa; Tremblay, Arianne; Matthews, Benjamin F; Alkharouf, Nadim W

    2010-07-02

    The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data

  9. BEACON: automated tool for Bacterial GEnome Annotation ComparisON

    KAUST Repository

    Kalkatawi, Manal M.; Alam, Intikhab; Bajic, Vladimir B.

    2015-01-01

    We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/

  10. Optimization of the C11-BODIPY581/591 Dye for the Determination of Lipid Oxidation in Chlamydomonas reinhardtii by Flow Cytometry

    OpenAIRE

    CHELONI Giulia

    2013-01-01

    Lipid oxidation is a recognized end point for the study of oxidative stress and is an important parameter to describe the mode of micropollutant action on aquatic microorganisms. Therefore the development of quick and reliable methodologies probing the oxidative stress and damage in living cells is highly sought. In the present proof of concept work we examined the potential of the fluorescent dye C11 BODIPY591/581 to probe lipid oxidation in the green microalga Chlamydomonas reinhardtii. C11...

  11. Automatic medical image annotation and keyword-based image retrieval using relevance feedback.

    Science.gov (United States)

    Ko, Byoung Chul; Lee, JiHyeon; Nam, Jae-Yeal

    2012-08-01

    This paper presents novel multiple keywords annotation for medical images, keyword-based medical image retrieval, and relevance feedback method for image retrieval for enhancing image retrieval performance. For semantic keyword annotation, this study proposes a novel medical image classification method combining local wavelet-based center symmetric-local binary patterns with random forests. For keyword-based image retrieval, our retrieval system use the confidence score that is assigned to each annotated keyword by combining probabilities of random forests with predefined body relation graph. To overcome the limitation of keyword-based image retrieval, we combine our image retrieval system with relevance feedback mechanism based on visual feature and pattern classifier. Compared with other annotation and relevance feedback algorithms, the proposed method shows both improved annotation performance and accurate retrieval results.

  12. Annotating the Function of the Human Genome with Gene Ontology and Disease Ontology.

    Science.gov (United States)

    Hu, Yang; Zhou, Wenyang; Ren, Jun; Dong, Lixiang; Wang, Yadong; Jin, Shuilin; Cheng, Liang

    2016-01-01

    Increasing evidences indicated that function annotation of human genome in molecular level and phenotype level is very important for systematic analysis of genes. In this study, we presented a framework named Gene2Function to annotate Gene Reference into Functions (GeneRIFs), in which each functional description of GeneRIFs could be annotated by a text mining tool Open Biomedical Annotator (OBA), and each Entrez gene could be mapped to Human Genome Organisation Gene Nomenclature Committee (HGNC) gene symbol. After annotating all the records about human genes of GeneRIFs, 288,869 associations between 13,148 mRNAs and 7,182 terms, 9,496 associations between 948 microRNAs and 533 terms, and 901 associations between 139 long noncoding RNAs (lncRNAs) and 297 terms were obtained as a comprehensive annotation resource of human genome. High consistency of term frequency of individual gene (Pearson correlation = 0.6401, p = 2.2e - 16) and gene frequency of individual term (Pearson correlation = 0.1298, p = 3.686e - 14) in GeneRIFs and GOA shows our annotation resource is very reliable.

  13. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.

    Science.gov (United States)

    Holt, Carson; Yandell, Mark

    2011-12-22

    Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes are being sequenced and how they are annotated. While the first generation of genome projects focused on well-studied model organisms, many of today's projects involve exotic organisms whose genomes are largely terra incognita. This complicates their annotation, because unlike first-generation projects, there are no pre-existing 'gold-standard' gene-models with which to train gene-finders. Improvements in genome assembly and the wide availability of mRNA-seq data are also creating opportunities to update and re-annotate previously published genome annotations. Today's genome projects are thus in need of new genome annotation tools that can meet the challenges and opportunities presented by second-generation sequencing technologies. We present MAKER2, a genome annotation and data management tool designed for second-generation genome projects. MAKER2 is a multi-threaded, parallelized application that can process second-generation datasets of virtually any size. We show that MAKER2 can produce accurate annotations for novel genomes where training-data are limited, of low quality or even non-existent. MAKER2 also provides an easy means to use mRNA-seq data to improve annotation quality; and it can use these data to update legacy annotations, significantly improving their quality. We also show that MAKER2 can evaluate the quality of genome annotations, and identify and prioritize problematic annotations for manual review. MAKER2 is the first annotation engine specifically designed for second-generation genome projects. MAKER2 scales to datasets of any size, requires little in the way of training data, and can use mRNA-seq data to improve annotation quality. It can also update and manage legacy genome annotation datasets.

  14. Annotation: The Savant Syndrome

    Science.gov (United States)

    Heaton, Pamela; Wallace, Gregory L.

    2004-01-01

    Background: Whilst interest has focused on the origin and nature of the savant syndrome for over a century, it is only within the past two decades that empirical group studies have been carried out. Methods: The following annotation briefly reviews relevant research and also attempts to address outstanding issues in this research area.…

  15. Analysis of LYSA-calculus with explicit confidentiality annotations

    DEFF Research Database (Denmark)

    Gao, Han; Nielson, Hanne Riis

    2006-01-01

    Recently there has been an increased research interest in applying process calculi in the verification of cryptographic protocols due to their ability to formally model protocols. This work presents LYSA with explicit confidentiality annotations for indicating the expected behavior of target...... malicious activities performed by attackers as specified by the confidentiality annotations. The proposed analysis approach is fully automatic without the need of human intervention and has been applied successfully to a number of protocols....

  16. First generation annotations for the fathead minnow (Pimephales promelas) genome

    Science.gov (United States)

    Ab initio gene prediction and evidence alignment were used to produce the first annotations for the fathead minnow SOAPdenovo genome assembly. Additionally, a genome browser hosted at genome.setac.org provides simplified access to the annotation data in context with fathead minno...

  17. Extracting Cross-Ontology Weighted Association Rules from Gene Ontology Annotations.

    Science.gov (United States)

    Agapito, Giuseppe; Milano, Marianna; Guzzi, Pietro Hiram; Cannataro, Mario

    2016-01-01

    Gene Ontology (GO) is a structured repository of concepts (GO Terms) that are associated to one or more gene products through a process referred to as annotation. The analysis of annotated data is an important opportunity for bioinformatics. There are different approaches of analysis, among those, the use of association rules (AR) which provides useful knowledge, discovering biologically relevant associations between terms of GO, not previously known. In a previous work, we introduced GO-WAR (Gene Ontology-based Weighted Association Rules), a methodology for extracting weighted association rules from ontology-based annotated datasets. We here adapt the GO-WAR algorithm to mine cross-ontology association rules, i.e., rules that involve GO terms present in the three sub-ontologies of GO. We conduct a deep performance evaluation of GO-WAR by mining publicly available GO annotated datasets, showing how GO-WAR outperforms current state of the art approaches.

  18. Copper excess-induced large reversible and small irreversible adaptations in a population of Chlamydomonas reinhardtii CW15 (Chlorophyta

    Directory of Open Access Journals (Sweden)

    Bartosz Pluciński

    2018-03-01

    Full Text Available Two Chlamydomonas reinhardtii CW15 populations modified by an excess of copper in growth medium were obtained: a “Cu” population that was continuously grown under the selection pressure of 5 µM Cu2+ (for at least 48 weeks and the “Re” population, where a relatively short (9 week exposure to elevated copper, necessary for acquiring tolerance, was followed by a prolonged period (at least 39 weeks of cultivation at a normal (0.25 µM copper concentration. Cells of the Cu population were able to multiply at a Cu2+ concentration 16 times higher than that of the control population at a normal light intensity and at a Cu2+ concentration 64 times higher when cultivated in dim light. The potential quantum yield of photosystem II (FV/FM ratio under copper stress was also significantly higher for the Cu population than for Re and control populations. The Re population showed only residual tolerance towards the elevated concentration of copper, which is revealed by an FV/FM ratio slightly higher than in the control population under Cu2+ stress in dim light or in darkness. We postulate that in the Chlamydomonas populations studied in this paper, at least two mechanisms of copper tolerance operate. The first mechanism is maintained during cultivation at a standard copper concentration and seems to be connected with photosynthetic apparatus. This mechanism, however, has only low adaptive value under excess of copper. The other mechanism, with a much higher adaptive value, is probably connected with Cu2+ homeostasis at the cellular level, but is lost during cultivation at a normal copper concentration.

  19. Jannovar: a java library for exome annotation.

    Science.gov (United States)

    Jäger, Marten; Wang, Kai; Bauer, Sebastian; Smedley, Damian; Krawitz, Peter; Robinson, Peter N

    2014-05-01

    Transcript-based annotation and pedigree analysis are two basic steps in the computational analysis of whole-exome sequencing experiments in genetic diagnostics and disease-gene discovery projects. Here, we present Jannovar, a stand-alone Java application as well as a Java library designed to be used in larger software frameworks for exome and genome analysis. Jannovar uses an interval tree to identify all transcripts affected by a given variant, and provides Human Genome Variation Society-compliant annotations both for variants affecting coding sequences and splice junctions as well as untranslated regions and noncoding RNA transcripts. Jannovar can also perform family-based pedigree analysis with Variant Call Format (VCF) files with data from members of a family segregating a Mendelian disorder. Using a desktop computer, Jannovar requires a few seconds to annotate a typical VCF file with exome data. Jannovar is freely available under the BSD2 license. Source code as well as the Java application and library file can be downloaded from http://compbio.charite.de (with tutorial) and https://github.com/charite/jannovar. © 2014 WILEY PERIODICALS, INC.

  20. Graph-based sequence annotation using a data integration approach.

    Science.gov (United States)

    Pesch, Robert; Lysenko, Artem; Hindle, Matthew; Hassani-Pak, Keywan; Thiele, Ralf; Rawlings, Christopher; Köhler, Jacob; Taubert, Jan

    2008-08-25

    The automated annotation of data from high throughput sequencing and genomics experiments is a significant challenge for bioinformatics. Most current approaches rely on sequential pipelines of gene finding and gene function prediction methods that annotate a gene with information from different reference data sources. Each function prediction method contributes evidence supporting a functional assignment. Such approaches generally ignore the links between the information in the reference datasets. These links, however, are valuable for assessing the plausibility of a function assignment and can be used to evaluate the confidence in a prediction. We are working towards a novel annotation system that uses the network of information supporting the function assignment to enrich the annotation process for use by expert curators and predicting the function of previously unannotated genes. In this paper we describe our success in the first stages of this development. We present the data integration steps that are needed to create the core database of integrated reference databases (UniProt, PFAM, PDB, GO and the pathway database Ara-Cyc) which has been established in the ONDEX data integration system. We also present a comparison between different methods for integration of GO terms as part of the function assignment pipeline and discuss the consequences of this analysis for improving the accuracy of gene function annotation. The methods and algorithms presented in this publication are an integral part of the ONDEX system which is freely available from http://ondex.sf.net/.

  1. Linking Disparate Datasets of the Earth Sciences with the SemantEco Annotator

    Science.gov (United States)

    Seyed, P.; Chastain, K.; McGuinness, D. L.

    2013-12-01

    Use of Semantic Web technologies for data management in the Earth sciences (and beyond) has great potential but is still in its early stages, since the challenges of translating data into a more explicit or semantic form for immediate use within applications has not been fully addressed. In this abstract we help address this challenge by introducing the SemantEco Annotator, which enables anyone, regardless of expertise, to semantically annotate tabular Earth Science data and translate it into linked data format, while applying the logic inherent in community-standard vocabularies to guide the process. The Annotator was conceived under a desire to unify dataset content from a variety of sources under common vocabularies, for use in semantically-enabled web applications. Our current use case employs linked data generated by the Annotator for use in the SemantEco environment, which utilizes semantics to help users explore, search, and visualize water or air quality measurement and species occurrence data through a map-based interface. The generated data can also be used immediately to facilitate discovery and search capabilities within 'big data' environments. The Annotator provides a method for taking information about a dataset, that may only be known to its maintainers, and making it explicit, in a uniform and machine-readable fashion, such that a person or information system can more easily interpret the underlying structure and meaning. Its primary mechanism is to enable a user to formally describe how columns of a tabular dataset relate and/or describe entities. For example, if a user identifies columns for latitude and longitude coordinates, we can infer the data refers to a point that can be plotted on a map. Further, it can be made explicit that measurements of 'nitrate' and 'NO3-' are of the same entity through vocabulary assignments, thus more easily utilizing data sets that use different nomenclatures. The Annotator provides an extensive and searchable

  2. A chloroplast pathway for the de novo biosynthesis of triacylglycerol in Chlamydomonas reinhardtii

    Energy Technology Data Exchange (ETDEWEB)

    Fan, J.; Xu, C.; Andre, C.

    2011-06-23

    Neutral lipid metabolism has been extensively studied in yeast, plants and mammals. In contrast, little information is available regarding the biochemical pathway, enzymes and regulatory factors involved in the biosynthesis of triacylglycerol (TAG) in microalgae. In the conventional TAG biosynthetic pathway widely accepted for yeast, plants and mammals, TAG is assembled in the endoplasmic reticulum (ER) from its immediate precursor diacylglycerol (DAG) made by ER-specific acyltransferases, and is deposited exclusively in lipid droplets in the cytosol. Here, we demonstrated that the unicellular microalga Chlamydomonas reinhardtii employs a distinct pathway that uses DAG derived almost exclusively from the chloroplast to produce TAG. This unique TAG biosynthesis pathway is largely dependent on de novo fatty acid synthesis, and the TAG formed in this pathway is stored in lipid droplets in both the chloroplast and the cytosol. These findings have wide implications for understanding TAG biosynthesis and storage and other areas of lipid metabolism in microalgae and other organisms.

  3. High-level 13C-enrichment of random and synchronous populations of Chlamydomonas reinhardii

    International Nuclear Information System (INIS)

    Price, R.L.; Crissman, H.A.; Martin, J.C.; Kollman, V.H.

    1975-01-01

    The alga Chlamydomonas reinhardii was grown in suspension culture at high levels of 13 C-enrichment (98 mol percent) both in synchronous and random populations for the purpose of investigating possible macro- and ultrastructural changes in the cell as induced by essentially total carbon replacement. The algae, grown in spinner flasks, were analyzed using a newly developed multiparameter flow-system technique applied to characterizing various algal genera. The versatility of this technique provides for measuring and processing several cell characteristics simultaneously and separating cells according to selected combinations of parameters. In these studies, cell volume (by Coulter aperture) and DNA and chlorophyll content were determined simultaneously. Cell ultrastructure was examined at various levels of isotope enrichment and time periods by electron microscopy. The data presented for synchronous growth of this organism demonstrate the absence of biological effects (considering the parameters measured) due to the almost total replacement of cellular 12 C with 13 C. Interpretational problems encountered when looking for biological effects on random populations are discussed

  4. Experimental evolution of an alternating uni- and multicellular life cycle in Chlamydomonas reinhardtii

    Science.gov (United States)

    Ratcliff, William C.; Herron, Matthew D.; Howell, Kathryn; Pentz, Jennifer T.; Rosenzweig, Frank; Travisano, Michael

    2013-01-01

    The transition to multicellularity enabled the evolution of large, complex organisms, but early steps in this transition remain poorly understood. Here we show that multicellular complexity, including development from a single cell, can evolve rapidly in a unicellular organism that has never had a multicellular ancestor. We subject the alga Chlamydomonas reinhardtii to conditions that favour multicellularity, resulting in the evolution of a multicellular life cycle in which clusters reproduce via motile unicellular propagules. While a single-cell genetic bottleneck during ontogeny is widely regarded as an adaptation to limit among-cell conflict, its appearance very early in this transition suggests that it did not evolve for this purpose. Instead, we find that unicellular propagules are adaptive even in the absence of intercellular conflict, maximizing cluster-level fecundity. These results demonstrate that the unicellular bottleneck, a trait essential for evolving multicellular complexity, can arise rapidly via co-option of the ancestral unicellular form. PMID:24193369

  5. Annotation of phenotypic diversity: decoupling data curation and ontology curation using Phenex.

    Science.gov (United States)

    Balhoff, James P; Dahdul, Wasila M; Dececchi, T Alexander; Lapp, Hilmar; Mabee, Paula M; Vision, Todd J

    2014-01-01

    Phenex (http://phenex.phenoscape.org/) is a desktop application for semantically annotating the phenotypic character matrix datasets common in evolutionary biology. Since its initial publication, we have added new features that address several major bottlenecks in the efficiency of the phenotype curation process: allowing curators during the data curation phase to provisionally request terms that are not yet available from a relevant ontology; supporting quality control against annotation guidelines to reduce later manual review and revision; and enabling the sharing of files for collaboration among curators. We decoupled data annotation from ontology development by creating an Ontology Request Broker (ORB) within Phenex. Curators can use the ORB to request a provisional term for use in data annotation; the provisional term can be automatically replaced with a permanent identifier once the term is added to an ontology. We added a set of annotation consistency checks to prevent common curation errors, reducing the need for later correction. We facilitated collaborative editing by improving the reliability of Phenex when used with online folder sharing services, via file change monitoring and continual autosave. With the addition of these new features, and in particular the Ontology Request Broker, Phenex users have been able to focus more effectively on data annotation. Phenoscape curators using Phenex have reported a smoother annotation workflow, with much reduced interruptions from ontology maintenance and file management issues.

  6. Ultraviolet radiation and the snow alga Chlamydomonas nivalis (Bauer) Wille.

    Science.gov (United States)

    Gorton, Holly L; Vogelmann, Thomas C

    2003-06-01

    Aplanospores of Chlamydomonas nivalis are frequently found in high-altitude, persistent snowfields where they are photosynthetically active despite cold temperatures and high levels of visible and ultraviolet (UV) radiation. The goals of this work were to characterize the UV environment of the cells in the snow and to investigate the existence and localization of screening compounds that might prevent UV damage. UV irradiance decreased precipitously in snow, with UV radiation of wavelengths 280-315 nm and UV radiation of wavelengths 315-400 nm dropping to 50% of incident levels in the top 1 and 2 cm, respectively. Isolated cell walls exhibited UV absorbance, possibly by sporopollenin, but this absorbance was weak in images of broken or plasmolyzed cells observed through a UV microscope. The cells also contained UV-absorbing cytoplasmic compounds, with the extrachloroplastic carotenoid astaxanthin providing most of the screening. Additional screening compound(s) soluble in aqueous methanol with an absorption maximum at 335 nm played a minor role. Thus, cells are protected against potentially high levels of UV radiation by the snow itself when they live several centimeters beneath the surface, and they rely on cellular screening compounds, chiefly astaxanthin, when located near the surface where UV fluxes are high.

  7. Automated evaluation of annotators for museum collections using subjective login

    NARCIS (Netherlands)

    Ceolin, D.; Nottamkandath, A.; Fokkink, W.J.; Dimitrakos, Th.; Moona, R.; Patel, Dh.; Harrison McKnight, D.

    2012-01-01

    Museums are rapidly digitizing their collections, and face a huge challenge to annotate every digitized artifact in store. Therefore they are opening up their archives for receiving annotations from experts world-wide. This paper presents an architecture for choosing the most eligible set of

  8. Annotation of the protein coding regions of the equine genome

    DEFF Research Database (Denmark)

    Hestand, Matthew S.; Kalbfleisch, Theodore S.; Coleman, Stephen J.

    2015-01-01

    Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced m...... and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross...

  9. Systems-Wide Analysis of Acclimation Responses to Long-Term Heat Stress and Recovery in the Photosynthetic Model Organism Chlamydomonas reinhardtii[W][OPEN

    Science.gov (United States)

    Hemme, Dorothea; Veyel, Daniel; Mühlhaus, Timo; Sommer, Frederik; Jüppner, Jessica; Unger, Ann-Katrin; Sandmann, Michael; Fehrle, Ines; Schönfelder, Stephanie; Steup, Martin; Geimer, Stefan; Kopka, Joachim; Giavalisco, Patrick; Schroda, Michael

    2014-01-01

    We applied a top-down systems biology approach to understand how Chlamydomonas reinhardtii acclimates to long-term heat stress (HS) and recovers from it. For this, we shifted cells from 25 to 42°C for 24 h and back to 25°C for ≥8 h and monitored abundances of 1856 proteins/protein groups, 99 polar and 185 lipophilic metabolites, and cytological and photosynthesis parameters. Our data indicate that acclimation of Chlamydomonas to long-term HS consists of a temporally ordered, orchestrated implementation of response elements at various system levels. These comprise (1) cell cycle arrest; (2) catabolism of larger molecules to generate compounds with roles in stress protection; (3) accumulation of molecular chaperones to restore protein homeostasis together with compatible solutes; (4) redirection of photosynthetic energy and reducing power from the Calvin cycle to the de novo synthesis of saturated fatty acids to replace polyunsaturated ones in membrane lipids, which are deposited in lipid bodies; and (5) when sinks for photosynthetic energy and reducing power are depleted, resumption of Calvin cycle activity associated with increased photorespiration, accumulation of reactive oxygen species scavengers, and throttling of linear electron flow by antenna uncoupling. During recovery from HS, cells appear to focus on processes allowing rapid resumption of growth rather than restoring pre-HS conditions. PMID:25415976

  10. Genome Annotation and Transcriptomics of Oil-Producing Algae

    Science.gov (United States)

    2015-03-16

    AFRL-OSR-VA-TR-2015-0103 GENOME ANNOTATION AND TRANSCRIPTOMICS OF OIL-PRODUCING ALGAE Sabeeha Merchant UNIVERSITY OF CALIFORNIA LOS ANGELES Final...2010 To 12-31-2014 4. TITLE AND SUBTITLE GENOME ANNOTATION AND TRANSCRIPTOMICS OF OIL-PRODUCING ALGAE 5a. CONTRACT NUMBER FA9550-10-1-0095 5b...NOTES 14. ABSTRACT Most algae accumulate triacylglycerols (TAGs) when they are starved for essential nutrients like N, S, P (or Si in the case of some

  11. Annotating smart environment sensor data for activity learning.

    Science.gov (United States)

    Szewcyzk, S; Dwan, K; Minor, B; Swedlove, B; Cook, D

    2009-01-01

    The pervasive sensing technologies found in smart homes offer unprecedented opportunities for providing health monitoring and assistance to individuals experiencing difficulties living independently at home. In order to monitor the functional health of smart home residents, we need to design technologies that recognize and track the activities that people perform at home. Machine learning techniques can perform this task, but the software algorithms rely upon large amounts of sample data that is correctly labeled with the corresponding activity. Labeling, or annotating, sensor data with the corresponding activity can be time consuming, may require input from the smart home resident, and is often inaccurate. Therefore, in this paper we investigate four alternative mechanisms for annotating sensor data with a corresponding activity label. We evaluate the alternative methods along the dimensions of annotation time, resident burden, and accuracy using sensor data collected in a real smart apartment.

  12. Annotation Method (AM): SE7_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE7_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  13. Annotation Method (AM): SE36_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE36_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  14. Annotation Method (AM): SE14_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE14_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  15. Annotation Method (AM): SE33_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE33_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  16. Annotation Method (AM): SE12_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE12_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  17. Annotation Method (AM): SE20_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE20_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  18. Annotation Method (AM): SE2_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE2_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  19. Annotation Method (AM): SE28_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE28_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  20. Annotation Method (AM): SE11_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE11_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  1. Annotation Method (AM): SE17_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE17_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  2. Annotation Method (AM): SE10_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE10_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  3. Annotation Method (AM): SE4_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE4_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  4. Annotation Method (AM): SE9_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE9_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  5. Annotation Method (AM): SE3_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE3_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  6. Annotation Method (AM): SE25_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE25_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  7. Annotation Method (AM): SE30_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE30_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  8. Annotation Method (AM): SE16_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE16_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  9. Annotation Method (AM): SE29_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE29_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  10. Annotation Method (AM): SE35_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE35_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  11. Annotation Method (AM): SE6_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE6_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  12. Annotation Method (AM): SE1_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE1_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  13. Annotation Method (AM): SE8_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE8_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  14. Annotation Method (AM): SE13_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE13_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  15. Annotation Method (AM): SE26_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE26_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  16. Annotation Method (AM): SE27_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE27_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  17. Annotation Method (AM): SE34_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE34_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  18. Annotation Method (AM): SE5_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available base search. Peaks with no hit to these databases are then selected to secondary se...arch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are ma...SE5_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary data

  19. Annotation Method (AM): SE15_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE15_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  20. Annotation Method (AM): SE31_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE31_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  1. Annotation Method (AM): SE32_AM1 [Metabolonote[Archive

    Lifescience Database Archive (English)

    Full Text Available abase search. Peaks with no hit to these databases are then selected to secondary s...earch using exactMassDB and Pep1000 databases. After the database search processes, each database hits are m...SE32_AM1 PowerGet annotation A1 In annotation process, KEGG, KNApSAcK and LipidMAPS are used for primary dat

  2. The TOR Signaling Network in the Model Unicellular Green Alga Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    María Esther Pérez-Pérez

    2017-07-01

    Full Text Available Cell growth is tightly coupled to nutrient availability. The target of rapamycin (TOR kinase transmits nutritional and environmental cues to the cellular growth machinery. TOR functions in two distinct multiprotein complexes, termed TOR complex 1 (TORC1 and TOR complex 2 (TORC2. While the structure and functions of TORC1 are highly conserved in all eukaryotes, including algae and plants, TORC2 core proteins seem to be missing in photosynthetic organisms. TORC1 controls cell growth by promoting anabolic processes, including protein synthesis and ribosome biogenesis, and inhibiting catabolic processes such as autophagy. Recent studies identified rapamycin-sensitive TORC1 signaling regulating cell growth, autophagy, lipid metabolism, and central metabolic pathways in the model unicellular green alga Chlamydomonas reinhardtii. The central role that microalgae play in global biomass production, together with the high biotechnological potential of these organisms in biofuel production, has drawn attention to the study of proteins that regulate cell growth such as the TOR kinase. In this review we discuss the recent progress on TOR signaling in algae.

  3. The TOR Signaling Network in the Model Unicellular Green Alga Chlamydomonas reinhardtii.

    Science.gov (United States)

    Pérez-Pérez, María Esther; Couso, Inmaculada; Crespo, José L

    2017-07-12

    Cell growth is tightly coupled to nutrient availability. The target of rapamycin (TOR) kinase transmits nutritional and environmental cues to the cellular growth machinery. TOR functions in two distinct multiprotein complexes, termed TOR complex 1 (TORC1) and TOR complex 2 (TORC2). While the structure and functions of TORC1 are highly conserved in all eukaryotes, including algae and plants, TORC2 core proteins seem to be missing in photosynthetic organisms. TORC1 controls cell growth by promoting anabolic processes, including protein synthesis and ribosome biogenesis, and inhibiting catabolic processes such as autophagy. Recent studies identified rapamycin-sensitive TORC1 signaling regulating cell growth, autophagy, lipid metabolism, and central metabolic pathways in the model unicellular green alga Chlamydomonas reinhardtii . The central role that microalgae play in global biomass production, together with the high biotechnological potential of these organisms in biofuel production, has drawn attention to the study of proteins that regulate cell growth such as the TOR kinase. In this review we discuss the recent progress on TOR signaling in algae.

  4. Hydrogen production by Chlamydomonas reinhardtii under light driven sulfur deprived condition

    Energy Technology Data Exchange (ETDEWEB)

    Vijayaraghavan, Krishnan; Karthik, Rajendran [Biotechnology Research Division, Department of Biotechnology, Prathyusha Institute of Technology and Management, Aranvoyalkuppam, Thiruvallur District 602025, Tamil Nadu (India); Kamala Nalini, S.P. [Department of Biotechnology, Vel Group of Educational Institutions, Avadi, Alamadhi Road, Chennai 600062, Tamil Nadu (India)

    2009-10-15

    This article explores the possibility of demonstrating sustainable photohydrogen production using Chlamydomonas reinhardtii when grown in sulfur deprived photoautotrophic condition. The hydrogen evolving capability of the algal species was monitored based on alternating light and dark period. Investigation was carried out during the day time in order to exploit the solar energy for meeting the demand of the light period. The results showed that when the reactor was operated at varying photoperiod namely 2, 3 and 4 h of alternating light and dark period, the gas generation was found to be 32 {+-} 4, 63 {+-} 7 and 52 {+-} 5 mL/h, while the corresponding hydrogen content was 47, 86 and 87% respectively. Functional components of hydrogen generation reaction centers were also analyzed, which showed that the PS(I) reaction centers were involved in hydrogen production pathway, as the light absorption by PS(I) was prerequisite for hydrogen generation under sulfur deprived photoautotrophic condition. The findings showed a higher gas yield and hydrogen content under dark period, whereas under light period the gas content was below detectable level for hydrogen due to the reversible hydrogenase reaction. (author)

  5. Software for computing and annotating genomic ranges.

    Science.gov (United States)

    Lawrence, Michael; Huber, Wolfgang; Pagès, Hervé; Aboyoun, Patrick; Carlson, Marc; Gentleman, Robert; Morgan, Martin T; Carey, Vincent J

    2013-01-01

    We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.

  6. FALDO: a semantic standard for describing the location of nucleotide and protein feature annotation.

    Science.gov (United States)

    Bolleman, Jerven T; Mungall, Christopher J; Strozzi, Francesco; Baran, Joachim; Dumontier, Michel; Bonnal, Raoul J P; Buels, Robert; Hoehndorf, Robert; Fujisawa, Takatomo; Katayama, Toshiaki; Cock, Peter J A

    2016-06-13

    Nucleotide and protein sequence feature annotations are essential to understand biology on the genomic, transcriptomic, and proteomic level. Using Semantic Web technologies to query biological annotations, there was no standard that described this potentially complex location information as subject-predicate-object triples. We have developed an ontology, the Feature Annotation Location Description Ontology (FALDO), to describe the positions of annotated features on linear and circular sequences. FALDO can be used to describe nucleotide features in sequence records, protein annotations, and glycan binding sites, among other features in coordinate systems of the aforementioned "omics" areas. Using the same data format to represent sequence positions that are independent of file formats allows us to integrate sequence data from multiple sources and data types. The genome browser JBrowse is used to demonstrate accessing multiple SPARQL endpoints to display genomic feature annotations, as well as protein annotations from UniProt mapped to genomic locations. Our ontology allows users to uniformly describe - and potentially merge - sequence annotations from multiple sources. Data sources using FALDO can prospectively be retrieved using federalised SPARQL queries against public SPARQL endpoints and/or local private triple stores.

  7. EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation.

    Science.gov (United States)

    Pafilis, Evangelos; Buttigieg, Pier Luigi; Ferrell, Barbra; Pereira, Emiliano; Schnetzer, Julia; Arvanitidis, Christos; Jensen, Lars Juhl

    2016-01-01

    The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, well documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15-25% and helps curators to detect terms that would otherwise have been missed. Database URL: https://extract.hcmr.gr/. © The Author(s) 2016. Published by Oxford University Press.

  8. miRBase: integrating microRNA annotation and deep-sequencing data.

    Science.gov (United States)

    Kozomara, Ana; Griffiths-Jones, Sam

    2011-01-01

    miRBase is the primary online repository for all microRNA sequences and annotation. The current release (miRBase 16) contains over 15,000 microRNA gene loci in over 140 species, and over 17,000 distinct mature microRNA sequences. Deep-sequencing technologies have delivered a sharp rise in the rate of novel microRNA discovery. We have mapped reads from short RNA deep-sequencing experiments to microRNAs in miRBase and developed web interfaces to view these mappings. The user can view all read data associated with a given microRNA annotation, filter reads by experiment and count, and search for microRNAs by tissue- and stage-specific expression. These data can be used as a proxy for relative expression levels of microRNA sequences, provide detailed evidence for microRNA annotations and alternative isoforms of mature microRNAs, and allow us to revisit previous annotations. miRBase is available online at: http://www.mirbase.org/.

  9. Online Metacognitive Strategies, Hypermedia Annotations, and Motivation on Hypertext Comprehension

    Science.gov (United States)

    Shang, Hui-Fang

    2016-01-01

    This study examined the effect of online metacognitive strategies, hypermedia annotations, and motivation on reading comprehension in a Taiwanese hypertext environment. A path analysis model was proposed based on the assumption that if English as a foreign language learners frequently use online metacognitive strategies and hypermedia annotations,…

  10. Annotating with Propp's Morphology of the Folktale: Reproducibility and Trainability

    NARCIS (Netherlands)

    Fisseni, B.; Kurji, A.; Löwe, B.

    2014-01-01

    We continue the study of the reproducibility of Propp’s annotations from Bod et al. (2012). We present four experiments in which test subjects were taught Propp’s annotation system; we conclude that Propp’s system needs a significant amount of training, but that with sufficient time investment, it

  11. Annotation of nerve cord transcriptome in earthworm Eisenia fetida

    Directory of Open Access Journals (Sweden)

    Vasanthakumar Ponesakki

    2017-12-01

    Full Text Available In annelid worms, the nerve cord serves as a crucial organ to control the sensory and behavioral physiology. The inadequate genome resource of earthworms has prioritized the comprehensive analysis of their transcriptome dataset to monitor the genes express in the nerve cord and predict their role in the neurotransmission and sensory perception of the species. The present study focuses on identifying the potential transcripts and predicting their functional features by annotating the transcriptome dataset of nerve cord tissues prepared by Gong et al., 2010 from the earthworm Eisenia fetida. Totally 9762 transcripts were successfully annotated against the NCBI nr database using the BLASTX algorithm and among them 7680 transcripts were assigned to a total of 44,354 GO terms. The conserve domain analysis indicated the over representation of P-loop NTPase domain and calcium binding EF-hand domain. The COG functional annotation classified 5860 transcript sequences into 25 functional categories. Further, 4502 contig sequences were found to map with 124 KEGG pathways. The annotated contig dataset exhibited 22 crucial neuropeptides having considerable matches to the marine annelid Platynereis dumerilii, suggesting their possible role in neurotransmission and neuromodulation. In addition, 108 human stem cell marker homologs were identified including the crucial epigenetic regulators, transcriptional repressors and cell cycle regulators, which may contribute to the neuronal and segmental regeneration. The complete functional annotation of this nerve cord transcriptome can be further utilized to interpret genetic and molecular mechanisms associated with neuronal development, nervous system regeneration and nerve cord function.

  12. A topic modeling approach for web service annotation

    Directory of Open Access Journals (Sweden)

    Leandro Ordóñez-Ante

    2014-06-01

    Full Text Available The actual implementation of semantic-based mechanisms for service retrieval has been restricted, given the resource-intensive procedure involved in the formal specification of services, which generally comprises associating semantic annotations to their documentation sources. Typically, developer performs such a procedure by hand, requiring specialized knowledge on models for semantic description of services (e.g. OWL-S, WSMO, SAWSDL, as well as formal specifications of knowledge. Thus, this semantic-based service description procedure turns out to be a cumbersome and error-prone task. This paper introduces a proposal for service annotation, based on processing web service documentation for extracting information regarding its offered capabilities. By uncovering the hidden semantic structure of such information through statistical analysis techniques, we are able to associate meaningful annotations to the services operations/resources, while grouping those operations into non-exclusive semantic related categories. This research paper belongs to the TelComp 2.0 project, which Colciencas and University of Cauca founded in cooperation.

  13. Tagging like Humans: Diverse and Distinct Image Annotation

    KAUST Repository

    Wu, Baoyuan

    2018-03-31

    In this work we propose a new automatic image annotation model, dubbed {\\\\bf diverse and distinct image annotation} (D2IA). The generative model D2IA is inspired by the ensemble of human annotations, which create semantically relevant, yet distinct and diverse tags. In D2IA, we generate a relevant and distinct tag subset, in which the tags are relevant to the image contents and semantically distinct to each other, using sequential sampling from a determinantal point process (DPP) model. Multiple such tag subsets that cover diverse semantic aspects or diverse semantic levels of the image contents are generated by randomly perturbing the DPP sampling process. We leverage a generative adversarial network (GAN) model to train D2IA. Extensive experiments including quantitative and qualitative comparisons, as well as human subject studies, on two benchmark datasets demonstrate that the proposed model can produce more diverse and distinct tags than the state-of-the-arts.

  14. An Atlas of annotations of Hydra vulgaris transcriptome.

    Science.gov (United States)

    Evangelista, Daniela; Tripathi, Kumar Parijat; Guarracino, Mario Rosario

    2016-09-22

    RNA sequencing takes advantage of the Next Generation Sequencing (NGS) technologies for analyzing RNA transcript counts with an excellent accuracy. Trying to interpret this huge amount of data in biological information is still a key issue, reason for which the creation of web-resources useful for their analysis is highly desiderable. Starting from a previous work, Transcriptator, we present the Atlas of Hydra's vulgaris, an extensible web tool in which its complete transcriptome is annotated. In order to provide to the users an advantageous resource that include the whole functional annotated transcriptome of Hydra vulgaris water polyp, we implemented the Atlas web-tool contains 31.988 accesible and downloadable transcripts of this non-reference model organism. Atlas, as a freely available resource, can be considered a valuable tool to rapidly retrieve functional annotation for transcripts differentially expressed in Hydra vulgaris exposed to the distinct experimental treatments. WEB RESOURCE URL: http://www-labgtp.na.icar.cnr.it/Atlas .

  15. GSV Annotated Bibliography

    Energy Technology Data Exchange (ETDEWEB)

    Roberts, Randy S. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Pope, Paul A. [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Jiang, Ming [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Trucano, Timothy G. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Aragon, Cecilia R. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Ni, Kevin [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Wei, Thomas [Argonne National Lab. (ANL), Argonne, IL (United States); Chilton, Lawrence K. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Bakel, Alan [Argonne National Lab. (ANL), Argonne, IL (United States)

    2011-06-14

    The following annotated bibliography was developed as part of the Geospatial Algorithm Veri cation and Validation (GSV) project for the Simulation, Algorithms and Modeling program of NA-22. Veri cation and Validation of geospatial image analysis algorithms covers a wide range of technologies. Papers in the bibliography are thus organized into the following ve topic areas: Image processing and analysis, usability and validation of geospatial image analysis algorithms, image distance measures, scene modeling and image rendering, and transportation simulation models.

  16. The integration of a metadata generation framework in a music annotation workflow

    OpenAIRE

    Corthaut, Nik; Lippens, Stefaan; Govaerts, Sten; Duval, Erik; Martens, Jean-Pierre

    2009-01-01

    In the MuziK project we try to automate the typically hard task of annotating music files manually. This annotation is used for music recommendation and for automated playlist creation. The music experts of Aristo Music (http://www.aristomusic.com) defined the data fields. High quality annotations are required since the results, playlists, are used in commercial live settings and the cost of a wrong selection is high [1].

  17. Automatic annotation of lecture videos for multimedia driven pedagogical platforms

    Directory of Open Access Journals (Sweden)

    Ali Shariq Imran

    2016-12-01

    Full Text Available Today’s eLearning websites are heavily loaded with multimedia contents, which are often unstructured, unedited, unsynchronized, and lack inter-links among different multimedia components. Hyperlinking different media modality may provide a solution for quick navigation and easy retrieval of pedagogical content in media driven eLearning websites. In addition, finding meta-data information to describe and annotate media content in eLearning platforms is challenging, laborious, prone to errors, and time-consuming task. Thus annotations for multimedia especially of lecture videos became an important part of video learning objects. To address this issue, this paper proposes three major contributions namely, automated video annotation, the 3-Dimensional (3D tag clouds, and the hyper interactive presenter (HIP eLearning platform. Combining existing state-of-the-art SIFT together with tag cloud, a novel approach for automatic lecture video annotation for the HIP is proposed. New video annotations are implemented automatically providing the needed random access in lecture videos within the platform, and a 3D tag cloud is proposed as a new way of user interaction mechanism. A preliminary study of the usefulness of the system has been carried out, and the initial results suggest that 70% of the students opted for using HIP as their preferred eLearning platform at Gjøvik University College (GUC.

  18. DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication.

    Science.gov (United States)

    Tanizawa, Yasuhiro; Fujisawa, Takatomo; Nakamura, Yasukazu

    2018-03-15

    We developed a prokaryotic genome annotation pipeline, DFAST, that also supports genome submission to public sequence databases. DFAST was originally started as an on-line annotation server, and to date, over 7000 jobs have been processed since its first launch in 2016. Here, we present a newly implemented background annotation engine for DFAST, which is also available as a standalone command-line program. The new engine can annotate a typical-sized bacterial genome within 10 min, with rich information such as pseudogenes, translation exceptions and orthologous gene assignment between given reference genomes. In addition, the modular framework of DFAST allows users to customize the annotation workflow easily and will also facilitate extensions for new functions and incorporation of new tools in the future. The software is implemented in Python 3 and runs in both Python 2.7 and 3.4-on Macintosh and Linux systems. It is freely available at https://github.com/nigyta/dfast_core/under the GPLv3 license with external binaries bundled in the software distribution. An on-line version is also available at https://dfast.nig.ac.jp/. yn@nig.ac.jp. Supplementary data are available at Bioinformatics online.

  19. Nickel and low CO2-controlled motility in Chlamydomonas through complementation of a paralyzed flagella mutant with chemically regulated promoters

    Directory of Open Access Journals (Sweden)

    Rosenbaum Joel L

    2011-01-01

    Full Text Available Abstract Background Chlamydomonas reinhardtii is a model system for the biology of unicellular green algae. Chemically regulated promoters, such as the nickel-inducible CYC6 or the low CO2-inducible CAH1 promoter, may prove useful for expressing, at precise times during its cell cycle, proteins with relevant biological functions, or complementing mutants in genes encoding such proteins. To this date, this has not been reported for the above promoters. Results We fused the CYC6 and CAH1 promoters to an HA-tagged RSP3 gene, encoding a protein of the flagellar radial spoke complex. The constructs were used for chemically regulated complementation of the pf14 mutant, carrying an ochre mutation in the RSP3 gene. 7 to 8% of the transformants showed cells with restored motility after induction with nickel or transfer to low CO2 conditions, but not in non-inducing conditions. Maximum complementation (5% motile cells was reached with very different kinetics (5-6 hours for CAH1, 48 hours for CYC6. The two inducible promoters drive much lower levels of RSP3 protein expression than the constitutive PSAD promoter, which shows almost complete rescue of motility. Conclusions To our knowledge, this is the first example of the use of the CYC6 or CAH1 promoters to perform a chemically regulated complementation of a Chlamydomonas mutant. Based on our data, the CYC6 and CAH1 promoters should be capable of fully complementing mutants in genes whose products exert their biological activity at low concentrations.

  20. Combining rules, background knowledge and change patterns to maintain semantic annotations.

    Science.gov (United States)

    Cardoso, Silvio Domingos; Chantal, Reynaud-Delaître; Da Silveira, Marcos; Pruski, Cédric

    2017-01-01

    Knowledge Organization Systems (KOS) play a key role in enriching biomedical information in order to make it machine-understandable and shareable. This is done by annotating medical documents, or more specifically, associating concept labels from KOS with pieces of digital information, e.g., images or texts. However, the dynamic nature of KOS may impact the annotations, thus creating a mismatch between the evolved concept and the associated information. To solve this problem, methods to maintain the quality of the annotations are required. In this paper, we define a framework based on rules, background knowledge and change patterns to drive the annotation adaption process. We evaluate experimentally the proposed approach in realistic cases-studies and demonstrate the overall performance of our approach in different KOS considering the precision, recall, F1-score and AUC value of the system.

  1. Phylogenetic molecular function annotation

    International Nuclear Information System (INIS)

    Engelhardt, Barbara E; Jordan, Michael I; Repo, Susanna T; Brenner, Steven E

    2009-01-01

    It is now easier to discover thousands of protein sequences in a new microbial genome than it is to biochemically characterize the specific activity of a single protein of unknown function. The molecular functions of protein sequences have typically been predicted using homology-based computational methods, which rely on the principle that homologous proteins share a similar function. However, some protein families include groups of proteins with different molecular functions. A phylogenetic approach for predicting molecular function (sometimes called 'phylogenomics') is an effective means to predict protein molecular function. These methods incorporate functional evidence from all members of a family that have functional characterizations using the evolutionary history of the protein family to make robust predictions for the uncharacterized proteins. However, they are often difficult to apply on a genome-wide scale because of the time-consuming step of reconstructing the phylogenies of each protein to be annotated. Our automated approach for function annotation using phylogeny, the SIFTER (Statistical Inference of Function Through Evolutionary Relationships) methodology, uses a statistical graphical model to compute the probabilities of molecular functions for unannotated proteins. Our benchmark tests showed that SIFTER provides accurate functional predictions on various protein families, outperforming other available methods.

  2. Annotation of rule-based models with formal semantics to enable creation, analysis, reuse and visualization

    Science.gov (United States)

    Misirli, Goksel; Cavaliere, Matteo; Waites, William; Pocock, Matthew; Madsen, Curtis; Gilfellon, Owen; Honorato-Zimmer, Ricardo; Zuliani, Paolo; Danos, Vincent; Wipat, Anil

    2016-01-01

    Motivation: Biological systems are complex and challenging to model and therefore model reuse is highly desirable. To promote model reuse, models should include both information about the specifics of simulations and the underlying biology in the form of metadata. The availability of computationally tractable metadata is especially important for the effective automated interpretation and processing of models. Metadata are typically represented as machine-readable annotations which enhance programmatic access to information about models. Rule-based languages have emerged as a modelling framework to represent the complexity of biological systems. Annotation approaches have been widely used for reaction-based formalisms such as SBML. However, rule-based languages still lack a rich annotation framework to add semantic information, such as machine-readable descriptions, to the components of a model. Results: We present an annotation framework and guidelines for annotating rule-based models, encoded in the commonly used Kappa and BioNetGen languages. We adapt widely adopted annotation approaches to rule-based models. We initially propose a syntax to store machine-readable annotations and describe a mapping between rule-based modelling entities, such as agents and rules, and their annotations. We then describe an ontology to both annotate these models and capture the information contained therein, and demonstrate annotating these models using examples. Finally, we present a proof of concept tool for extracting annotations from a model that can be queried and analyzed in a uniform way. The uniform representation of the annotations can be used to facilitate the creation, analysis, reuse and visualization of rule-based models. Although examples are given, using specific implementations the proposed techniques can be applied to rule-based models in general. Availability and implementation: The annotation ontology for rule-based models can be found at http

  3. Protein Annotators' Assistant: A Novel Application of Information Retrieval Techniques.

    Science.gov (United States)

    Wise, Michael J.

    2000-01-01

    Protein Annotators' Assistant (PAA) is a software system which assists protein annotators in assigning functions to newly sequenced proteins. PAA employs a number of information retrieval techniques in a novel setting and is thus related to text categorization, where multiple categories may be suggested, except that in this case none of the…

  4. Crowdsourcing Twitter annotations to identify first-hand experiences of prescription drug use.

    Science.gov (United States)

    Alvaro, Nestor; Conway, Mike; Doan, Son; Lofi, Christoph; Overington, John; Collier, Nigel

    2015-12-01

    Self-reported patient data has been shown to be a valuable knowledge source for post-market pharmacovigilance. In this paper we propose using the popular micro-blogging service Twitter to gather evidence about adverse drug reactions (ADRs) after firstly having identified micro-blog messages (also know as "tweets") that report first-hand experience. In order to achieve this goal we explore machine learning with data crowdsourced from laymen annotators. With the help of lay annotators recruited from CrowdFlower we manually annotated 1548 tweets containing keywords related to two kinds of drugs: SSRIs (eg. Paroxetine), and cognitive enhancers (eg. Ritalin). Our results show that inter-annotator agreement (Fleiss' kappa) for crowdsourcing ranks in moderate agreement with a pair of experienced annotators (Spearman's Rho=0.471). We utilized the gold standard annotations from CrowdFlower for automatically training a range of supervised machine learning models to recognize first-hand experience. F-Score values are reported for 6 of these techniques with the Bayesian Generalized Linear Model being the best (F-Score=0.64 and Informedness=0.43) when combined with a selected set of features obtained by using information gain criteria. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  5. Semi-supervised learning based probabilistic latent semantic analysis for automatic image annotation

    Institute of Scientific and Technical Information of China (English)

    Tian Dongping

    2017-01-01

    In recent years, multimedia annotation problem has been attracting significant research attention in multimedia and computer vision areas, especially for automatic image annotation, whose purpose is to provide an efficient and effective searching environment for users to query their images more easily.In this paper, a semi-supervised learning based probabilistic latent semantic analysis ( PL-SA) model for automatic image annotation is presenred.Since it' s often hard to obtain or create la-beled images in large quantities while unlabeled ones are easier to collect, a transductive support vector machine ( TSVM) is exploited to enhance the quality of the training image data.Then, differ-ent image features with different magnitudes will result in different performance for automatic image annotation.To this end, a Gaussian normalization method is utilized to normalize different features extracted from effective image regions segmented by the normalized cuts algorithm so as to reserve the intrinsic content of images as complete as possible.Finally, a PLSA model with asymmetric mo-dalities is constructed based on the expectation maximization( EM) algorithm to predict a candidate set of annotations with confidence scores.Extensive experiments on the general-purpose Corel5k dataset demonstrate that the proposed model can significantly improve performance of traditional PL-SA for the task of automatic image annotation.

  6. Multi-Label Classification Based on Low Rank Representation for Image Annotation

    Directory of Open Access Journals (Sweden)

    Qiaoyu Tan

    2017-01-01

    Full Text Available Annotating remote sensing images is a challenging task for its labor demanding annotation process and requirement of expert knowledge, especially when images can be annotated with multiple semantic concepts (or labels. To automatically annotate these multi-label images, we introduce an approach called Multi-Label Classification based on Low Rank Representation (MLC-LRR. MLC-LRR firstly utilizes low rank representation in the feature space of images to compute the low rank constrained coefficient matrix, then it adapts the coefficient matrix to define a feature-based graph and to capture the global relationships between images. Next, it utilizes low rank representation in the label space of labeled images to construct a semantic graph. Finally, these two graphs are exploited to train a graph-based multi-label classifier. To validate the performance of MLC-LRR against other related graph-based multi-label methods in annotating images, we conduct experiments on a public available multi-label remote sensing images (Land Cover. We perform additional experiments on five real-world multi-label image datasets to further investigate the performance of MLC-LRR. Empirical study demonstrates that MLC-LRR achieves better performance on annotating images than these comparing methods across various evaluation criteria; it also can effectively exploit global structure and label correlations of multi-label images.

  7. Software for computing and annotating genomic ranges.

    Directory of Open Access Journals (Sweden)

    Michael Lawrence

    Full Text Available We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.

  8. Consumer energy research: an annotated bibliography

    Energy Technology Data Exchange (ETDEWEB)

    Anderson, C.D.; McDougall, G.H.G.

    1980-01-01

    This document is an updated and expanded version of an earlier annotated bibliography by Dr. C. Dennis Anderson and Carman Cullen (A Review and Annotation of Energy Research on Consumers, March 1978). It is the final draft of the major report that will be published in English and French and made publicly available through the Consumer Research and Evaluation Branch of Consumer and Corporate Affairs, Canada. Two agencies granting permission to include some of their energy abstracts are the Rand Corporation and the DOE Technical Information Center. The bibliography consists mainly of empirical studies, including surveys and experiments. It also includes a number of descriptive and econometric studies that utilize secondary data. Many of the studies provide summaries of research is specific areas, and point out directions for future research efforts. 14 tables.

  9. Public Relations: Selected, Annotated Bibliography.

    Science.gov (United States)

    Demo, Penny

    Designed for students and practitioners of public relations (PR), this annotated bibliography focuses on recent journal articles and ERIC documents. The 34 citations include the following: (1) surveys of public relations professionals on career-related education; (2) literature reviews of research on measurement and evaluation of PR and…

  10. Persuasion: A Selected, Annotated Bibliography.

    Science.gov (United States)

    McDermott, Steven T.

    Designed to reflect the diversity of approaches to persuasion, this annotated bibliography cites materials selected for their contribution to that diversity as well as for being relatively current and/or especially significant representatives of particular approaches. The bibliography starts with a list of 17 general textbooks on approaches to…

  11. Annotating spatio-temporal datasets for meaningful analysis in the Web

    Science.gov (United States)

    Stasch, Christoph; Pebesma, Edzer; Scheider, Simon

    2014-05-01

    More and more environmental datasets that vary in space and time are available in the Web. This comes along with an advantage of using the data for other purposes than originally foreseen, but also with the danger that users may apply inappropriate analysis procedures due to lack of important assumptions made during the data collection process. In order to guide towards a meaningful (statistical) analysis of spatio-temporal datasets available in the Web, we have developed a Higher-Order-Logic formalism that captures some relevant assumptions in our previous work [1]. It allows to proof on meaningful spatial prediction and aggregation in a semi-automated fashion. In this poster presentation, we will present a concept for annotating spatio-temporal datasets available in the Web with concepts defined in our formalism. Therefore, we have defined a subset of the formalism as a Web Ontology Language (OWL) pattern. It allows capturing the distinction between the different spatio-temporal variable types, i.e. point patterns, fields, lattices and trajectories, that in turn determine whether a particular dataset can be interpolated or aggregated in a meaningful way using a certain procedure. The actual annotations that link spatio-temporal datasets with the concepts in the ontology pattern are provided as Linked Data. In order to allow data producers to add the annotations to their datasets, we have implemented a Web portal that uses a triple store at the backend to store the annotations and to make them available in the Linked Data cloud. Furthermore, we have implemented functions in the statistical environment R to retrieve the RDF annotations and, based on these annotations, to support a stronger typing of spatio-temporal datatypes guiding towards a meaningful analysis in R. [1] Stasch, C., Scheider, S., Pebesma, E., Kuhn, W. (2014): "Meaningful spatial prediction and aggregation", Environmental Modelling & Software, 51, 149-165.

  12. Automatic Annotation Method on Learners' Opinions in Case Method Discussion

    Science.gov (United States)

    Samejima, Masaki; Hisakane, Daichi; Komoda, Norihisa

    2015-01-01

    Purpose: The purpose of this paper is to annotate an attribute of a problem, a solution or no annotation on learners' opinions automatically for supporting the learners' discussion without a facilitator. The case method aims at discussing problems and solutions in a target case. However, the learners miss discussing some of problems and solutions.…

  13. High-performance web services for querying gene and variant annotation.

    Science.gov (United States)

    Xin, Jiwen; Mark, Adam; Afrasiabi, Cyrus; Tsueng, Ginger; Juchler, Moritz; Gopal, Nikhil; Stupp, Gregory S; Putman, Timothy E; Ainscough, Benjamin J; Griffith, Obi L; Torkamani, Ali; Whetzel, Patricia L; Mungall, Christopher J; Mooney, Sean D; Su, Andrew I; Wu, Chunlei

    2016-05-06

    Efficient tools for data management and integration are essential for many aspects of high-throughput biology. In particular, annotations of genes and human genetic variants are commonly used but highly fragmented across many resources. Here, we describe MyGene.info and MyVariant.info, high-performance web services for querying gene and variant annotation information. These web services are currently accessed more than three million times permonth. They also demonstrate a generalizable cloud-based model for organizing and querying biological annotation information. MyGene.info and MyVariant.info are provided as high-performance web services, accessible at http://mygene.info and http://myvariant.info . Both are offered free of charge to the research community.

  14. Modeling multiple time series annotations as noisy distortions of the ground truth: An Expectation-Maximization approach.

    Science.gov (United States)

    Gupta, Rahul; Audhkhasi, Kartik; Jacokes, Zach; Rozga, Agata; Narayanan, Shrikanth

    2018-01-01

    Studies of time-continuous human behavioral phenomena often rely on ratings from multiple annotators. Since the ground truth of the target construct is often latent, the standard practice is to use ad-hoc metrics (such as averaging annotator ratings). Despite being easy to compute, such metrics may not provide accurate representations of the underlying construct. In this paper, we present a novel method for modeling multiple time series annotations over a continuous variable that computes the ground truth by modeling annotator specific distortions. We condition the ground truth on a set of features extracted from the data and further assume that the annotators provide their ratings as modification of the ground truth, with each annotator having specific distortion tendencies. We train the model using an Expectation-Maximization based algorithm and evaluate it on a study involving natural interaction between a child and a psychologist, to predict confidence ratings of the children's smiles. We compare and analyze the model against two baselines where: (i) the ground truth in considered to be framewise mean of ratings from various annotators and, (ii) each annotator is assumed to bear a distinct time delay in annotation and their annotations are aligned before computing the framewise mean.

  15. AUTHOR’S ANNOTATION AS A MANIFESTATION OF THE COMPOSER’S CREATIVE CONCEPTION

    Directory of Open Access Journals (Sweden)

    CIOBANU GHENADIE

    2015-06-01

    Full Text Available Annotation to his own musical works is considered by the author as a form of analysis of these opuses. Designed to provide answers about works, these comments facilitate the perception of contemporary music by performers and the audience. The composer examines various forms of annotations basing himself on their goals and the context of use, and compares them to other genres with informative function, such as the interview, analytical essay, memoirs, personal diary, etc. The article illustrated some possible forms of annotations. Besides a purely informative character of the annotation, the author notes in the conclusions the value of genuine professional analysis, providing a wide circle of listeners and experts with a brief exegetical approach to his musical works.

  16. Annotated Bibliography of MMPI Research Among College Populations: 1962-1970.

    Science.gov (United States)

    Cornish, Richard D.

    The MMPI continues to be the focus of a large quantity of research. This article offers an aid to persons working with college student populations by annotating recent MMPI research relating to college populations. A total of 49 articles (each categorized in terms of content into one of 10 sections or subsections) were annotated. The Validity of…

  17. Screening of Chlamydomonas reinhardtii Populations with Single-Cell Resolution by Using a High-Throughput Microscale Sample Preparation for Matrix-Assisted Laser Desorption Ionization Mass Spectrometry.

    Science.gov (United States)

    Krismer, Jasmin; Sobek, Jens; Steinhoff, Robert F; Fagerer, Stephan R; Pabst, Martin; Zenobi, Renato

    2015-08-15

    The consequences of cellular heterogeneity, such as biocide persistence, can only be tackled by studying each individual in a cell population. Fluorescent tags provide tools for the high-throughput analysis of genomes, RNA transcripts, or proteins on the single-cell level. However, the analysis of lower-molecular-weight compounds that elude tagging is still a great challenge. Here, we describe a novel high-throughput microscale sample preparation technique for single cells that allows a mass spectrum to be obtained for each individual cell within a microbial population. The approach presented includes spotting Chlamydomonas reinhardtii cells, using a noncontact microarrayer, onto a specialized slide and controlled lysis of cells separated on the slide. Throughout the sample preparation, analytes were traced and individual steps optimized using autofluorescence detection of chlorophyll. The lysates of isolated cells are subjected to a direct, label-free analysis using matrix-assisted laser desorption ionization mass spectrometry. Thus, we were able to differentiate individual cells of two Chlamydomonas reinhardtii strains based on single-cell mass spectra. Furthermore, we showed that only population profiles with real single-cell resolution render a nondistorted picture of the phenotypes contained in a population. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  18. Kinetic modeling of light limitation and sulfur deprivation effects in the induction of hydrogen production with Chlamydomonas reinhardtii: Part I. Model development and parameter identification.

    Science.gov (United States)

    Fouchard, Swanny; Pruvost, Jérémy; Degrenne, Benoit; Titica, Mariana; Legrand, Jack

    2009-01-01

    Chlamydomonas reinhardtii is a green microalga capable of turning its metabolism towards H2 production under specific conditions. However this H2 production, narrowly linked to the photosynthetic process, results from complex metabolic reactions highly dependent on the environmental conditions of the cells. A kinetic model has been developed to relate culture evolution from standard photosynthetic growth to H2 producing cells. It represents transition in sulfur-deprived conditions, known to lead to H2 production in Chlamydomonas reinhardtii, and the two main processes then induced which are an over-accumulation of intracellular starch and a progressive reduction of PSII activity for anoxia achievement. Because these phenomena are directly linked to the photosynthetic growth, two kinetic models were associated, the first (one) introducing light dependency (Haldane type model associated to a radiative light transfer model), the second (one) making growth a function of available sulfur amount under extracellular and intracellular forms (Droop formulation). The model parameters identification was realized from experimental data obtained with especially designed experiments and a sensitivity analysis of the model to its parameters was also conducted. Model behavior was finally studied showing interdependency between light transfer conditions, photosynthetic growth, sulfate uptake, photosynthetic activity and O2 release, during transition from oxygenic growth to anoxic H2 production conditions.

  19. ePNK Applications and Annotations

    DEFF Research Database (Denmark)

    Kindler, Ekkart

    2017-01-01

    newapplicationsfor the ePNK and, in particular, visualizing the result of an application in the graphical editor of the ePNK by singannotations, and interacting with the end user using these annotations. In this paper, we give an overview of the concepts of ePNK applications by discussing the implementation...

  20. Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements

    Directory of Open Access Journals (Sweden)

    Danuta Roszko

    2015-06-01

    Full Text Available Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements In the article the authors present the experimental Polish-Lithuanian corpus (ECorpPL-LT formed for the idea of Polish-Lithuanian theoretical contrastive studies, a Polish-Lithuanian electronic dictionary, and as help for a sworn translator. The semantic annotation being brought into ECorpPL-LT is extremely useful in Polish-Lithuanian contrastive studies, and also proves helpful in translation work.

  1. Ultraviolet modification of Chlamydomonas reinhardtii for carbon capture

    Directory of Open Access Journals (Sweden)

    Gopal NS

    2016-04-01

    Full Text Available Nikhil S Gopal,1 K Sudhakar2 1The Lawrenceville School, Lawrenceville, NJ, USA; 2Bioenergy Laboratory, Malauna Azad National Institute of Technology, Bhopal, India Purpose: Carbon dioxide (CO2 levels have been rising rapidly. Algae are single-cell organisms with highly efficient CO2 uptake mechanisms. Algae yield two to ten times more biomass versus terrestrial plants and can grow nearly anywhere. Large scale CO2 sequestration is not yet sustainable due to high amounts of nitrogen (N and phosphate (P needed to grow algae in media. Methods: Mutant strains of Chlamydomonas reinhardtii were created using ultraviolet light (2.2–3 K J/m2 and natural selection using media with 20%–80% lower N and P compared to standard Sueoka's high salt medium. Strains were selected based upon growth in media concentrations varying from 20% to 80% less N/P compared to control. Biomass was compared to wild-type control (CC-125 using direct counts, optical density dry weight, and mean doubling time. Results: Mean doubling time was 20 and 25 hours in the low N and N/P strains, respectively (vs 66 hours in control. Using direct counts, growth rates of mutant strains of low N and N/P cultures were not statistically different from control (P=0.37 and 0.70, respectively. Conclusion: Two new strains of algae, as well as wild-type control, were able to grow while using 20%–40% less N and P. Ultraviolet light-based modification of algae is an inexpensive and alternative option to genetic engineering techniques. This technique might make larger scale biosequestration possible. Keywords: biosequestration, ultraviolet, carbon sequestration, carbon capture, algae

  2. A Novel Quality Measure and Correction Procedure for the Annotation of Microbial Translation Initiation Sites.

    Directory of Open Access Journals (Sweden)

    Lex Overmars

    Full Text Available The identification of translation initiation sites (TISs constitutes an important aspect of sequence-based genome analysis. An erroneous TIS annotation can impair the identification of regulatory elements and N-terminal signal peptides, and also may flaw the determination of descent, for any particular gene. We have formulated a reference-free method to score the TIS annotation quality. The method is based on a comparison of the observed and expected distribution of all TISs in a particular genome given prior gene-calling. We have assessed the TIS annotations for all available NCBI RefSeq microbial genomes and found that approximately 87% is of appropriate quality, whereas 13% needs substantial improvement. We have analyzed a number of factors that could affect TIS annotation quality such as GC-content, taxonomy, the fraction of genes with a Shine-Dalgarno sequence and the year of publication. The analysis showed that only the first factor has a clear effect. We have then formulated a straightforward Principle Component Analysis-based TIS identification strategy to self-organize and score potential TISs. The strategy is independent of reference data and a priori calculations. A representative set of 277 genomes was subjected to the analysis and we found a clear increase in TIS annotation quality for the genomes with a low quality score. The PCA-based annotation was also compared with annotation with the current tool of reference, Prodigal. The comparison for the model genome of Escherichia coli K12 showed that both methods supplement each other and that prediction agreement can be used as an indicator of a correct TIS annotation. Importantly, the data suggest that the addition of a PCA-based strategy to a Prodigal prediction can be used to 'flag' TIS annotations for re-evaluation and in addition can be used to evaluate a given annotation in case a Prodigal annotation is lacking.

  3. Using Nonexperts for Annotating Pharmacokinetic Drug-Drug Interaction Mentions in Product Labeling: A Feasibility Study.

    Science.gov (United States)

    Hochheiser, Harry; Ning, Yifan; Hernandez, Andres; Horn, John R; Jacobson, Rebecca; Boyce, Richard D

    2016-04-11

    Because vital details of potential pharmacokinetic drug-drug interactions are often described in free-text structured product labels, manual curation is a necessary but expensive step in the development of electronic drug-drug interaction information resources. The use of nonexperts to annotate potential drug-drug interaction (PDDI) mentions in drug product label annotation may be a means of lessening the burden of manual curation. Our goal was to explore the practicality of using nonexpert participants to annotate drug-drug interaction descriptions from structured product labels. By presenting annotation tasks to both pharmacy experts and relatively naïve participants, we hoped to demonstrate the feasibility of using nonexpert annotators for drug-drug information annotation. We were also interested in exploring whether and to what extent natural language processing (NLP) preannotation helped improve task completion time, accuracy, and subjective satisfaction. Two experts and 4 nonexperts were asked to annotate 208 structured product label sections under 4 conditions completed sequentially: (1) no NLP assistance, (2) preannotation of drug mentions, (3) preannotation of drug mentions and PDDIs, and (4) a repeat of the no-annotation condition. Results were evaluated within the 2 groups and relative to an existing gold standard. Participants were asked to provide reports on the time required to complete tasks and their perceptions of task difficulty. One of the experts and 3 of the nonexperts completed all tasks. Annotation results from the nonexpert group were relatively strong in every scenario and better than the performance of the NLP pipeline. The expert and 2 of the nonexperts were able to complete most tasks in less than 3 hours. Usability perceptions were generally positive (3.67 for expert, mean of 3.33 for nonexperts). The results suggest that nonexpert annotation might be a feasible option for comprehensive labeling of annotated PDDIs across a broader

  4. SeqAnt: A web service to rapidly identify and annotate DNA sequence variations

    Directory of Open Access Journals (Sweden)

    Patel Viren

    2010-09-01

    Full Text Available Abstract Background The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research. Results SeqAnt (Sequence Annotator is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds. Conclusion SeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.

  5. A multi-ontology approach to annotate scientific documents based on a modularization technique.

    Science.gov (United States)

    Gomes, Priscilla Corrêa E Castro; Moura, Ana Maria de Carvalho; Cavalcanti, Maria Cláudia

    2015-12-01

    Scientific text annotation has become an important task for biomedical scientists. Nowadays, there is an increasing need for the development of intelligent systems to support new scientific findings. Public databases available on the Web provide useful data, but much more useful information is only accessible in scientific texts. Text annotation may help as it relies on the use of ontologies to maintain annotations based on a uniform vocabulary. However, it is difficult to use an ontology, especially those that cover a large domain. In addition, since scientific texts explore multiple domains, which are covered by distinct ontologies, it becomes even more difficult to deal with such task. Moreover, there are dozens of ontologies in the biomedical area, and they are usually big in terms of the number of concepts. It is in this context that ontology modularization can be useful. This work presents an approach to annotate scientific documents using modules of different ontologies, which are built according to a module extraction technique. The main idea is to analyze a set of single-ontology annotations on a text to find out the user interests. Based on these annotations a set of modules are extracted from a set of distinct ontologies, and are made available for the user, for complementary annotation. The reduced size and focus of the extracted modules tend to facilitate the annotation task. An experiment was conducted to evaluate this approach, with the participation of a bioinformatician specialist of the Laboratory of Peptides and Proteins of the IOC/Fiocruz, who was interested in discovering new drug targets aiming at the combat of tropical diseases. Copyright © 2015 Elsevier Inc. All rights reserved.

  6. An Informally Annotated Bibliography of Sociolinguistics.

    Science.gov (United States)

    Tannen, Deborah

    This annotated bibliography of sociolinguistics is divided into the following sections: speech events, ethnography of speaking and anthropological approaches to analysis of conversation; discourse analysis (including analysis of conversation and narrative), ethnomethodology and nonverbal communication; sociolinguistics; pragmatics (including…

  7. The use of semantic similarity measures for optimally integrating heterogeneous Gene Ontology data from large scale annotation pipelines

    Directory of Open Access Journals (Sweden)

    Gaston K Mazandu

    2014-08-01

    Full Text Available With the advancement of new high throughput sequencing technologies, there has been an increase in the number of genome sequencing projects worldwide, which has yielded complete genome sequences of human, animals and plants. Subsequently, several labs have focused on genome annotation, consisting of assigning functions to gene products, mostly using Gene Ontology (GO terms. As a consequence, there is an increased heterogeneity in annotations across genomes due to different approaches used by different pipelines to infer these annotations and also due to the nature of the GO structure itself. This makes a curator's task difficult, even if they adhere to the established guidelines for assessing these protein annotations. Here we develop a genome-scale approach for integrating GO annotations from different pipelines using semantic similarity measures. We used this approach to identify inconsistencies and similarities in functional annotations between orthologs of human and Drosophila melanogaster, to assess the quality of GO annotations derived from InterPro2GO mappings compared to manually annotated GO annotations for the Drosophila melanogaster proteome from a FlyBase dataset and human, and to filter GO annotation data for these proteomes. Results obtained indicate that an efficient integration of GO annotations eliminates redundancy up to 27.08 and 22.32% in the Drosophila melanogaster and human GO annotation datasets, respectively. Furthermore, we identified lack of and missing annotations for some orthologs, and annotation mismatches between InterPro2GO and manual pipelines in these two proteomes, thus requiring further curation. This simplifies and facilitates tasks of curators in assessing protein annotations, reduces redundancy and eliminates inconsistencies in large annotation datasets for ease of comparative functional genomics.

  8. Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks

    Directory of Open Access Journals (Sweden)

    Mazo Ilya

    2007-07-01

    Full Text Available Abstract Background Uncovering cellular roles of a protein is a task of tremendous importance and complexity that requires dedicated experimental work as well as often sophisticated data mining and processing tools. Protein functions, often referred to as its annotations, are believed to manifest themselves through topology of the networks of inter-proteins interactions. In particular, there is a growing body of evidence that proteins performing the same function are more likely to interact with each other than with proteins with other functions. However, since functional annotation and protein network topology are often studied separately, the direct relationship between them has not been comprehensively demonstrated. In addition to having the general biological significance, such demonstration would further validate the data extraction and processing methods used to compose protein annotation and protein-protein interactions datasets. Results We developed a method for automatic extraction of protein functional annotation from scientific text based on the Natural Language Processing (NLP technology. For the protein annotation extracted from the entire PubMed, we evaluated the precision and recall rates, and compared the performance of the automatic extraction technology to that of manual curation used in public Gene Ontology (GO annotation. In the second part of our presentation, we reported a large-scale investigation into the correspondence between communities in the literature-based protein networks and GO annotation groups of functionally related proteins. We found a comprehensive two-way match: proteins within biological annotation groups form significantly denser linked network clusters than expected by chance and, conversely, densely linked network communities exhibit a pronounced non-random overlap with GO groups. We also expanded the publicly available GO biological process annotation using the relations extracted by our NLP technology

  9. Recognition of Learner's Personality Traits through Digital Annotations in Distance Learning

    Science.gov (United States)

    Omheni, Nizar; Kalboussi, Anis; Mazhoud, Omar; Kacem, Ahmed Hadj

    2017-01-01

    Researchers in distance education are interested in observing and modelling of learner's personality profile, and adapting their learning experiences accordingly. When learners read and interact with their reading materials, they do unselfconscious activities like annotation which may be a key feature of their personalities. Annotation activity…

  10. Metab2MeSH: annotating compounds with medical subject headings.

    Science.gov (United States)

    Sartor, Maureen A; Ade, Alex; Wright, Zach; States, David; Omenn, Gilbert S; Athey, Brian; Karnovsky, Alla

    2012-05-15

    Progress in high-throughput genomic technologies has led to the development of a variety of resources that link genes to functional information contained in the biomedical literature. However, tools attempting to link small molecules to normal and diseased physiology and published data relevant to biologists and clinical investigators, are still lacking. With metabolomics rapidly emerging as a new omics field, the task of annotating small molecule metabolites becomes highly relevant. Our tool Metab2MeSH uses a statistical approach to reliably and automatically annotate compounds with concepts defined in Medical Subject Headings, and the National Library of Medicine's controlled vocabulary for biomedical concepts. These annotations provide links from compounds to biomedical literature and complement existing resources such as PubChem and the Human Metabolome Database.

  11. Annotating breast cancer microarray samples using ontologies

    Science.gov (United States)

    Liu, Hongfang; Li, Xin; Yoon, Victoria; Clarke, Robert

    2008-01-01

    As the most common cancer among women, breast cancer results from the accumulation of mutations in essential genes. Recent advance in high-throughput gene expression microarray technology has inspired researchers to use the technology to assist breast cancer diagnosis, prognosis, and treatment prediction. However, the high dimensionality of microarray experiments and public access of data from many experiments have caused inconsistencies which initiated the development of controlled terminologies and ontologies for annotating microarray experiments, such as the standard microarray Gene Expression Data (MGED) ontology (MO). In this paper, we developed BCM-CO, an ontology tailored specifically for indexing clinical annotations of breast cancer microarray samples from the NCI Thesaurus. Our research showed that the coverage of NCI Thesaurus is very limited with respect to i) terms used by researchers to describe breast cancer histology (covering 22 out of 48 histology terms); ii) breast cancer cell lines (covering one out of 12 cell lines); and iii) classes corresponding to the breast cancer grading and staging. By incorporating a wider range of those terms into BCM-CO, we were able to indexed breast cancer microarray samples from GEO using BCM-CO and MGED ontology and developed a prototype system with web interface that allows the retrieval of microarray data based on the ontology annotations. PMID:18999108

  12. An omics based assessment of cadmium toxicity in the green alga Chlamydomonas reinhardtii

    Energy Technology Data Exchange (ETDEWEB)

    Jamers, An; Blust, Ronny; De Coen, Wim [Laboratory for Ecophysiology, Biochemistry and Toxicology, Department of Biology, University of Antwerp, Groenenborgerlaan 171, 2020 Antwerp (Belgium); Griffin, Julian L. [Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 2QA (United Kingdom); Jones, Oliver A.H., E-mail: oliver.jones@rmit.edu.au [School of Applied Sciences, RMIT University, GPO Box 2476, Melbourne, VIC 3001 (Australia)

    2013-01-15

    The effects of cadmium were assessed in the freshwater alga Chlamydomonas reinhardtii. Algae were exposed to concentrations of 0, 8.1 or 114.8 {mu}M of cadmium and growth rates, gene transcription and metabolite profiles were examined after 48 and 72 h of exposure. In algae exposed to 8.1 {mu}M Cd, several genes were differentially transcribed after 48 h but no adverse growth related effects were detected. A transient effect on both gene transcription patterns and metabolite profiles could be discerned after 48 h of exposure but the majority of these changes disappeared after 72 h. In contrast, all effects were more pronounced at the 114.8 {mu}M cadmium exposure. Here growth was clearly reduced and transcription of a large number of genes involved in oxidative stress defense mechanisms was differentially increased. Metabolites involved in the glutathione synthesis pathway (an important antioxidant defense) were also affected but the effects of cadmium were found to be more pronounced at the transcript level than in the metabolome, suggesting that the former exhibits greater sensitivity toward cadmium exposure.

  13. The two parallel photocycles of the Chlamydomonas sensory photoreceptor histidine kinase rhodopsin 1.

    Science.gov (United States)

    Luck, Meike; Hegemann, Peter

    2017-10-01

    Histidine kinase rhodopsins (HKRs) belong to a class of unexplored sensory photoreceptors that share a similar modular architecture. The light sensing rhodopsin domain is covalently linked to signal-transducing modules and in some cases to a C-terminal guanylyl-cyclase effector. In spite of their wide distribution in unicellular organisms, very little is known about their physiological role and mechanistic functioning. We investigated the photochemical properties of the recombinant rhodopsin-fragment of Cr-HKR1 originating from Chlamydomonas reinhardtii. Our spectroscopic studies revealed an unusual thermal stability of the photoproducts with the deprotonated retinal Schiff base (RSB). Upon UV-irradiation these Rh-UV states with maximal absorbance in the UVA-region (Rh-UV) photochemically convert to stable blue light absorbing rhodopsin (Rh-Bl) with protonated chromophore. The heterogeneity of the sample is based on two parallel photocycles with the chromophore in C 15 =N-syn- or -anti-configuration. This report represents an attempt to decipher the underlying reaction schemes and interconversions of the two coexisting photocycles. Copyright © 2017 Elsevier GmbH. All rights reserved.

  14. The effect of caffeine on repair in chlamydomonas reinhardtii. Pt. 1

    International Nuclear Information System (INIS)

    Rosen, H.; Rehn, M.M.; Johnson, B.A.

    1980-01-01

    The effect of caffeine on repair was studied in the green alga Chlamydomonas reinhardtii. Treatment of UV-irradiated wild-type (UVS + ) cells with a sublethal level of caffeine caused a significant increase in survival compared to untreated UV-irradiated cells. Caffeine did not affect survival in the repair-deficient strain UVSE1, which is deficient in repair of UV-induced damage carried out by enzymes associated with recombination during meiosis. A significant increase in survival in the presence of caffeine was observed in the repair-deficient strain UVSE4 in which recombination during meiosis is not affected. Treatment of zygotes homozygous for UVS + , UVSE1, or UVSE4 with sublethal levels of caffeine caused marked increases in recombination frequency in UVS + and UVSE4 zygotes and no increase in recombination in UVSE1 zygotes. These results indicate that caffeine increases recombination in normal strains. Increased opportunity for recombination caused by caffeine would not result in increased recombination frequency in the UVSE1 strain, assuming limited-recombination enzyme activity in this strain. The observed increase in survival following UV-irradiation in the presence of caffeine in strains having normal recombination would therefore be associated with a caffeine-induced increase in opportunities for recombination repair. (orig.)

  15. Combining gene prediction methods to improve metagenomic gene annotation

    Directory of Open Access Journals (Sweden)

    Rosen Gail L

    2011-01-01

    Full Text Available Abstract Background Traditional gene annotation methods rely on characteristics that may not be available in short reads generated from next generation technology, resulting in suboptimal performance for metagenomic (environmental samples. Therefore, in recent years, new programs have been developed that optimize performance on short reads. In this work, we benchmark three metagenomic gene prediction programs and combine their predictions to improve metagenomic read gene annotation. Results We not only analyze the programs' performance at different read-lengths like similar studies, but also separate different types of reads, including intra- and intergenic regions, for analysis. The main deficiencies are in the algorithms' ability to predict non-coding regions and gene edges, resulting in more false-positives and false-negatives than desired. In fact, the specificities of the algorithms are notably worse than the sensitivities. By combining the programs' predictions, we show significant improvement in specificity at minimal cost to sensitivity, resulting in 4% improvement in accuracy for 100 bp reads with ~1% improvement in accuracy for 200 bp reads and above. To correctly annotate the start and stop of the genes, we find that a consensus of all the predictors performs best for shorter read lengths while a unanimous agreement is better for longer read lengths, boosting annotation accuracy by 1-8%. We also demonstrate use of the classifier combinations on a real dataset. Conclusions To optimize the performance for both prediction and annotation accuracies, we conclude that the consensus of all methods (or a majority vote is the best for reads 400 bp and shorter, while using the intersection of GeneMark and Orphelia predictions is the best for reads 500 bp and longer. We demonstrate that most methods predict over 80% coding (including partially coding reads on a real human gut sample sequenced by Illumina technology.

  16. NegGOA: negative GO annotations selection using ontology structure.

    Science.gov (United States)

    Fu, Guangyuan; Wang, Jun; Yang, Bo; Yu, Guoxian

    2016-10-01

    Predicting the biological functions of proteins is one of the key challenges in the post-genomic era. Computational models have demonstrated the utility of applying machine learning methods to predict protein function. Most prediction methods explicitly require a set of negative examples-proteins that are known not carrying out a particular function. However, Gene Ontology (GO) almost always only provides the knowledge that proteins carry out a particular function, and functional annotations of proteins are incomplete. GO structurally organizes more than tens of thousands GO terms and a protein is annotated with several (or dozens) of these terms. For these reasons, the negative examples of a protein can greatly help distinguishing true positive examples of the protein from such a large candidate GO space. In this paper, we present a novel approach (called NegGOA) to select negative examples. Specifically, NegGOA takes advantage of the ontology structure, available annotations and potentiality of additional annotations of a protein to choose negative examples of the protein. We compare NegGOA with other negative examples selection algorithms and find that NegGOA produces much fewer false negatives than them. We incorporate the selected negative examples into an efficient function prediction model to predict the functions of proteins in Yeast, Human, Mouse and Fly. NegGOA also demonstrates improved accuracy than these comparing algorithms across various evaluation metrics. In addition, NegGOA is less suffered from incomplete annotations of proteins than these comparing methods. The Matlab and R codes are available at https://sites.google.com/site/guoxian85/neggoa gxyu@swu.edu.cn Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. Guidelines for visualizing and annotating rule-based models†

    Science.gov (United States)

    Chylek, Lily A.; Hu, Bin; Blinov, Michael L.; Emonet, Thierry; Faeder, James R.; Goldstein, Byron; Gutenkunst, Ryan N.; Haugh, Jason M.; Lipniacki, Tomasz; Posner, Richard G.; Yang, Jin; Hlavacek, William S.

    2011-01-01

    Rule-based modeling provides a means to represent cell signaling systems in a way that captures site-specific details of molecular interactions. For rule-based models to be more widely understood and (re)used, conventions for model visualization and annotation are needed. We have developed the concepts of an extended contact map and a model guide for illustrating and annotating rule-based models. An extended contact map represents the scope of a model by providing an illustration of each molecule, molecular component, direct physical interaction, post-translational modification, and enzyme-substrate relationship considered in a model. A map can also illustrate allosteric effects, structural relationships among molecular components, and compartmental locations of molecules. A model guide associates elements of a contact map with annotation and elements of an underlying model, which may be fully or partially specified. A guide can also serve to document the biological knowledge upon which a model is based. We provide examples of a map and guide for a published rule-based model that characterizes early events in IgE receptor (FcεRI) signaling. We also provide examples of how to visualize a variety of processes that are common in cell signaling systems but not considered in the example model, such as ubiquitination. An extended contact map and an associated guide can document knowledge of a cell signaling system in a form that is visual as well as executable. As a tool for model annotation, a map and guide can communicate the content of a model clearly and with precision, even for large models. PMID:21647530

  18. Guidelines for visualizing and annotating rule-based models.

    Science.gov (United States)

    Chylek, Lily A; Hu, Bin; Blinov, Michael L; Emonet, Thierry; Faeder, James R; Goldstein, Byron; Gutenkunst, Ryan N; Haugh, Jason M; Lipniacki, Tomasz; Posner, Richard G; Yang, Jin; Hlavacek, William S

    2011-10-01

    Rule-based modeling provides a means to represent cell signaling systems in a way that captures site-specific details of molecular interactions. For rule-based models to be more widely understood and (re)used, conventions for model visualization and annotation are needed. We have developed the concepts of an extended contact map and a model guide for illustrating and annotating rule-based models. An extended contact map represents the scope of a model by providing an illustration of each molecule, molecular component, direct physical interaction, post-translational modification, and enzyme-substrate relationship considered in a model. A map can also illustrate allosteric effects, structural relationships among molecular components, and compartmental locations of molecules. A model guide associates elements of a contact map with annotation and elements of an underlying model, which may be fully or partially specified. A guide can also serve to document the biological knowledge upon which a model is based. We provide examples of a map and guide for a published rule-based model that characterizes early events in IgE receptor (FcεRI) signaling. We also provide examples of how to visualize a variety of processes that are common in cell signaling systems but not considered in the example model, such as ubiquitination. An extended contact map and an associated guide can document knowledge of a cell signaling system in a form that is visual as well as executable. As a tool for model annotation, a map and guide can communicate the content of a model clearly and with precision, even for large models.

  19. Efficient expression of green fluorescent protein (GFP) mediated by a chimeric promoter in Chlamydomonas reinhardtii

    Science.gov (United States)

    Wu, Jinxia; Hu, Zhangli; Wang, Chaogang; Li, Shuangfei; Lei, Anping

    2008-08-01

    To improve the expression efficiency of exogenous genes in Chlamydomonas reinhardtii, a high efficient expression vector was constructed. Green fluorescent protein (GFP) was expressed in C. reinhardtii under the control of promoters: RBCS2 and HSP70A-RBCS2. Efficiency of transformation and expression were compared between two transgenic algae: RBCS2 mediated strain Tran-I and HSP70A-RBCS2 mediated strain Tran-II. Results show that HSP70A-RBCS2 could improve greatly the transformation efficiency by approximately eightfold of RBCS2, and the expression efficiency of GFP in Tran-II was at least double of that in Tran-I. In addition, a threefold increase of GFP in Tran-II was induced by heat shock at 40°C. All of the results demonstrated that HSP70A-RBCS2 was more efficient than RBCS2 in expressing exogenous gene in C. reinhardtii.

  20. Learning pathology using collaborative vs. individual annotation of whole slide images: a mixed methods trial.

    Science.gov (United States)

    Sahota, Michael; Leung, Betty; Dowdell, Stephanie; Velan, Gary M

    2016-12-12

    Students in biomedical disciplines require understanding of normal and abnormal microscopic appearances of human tissues (histology and histopathology). For this purpose, practical classes in these disciplines typically use virtual microscopy, viewing digitised whole slide images in web browsers. To enhance engagement, tools have been developed to enable individual or collaborative annotation of whole slide images within web browsers. To date, there have been no studies that have critically compared the impact on learning of individual and collaborative annotations on whole slide images. Junior and senior students engaged in Pathology practical classes within Medical Science and Medicine programs participated in cross-over trials of individual and collaborative annotation activities. Students' understanding of microscopic morphology was compared using timed online quizzes, while students' perceptions of learning were evaluated using an online questionnaire. For senior medical students, collaborative annotation of whole slide images was superior for understanding key microscopic features when compared to individual annotation; whilst being at least equivalent to individual annotation for junior medical science students. Across cohorts, students agreed that the annotation activities provided a user-friendly learning environment that met their flexible learning needs, improved efficiency, provided useful feedback, and helped them to set learning priorities. Importantly, these activities were also perceived to enhance motivation and improve understanding. Collaborative annotation improves understanding of microscopic morphology for students with sufficient background understanding of the discipline. These findings have implications for the deployment of annotation activities in biomedical curricula, and potentially for postgraduate training in Anatomical Pathology.

  1. Evaluation of Three Automated Genome Annotations for Halorhabdus utahensis

    DEFF Research Database (Denmark)

    Bakke, Peter; Carney, Nick; DeLoache, Will

    2009-01-01

    in databases such as NCBI and used to validate subsequent annotation errors. We submitted the genome sequence of halophilic archaeon Halorhabdus utahensis to be analyzed by three genome annotation services. We have examined the output from each service in a variety of ways in order to compare the methodology...

  2. Feeling Expression Using Avatars and Its Consistency for Subjective Annotation

    Science.gov (United States)

    Ito, Fuyuko; Sasaki, Yasunari; Hiroyasu, Tomoyuki; Miki, Mitsunori

    Consumer Generated Media(CGM) is growing rapidly and the amount of content is increasing. However, it is often difficult for users to extract important contents and the existence of contents recording their experiences can easily be forgotten. As there are no methods or systems to indicate the subjective value of the contents or ways to reuse them, subjective annotation appending subjectivity, such as feelings and intentions, to contents is needed. Representation of subjectivity depends on not only verbal expression, but also nonverbal expression. Linguistically expressed annotation, typified by collaborative tagging in social bookmarking systems, has come into widespread use, but there is no system of nonverbally expressed annotation on the web. We propose the utilization of controllable avatars as a means of nonverbal expression of subjectivity, and confirmed the consistency of feelings elicited by avatars over time for an individual and in a group. In addition, we compared the expressiveness and ease of subjective annotation between collaborative tagging and controllable avatars. The result indicates that the feelings evoked by avatars are consistent in both cases, and using controllable avatars is easier than collaborative tagging for representing feelings elicited by contents that do not express meaning, such as photos.

  3. Chemical annotation of small and peptide-like molecules at the Protein Data Bank

    Science.gov (United States)

    Young, Jasmine Y.; Feng, Zukang; Dimitropoulos, Dimitris; Sala, Raul; Westbrook, John; Zhuravleva, Marina; Shao, Chenghua; Quesada, Martha; Peisach, Ezra; Berman, Helen M.

    2013-01-01

    Over the past decade, the number of polymers and their complexes with small molecules in the Protein Data Bank archive (PDB) has continued to increase significantly. To support scientific advancements and ensure the best quality and completeness of the data files over the next 10 years and beyond, the Worldwide PDB partnership that manages the PDB archive is developing a new deposition and annotation system. This system focuses on efficient data capture across all supported experimental methods. The new deposition and annotation system is composed of four major modules that together support all of the processing requirements for a PDB entry. In this article, we describe one such module called the Chemical Component Annotation Tool. This tool uses information from both the Chemical Component Dictionary and Biologically Interesting molecule Reference Dictionary to aid in annotation. Benchmark studies have shown that the Chemical Component Annotation Tool provides significant improvements in processing efficiency and data quality. Database URL: http://wwpdb.org PMID:24291661

  4. ANNOTATION SUPPORTED OCCLUDED OBJECT TRACKING

    Directory of Open Access Journals (Sweden)

    Devinder Kumar

    2012-08-01

    Full Text Available Tracking occluded objects at different depths has become as extremely important component of study for any video sequence having wide applications in object tracking, scene recognition, coding, editing the videos and mosaicking. The paper studies the ability of annotation to track the occluded object based on pyramids with variation in depth further establishing a threshold at which the ability of the system to track the occluded object fails. Image annotation is applied on 3 similar video sequences varying in depth. In the experiment, one bike occludes the other at a depth of 60cm, 80cm and 100cm respectively. Another experiment is performed on tracking humans with similar depth to authenticate the results. The paper also computes the frame by frame error incurred by the system, supported by detailed simulations. This system can be effectively used to analyze the error in motion tracking and further correcting the error leading to flawless tracking. This can be of great interest to computer scientists while designing surveillance systems etc.

  5. Gene coexpression network analysis as a source of functional annotation for rice genes.

    Directory of Open Access Journals (Sweden)

    Kevin L Childs

    Full Text Available With the existence of large publicly available plant gene expression data sets, many groups have undertaken data analyses to construct gene coexpression networks and functionally annotate genes. Often, a large compendium of unrelated or condition-independent expression data is used to construct gene networks. Condition-dependent expression experiments consisting of well-defined conditions/treatments have also been used to create coexpression networks to help examine particular biological processes. Gene networks derived from either condition-dependent or condition-independent data can be difficult to interpret if a large number of genes and connections are present. However, algorithms exist to identify modules of highly connected and biologically relevant genes within coexpression networks. In this study, we have used publicly available rice (Oryza sativa gene expression data to create gene coexpression networks using both condition-dependent and condition-independent data and have identified gene modules within these networks using the Weighted Gene Coexpression Network Analysis method. We compared the number of genes assigned to modules and the biological interpretability of gene coexpression modules to assess the utility of condition-dependent and condition-independent gene coexpression networks. For the purpose of providing functional annotation to rice genes, we found that gene modules identified by coexpression analysis of condition-dependent gene expression experiments to be more useful than gene modules identified by analysis of a condition-independent data set. We have incorporated our results into the MSU Rice Genome Annotation Project database as additional expression-based annotation for 13,537 genes, 2,980 of which lack a functional annotation description. These results provide two new types of functional annotation for our database. Genes in modules are now associated with groups of genes that constitute a collective functional

  6. Laughter annotations in conversational speech corpora - possibilities and limitations for phonetic analysis

    NARCIS (Netherlands)

    Truong, Khiet Phuong; Trouvain, Jürgen

    Existing laughter annotations provided with several publicly available conversational speech corpora (both multiparty and dyadic conversations) were investigated and compared. We discuss the possibilities and limitations of these rather coarse and shallow laughter annotations. There are definition

  7. BIOCAT: a pattern recognition platform for customizable biological image classification and annotation.

    Science.gov (United States)

    Zhou, Jie; Lamichhane, Santosh; Sterne, Gabriella; Ye, Bing; Peng, Hanchuan

    2013-10-04

    Pattern recognition algorithms are useful in bioimage informatics applications such as quantifying cellular and subcellular objects, annotating gene expressions, and classifying phenotypes. To provide effective and efficient image classification and annotation for the ever-increasing microscopic images, it is desirable to have tools that can combine and compare various algorithms, and build customizable solution for different biological problems. However, current tools often offer a limited solution in generating user-friendly and extensible tools for annotating higher dimensional images that correspond to multiple complicated categories. We develop the BIOimage Classification and Annotation Tool (BIOCAT). It is able to apply pattern recognition algorithms to two- and three-dimensional biological image sets as well as regions of interest (ROIs) in individual images for automatic classification and annotation. We also propose a 3D anisotropic wavelet feature extractor for extracting textural features from 3D images with xy-z resolution disparity. The extractor is one of the about 20 built-in algorithms of feature extractors, selectors and classifiers in BIOCAT. The algorithms are modularized so that they can be "chained" in a customizable way to form adaptive solution for various problems, and the plugin-based extensibility gives the tool an open architecture to incorporate future algorithms. We have applied BIOCAT to classification and annotation of images and ROIs of different properties with applications in cell biology and neuroscience. BIOCAT provides a user-friendly, portable platform for pattern recognition based biological image classification of two- and three- dimensional images and ROIs. We show, via diverse case studies, that different algorithms and their combinations have different suitability for various problems. The customizability of BIOCAT is thus expected to be useful for providing effective and efficient solutions for a variety of biological

  8. PANDA: pathway and annotation explorer for visualizing and interpreting gene-centric data.

    Science.gov (United States)

    Hart, Steven N; Moore, Raymond M; Zimmermann, Michael T; Oliver, Gavin R; Egan, Jan B; Bryce, Alan H; Kocher, Jean-Pierre A

    2015-01-01

    Objective. Bringing together genomics, transcriptomics, proteomics, and other -omics technologies is an important step towards developing highly personalized medicine. However, instrumentation has advances far beyond expectations and now we are able to generate data faster than it can be interpreted. Materials and Methods. We have developed PANDA (Pathway AND Annotation) Explorer, a visualization tool that integrates gene-level annotation in the context of biological pathways to help interpret complex data from disparate sources. PANDA is a web-based application that displays data in the context of well-studied pathways like KEGG, BioCarta, and PharmGKB. PANDA represents data/annotations as icons in the graph while maintaining the other data elements (i.e., other columns for the table of annotations). Custom pathways from underrepresented diseases can be imported when existing data sources are inadequate. PANDA also allows sharing annotations among collaborators. Results. In our first use case, we show how easy it is to view supplemental data from a manuscript in the context of a user's own data. Another use-case is provided describing how PANDA was leveraged to design a treatment strategy from the somatic variants found in the tumor of a patient with metastatic sarcomatoid renal cell carcinoma. Conclusion. PANDA facilitates the interpretation of gene-centric annotations by visually integrating this information with context of biological pathways. The application can be downloaded or used directly from our website: http://bioinformaticstools.mayo.edu/research/panda-viewer/.

  9. Application of whole slide image markup and annotation for pathologist knowledge capture.

    Science.gov (United States)

    Campbell, Walter S; Foster, Kirk W; Hinrichs, Steven H

    2013-01-01

    The ability to transfer image markup and annotation data from one scanned image of a slide to a newly acquired image of the same slide within a single vendor platform was investigated. The goal was to study the ability to use image markup and annotation data files as a mechanism to capture and retain pathologist knowledge without retaining the entire whole slide image (WSI) file. Accepted mathematical principles were investigated as a method to overcome variations in scans of the same glass slide and to accurately associate image markup and annotation data across different WSI of the same glass slide. Trilateration was used to link fixed points within the image and slide to the placement of markups and annotations of the image in a metadata file. Variation in markup and annotation placement between WSI of the same glass slide was reduced from over 80 μ to less than 4 μ in the x-axis and from 17 μ to 6 μ in the y-axis (P < 0.025). This methodology allows for the creation of a highly reproducible image library of histopathology images and interpretations for educational and research use.

  10. Transcript-level annotation of Affymetrix probesets improves the interpretation of gene expression data

    Directory of Open Access Journals (Sweden)

    Tu Kang

    2007-06-01

    Full Text Available Abstract Background The wide use of Affymetrix microarray in broadened fields of biological research has made the probeset annotation an important issue. Standard Affymetrix probeset annotation is at gene level, i.e. a probeset is precisely linked to a gene, and probeset intensity is interpreted as gene expression. The increased knowledge that one gene may have multiple transcript variants clearly brings up the necessity of updating this gene-level annotation to a refined transcript-level. Results Through performing rigorous alignments of the Affymetrix probe sequences against a comprehensive pool of currently available transcript sequences, and further linking the probesets to the International Protein Index, we generated transcript-level or protein-level annotation tables for two popular Affymetrix expression arrays, Mouse Genome 430A 2.0 Array and Human Genome U133A Array. Application of our new annotations in re-examining existing expression data sets shows increased expression consistency among synonymous probesets and strengthened expression correlation between interacting proteins. Conclusion By refining the standard Affymetrix annotation of microarray probesets from the gene level to the transcript level and protein level, one can achieve a more reliable interpretation of their experimental data, which may lead to discovery of more profound regulatory mechanism.

  11. Shewanella oneidensis: a new and efficient System for Expression and Maturation of heterologous [Fe-Fe] Hydrogenase from Chlamydomonas reinhardtii

    Directory of Open Access Journals (Sweden)

    Sybirna Kateryna

    2008-09-01

    Full Text Available Abstract Background The eukaryotic green alga, Chlamydomonas reinhardtii, produces H2 under anaerobic conditions, in a reaction catalysed by a [Fe-Fe] hydrogenase HydA1. For further biochemical and biophysical studies a suitable expression system of this enzyme should be found to overcome its weak expression in the host organism. Two heterologous expression systems used up to now have several advantages. However they are not free from some drawbacks. In this work we use bacterium Shewanella oneidensis as a new and efficient system for expression and maturation of HydA1 from Chlamydomonas reinhardtii. Results Based on codon usage bias and hydrogenase maturation ability, the bacterium S. oneidensis, which possesses putative [Fe-Fe] and [Ni-Fe] hydrogenase operons, was selected as the best potential host for C. reinhardtii [Fe-Fe] hydrogenase expression. Hydrogen formation by S. oneidensis strain AS52 (ΔhydAΔhyaB transformed with a plasmid bearing CrHydA1 and grown in the presence of six different substrates for anaerobic respiration was determined. A significant increase in hydrogen evolution was observed for cells grown in the presence of trimethylamine oxide, dimethylsulfoxide and disodium thiosulfate, showing that the system of S. oneidensis is efficient for heterologous expression of algal [Fe-Fe] hydrogenase. Conclusion In the present work a new efficient system for heterologous expression and maturation of C. reinhardtii hydrogenase has been developed. HydA1 of C. reinhardtii was purified and shown to contain 6 Fe atoms/molecule of protein, as expected. Using DMSO, TMAO or thiosulfate as substrates for anaerobic respiration during the cell growth, 0.4 – 0.5 mg l-1(OD600 = 1 of catalytically active HydA1 was obtained with hydrogen evolution rate of ~700 μmol H2 mg-1 min-1.

  12. Rubisco mutants of Chlamydomonas reinhardtii enhance photosynthetic hydrogen production.

    Science.gov (United States)

    Pinto, T S; Malcata, F X; Arrabaça, J D; Silva, J M; Spreitzer, R J; Esquível, M G

    2013-06-01

    Molecular hydrogen (H2) is an ideal fuel characterized by high enthalpy change and lack of greenhouse effects. This biofuel can be released by microalgae via reduction of protons to molecular hydrogen catalyzed by hydrogenases. The main competitor for the reducing power required by the hydrogenases is the Calvin cycle, and rubisco plays a key role therein. Engineered Chlamydomonas with reduced rubisco levels, activity and stability was used as the basis of this research effort aimed at increasing hydrogen production. Biochemical monitoring in such metabolically engineered mutant cells proceeded in Tris/acetate/phosphate culture medium with S-depletion or repletion, both under hypoxia. Photosynthetic activity, maximum photochemical efficiency, chlorophyll and protein levels were all measured. In addition, expression of rubisco, hydrogenase, D1 and Lhcb were investigated, and H2 was quantified. At the beginning of the experiments, rubisco increased followed by intense degradation. Lhcb proteins exhibited monomeric isoforms during the first 24 to 48 h, and D1 displayed sensitivity under S-depletion. Rubisco mutants exhibited a significant decrease in O2 evolution compared with the control. Although the S-depleted medium was much more suitable than its complete counterpart for H2 production, hydrogen release was observed also in sealed S-repleted cultures of rubisco mutated cells under low-moderate light conditions. In particular, the rubisco mutant Y67A accounted for 10-15-fold higher hydrogen production than the wild type under the same conditions and also displayed divergent metabolic parameters. These results indicate that rubisco is a promising target for improving hydrogen production rates in engineered microalgae.

  13. Sensor Control And Film Annotation For Long Range, Standoff Reconnaissance

    Science.gov (United States)

    Schmidt, Thomas G.; Peters, Owen L.; Post, Lawrence H.

    1984-12-01

    This paper describes a Reconnaissance Data Annotation System that incorporates off-the-shelf technology and system designs providing a high degree of adaptability and interoperability to satisfy future reconnaissance data requirements. The history of data annotation for reconnaissance is reviewed in order to provide the base from which future developments can be assessed and technical risks minimized. The system described will accommodate new developments in recording head assemblies and the incorporation of advanced cameras of both the film and electro-optical type. Use of microprocessor control and digital bus inter-face form the central design philosophy. For long range, high altitude, standoff missions, the Data Annotation System computes the projected latitude and longitude of central target position from aircraft position and attitude. This complements the use of longer ranges and high altitudes for reconnaissance missions.

  14. Integrating UIMA annotators in a web-based text processing framework.

    Science.gov (United States)

    Chen, Xiang; Arnold, Corey W

    2013-01-01

    The Unstructured Information Management Architecture (UIMA) [1] framework is a growing platform for natural language processing (NLP) applications. However, such applications may be difficult for non-technical users deploy. This project presents a web-based framework that wraps UIMA-based annotator systems into a graphical user interface for researchers and clinicians, and a web service for developers. An annotator that extracts data elements from lung cancer radiology reports is presented to illustrate the use of the system. Annotation results from the web system can be exported to multiple formats for users to utilize in other aspects of their research and workflow. This project demonstrates the benefits of a lay-user interface for complex NLP applications. Efforts such as this can lead to increased interest and support for NLP work in the clinical domain.

  15. Processing sequence annotation data using the Lua programming language.

    Science.gov (United States)

    Ueno, Yutaka; Arita, Masanori; Kumagai, Toshitaka; Asai, Kiyoshi

    2003-01-01

    The data processing language in a graphical software tool that manages sequence annotation data from genome databases should provide flexible functions for the tasks in molecular biology research. Among currently available languages we adopted the Lua programming language. It fulfills our requirements to perform computational tasks for sequence map layouts, i.e. the handling of data containers, symbolic reference to data, and a simple programming syntax. Upon importing a foreign file, the original data are first decomposed in the Lua language while maintaining the original data schema. The converted data are parsed by the Lua interpreter and the contents are stored in our data warehouse. Then, portions of annotations are selected and arranged into our catalog format to be depicted on the sequence map. Our sequence visualization program was successfully implemented, embedding the Lua language for processing of annotation data and layout script. The program is available at http://staff.aist.go.jp/yutaka.ueno/guppy/.

  16. A Linked Data-Based Collaborative Annotation System for Increasing Learning Achievements

    Science.gov (United States)

    Zarzour, Hafed; Sellami, Mokhtar

    2017-01-01

    With the emergence of the Web 2.0, collaborative annotation practices have become more mature in the field of learning. In this context, several recent studies have shown the powerful effects of the integration of annotation mechanism in learning process. However, most of these studies provide poor support for semantically structured resources,…

  17. Assessment of disease named entity recognition on a corpus of annotated sentences.

    Science.gov (United States)

    Jimeno, Antonio; Jimenez-Ruiz, Ernesto; Lee, Vivian; Gaudan, Sylvain; Berlanga, Rafael; Rebholz-Schuhmann, Dietrich

    2008-04-11

    In recent years, the recognition of semantic types from the biomedical scientific literature has been focused on named entities like protein and gene names (PGNs) and gene ontology terms (GO terms). Other semantic types like diseases have not received the same level of attention. Different solutions have been proposed to identify disease named entities in the scientific literature. While matching the terminology with language patterns suffers from low recall (e.g., Whatizit) other solutions make use of morpho-syntactic features to better cover the full scope of terminological variability (e.g., MetaMap). Currently, MetaMap that is provided from the National Library of Medicine (NLM) is the state of the art solution for the annotation of concepts from UMLS (Unified Medical Language System) in the literature. Nonetheless, its performance has not yet been assessed on an annotated corpus. In addition, little effort has been invested so far to generate an annotated dataset that links disease entities in text to disease entries in a database, thesaurus or ontology and that could serve as a gold standard to benchmark text mining solutions. As part of our research work, we have taken a corpus that has been delivered in the past for the identification of associations of genes to diseases based on the UMLS Metathesaurus and we have reprocessed and re-annotated the corpus. We have gathered annotations for disease entities from two curators, analyzed their disagreement (0.51 in the kappa-statistic) and composed a single annotated corpus for public use. Thereafter, three solutions for disease named entity recognition including MetaMap have been applied to the corpus to automatically annotate it with UMLS Metathesaurus concepts. The resulting annotations have been benchmarked to compare their performance. The annotated corpus is publicly available at ftp://ftp.ebi.ac.uk/pub/software/textmining/corpora/diseases and can serve as a benchmark to other systems. In addition, we found

  18. AIGO: Towards a unified framework for the Analysis and the Inter-comparison of GO functional annotations

    Directory of Open Access Journals (Sweden)

    Defoin-Platel Michael

    2011-11-01

    Full Text Available Abstract Background In response to the rapid growth of available genome sequences, efforts have been made to develop automatic inference methods to functionally characterize them. Pipelines that infer functional annotation are now routinely used to produce new annotations at a genome scale and for a broad variety of species. These pipelines differ widely in their inference algorithms, confidence thresholds and data sources for reasoning. This heterogeneity makes a comparison of the relative merits of each approach extremely complex. The evaluation of the quality of the resultant annotations is also challenging given there is often no existing gold-standard against which to evaluate precision and recall. Results In this paper, we present a pragmatic approach to the study of functional annotations. An ensemble of 12 metrics, describing various aspects of functional annotations, is defined and implemented in a unified framework, which facilitates their systematic analysis and inter-comparison. The use of this framework is demonstrated on three illustrative examples: analysing the outputs of state-of-the-art inference pipelines, comparing electronic versus manual annotation methods, and monitoring the evolution of publicly available functional annotations. The framework is part of the AIGO library (http://code.google.com/p/aigo for the Analysis and the Inter-comparison of the products of Gene Ontology (GO annotation pipelines. The AIGO library also provides functionalities to easily load, analyse, manipulate and compare functional annotations and also to plot and export the results of the analysis in various formats. Conclusions This work is a step toward developing a unified framework for the systematic study of GO functional annotations. This framework has been designed so that new metrics on GO functional annotations can be added in a very straightforward way.

  19. Saint: a lightweight integration environment for model annotation.

    Science.gov (United States)

    Lister, Allyson L; Pocock, Matthew; Taschuk, Morgan; Wipat, Anil

    2009-11-15

    Saint is a web application which provides a lightweight annotation integration environment for quantitative biological models. The system enables modellers to rapidly mark up models with biological information derived from a range of data sources. Saint is freely available for use on the web at http://www.cisban.ac.uk/saint. The web application is implemented in Google Web Toolkit and Tomcat, with all major browsers supported. The Java source code is freely available for download at http://saint-annotate.sourceforge.net. The Saint web server requires an installation of libSBML and has been tested on Linux (32-bit Ubuntu 8.10 and 9.04).

  20. Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.

    Science.gov (United States)

    Oellrich, Anika; Collier, Nigel; Smedley, Damian; Groza, Tudor

    2015-01-01

    Electronic health records and scientific articles possess differing linguistic characteristics that may impact the performance of natural language processing tools developed for one or the other. In this paper, we investigate the performance of four extant concept recognition tools: the clinical Text Analysis and Knowledge Extraction System (cTAKES), the National Center for Biomedical Ontology (NCBO) Annotator, the Biomedical Concept Annotation System (BeCAS) and MetaMap. Each of the four concept recognition systems is applied to four different corpora: the i2b2 corpus of clinical documents, a PubMed corpus of Medline abstracts, a clinical trails corpus and the ShARe/CLEF corpus. In addition, we assess the individual system performances with respect to one gold standard annotation set, available for the ShARe/CLEF corpus. Furthermore, we built a silver standard annotation set from the individual systems' output and assess the quality as well as the contribution of individual systems to the quality of the silver standard. Our results demonstrate that mainly the NCBO annotator and cTAKES contribute to the silver standard corpora (F1-measures in the range of 21% to 74%) and their quality (best F1-measure of 33%), independent from the type of text investigated. While BeCAS and MetaMap can contribute to the precision of silver standard annotations (precision of up to 42%), the F1-measure drops when combined with NCBO Annotator and cTAKES due to a low recall. In conclusion, the performances of individual systems need to be improved independently from the text types, and the leveraging strategies to best take advantage of individual systems' annotations need to be revised. The textual content of the PubMed corpus, accession numbers for the clinical trials corpus, and assigned annotations of the four concept recognition systems as well as the generated silver standard annotation sets are available from http://purl.org/phenotype/resources. The textual content of the Sh

  1. Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.

    Directory of Open Access Journals (Sweden)

    Anika Oellrich

    Full Text Available Electronic health records and scientific articles possess differing linguistic characteristics that may impact the performance of natural language processing tools developed for one or the other. In this paper, we investigate the performance of four extant concept recognition tools: the clinical Text Analysis and Knowledge Extraction System (cTAKES, the National Center for Biomedical Ontology (NCBO Annotator, the Biomedical Concept Annotation System (BeCAS and MetaMap. Each of the four concept recognition systems is applied to four different corpora: the i2b2 corpus of clinical documents, a PubMed corpus of Medline abstracts, a clinical trails corpus and the ShARe/CLEF corpus. In addition, we assess the individual system performances with respect to one gold standard annotation set, available for the ShARe/CLEF corpus. Furthermore, we built a silver standard annotation set from the individual systems' output and assess the quality as well as the contribution of individual systems to the quality of the silver standard. Our results demonstrate that mainly the NCBO annotator and cTAKES contribute to the silver standard corpora (F1-measures in the range of 21% to 74% and their quality (best F1-measure of 33%, independent from the type of text investigated. While BeCAS and MetaMap can contribute to the precision of silver standard annotations (precision of up to 42%, the F1-measure drops when combined with NCBO Annotator and cTAKES due to a low recall. In conclusion, the performances of individual systems need to be improved independently from the text types, and the leveraging strategies to best take advantage of individual systems' annotations need to be revised. The textual content of the PubMed corpus, accession numbers for the clinical trials corpus, and assigned annotations of the four concept recognition systems as well as the generated silver standard annotation sets are available from http://purl.org/phenotype/resources. The textual content

  2. How Viral Poems are Annotated : On ‘OCD’ by Neil Hilborn

    NARCIS (Netherlands)

    van der Starre, K.A.

    2015-01-01

    In How Viral Poems are Annotated: On ‘OCD’ by Neil Hilborn Kila van der Starre explores how, where and by whom viral poems are annotated. The article focuses on the performance of the poem ‘OCD’ by Neil Hilborn that went viral in the summer of 2013 and has been viewed more than 10 million times on

  3. Current trend of annotating single nucleotide variation in humans--A case study on SNVrap.

    Science.gov (United States)

    Li, Mulin Jun; Wang, Junwen

    2015-06-01

    As high throughput methods, such as whole genome genotyping arrays, whole exome sequencing (WES) and whole genome sequencing (WGS), have detected huge amounts of genetic variants associated with human diseases, function annotation of these variants is an indispensable step in understanding disease etiology. Large-scale functional genomics projects, such as The ENCODE Project and Roadmap Epigenomics Project, provide genome-wide profiling of functional elements across different human cell types and tissues. With the urgent demands for identification of disease-causal variants, comprehensive and easy-to-use annotation tool is highly in demand. Here we review and discuss current progress and trend of the variant annotation field. Furthermore, we introduce a comprehensive web portal for annotating human genetic variants. We use gene-based features and the latest functional genomics datasets to annotate single nucleotide variation (SNVs) in human, at whole genome scale. We further apply several function prediction algorithms to annotate SNVs that might affect different biological processes, including transcriptional gene regulation, alternative splicing, post-transcriptional regulation, translation and post-translational modifications. The SNVrap web portal is freely available at http://jjwanglab.org/snvrap. Copyright © 2014 Elsevier Inc. All rights reserved.

  4. A community effort towards a knowledge-base and mathematical model of the human pathogen Salmonella Typhimurium LT2

    Directory of Open Access Journals (Sweden)

    Shin Sook-Il

    2011-01-01

    Full Text Available Abstract Background Metabolic reconstructions (MRs are common denominators in systems biology and represent biochemical, genetic, and genomic (BiGG knowledge-bases for target organisms by capturing currently available information in a consistent, structured manner. Salmonella enterica subspecies I serovar Typhimurium is a human pathogen, causes various diseases and its increasing antibiotic resistance poses a public health problem. Results Here, we describe a community-driven effort, in which more than 20 experts in S. Typhimurium biology and systems biology collaborated to reconcile and expand the S. Typhimurium BiGG knowledge-base. The consensus MR was obtained starting from two independently developed MRs for S. Typhimurium. Key results of this reconstruction jamboree include i development and implementation of a community-based workflow for MR annotation and reconciliation; ii incorporation of thermodynamic information; and iii use of the consensus MR to identify potential multi-target drug therapy approaches. Conclusion Taken together, with the growing number of parallel MRs a structured, community-driven approach will be necessary to maximize quality while increasing adoption of MRs in experimental design and interpretation.

  5. A community effort towards a knowledge-base and mathematical model of the human pathogen Salmonella Typhimurium LT2.

    Science.gov (United States)

    Thiele, Ines; Hyduke, Daniel R; Steeb, Benjamin; Fankam, Guy; Allen, Douglas K; Bazzani, Susanna; Charusanti, Pep; Chen, Feng-Chi; Fleming, Ronan M T; Hsiung, Chao A; De Keersmaecker, Sigrid C J; Liao, Yu-Chieh; Marchal, Kathleen; Mo, Monica L; Özdemir, Emre; Raghunathan, Anu; Reed, Jennifer L; Shin, Sook-il; Sigurbjörnsdóttir, Sara; Steinmann, Jonas; Sudarsan, Suresh; Swainston, Neil; Thijs, Inge M; Zengler, Karsten; Palsson, Bernhard O; Adkins, Joshua N; Bumann, Dirk

    2011-01-18

    Metabolic reconstructions (MRs) are common denominators in systems biology and represent biochemical, genetic, and genomic (BiGG) knowledge-bases for target organisms by capturing currently available information in a consistent, structured manner. Salmonella enterica subspecies I serovar Typhimurium is a human pathogen, causes various diseases and its increasing antibiotic resistance poses a public health problem. Here, we describe a community-driven effort, in which more than 20 experts in S. Typhimurium biology and systems biology collaborated to reconcile and expand the S. Typhimurium BiGG knowledge-base. The consensus MR was obtained starting from two independently developed MRs for S. Typhimurium. Key results of this reconstruction jamboree include i) development and implementation of a community-based workflow for MR annotation and reconciliation; ii) incorporation of thermodynamic information; and iii) use of the consensus MR to identify potential multi-target drug therapy approaches. Taken together, with the growing number of parallel MRs a structured, community-driven approach will be necessary to maximize quality while increasing adoption of MRs in experimental design and interpretation.

  6. A community effort towards a knowledge-base and mathematical model of the human pathogen Salmonella Typhimurium LT2

    Energy Technology Data Exchange (ETDEWEB)

    Thiele, Ines; Hyduke, Daniel R.; Steeb, Benjamin; Fankam, Guy; Allen, Douglas K.; Bazzani, Susanna; Charusanti, Pep; Chen, Feng-Chi; Fleming, Ronan MT; Hsiung, Chao A.; De Keersmaecker, Sigrid CJ; Liao, Yu-Chieh; Marchal, Kathleen; Mo, Monica L.; Özdemir, Emre; Raghunathan, Anu; Reed, Jennifer L.; Shin, Sook-Il; Sigurbjörnsdóttir, Sara; Steinmann, Jonas; Sudarsan, Suresh; Swainston, Neil; Thijs, Inge M.; Zengler, Karsten; Palsson, Bernhard O.; Adkins, Joshua N.; Bumann, Dirk

    2011-01-01

    Metabolic reconstructions (MRs) are common denominators in systems biology and represent biochemical, genetic, and genomic (BiGG) knowledge-bases for target organisms by capturing currently available information in a consistent, structured manner. Salmonella enterica subspecies I serovar Typhimurium is a human pathogen, causes various diseases and its increasing antibiotic resistance poses a public health problem. Here, we describe a community-driven effort, in which more than 20 experts in S. Typhimurium biology and systems biology collaborated to reconcile and expand the S. Typhimurium BiGG knowledge-base. The consensus MR was obtained starting from two independently developed MRs for S. Typhimurium. Key results of this reconstruction jamboree include i) development and implementation of a community-based workflow for MR annotation and reconciliation; ii) incorporation of thermodynamic information; and iii) use of the consensus MR to identify potential multi-target drug therapy approaches. Finally, taken together, with the growing number of parallel MRs a structured, community-driven approach will be necessary to maximize quality while increasing adoption of MRs in experimental design and interpretation.

  7. Expression profiling of hypothetical genes in Desulfovibrio vulgaris leads to improved functional annotation

    Energy Technology Data Exchange (ETDEWEB)

    Elias, Dwayne A.; Mukhopadhyay, Aindrila; Joachimiak, Marcin P.; Drury, Elliott C.; Redding, Alyssa M.; Yen, Huei-Che B.; Fields, Matthew W.; Hazen, Terry C.; Arkin, Adam P.; Keasling, Jay D.; Wall, Judy D.

    2008-10-27

    Hypothetical and conserved hypothetical genes account for>30percent of sequenced bacterial genomes. For the sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough, 347 of the 3634 genes were annotated as conserved hypothetical (9.5percent) along with 887 hypothetical genes (24.4percent). Given the large fraction of the genome, it is plausible that some of these genes serve critical cellular roles. The study goals were to determine which genes were expressed and provide a more functionally based annotation. To accomplish this, expression profiles of 1234 hypothetical and conserved genes were used from transcriptomic datasets of 11 environmental stresses, complemented with shotgun LC-MS/MS and AMT tag proteomic data. Genes were divided into putatively polycistronic operons and those predicted to be monocistronic, then classified by basal expression levels and grouped according to changes in expression for one or multiple stresses. 1212 of these genes were transcribed with 786 producing detectable proteins. There was no evidence for expression of 17 predicted genes. Except for the latter, monocistronic gene annotation was expanded using the above criteria along with matching Clusters of Orthologous Groups. Polycistronic genes were annotated in the same manner with inferences from their proximity to more confidently annotated genes. Two targeted deletion mutants were used as test cases to determine the relevance of the inferred functional annotations.

  8. Legal Information Sources: An Annotated Bibliography.

    Science.gov (United States)

    Conner, Ronald C.

    This 25-page annotated bibliography describes the legal reference materials in the special collection of a medium-sized public library. Sources are listed in 12 categories: cases, dictionaries, directories, encyclopedias, forms, references for the lay person, general, indexes, laws and legislation, legal research aids, periodicals, and specialized…

  9. Just-in-time : on strategy annotations

    NARCIS (Netherlands)

    J.C. van de Pol (Jaco)

    2001-01-01

    textabstractA simple kind of strategy annotations is investigated, giving rise to a class of strategies, including leftmost-innermost. It is shown that under certain restrictions, an interpreter can be written which computes the normal form of a term in a bottom-up traversal. The main contribution

  10. Snap: an integrated SNP annotation platform

    DEFF Research Database (Denmark)

    Li, Shengting; Ma, Lijia; Li, Heng

    2007-01-01

    Snap (Single Nucleotide Polymorphism Annotation Platform) is a server designed to comprehensively analyze single genes and relationships between genes basing on SNPs in the human genome. The aim of the platform is to facilitate the study of SNP finding and analysis within the framework of medical...

  11. Improved annotation through genome-scale metabolic modeling of Aspergillus oryzae

    DEFF Research Database (Denmark)

    Vongsangnak, Wanwipa; Olsen, Peter; Hansen, Kim

    2008-01-01

    Background: Since ancient times the filamentous fungus Aspergillus oryzae has been used in the fermentation industry for the production of fermented sauces and the production of industrial enzymes. Recently, the genome sequence of A. oryzae with 12,074 annotated genes was released but the number...... to a genome scale metabolic model of A. oryzae. Results: Our assembled EST sequences we identified 1,046 newly predicted genes in the A. oryzae genome. Furthermore, it was possible to assign putative protein functions to 398 of the newly predicted genes. Noteworthy, our annotation strategy resulted...... model was validated and shown to correctly describe the phenotypic behavior of A. oryzae grown on different carbon sources. Conclusion: A much enhanced annotation of the A. oryzae genome was performed and a genomescale metabolic model of A. oryzae was reconstructed. The model accurately predicted...

  12. Supporting Listening Comprehension and Vocabulary Acquisition with Multimedia Annotations: The Students' Voice.

    Science.gov (United States)

    Jones, Linda C.

    2003-01-01

    Extends Mayer's (1997, 2001) generative theory of multimedia learning and investigates under what conditions multimedia annotations can support listening comprehension in a second language. Highlights students' views on the effectiveness of multimedia annotations (visual and verbal) in assisting them in their comprehension and acquisition of…

  13. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation

    DEFF Research Database (Denmark)

    Zhang, Han; Yohe, Tanner; Huang, Le

    2018-01-01

    of plant and plant-associated microbial genomes and metagenomes being sequenced, there is an urgent need of automatic tools for genomic data mining of CAZymes. We developed the dbCAN web server in 2012 to provide a public service for automated CAZyme annotation for newly sequenced genomes. Here, dbCAN2...... (http://cys.bios.niu.edu/dbCAN2) is presented as an updated meta server, which integrates three state-of-the-art tools for CAZome (all CAZymes of a genome) annotation: (i) HMMER search against the dbCAN HMM (hidden Markov model) database; (ii) DIAMOND search against the CAZy pre-annotated CAZyme...

  14. Genome, Functional Gene Annotation, and Nuclear Transformation of the Heterokont Oleaginous Alga Nannochloropsis oceanica CCMP1779

    Science.gov (United States)

    2012-11-15

    development of such an algal model system for basic discovery, we sequenced the genome and two sets of transcriptomes of N. oceanica CCMP1779, assembled...CCMP1779 has a gene encoding a highly conserved violax- anthin de-epoxidase ( VDE ) protein like that found in plants (Table S9). In Arabidopsis, VDE is...HLA3 or LCI1 were present. This result suggests that CCMP1779 might have a plastid Ci transport system similar to that of Chlamydomonas, but a distinct

  15. GRADUATE AND PROFESSIONAL EDUCATION, AN ANNOTATED BIBLIOGRAPHY.

    Science.gov (United States)

    HEISS, ANN M.; AND OTHERS

    THIS ANNOTATED BIBLIOGRAPHY CONTAINS REFERENCES TO GENERAL GRADUATE EDUCATION AND TO EDUCATION FOR THE FOLLOWING PROFESSIONAL FIELDS--ARCHITECTURE, BUSINESS, CLINICAL PSYCHOLOGY, DENTISTRY, ENGINEERING, LAW, LIBRARY SCIENCE, MEDICINE, NURSING, SOCIAL WORK, TEACHING, AND THEOLOGY. (HW)

  16. AnnoLnc: a web server for systematically annotating novel human lncRNAs.

    Science.gov (United States)

    Hou, Mei; Tang, Xing; Tian, Feng; Shi, Fangyuan; Liu, Fenglin; Gao, Ge

    2016-11-16

    Long noncoding RNAs (lncRNAs) have been shown to play essential roles in almost every important biological process through multiple mechanisms. Although the repertoire of human lncRNAs has rapidly expanded, their biological function and regulation remain largely elusive, calling for a systematic and integrative annotation tool. Here we present AnnoLnc ( http://annolnc.cbi.pku.edu.cn ), a one-stop portal for systematically annotating novel human lncRNAs. Based on more than 700 data sources and various tool chains, AnnoLnc enables a systematic annotation covering genomic location, secondary structure, expression patterns, transcriptional regulation, miRNA interaction, protein interaction, genetic association and evolution. An intuitive web interface is available for interactive analysis through both desktops and mobile devices, and programmers can further integrate AnnoLnc into their pipeline through standard JSON-based Web Service APIs. To the best of our knowledge, AnnoLnc is the only web server to provide on-the-fly and systematic annotation for newly identified human lncRNAs. Compared with similar tools, the annotation generated by AnnoLnc covers a much wider spectrum with intuitive visualization. Case studies demonstrate the power of AnnoLnc in not only rediscovering known functions of human lncRNAs but also inspiring novel hypotheses.

  17. Radiocesium bioaccumulation in freshwater plankton: Influences of cation concentrations (K{sup +} and Na{sup +}) on direct uptake of {sup 137}Cs in Chlamydomonas, Scenedesmus and Daphnia. Food-chain transfer of {sup 137}Cs from Chlamydomonas to Daphnia at different K{sup +} concentrations

    Energy Technology Data Exchange (ETDEWEB)

    Hagstroem, J. [Uppsala Univ., Dept. of Limnology, Uppsala (Sweden)

    2002-04-01

    The influences of cation concentrations (K{sup +} and Na{sup +}) on radiocesium ({sup 137}Cs) bioaccumulation in two freshwater phytoplankton species (Scenedesmus quadricauda and Chlamydomonas noctigama) were systematically investigated in batch-cultures monitored during two weeks. Both species were cultured at 9 {mu}E M{sup -2} s{sup -1} constant illumination at 20 deg. C. The exponential growth phase lasted for more than 100 hours ({mu} {approx_equal} 0.02 h{sup -1} for C. noctigama and 0.03 h{sup -1} for S, quadricauda). Over cation concentration ranges encountered in natural fresh waters ([K{sup +}] from 0.1 {mu}M to 3 mM, [Na{sup +}] from 20 {mu}M to 3 mM), a more than three order of magnitude variation was found for both intake rate and observed bioconcentration factors (BCF) at apparent steady-state (from less than 10{sup 3} to 10{sup 6} L (kg C){sup -1}). For both species, the major effector on BCF and uptake rate was external [K{sup +}], which was inversely proportional to these parameters over wide ranges (1-1000 {mu}M for S. quadricauda and 0.1 to 300 {mu}M for C. noctigama). At concentrations above these ranges K{sup +} still reduced {sup 137} Cs bio-uptake, but less effectively. A minor influence of external [Na{sup +}] on {sup 137}Cs bioaccumulation was indicated for S. quadricauda, whereas no such influence was significant for C. noctigama. A biphasic pattern for {sup 137}Cs bioaccumulation was discovered in C. noctigama. A rapid 'quasi-steady state' with an effective equilibration time of less than 100 hours was approached during the exponential growth phase. A surge in the uptake occurred when exponential growth ceased, and this pattern was consistent over the range 30 {mu}M to 1.4 mM external [K{sup +}]. Since depletion of external [K{sup +}] was not detected for these treatments, this pattern can only be explained if there are at least two different cellular compartments involved. Although less certain, a second steady-state BCF

  18. Effects of Annotations and Homework on Learning Achievement: An Empirical Study of Scratch Programming Pedagogy

    Science.gov (United States)

    Su, Addison Y. S.; Huang, Chester S. J.; Yang, Stephen J. H.; Ding, T. J.; Hsieh, Y. Z.

    2015-01-01

    In Taiwan elementary schools, Scratch programming has been taught for more than four years. Previous studies have shown that personal annotations is a useful learning method that improve learning performance. An annotation-based Scratch programming (ASP) system provides for the creation, share, and review of annotations and homework solutions in…

  19. Crowdsourcing image annotation for nucleus detection and segmentation in computational pathology: evaluating experts, automated methods, and the crowd.

    Science.gov (United States)

    Irshad, H; Montaser-Kouhsari, L; Waltz, G; Bucur, O; Nowak, J A; Dong, F; Knoblauch, N W; Beck, A H

    2015-01-01

    The development of tools in computational pathology to assist physicians and biomedical scientists in the diagnosis of disease requires access to high-quality annotated images for algorithm learning and evaluation. Generating high-quality expert-derived annotations is time-consuming and expensive. We explore the use of crowdsourcing for rapidly obtaining annotations for two core tasks in com- putational pathology: nucleus detection and nucleus segmentation. We designed and implemented crowdsourcing experiments using the CrowdFlower platform, which provides access to a large set of labor channel partners that accesses and manages millions of contributors worldwide. We obtained annotations from four types of annotators and compared concordance across these groups. We obtained: crowdsourced annotations for nucleus detection and segmentation on a total of 810 images; annotations using automated methods on 810 images; annotations from research fellows for detection and segmentation on 477 and 455 images, respectively; and expert pathologist-derived annotations for detection and segmentation on 80 and 63 images, respectively. For the crowdsourced annotations, we evaluated performance across a range of contributor skill levels (1, 2, or 3). The crowdsourced annotations (4,860 images in total) were completed in only a fraction of the time and cost required for obtaining annotations using traditional methods. For the nucleus detection task, the research fellow-derived annotations showed the strongest concordance with the expert pathologist- derived annotations (F-M =93.68%), followed by the crowd-sourced contributor levels 1,2, and 3 and the automated method, which showed relatively similar performance (F-M = 87.84%, 88.49%, 87.26%, and 86.99%, respectively). For the nucleus segmentation task, the crowdsourced contributor level 3-derived annotations, research fellow-derived annotations, and automated method showed the strongest concordance with the expert pathologist

  20. nGASP - the nematode genome annotation assessment project

    Energy Technology Data Exchange (ETDEWEB)

    Coghlan, A; Fiedler, T J; McKay, S J; Flicek, P; Harris, T W; Blasiar, D; Allen, J; Stein, L D

    2008-12-19

    While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second place. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy as reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs were the most challenging for gene-finders. While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets for 10 Mb of the C

  1. The standard operating procedure of the DOE-JGI Microbial Genome Annotation Pipeline (MGAP v.4).

    Science.gov (United States)

    Huntemann, Marcel; Ivanova, Natalia N; Mavromatis, Konstantinos; Tripp, H James; Paez-Espino, David; Palaniappan, Krishnaveni; Szeto, Ernest; Pillay, Manoj; Chen, I-Min A; Pati, Amrita; Nielsen, Torben; Markowitz, Victor M; Kyrpides, Nikos C

    2015-01-01

    The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. Structural annotation is followed by assignment of protein product names and functions.

  2. Águas com predominância de Eutreptia lanowi steuer e Chlamydomonas reinhardi dangeard no plancton, na enseada de Inhauma, Baía de Guanabara

    Directory of Open Access Journals (Sweden)

    Lejeune P. H. de Oliveira

    1962-03-01

    Full Text Available In brackish waters of a creek of Guanabara Bay, the author points by the first time the presence of Chlamydomonas reinhardi, Eutreptia lanowi, Oscillatoria putrida, O. limosa, O. chlorina that were unknown in our waters; such biologic indicators proved themselves pollutional conditions, so bad a stark-mesosaprobic regime. Other news are plankton analysis by the Standar methods, of two most expressive samples of water masses;also the mobility of the plankters are measured in micra by second.

  3. Towards the VWO Annotation Service: a Success Story of the IMAGE RPI Expert Rating System

    Science.gov (United States)

    Reinisch, B. W.; Galkin, I. A.; Fung, S. F.; Benson, R. F.; Kozlov, A. V.; Khmyrov, G. M.; Garcia, L. N.

    2010-12-01

    Interpretation of Heliophysics wave data requires specialized knowledge of wave phenomena. Users of the virtual wave observatory (VWO) will greatly benefit from a data annotation service that will allow querying of data by phenomenon type, thus helping accomplish the VWO goal to make Heliophysics wave data searchable, understandable, and usable by the scientific community. Individual annotations can be sorted by phenomenon type and reduced into event lists (catalogs). However, in contrast to the event lists, annotation records allow a greater flexibility of collaborative management by more easily admitting operations of addition, revision, or deletion. They can therefore become the building blocks for an interactive Annotation Service with a suitable graphic user interface to the VWO middleware. The VWO Annotation Service vision is an interactive, collaborative sharing of domain expert knowledge with fellow scientists and students alike. An effective prototype of the VWO Annotation Service has been in operation at the University of Massachusetts Lowell since 2001. An expert rating system (ERS) was developed for annotating the IMAGE radio plasma imager (RPI) active sounding data containing 1.2 million plasmagrams. The RPI data analysts can use ERS to submit expert ratings of plasmagram features, such as presence of echo traces resulted from reflected RPI signals from distant plasma structures. Since its inception in 2001, the RPI ERS has accumulated 7351 expert plasmagram ratings in 16 phenomenon categories, together with free-text descriptions and other metadata. In addition to human expert ratings, the system holds 225,125 ratings submitted by the CORPRAL data prospecting software that employs a model of the human pre-attentive vision to select images potentially containing interesting features. The annotation records proved to be instrumental in a number of investigations where manual data exploration would have been prohibitively tedious and expensive

  4. Skin Cancer Education Materials: Selected Annotations.

    Science.gov (United States)

    National Cancer Inst. (NIH), Bethesda, MD.

    This annotated bibliography presents 85 entries on a variety of approaches to cancer education. The entries are grouped under three broad headings, two of which contain smaller sub-divisions. The first heading, Public Education, contains prevention and general information, and non-print materials. The second heading, Professional Education,…

  5. Ten steps to get started in Genome Assembly and Annotation [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Victoria Dominguez Del Angel

    2018-02-01

    Full Text Available As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR.

  6. Small molecule annotation for the Protein Data Bank.

    Science.gov (United States)

    Sen, Sanchayita; Young, Jasmine; Berrisford, John M; Chen, Minyu; Conroy, Matthew J; Dutta, Shuchismita; Di Costanzo, Luigi; Gao, Guanghua; Ghosh, Sutapa; Hudson, Brian P; Igarashi, Reiko; Kengaku, Yumiko; Liang, Yuhe; Peisach, Ezra; Persikova, Irina; Mukhopadhyay, Abhik; Narayanan, Buvaneswari Coimbatore; Sahni, Gaurav; Sato, Junko; Sekharan, Monica; Shao, Chenghua; Tan, Lihua; Zhuravleva, Marina A

    2014-01-01

    The Protein Data Bank (PDB) is the single global repository for three-dimensional structures of biological macromolecules and their complexes, and its more than 100,000 structures contain more than 20,000 distinct ligands or small molecules bound to proteins and nucleic acids. Information about these small molecules and their interactions with proteins and nucleic acids is crucial for our understanding of biochemical processes and vital for structure-based drug design. Small molecules present in a deposited structure may be attached to a polymer or may occur as a separate, non-covalently linked ligand. During curation of a newly deposited structure by wwPDB annotation staff, each molecule is cross-referenced to the PDB Chemical Component Dictionary (CCD). If the molecule is new to the PDB, a dictionary description is created for it. The information about all small molecule components found in the PDB is distributed via the ftp archive as an external reference file. Small molecule annotation in the PDB also includes information about ligand-binding sites and about covalent and other linkages between ligands and macromolecules. During the remediation of the peptide-like antibiotics and inhibitors present in the PDB archive in 2011, it became clear that additional annotation was required for consistent representation of these molecules, which are quite often composed of several sequential subcomponents including modified amino acids and other chemical groups. The connectivity information of the modified amino acids is necessary for correct representation of these biologically interesting molecules. The combined information is made available via a new resource called the Biologically Interesting molecules Reference Dictionary, which is complementary to the CCD and is now routinely used for annotation of peptide-like antibiotics and inhibitors. © The Author(s) 2014. Published by Oxford University Press.

  7. Functional Annotation of Ion Channel Structures by Molecular Simulation.

    Science.gov (United States)

    Trick, Jemma L; Chelvaniththilan, Sivapalan; Klesse, Gianni; Aryal, Prafulla; Wallace, E Jayne; Tucker, Stephen J; Sansom, Mark S P

    2016-12-06

    Ion channels play key roles in cell membranes, and recent advances are yielding an increasing number of structures. However, their functional relevance is often unclear and better tools are required for their functional annotation. In sub-nanometer pores such as ion channels, hydrophobic gating has been shown to promote dewetting to produce a functionally closed (i.e., non-conductive) state. Using the serotonin receptor (5-HT 3 R) structure as an example, we demonstrate the use of molecular dynamics to aid the functional annotation of channel structures via simulation of the behavior of water within the pore. Three increasingly complex simulation analyses are described: water equilibrium densities; single-ion free-energy profiles; and computational electrophysiology. All three approaches correctly predict the 5-HT 3 R crystal structure to represent a functionally closed (i.e., non-conductive) state. We also illustrate the application of water equilibrium density simulations to annotate different conformational states of a glycine receptor. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  8. iPad: Semantic annotation and markup of radiological images.

    Science.gov (United States)

    Rubin, Daniel L; Rodriguez, Cesar; Shah, Priyanka; Beaulieu, Chris

    2008-11-06

    Radiological images contain a wealth of information,such as anatomy and pathology, which is often not explicit and computationally accessible. Information schemes are being developed to describe the semantic content of images, but such schemes can be unwieldy to operationalize because there are few tools to enable users to capture structured information easily as part of the routine research workflow. We have created iPad, an open source tool enabling researchers and clinicians to create semantic annotations on radiological images. iPad hides the complexity of the underlying image annotation information model from users, permitting them to describe images and image regions using a graphical interface that maps their descriptions to structured ontologies semi-automatically. Image annotations are saved in a variety of formats,enabling interoperability among medical records systems, image archives in hospitals, and the Semantic Web. Tools such as iPad can help reduce the burden of collecting structured information from images, and it could ultimately enable researchers and physicians to exploit images on a very large scale and glean the biological and physiological significance of image content.

  9. A Machine Learning Based Analytical Framework for Semantic Annotation Requirements

    OpenAIRE

    Hamed Hassanzadeh; MohammadReza Keyvanpour

    2011-01-01

    The Semantic Web is an extension of the current web in which information is given well-defined meaning. The perspective of Semantic Web is to promote the quality and intelligence of the current web by changing its contents into machine understandable form. Therefore, semantic level information is one of the cornerstones of the Semantic Web. The process of adding semantic metadata to web resources is called Semantic Annotation. There are many obstacles against the Semantic Annotation, such as ...

  10. The influence of annotation in graphical organizers

    NARCIS (Netherlands)

    Bezdan, Eniko; Kester, Liesbeth; Kirschner, Paul A.

    2013-01-01

    Bezdan, E., Kester, L., & Kirschner, P. A. (2012, 29-31 August). The influence of annotation in graphical organizers. Poster presented at the biannual meeting of the EARLI Special Interest Group Comprehension of Text and Graphics, Grenoble, France.

  11. Annotation of mammalian primary microRNAs

    Directory of Open Access Journals (Sweden)

    Enright Anton J

    2008-11-01

    Full Text Available Abstract Background MicroRNAs (miRNAs are important regulators of gene expression and have been implicated in development, differentiation and pathogenesis. Hundreds of miRNAs have been discovered in mammalian genomes. Approximately 50% of mammalian miRNAs are expressed from introns of protein-coding genes; the primary transcript (pri-miRNA is therefore assumed to be the host transcript. However, very little is known about the structure of pri-miRNAs expressed from intergenic regions. Here we annotate transcript boundaries of miRNAs in human, mouse and rat genomes using various transcription features. The 5' end of the pri-miRNA is predicted from transcription start sites, CpG islands and 5' CAGE tags mapped in the upstream flanking region surrounding the precursor miRNA (pre-miRNA. The 3' end of the pri-miRNA is predicted based on the mapping of polyA signals, and supported by cDNA/EST and ditags data. The predicted pri-miRNAs are also analyzed for promoter and insulator-associated regulatory regions. Results We define sets of conserved and non-conserved human, mouse and rat pre-miRNAs using bidirectional BLAST and synteny analysis. Transcription features in their flanking regions are used to demarcate the 5' and 3' boundaries of the pri-miRNAs. The lengths and boundaries of primary transcripts are highly conserved between orthologous miRNAs. A significant fraction of pri-miRNAs have lengths between 1 and 10 kb, with very few introns. We annotate a total of 59 pri-miRNA structures, which include 82 pre-miRNAs. 36 pri-miRNAs are conserved in all 3 species. In total, 18 of the confidently annotated transcripts express more than one pre-miRNA. The upstream regions of 54% of the predicted pri-miRNAs are found to be associated with promoter and insulator regulatory sequences. Conclusion Little is known about the primary transcripts of intergenic miRNAs. Using comparative data, we are able to identify the boundaries of a significant proportion of

  12. SAS- Semantic Annotation Service for Geoscience resources on the web

    Science.gov (United States)

    Elag, M.; Kumar, P.; Marini, L.; Li, R.; Jiang, P.

    2015-12-01

    There is a growing need for increased integration across the data and model resources that are disseminated on the web to advance their reuse across different earth science applications. Meaningful reuse of resources requires semantic metadata to realize the semantic web vision for allowing pragmatic linkage and integration among resources. Semantic metadata associates standard metadata with resources to turn them into semantically-enabled resources on the web. However, the lack of a common standardized metadata framework as well as the uncoordinated use of metadata fields across different geo-information systems, has led to a situation in which standards and related Standard Names abound. To address this need, we have designed SAS to provide a bridge between the core ontologies required to annotate resources and information systems in order to enable queries and analysis over annotation from a single environment (web). SAS is one of the services that are provided by the Geosematnic framework, which is a decentralized semantic framework to support the integration between models and data and allow semantically heterogeneous to interact with minimum human intervention. Here we present the design of SAS and demonstrate its application for annotating data and models. First we describe how predicates and their attributes are extracted from standards and ingested in the knowledge-base of the Geosemantic framework. Then we illustrate the application of SAS in annotating data managed by SEAD and annotating simulation models that have web interface. SAS is a step in a broader approach to raise the quality of geoscience data and models that are published on the web and allow users to better search, access, and use of the existing resources based on standard vocabularies that are encoded and published using semantic technologies.

  13. Argumentation Theory. [A Selected Annotated Bibliography].

    Science.gov (United States)

    Benoit, William L.

    Materials dealing with aspects of argumentation theory are cited in this annotated bibliography. The 50 citations are organized by topic as follows: (1) argumentation; (2) the nature of argument; (3) traditional perspectives on argument; (4) argument diagrams; (5) Chaim Perelman's theory of rhetoric; (6) the evaluation of argument; (7) argument…

  14. Draft Genome Sequence and Gene Annotation of the Entomopathogenic Fungus Verticillium hemipterigenum

    OpenAIRE

    Horn, Fabian; Habel, Andreas; Scharf, Daniel H.; Dworschak, Jan; Brakhage, Axel A.; Guthke, Reinhard; Hertweck, Christian; Linde, J?rg

    2015-01-01

    Verticillium hemipterigenum (anamorph Torrubiella hemipterigena) is an entomopathogenic fungus and produces a broad range of secondary metabolites. Here, we present the draft genome sequence of the fungus, including gene structure and functional annotation. Genes were predicted incorporating RNA-Seq data and functionally annotated to provide the basis for further genome studies.

  15. Annotating patient clinical records with syntactic chunks and named entities: the Harvey Corpus.

    Science.gov (United States)

    Savkov, Aleksandar; Carroll, John; Koeling, Rob; Cassell, Jackie

    The free text notes typed by physicians during patient consultations contain valuable information for the study of disease and treatment. These notes are difficult to process by existing natural language analysis tools since they are highly telegraphic (omitting many words), and contain many spelling mistakes, inconsistencies in punctuation, and non-standard word order. To support information extraction and classification tasks over such text, we describe a de-identified corpus of free text notes, a shallow syntactic and named entity annotation scheme for this kind of text, and an approach to training domain specialists with no linguistic background to annotate the text. Finally, we present a statistical chunking system for such clinical text with a stable learning rate and good accuracy, indicating that the manual annotation is consistent and that the annotation scheme is tractable for machine learning.

  16. CurlySMILES: a chemical language to customize and annotate encodings of molecular and nanodevice structures

    Directory of Open Access Journals (Sweden)

    Drefahl Axel

    2011-01-01

    Full Text Available Abstract CurlySMILES is a chemical line notation which extends SMILES with annotations for storage, retrieval and modeling of interlinked, coordinated, assembled and adsorbed molecules in supramolecular structures and nanodevices. Annotations are enclosed in curly braces and anchored to an atomic node or at the end of the molecular graph depending on the annotation type. CurlySMILES includes predefined annotations for stereogenicity, electron delocalization charges, extra-molecular interactions and connectivity, surface attachment, solutions, and crystal structures and allows extensions for domain-specific annotations. CurlySMILES provides a shorthand format to encode molecules with repetitive substructural parts or motifs such as monomer units in macromolecules and amino acids in peptide chains. CurlySMILES further accommodates special formats for non-molecular materials that are commonly denoted by composition of atoms or substructures rather than complete atom connectivity.

  17. A Flexible Object-of-Interest Annotation Framework for Online Video Portals

    Directory of Open Access Journals (Sweden)

    Robert Sorschag

    2012-02-01

    Full Text Available In this work, we address the use of object recognition techniques to annotate what is shown where in online video collections. These annotations are suitable to retrieve specific video scenes for object related text queries which is not possible with the manually generated metadata that is used by current portals. We are not the first to present object annotations that are generated with content-based analysis methods. However, the proposed framework possesses some outstanding features that offer good prospects for its application in real video portals. Firstly, it can be easily used as background module in any video environment. Secondly, it is not based on a fixed analysis chain but on an extensive recognition infrastructure that can be used with all kinds of visual features, matching and machine learning techniques. New recognition approaches can be integrated into this infrastructure with low development costs and a configuration of the used recognition approaches can be performed even on a running system. Thus, this framework might also benefit from future advances in computer vision. Thirdly, we present an automatic selection approach to support the use of different recognition strategies for different objects. Last but not least, visual analysis can be performed efficiently on distributed, multi-processor environments and a database schema is presented to store the resulting video annotations as well as the off-line generated low-level features in a compact form. We achieve promising results in an annotation case study and the instance search task of the TRECVID 2011 challenge.

  18. BLAST-based structural annotation of protein residues using Protein Data Bank.

    Science.gov (United States)

    Singh, Harinder; Raghava, Gajendra P S

    2016-01-25

    In the era of next-generation sequencing where thousands of genomes have been already sequenced; size of protein databases is growing with exponential rate. Structural annotation of these proteins is one of the biggest challenges for the computational biologist. Although, it is easy to perform BLAST search against Protein Data Bank (PDB) but it is difficult for a biologist to annotate protein residues from BLAST search. A web-server StarPDB has been developed for structural annotation of a protein based on its similarity with known protein structures. It uses standard BLAST software for performing similarity search of a query protein against protein structures in PDB. This server integrates wide range modules for assigning different types of annotation that includes, Secondary-structure, Accessible surface area, Tight-turns, DNA-RNA and Ligand modules. Secondary structure module allows users to predict regular secondary structure states to each residue in a protein. Accessible surface area predict the exposed or buried residues in a protein. Tight-turns module is designed to predict tight turns like beta-turns in a protein. DNA-RNA module developed for predicting DNA and RNA interacting residues in a protein. Similarly, Ligand module of server allows one to predicted ligands, metal and nucleotides ligand interacting residues in a protein. In summary, this manuscript presents a web server for comprehensive annotation of a protein based on similarity search. It integrates number of visualization tools that facilitate users to understand structure and function of protein residues. This web server is available freely for scientific community from URL http://crdd.osdd.net/raghava/starpdb .

  19. DFAST and DAGA: web-based integrated genome annotation tools and resources.

    Science.gov (United States)

    Tanizawa, Yasuhiro; Fujisawa, Takatomo; Kaminuma, Eli; Nakamura, Yasukazu; Arita, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus , obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii , whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp.

  20. Annotation of selection strengths in viral genomes

    DEFF Research Database (Denmark)

    McCauley, Stephen; de Groot, Saskia; Mailund, Thomas

    2007-01-01

    Motivation: Viral genomes tend to code in overlapping reading frames to maximize information content. This may result in atypical codon bias and particular evolutionary constraints. Due to the fast mutation rate of viruses, there is additional strong evidence for varying selection between intra......- and intergenomic regions. The presence of multiple coding regions complicates the concept of Ka/Ks ratio, and thus begs for an alternative approach when investigating selection strengths. Building on the paper by McCauley & Hein (2006), we develop a method for annotating a viral genome coding in overlapping...... may thus achieve an annotation both of coding regions as well as selection strengths, allowing us to investigate different selection patterns and hypotheses. Results: We illustrate our method by applying it to a multiple alignment of four HIV2 sequences, as well as four Hepatitis B sequences. We...

  1. Annotating functional RNAs in genomes using Infernal.

    Science.gov (United States)

    Nawrocki, Eric P

    2014-01-01

    Many different types of functional non-coding RNAs participate in a wide range of important cellular functions but the large majority of these RNAs are not routinely annotated in published genomes. Several programs have been developed for identifying RNAs, including specific tools tailored to a particular RNA family as well as more general ones designed to work for any family. Many of these tools utilize covariance models (CMs), statistical models of the conserved sequence, and structure of an RNA family. In this chapter, as an illustrative example, the Infernal software package and CMs from the Rfam database are used to identify RNAs in the genome of the archaeon Methanobrevibacter ruminantium, uncovering some additional RNAs not present in the genome's initial annotation. Analysis of the results and comparison with family-specific methods demonstrate some important strengths and weaknesses of this general approach.

  2. Linking human diseases to animal models using ontology-based phenotype annotation.

    Directory of Open Access Journals (Sweden)

    Nicole L Washington

    2009-11-01

    Full Text Available Scientists and clinicians who study genetic alterations and disease have traditionally described phenotypes in natural language. The considerable variation in these free-text descriptions has posed a hindrance to the important task of identifying candidate genes and models for human diseases and indicates the need for a computationally tractable method to mine data resources for mutant phenotypes. In this study, we tested the hypothesis that ontological annotation of disease phenotypes will facilitate the discovery of new genotype-phenotype relationships within and across species. To describe phenotypes using ontologies, we used an Entity-Quality (EQ methodology, wherein the affected entity (E and how it is affected (Q are recorded using terms from a variety of ontologies. Using this EQ method, we annotated the phenotypes of 11 gene-linked human diseases described in Online Mendelian Inheritance in Man (OMIM. These human annotations were loaded into our Ontology-Based Database (OBD along with other ontology-based phenotype descriptions of mutants from various model organism databases. Phenotypes recorded with this EQ method can be computationally compared based on the hierarchy of terms in the ontologies and the frequency of annotation. We utilized four similarity metrics to compare phenotypes and developed an ontology of homologous and analogous anatomical structures to compare phenotypes between species. Using these tools, we demonstrate that we can identify, through the similarity of the recorded phenotypes, other alleles of the same gene, other members of a signaling pathway, and orthologous genes and pathway members across species. We conclude that EQ-based annotation of phenotypes, in conjunction with a cross-species ontology, and a variety of similarity metrics can identify biologically meaningful similarities between genes by comparing phenotypes alone. This annotation and search method provides a novel and efficient means to identify

  3. CAGE_peaks_annotation - FANTOM5 | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data List Contact us FANTOM...file File name: CAGE_peaks_annotation File URL: ftp://ftp.biosciencedbc.jp/archive/fantom...on Download License Update History of This Database Site Policy | Contact Us CAGE_peaks_annotation - FANTOM5 | LSDB Archive ...

  4. The Community Junior College: An Annotated Bibliography.

    Science.gov (United States)

    Rarig, Emory W., Jr., Ed.

    This annotated bibliography on the junior college is arranged by topic: research tools, history, functions and purposes, organization and administration, students, programs, personnel, facilities, and research. It covers publications through the fall of 1965 and has an author index. (HH)

  5. The energy balance of the biomass generation of Chlamydomonas acidophila under acidic and neutral conditions and Chlamydomonas reinhardtii; Die Energiebilanz der Biomasseneubildung von Chlamydomonas acidophila unter sauren und neutralen Bedingungen und von Chlamydomonas reinhardtii

    Energy Technology Data Exchange (ETDEWEB)

    Langner, Uwe

    2009-01-16

    In this study the influence of pH < 3 as an extreme environment has been investigated for the eukaryotic green alga Chlamydomonas (C.) acidophila. The limited number of trophic levels, consisting of bacteria, phytoplankton, zooplankton and macrophytes, is a special characteristic of extreme acidic water bodies. C. acidophila was isolated from an extreme acidic mining lake (RL 111) (Bissinger et al. 2000). A special feature of the examined algal species is its wide tolerance range of external pH values from 2 to 7 (Cassin 1874, Gerloff-Elias et al. 2005a). C. acidophila is a dominant species in the acidic mining lakes, it can grow up to chlorophyll maxima of 500 {mu}g L{sup -1} during the summer time (Nixdorf et al. 1998, 2003). The alga can be found elsewhere in extreme acidic water bodies around the world. The hydrochemistry of the acidic mining lakes in the central regions of Germany and Lusatia show clear differences compared to neutral water bodies. Some of the characteristics of acidic mining lakes are high metal and heavy metal (aluminum) concentrations, high ion concentrations, which lead to high conductivity, as well as low phosphate concentrations, ammonium as the only nitrogen source and CO{sub 2} as the only inorganic carbon source. Many eukaryotic microalgae in acidic water bodies including C. acidophila show a neutral cytosolic pH. This is provided by special adaptations of the organisms. Thus, organisms in extreme acidic environments have a positive cell surface charge, a very efficient H{sup +}-ATPase and high internal buffer capacities. These mechanisms work contrary to the proton influx and the acidification of the cytosol and are therefore proof for the physiological impact of osmoregulation by microalgae in extreme acidic environments (Sekler et al.1991, Weiss et al. 1999). Hence, these mechanisms also complicate the access to nutrients for the algal cell. The passive influx of positively charged ions such as potassium or ammonium is reduced by

  6. A Resource of Quantitative Functional Annotation for Homo sapiens Genes.

    Science.gov (United States)

    Taşan, Murat; Drabkin, Harold J; Beaver, John E; Chua, Hon Nian; Dunham, Julie; Tian, Weidong; Blake, Judith A; Roth, Frederick P

    2012-02-01

    The body of human genomic and proteomic evidence continues to grow at ever-increasing rates, while annotation efforts struggle to keep pace. A surprisingly small fraction of human genes have clear, documented associations with specific functions, and new functions continue to be found for characterized genes. Here we assembled an integrated collection of diverse genomic and proteomic data for 21,341 human genes and make quantitative associations of each to 4333 Gene Ontology terms. We combined guilt-by-profiling and guilt-by-association approaches to exploit features unique to the data types. Performance was evaluated by cross-validation, prospective validation, and by manual evaluation with the biological literature. Functional-linkage networks were also constructed, and their utility was demonstrated by identifying candidate genes related to a glioma FLN using a seed network from genome-wide association studies. Our annotations are presented-alongside existing validated annotations-in a publicly accessible and searchable web interface.

  7. Annotating and Interpreting Linear and Cyclic Peptide Tandem Mass Spectra.

    Science.gov (United States)

    Niedermeyer, Timo Horst Johannes

    2016-01-01

    Nonribosomal peptides often possess pronounced bioactivity, and thus, they are often interesting hit compounds in natural product-based drug discovery programs. Their mass spectrometric characterization is difficult due to the predominant occurrence of non-proteinogenic monomers and, especially in the case of cyclic peptides, the complex fragmentation patterns observed. This makes nonribosomal peptide tandem mass spectra annotation challenging and time-consuming. To meet this challenge, software tools for this task have been developed. In this chapter, the workflow for using the software mMass for the annotation of experimentally obtained peptide tandem mass spectra is described. mMass is freely available (http://www.mmass.org), open-source, and the most advanced and user-friendly software tool for this purpose. The software enables the analyst to concisely annotate and interpret tandem mass spectra of linear and cyclic peptides. Thus, it is highly useful for accelerating the structure confirmation and elucidation of cyclic as well as linear peptides and depsipeptides.

  8. Great Basin Experimental Range: Annotated bibliography

    Science.gov (United States)

    E. Durant McArthur; Bryce A. Richardson; Stanley G. Kitchen

    2013-01-01

    This annotated bibliography documents the research that has been conducted on the Great Basin Experimental Range (GBER, also known as the Utah Experiment Station, Great Basin Station, the Great Basin Branch Experiment Station, Great Basin Experimental Center, and other similar name variants) over the 102 years of its existence. Entries were drawn from the original...

  9. FIGENIX: Intelligent automation of genomic annotation: expertise integration in a new software platform

    Directory of Open Access Journals (Sweden)

    Pontarotti Pierre

    2005-08-01

    Full Text Available Abstract Background Two of the main objectives of the genomic and post-genomic era are to structurally and functionally annotate genomes which consists of detecting genes' position and structure, and inferring their function (as well as of other features of genomes. Structural and functional annotation both require the complex chaining of numerous different software, algorithms and methods under the supervision of a biologist. The automation of these pipelines is necessary to manage huge amounts of data released by sequencing projects. Several pipelines already automate some of these complex chaining but still necessitate an important contribution of biologists for supervising and controlling the results at various steps. Results Here we propose an innovative automated platform, FIGENIX, which includes an expert system capable to substitute to human expertise at several key steps. FIGENIX currently automates complex pipelines of structural and functional annotation under the supervision of the expert system (which allows for example to make key decisions, check intermediate results or refine the dataset. The quality of the results produced by FIGENIX is comparable to those obtained by expert biologists with a drastic gain in terms of time costs and avoidance of errors due to the human manipulation of data. Conclusion The core engine and expert system of the FIGENIX platform currently handle complex annotation processes of broad interest for the genomic community. They could be easily adapted to new, or more specialized pipelines, such as for example the annotation of miRNAs, the classification of complex multigenic families, annotation of regulatory elements and other genomic features of interest.

  10. SplicingTypesAnno: annotating and quantifying alternative splicing events for RNA-Seq data.

    Science.gov (United States)

    Sun, Xiaoyong; Zuo, Fenghua; Ru, Yuanbin; Guo, Jiqiang; Yan, Xiaoyan; Sablok, Gaurav

    2015-04-01

    Alternative splicing plays a key role in the regulation of the central dogma. Four major types of alternative splicing have been classified as intron retention, exon skipping, alternative 5 splice sites or alternative donor sites, and alternative 3 splice sites or alternative acceptor sites. A few algorithms have been developed to detect splice junctions from RNA-Seq reads. However, there are few tools targeting at the major alternative splicing types at the exon/intron level. This type of analysis may reveal subtle, yet important events of alternative splicing, and thus help gain deeper understanding of the mechanism of alternative splicing. This paper describes a user-friendly R package, extracting, annotating and analyzing alternative splicing types for sequence alignment files from RNA-Seq. SplicingTypesAnno can: (1) provide annotation for major alternative splicing at exon/intron level. By comparing the annotation from GTF/GFF file, it identifies the novel alternative splicing sites; (2) offer a convenient two-level analysis: genome-scale annotation for users with high performance computing environment, and gene-scale annotation for users with personal computers; (3) generate a user-friendly web report and additional BED files for IGV visualization. SplicingTypesAnno is a user-friendly R package for extracting, annotating and analyzing alternative splicing types at exon/intron level for sequence alignment files from RNA-Seq. It is publically available at https://sourceforge.net/projects/splicingtypes/files/ or http://genome.sdau.edu.cn/research/software/SplicingTypesAnno.html. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  11. Annotation of Regular Polysemy

    DEFF Research Database (Denmark)

    Martinez Alonso, Hector

    Regular polysemy has received a lot of attention from the theory of lexical semantics and from computational linguistics. However, there is no consensus on how to represent the sense of underspecified examples at the token level, namely when annotating or disambiguating senses of metonymic words...... and metonymic. We have conducted an analysis in English, Danish and Spanish. Later on, we have tried to replicate the human judgments by means of unsupervised and semi-supervised sense prediction. The automatic sense-prediction systems have been unable to find empiric evidence for the underspecified sense, even...

  12. Phenex: ontological annotation of phenotypic diversity.

    Directory of Open Access Journals (Sweden)

    James P Balhoff

    2010-05-01

    Full Text Available Phenotypic differences among species have long been systematically itemized and described by biologists in the process of investigating phylogenetic relationships and trait evolution. Traditionally, these descriptions have been expressed in natural language within the context of individual journal publications or monographs. As such, this rich store of phenotype data has been largely unavailable for statistical and computational comparisons across studies or integration with other biological knowledge.Here we describe Phenex, a platform-independent desktop application designed to facilitate efficient and consistent annotation of phenotypic similarities and differences using Entity-Quality syntax, drawing on terms from community ontologies for anatomical entities, phenotypic qualities, and taxonomic names. Phenex can be configured to load only those ontologies pertinent to a taxonomic group of interest. The graphical user interface was optimized for evolutionary biologists accustomed to working with lists of taxa, characters, character states, and character-by-taxon matrices.Annotation of phenotypic data using ontologies and globally unique taxonomic identifiers will allow biologists to integrate phenotypic data from different organisms and studies, leveraging decades of work in systematics and comparative morphology.

  13. Phenex: ontological annotation of phenotypic diversity.

    Science.gov (United States)

    Balhoff, James P; Dahdul, Wasila M; Kothari, Cartik R; Lapp, Hilmar; Lundberg, John G; Mabee, Paula; Midford, Peter E; Westerfield, Monte; Vision, Todd J

    2010-05-05

    Phenotypic differences among species have long been systematically itemized and described by biologists in the process of investigating phylogenetic relationships and trait evolution. Traditionally, these descriptions have been expressed in natural language within the context of individual journal publications or monographs. As such, this rich store of phenotype data has been largely unavailable for statistical and computational comparisons across studies or integration with other biological knowledge. Here we describe Phenex, a platform-independent desktop application designed to facilitate efficient and consistent annotation of phenotypic similarities and differences using Entity-Quality syntax, drawing on terms from community ontologies for anatomical entities, phenotypic qualities, and taxonomic names. Phenex can be configured to load only those ontologies pertinent to a taxonomic group of interest. The graphical user interface was optimized for evolutionary biologists accustomed to working with lists of taxa, characters, character states, and character-by-taxon matrices. Annotation of phenotypic data using ontologies and globally unique taxonomic identifiers will allow biologists to integrate phenotypic data from different organisms and studies, leveraging decades of work in systematics and comparative morphology.

  14. An Annotated Bibliography of Education for Medical Librarianship, 1940-1968

    Science.gov (United States)

    Shirley, Sherrilynne

    1969-01-01

    An attempt has been made in this bibliography to represent the various viewpoints concerning education for medical librarianship equally. The topics covered include: general background reading and readings for those interested in establishing courses in medical librarianship. The former includes annotations on the history and international aspects of the subject. The latter consists of annotations of articles on early courses and present courses in medical librarianship. A final area discussed is the Medical Library Association's Code for the Training and Certification of Medical Librarians. PMID:4898629

  15. Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics.

    Science.gov (United States)

    Chepelev, Leonid L; Riazanov, Alexandre; Kouznetsov, Alexandre; Low, Hong Sang; Dumontier, Michel; Baker, Christopher J O

    2011-07-26

    The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have largely remained in the hands of human curators using manual or semi-automated protocols, leaving many novel entities unclassified. Since chemical function is often closely linked to structure, accurate structure-based classification and annotation of chemical entities is imperative to understanding their functionality. As part of an exploratory study, we have investigated the utility of semantic web technologies in automated chemical classification and annotation of lipids. Our prototype framework consists of two components: an ontology and a set of federated web services that operate upon it. The formal lipid ontology we use here extends a part of the LiPrO ontology and draws on the lipid hierarchy in the LIPID MAPS database, as well as literature-derived knowledge. The federated semantic web services that operate upon this ontology are deployed within the Semantic Annotation, Discovery, and Integration (SADI) framework. Structure-based lipid classification is enacted by two core services. Firstly, a structural annotation service detects and enumerates relevant functional groups for a specified chemical structure. A second service reasons over lipid ontology class descriptions using the attributes obtained from the annotation service and identifies the appropriate lipid classification. We extend the utility of these core services by combining them with additional SADI services that retrieve associations between lipids and proteins and identify publications related to specified lipid types. We analyze the performance of SADI-enabled eicosanoid classification relative to the LIPID MAPS classification and reflect on the contribution of our integrative methodology in the context of

  16. Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics

    Directory of Open Access Journals (Sweden)

    Dumontier Michel

    2011-07-01

    Full Text Available Abstract Background The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have largely remained in the hands of human curators using manual or semi-automated protocols, leaving many novel entities unclassified. Since chemical function is often closely linked to structure, accurate structure-based classification and annotation of chemical entities is imperative to understanding their functionality. Results As part of an exploratory study, we have investigated the utility of semantic web technologies in automated chemical classification and annotation of lipids. Our prototype framework consists of two components: an ontology and a set of federated web services that operate upon it. The formal lipid ontology we use here extends a part of the LiPrO ontology and draws on the lipid hierarchy in the LIPID MAPS database, as well as literature-derived knowledge. The federated semantic web services that operate upon this ontology are deployed within the Semantic Annotation, Discovery, and Integration (SADI framework. Structure-based lipid classification is enacted by two core services. Firstly, a structural annotation service detects and enumerates relevant functional groups for a specified chemical structure. A second service reasons over lipid ontology class descriptions using the attributes obtained from the annotation service and identifies the appropriate lipid classification. We extend the utility of these core services by combining them with additional SADI services that retrieve associations between lipids and proteins and identify publications related to specified lipid types. We analyze the performance of SADI-enabled eicosanoid classification relative to the LIPID MAPS classification and reflect on the contribution of

  17. MUTAGEN: Multi-user tool for annotating GENomes

    DEFF Research Database (Denmark)

    Brugger, K.; Redder, P.; Skovgaard, Marie

    2003-01-01

    MUTAGEN is a free prokaryotic annotation system. It offers the advantages of genome comparison, graphical sequence browsers, search facilities and open-source for user-specific adjustments. The web-interface allows several users to access the system from standard desktop computers. The Sulfolobus...

  18. CommWalker: correctly evaluating modules in molecular networks in light of annotation bias.

    Science.gov (United States)

    Luecken, M D; Page, M J T; Crosby, A J; Mason, S; Reinert, G; Deane, C M

    2018-03-15

    Detecting novel functional modules in molecular networks is an important step in biological research. In the absence of gold standard functional modules, functional annotations are often used to verify whether detected modules/communities have biological meaning. However, as we show, the uneven distribution of functional annotations means that such evaluation methods favor communities of well-studied proteins. We propose a novel framework for the evaluation of communities as functional modules. Our proposed framework, CommWalker, takes communities as inputs and evaluates them in their local network environment by performing short random walks. We test CommWalker's ability to overcome annotation bias using input communities from four community detection methods on two protein interaction networks. We find that modules accepted by CommWalker are similarly co-expressed as those accepted by current methods. Crucially, CommWalker performs well not only in well-annotated regions, but also in regions otherwise obscured by poor annotation. CommWalker community prioritization both faithfully captures well-validated communities and identifies functional modules that may correspond to more novel biology. The CommWalker algorithm is freely available at opig.stats.ox.ac.uk/resources or as a docker image on the Docker Hub at hub.docker.com/r/lueckenmd/commwalker/. deane@stats.ox.ac.uk. Supplementary data are available at Bioinformatics online.

  19. The AnnoLite and AnnoLyze programs for comparative annotation of protein structures

    Directory of Open Access Journals (Sweden)

    Dopazo Joaquín

    2007-05-01

    Full Text Available Abstract Background Advances in structural biology, including structural genomics, have resulted in a rapid increase in the number of experimentally determined protein structures. However, about half of the structures deposited by the structural genomics consortia have little or no information about their biological function. Therefore, there is a need for tools for automatically and comprehensively annotating the function of protein structures. We aim to provide such tools by applying comparative protein structure annotation that relies on detectable relationships between protein structures to transfer functional annotations. Here we introduce two programs, AnnoLite and AnnoLyze, which use the structural alignments deposited in the DBAli database. Description AnnoLite predicts the SCOP, CATH, EC, InterPro, PfamA, and GO terms with an average sensitivity of ~90% and average precision of ~80%. AnnoLyze predicts ligand binding site and domain interaction patches with an average sensitivity of ~70% and average precision of ~30%, correctly localizing binding sites for small molecules in ~95% of its predictions. Conclusion The AnnoLite and AnnoLyze programs for comparative annotation of protein structures can reliably and automatically annotate new protein structures. The programs are fully accessible via the Internet as part of the DBAli suite of tools at http://salilab.org/DBAli/.

  20. Retrieval-based Face Annotation by Weak Label Regularized Local Coordinate Coding.

    Science.gov (United States)

    Wang, Dayong; Hoi, Steven C H; He, Ying; Zhu, Jianke; Mei, Tao; Luo, Jiebo

    2013-08-02

    Retrieval-based face annotation is a promising paradigm of mining massive web facial images for automated face annotation. This paper addresses a critical problem of such paradigm, i.e., how to effectively perform annotation by exploiting the similar facial images and their weak labels which are often noisy and incomplete. In particular, we propose an effective Weak Label Regularized Local Coordinate Coding (WLRLCC) technique, which exploits the principle of local coordinate coding in learning sparse features, and employs the idea of graph-based weak label regularization to enhance the weak labels of the similar facial images. We present an efficient optimization algorithm to solve the WLRLCC task. We conduct extensive empirical studies on two large-scale web facial image databases: (i) a Western celebrity database with a total of $6,025$ persons and $714,454$ web facial images, and (ii)an Asian celebrity database with $1,200$ persons and $126,070$ web facial images. The encouraging results validate the efficacy of the proposed WLRLCC algorithm. To further improve the efficiency and scalability, we also propose a PCA-based approximation scheme and an offline approximation scheme (AWLRLCC), which generally maintains comparable results but significantly saves much time cost. Finally, we show that WLRLCC can also tackle two existing face annotation tasks with promising performance.

  1. Similarity maps and hierarchical clustering for annotating FT-IR spectral images.

    Science.gov (United States)

    Zhong, Qiaoyong; Yang, Chen; Großerüschkamp, Frederik; Kallenbach-Thieltges, Angela; Serocka, Peter; Gerwert, Klaus; Mosig, Axel

    2013-11-20

    Unsupervised segmentation of multi-spectral images plays an important role in annotating infrared microscopic images and is an essential step in label-free spectral histopathology. In this context, diverse clustering approaches have been utilized and evaluated in order to achieve segmentations of Fourier Transform Infrared (FT-IR) microscopic images that agree with histopathological characterization. We introduce so-called interactive similarity maps as an alternative annotation strategy for annotating infrared microscopic images. We demonstrate that segmentations obtained from interactive similarity maps lead to similarly accurate segmentations as segmentations obtained from conventionally used hierarchical clustering approaches. In order to perform this comparison on quantitative grounds, we provide a scheme that allows to identify non-horizontal cuts in dendrograms. This yields a validation scheme for hierarchical clustering approaches commonly used in infrared microscopy. We demonstrate that interactive similarity maps may identify more accurate segmentations than hierarchical clustering based approaches, and thus are a viable and due to their interactive nature attractive alternative to hierarchical clustering. Our validation scheme furthermore shows that performance of hierarchical two-means is comparable to the traditionally used Ward's clustering. As the former is much more efficient in time and memory, our results suggest another less resource demanding alternative for annotating large spectral images.

  2. Book Reviews, Annotation, and Web Technology.

    Science.gov (United States)

    Schulze, Patricia

    From reading texts to annotating web pages, grade 6-8 students rely on group cooperation and individual reading and writing skills in this research project that spans six 50-minute lessons. Student objectives for this project are that they will: read, discuss, and keep a journal on a book in literature circles; understand the elements of and…

  3. H2DB: a heritability database across multiple species by annotating trait-associated genomic loci.

    Science.gov (United States)

    Kaminuma, Eli; Fujisawa, Takatomo; Tanizawa, Yasuhiro; Sakamoto, Naoko; Kurata, Nori; Shimizu, Tokurou; Nakamura, Yasukazu

    2013-01-01

    H2DB (http://tga.nig.ac.jp/h2db/), an annotation database of genetic heritability estimates for humans and other species, has been developed as a knowledge database to connect trait-associated genomic loci. Heritability estimates have been investigated for individual species, particularly in human twin studies and plant/animal breeding studies. However, there appears to be no comprehensive heritability database for both humans and other species. Here, we introduce an annotation database for genetic heritabilities of various species that was annotated by manually curating online public resources in PUBMED abstracts and journal contents. The proposed heritability database contains attribute information for trait descriptions, experimental conditions, trait-associated genomic loci and broad- and narrow-sense heritability specifications. Annotated trait-associated genomic loci, for which most are single-nucleotide polymorphisms derived from genome-wide association studies, may be valuable resources for experimental scientists. In addition, we assigned phenotype ontologies to the annotated traits for the purposes of discussing heritability distributions based on phenotypic classifications.

  4. BG7: A New Approach for Bacterial Genome Annotation Designed for Next Generation Sequencing Data

    Science.gov (United States)

    Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Pareja, Eduardo; Tobes, Raquel

    2012-01-01

    BG7 is a new system for de novo bacterial, archaeal and viral genome annotation based on a new approach specifically designed for annotating genomes sequenced with next generation sequencing technologies. The system is versatile and able to annotate genes even in the step of preliminary assembly of the genome. It is especially efficient detecting unexpected genes horizontally acquired from bacterial or archaeal distant genomes, phages, plasmids, and mobile elements. From the initial phases of the gene annotation process, BG7 exploits the massive availability of annotated protein sequences in databases. BG7 predicts ORFs and infers their function based on protein similarity with a wide set of reference proteins, integrating ORF prediction and functional annotation phases in just one step. BG7 is especially tolerant to sequencing errors in start and stop codons, to frameshifts, and to assembly or scaffolding errors. The system is also tolerant to the high level of gene fragmentation which is frequently found in not fully assembled genomes. BG7 current version – which is developed in Java, takes advantage of Amazon Web Services (AWS) cloud computing features, but it can also be run locally in any operating system. BG7 is a fast, automated and scalable system that can cope with the challenge of analyzing the huge amount of genomes that are being sequenced with NGS technologies. Its capabilities and efficiency were demonstrated in the 2011 EHEC Germany outbreak in which BG7 was used to get the first annotations right the next day after the first entero-hemorrhagic E. coli genome sequences were made publicly available. The suitability of BG7 for genome annotation has been proved for Illumina, 454, Ion Torrent, and PacBio sequencing technologies. Besides, thanks to its plasticity, our system could be very easily adapted to work with new technologies in the future. PMID:23185310

  5. New in protein structure and function annotation: hotspots, single nucleotide polymorphisms and the 'Deep Web'.

    Science.gov (United States)

    Bromberg, Yana; Yachdav, Guy; Ofran, Yanay; Schneider, Reinhard; Rost, Burkhard

    2009-05-01

    The rapidly increasing quantity of protein sequence data continues to widen the gap between available sequences and annotations. Comparative modeling suggests some aspects of the 3D structures of approximately half of all known proteins; homology- and network-based inferences annotate some aspect of function for a similar fraction of the proteome. For most known protein sequences, however, there is detailed knowledge about neither their function nor their structure. Comprehensive efforts towards the expert curation of sequence annotations have failed to meet the demand of the rapidly increasing number of available sequences. Only the automated prediction of protein function in the absence of homology can close the gap between available sequences and annotations in the foreseeable future. This review focuses on two novel methods for automated annotation, and briefly presents an outlook on how modern web software may revolutionize the field of protein sequence annotation. First, predictions of protein binding sites and functional hotspots, and the evolution of these into the most successful type of prediction of protein function from sequence will be discussed. Second, a new tool, comprehensive in silico mutagenesis, which contributes important novel predictions of function and at the same time prepares for the onset of the next sequencing revolution, will be described. While these two new sub-fields of protein prediction represent the breakthroughs that have been achieved methodologically, it will then be argued that a different development might further change the way biomedical researchers benefit from annotations: modern web software can connect the worldwide web in any browser with the 'Deep Web' (ie, proprietary data resources). The availability of this direct connection, and the resulting access to a wealth of data, may impact drug discovery and development more than any existing method that contributes to protein annotation.

  6. BG7: a new approach for bacterial genome annotation designed for next generation sequencing data.

    Directory of Open Access Journals (Sweden)

    Pablo Pareja-Tobes

    Full Text Available BG7 is a new system for de novo bacterial, archaeal and viral genome annotation based on a new approach specifically designed for annotating genomes sequenced with next generation sequencing technologies. The system is versatile and able to annotate genes even in the step of preliminary assembly of the genome. It is especially efficient detecting unexpected genes horizontally acquired from bacterial or archaeal distant genomes, phages, plasmids, and mobile elements. From the initial phases of the gene annotation process, BG7 exploits the massive availability of annotated protein sequences in databases. BG7 predicts ORFs and infers their function based on protein similarity with a wide set of reference proteins, integrating ORF prediction and functional annotation phases in just one step. BG7 is especially tolerant to sequencing errors in start and stop codons, to frameshifts, and to assembly or scaffolding errors. The system is also tolerant to the high level of gene fragmentation which is frequently found in not fully assembled genomes. BG7 current version - which is developed in Java, takes advantage of Amazon Web Services (AWS cloud computing features, but it can also be run locally in any operating system. BG7 is a fast, automated and scalable system that can cope with the challenge of analyzing the huge amount of genomes that are being sequenced with NGS technologies. Its capabilities and efficiency were demonstrated in the 2011 EHEC Germany outbreak in which BG7 was used to get the first annotations right the next day after the first entero-hemorrhagic E. coli genome sequences were made publicly available. The suitability of BG7 for genome annotation has been proved for Illumina, 454, Ion Torrent, and PacBio sequencing technologies. Besides, thanks to its plasticity, our system could be very easily adapted to work with new technologies in the future.

  7. Evaluating automatically annotated treebanks for linguistic research

    NARCIS (Netherlands)

    Bloem, J.; Bański, P.; Kupietz, M.; Lüngen, H.; Witt, A.; Barbaresi, A.; Biber, H.; Breiteneder, E.; Clematide, S.

    2016-01-01

    This study discusses evaluation methods for linguists to use when employing an automatically annotated treebank as a source of linguistic evidence. While treebanks are usually evaluated with a general measure over all the data, linguistic studies often focus on a particular construction or a group

  8. DIMA – Annotation guidelines for German intonation

    DEFF Research Database (Denmark)

    Kügler, Frank; Smolibocki, Bernadett; Arnold, Denis

    2015-01-01

    This paper presents newly developed guidelines for prosodic annotation of German as a consensus system agreed upon by German intonologists. The DIMA system is rooted in the framework of autosegmental-metrical phonology. One important goal of the consensus is to make exchanging data between groups...

  9. Annotating State of Mind in Meeting Data

    NARCIS (Netherlands)

    Heylen, Dirk K.J.; Reidsma, Dennis; Ordelman, Roeland J.F.; Devillers, L.; Martin, J-C.; Cowie, R.; Batliner, A.

    We discuss the annotation procedure for mental state and emotion that is under development for the AMI (Augmented Multiparty Interaction) corpus. The categories that were found to be most appropriate relate not only to emotions but also to (meta-)cognitive states and interpersonal variables. The

  10. Role of metal mixtures (Ca, Cu and Pb) on Cd bioaccumulation and phytochelatin production by Chlamydomonas reinhardtii.

    Science.gov (United States)

    Abboud, Pauline; Wilkinson, Kevin J

    2013-08-01

    The goal of the study was to determine whether metal uptake and biological effects could be predicted by free ion concentrations when organisms were exposed to Cd and a second metal. Bioaccumulation and algal phytochelatin (PC) concentrations were determined for Chlamydomonas reinhardtii following a 6-h exposure. Bioaccumulation results, after six hours of exposure, showed that Cd uptake decreased in the presence of relatively high concentrations of Ca, Cu and Pb, however, Cd bioaccumulation increased in the presence of ca. equimolar concentrations of Cu. A good correlation was observed between the production of PCs and the amount of metals bioaccumulated for the binary mixtures of Cd-Pb and Cd-Cu, but not the Cd-Ca mixture. Overall, the results suggested that, in the case of mixtures, bioaccumulated metal rather than free ion concentrations would be a better predictor of biological effect. Copyright © 2013 Elsevier Ltd. All rights reserved.

  11. Geothermal wetlands: an annotated bibliography of pertinent literature

    Energy Technology Data Exchange (ETDEWEB)

    Stanley, N.E.; Thurow, T.L.; Russell, B.F.; Sullivan, J.F.

    1980-05-01

    This annotated bibliography covers the following topics: algae, wetland ecosystems; institutional aspects; macrophytes - general, production rates, and mineral absorption; trace metal absorption; wetland soils; water quality; and other aspects of marsh ecosystems. (MHR)

  12. Review of actinide-sediment reactions with an annotated bibliography

    Energy Technology Data Exchange (ETDEWEB)

    Ames, L.L.; Rai, D.; Serne, R.J.

    1976-02-10

    The annotated bibliography is divided into sections on chemistry and geochemistry, migration and accumulation, cultural distributions, natural distributions, and bibliographies and annual reviews. (LK)

  13. Flagellar coordination in Chlamydomonas cells held on micropipettes.

    Science.gov (United States)

    Rüffer, U; Nultsch, W

    1998-01-01

    The two flagella of Chlamydomonas are known to beat synchronously: During breaststroke beating they are generally coordinated in a bilateral way while in shock responses during undulatory beating coordination is mostly parallel [Rüffer and Nultsch, 1995: Botanica Acta 108:169-276]. Analysis of a great number of shock responses revealed that in undulatory beats also periods of bilateral coordination are found and that the coordination type may change several times during a shock response, without concomitant changes of the beat envelope and the beat period. In normal wt cells no coordination changes are found during breaststroke beating, but only short temporary asynchronies: During 2 or 3 normal beats of the cis flagellum, the trans flagellum performs 3 or 4 flat beats with a reduced beat envelope and a smaller beat period, resulting in one additional trans beat. Long periods with flat beats of the same shape and beat period are found in both flagella of the non-phototactic mutant ptx1 and in defective wt 622E cells. During these periods, the coordination is parallel, the two flagella beat alternately. A correlation between normal asynchronous trans beats and the parallel-coordinated beats in the presumably cis defective cells and also the undulatory beats is discussed. In the cis defective cells, a perpetual spontaneous change between parallel beats with small beat periods (higher beat frequency) and bilateral beats with greater beat periods (lower beat frequency) are observed and render questionable the existence of two different intrinsic beat frequencies of the two flagella cis and trans. Asynchronies occur spontaneously but may also be induced by light changes, either step-up or step-down, but not by both stimuli in turn as breaststroke flagellar photoresponses (BFPRs). Asynchronies are not involved in phototaxis. They are independent of the BFPRs, which are supposed to be the basis of phototaxis. Both types of coordination must be assumed to be regulated

  14. A computational platform to maintain and migrate manual functional annotations for BioCyc databases.

    Science.gov (United States)

    Walsh, Jesse R; Sen, Taner Z; Dickerson, Julie A

    2014-10-12

    BioCyc databases are an important resource for information on biological pathways and genomic data. Such databases represent the accumulation of biological data, some of which has been manually curated from literature. An essential feature of these databases is the continuing data integration as new knowledge is discovered. As functional annotations are improved, scalable methods are needed for curators to manage annotations without detailed knowledge of the specific design of the BioCyc database. We have developed CycTools, a software tool which allows curators to maintain functional annotations in a model organism database. This tool builds on existing software to improve and simplify annotation data imports of user provided data into BioCyc databases. Additionally, CycTools automatically resolves synonyms and alternate identifiers contained within the database into the appropriate internal identifiers. Automating steps in the manual data entry process can improve curation efforts for major biological databases. The functionality of CycTools is demonstrated by transferring GO term annotations from MaizeCyc to matching proteins in CornCyc, both maize metabolic pathway databases available at MaizeGDB, and by creating strain specific databases for metabolic engineering.

  15. Chlamydomonas outer arm dynein alters conformation in response to Ca2+.

    Science.gov (United States)

    Sakato, Miho; Sakakibara, Hitoshi; King, Stephen M

    2007-09-01

    We have previously shown that Ca(2+) directly activates ATP-sensitive microtubule binding by a Chlamydomonas outer arm dynein subparticle containing the beta and gamma heavy chains (HCs). The gamma HC-associated LC4 light chain is a member of the calmodulin family and binds 1-2 Ca(2+) with K(Ca) = 3 x 10(-5) M in vitro, suggesting it may act as a Ca(2+) sensor for outer arm dynein. Here we investigate interactions between the LC4 light chain and gamma HC. Two IQ consensus motifs for binding calmodulin-like proteins are located within the stem domain of the gamma heavy chain. In vitro experiments indicate that LC4 undergoes a Ca(2+)-dependent interaction with the IQ motif domain while remaining tethered to the HC. LC4 also moves into close proximity of the intermediate chain IC1 in the presence of Ca(2+). The sedimentation profile of the gamma HC subunit changed subtly upon Ca(2+) addition, suggesting that the entire complex had become more compact, and electron microscopy of the isolated gamma subunit revealed a distinct alteration in conformation of the N-terminal stem in response to Ca(2+) addition. We propose that Ca(2+)-dependent conformational change of LC4 has a direct effect on the stem domain of the gamma HC, which eventually leads to alterations in mechanochemical interactions between microtubules and the motor domain(s) of the outer dynein arm.

  16. Visual Interpretation with Three-Dimensional Annotations (VITA): Three-Dimensional Image Interpretation Tool for Radiological Reporting

    OpenAIRE

    Roy, Sharmili; Brown, Michael S.; Shih, George L.

    2013-01-01

    This paper introduces a software framework called Visual Interpretation with Three-Dimensional Annotations (VITA) that is able to automatically generate three-dimensional (3D) visual summaries based on radiological annotations made during routine exam reporting. VITA summaries are in the form of rotating 3D volumes where radiological annotations are highlighted to place important clinical observations into a 3D context. The rendered volume is produced as a Digital Imaging and Communications i...

  17. Automated Eukaryotic Gene Structure Annotation Using EVidenceModeler and the Program to Assemble Spliced Alignments

    Energy Technology Data Exchange (ETDEWEB)

    Haas, B J; Salzberg, S L; Zhu, W; Pertea, M; Allen, J E; Orvis, J; White, O; Buell, C R; Wortman, J R

    2007-12-10

    EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

  18. Developing national on-line services to annotate and analyse underwater imagery in a research cloud

    Science.gov (United States)

    Proctor, R.; Langlois, T.; Friedman, A.; Davey, B.

    2017-12-01

    Fish image annotation data is currently collected by various research, management and academic institutions globally (+100,000's hours of deployments) with varying degrees of standardisation and limited formal collaboration or data synthesis. We present a case study of how national on-line services, developed within a domain-oriented research cloud, have been used to annotate habitat images and synthesise fish annotation data sets collected using Autonomous Underwater Vehicles (AUVs) and baited remote underwater stereo-video (stereo-BRUV). Two developing software tools have been brought together in the marine science cloud to provide marine biologists with a powerful service for image annotation. SQUIDLE+ is an online platform designed for exploration, management and annotation of georeferenced images & video data. It provides a flexible annotation framework allowing users to work with their preferred annotation schemes. We have used SQUIDLE+ to sample the habitat composition and complexity of images of the benthos collected using stereo-BRUV. GlobalArchive is designed to be a centralised repository of aquatic ecological survey data with design principles including ease of use, secure user access, flexible data import, and the collection of any sampling and image analysis information. To easily share and synthesise data we have implemented data sharing protocols, including Open Data and synthesis Collaborations, and a spatial map to explore global datasets and filter to create a synthesis. These tools in the science cloud, together with a virtual desktop analysis suite offering python and R environments offer an unprecedented capability to deliver marine biodiversity information of value to marine managers and scientists alike.

  19. Experimental annotation of post-translational features and translated coding regions in the pathogen Salmonella Typhimurium

    Energy Technology Data Exchange (ETDEWEB)

    Ansong, Charles; Tolic, Nikola; Purvine, Samuel O.; Porwollik, Steffen; Jones, Marcus B.; Yoon, Hyunjin; Payne, Samuel H.; Martin, Jessica L.; Burnet, Meagan C.; Monroe, Matthew E.; Venepally, Pratap; Smith, Richard D.; Peterson, Scott; Heffron, Fred; Mcclelland, Michael; Adkins, Joshua N.

    2011-08-25

    Complete and accurate genome annotation is crucial for comprehensive and systematic studies of biological systems. For example systems biology-oriented genome scale modeling efforts greatly benefit from accurate annotation of protein-coding genes to develop proper functioning models. However, determining protein-coding genes for most new genomes is almost completely performed by inference, using computational predictions with significant documented error rates (> 15%). Furthermore, gene prediction programs provide no information on biologically important post-translational processing events critical for protein function. With the ability to directly measure peptides arising from expressed proteins, mass spectrometry-based proteomics approaches can be used to augment and verify coding regions of a genomic sequence and importantly detect post-translational processing events. In this study we utilized “shotgun” proteomics to guide accurate primary genome annotation of the bacterial pathogen Salmonella Typhimurium 14028 to facilitate a systems-level understanding of Salmonella biology. The data provides protein-level experimental confirmation for 44% of predicted protein-coding genes, suggests revisions to 48 genes assigned incorrect translational start sites, and uncovers 13 non-annotated genes missed by gene prediction programs. We also present a comprehensive analysis of post-translational processing events in Salmonella, revealing a wide range of complex chemical modifications (70 distinct modifications) and confirming more than 130 signal peptide and N-terminal methionine cleavage events in Salmonella. This study highlights several ways in which proteomics data applied during the primary stages of annotation can improve the quality of genome annotations, especially with regards to the annotation of mature protein products.

  20. The Effect of DNA and Sodium Cholate Dispersed Single-Walled Carbon Nano tubes on the Green Algae Chlamydomonas reinhardtii

    International Nuclear Information System (INIS)

    Williams, R.M.; Cox, Z.; Dolash, B.D.; Sooter, L.J.; Williams, R.M.; Taylor, H.K.; Thomas, J.

    2014-01-01

    Increasing use of single-walled carbon nano tubes (SWCNTs) will lead to their increased release into the environment. Previous work has shown negative effects of SWCNT on growth and survival of model organisms. The aim of the current study was to determine the effect of SWCNT well-dispersed by either DNA or sodium cholate (SC) on the unicellular green algae Chlamydomonas reinhardtii in stagnant water conditions. Growth measurements were taken up to ten days for algae treated with varied levels of DNA:SWCNT or SC:SWCNT or controls, and chlorophyll content after 10 days was determined. Results show no effect on either growth or chlorophyll content of algae at any concentration or duration. This is in contradiction to prior work showing toxicity of SWCNT to environmental model organisms.